The historic surge of curiosity in giant language fashions (LLMs) since ChatGPT launched to the general public late final yr has made the subject inescapable. Not solely is the expertise bettering at an unparalleled cadence, however corporations are additionally constructing their very own fashions like by no means earlier than. Now, predictive fashions are underpinning mission-critical duties, giving organizations a window into the longer term as an alternative of only a overview of the previous, and serving to them function faster and leaner.
On the cusp of this new computing revolution, we have been desirous to be taught precisely the place enterprises are at on this transformation, in addition to the platforms and instruments they’re utilizing to reap the benefits of it. By analyzing anonymized utilization information from greater than 9,000 international Databricks clients, we’ve compiled the 2023 State of Information + AI, a complete have a look at organizations’ information and AI initiatives.
Right here’s a glimpse at what we found:

- The hype round LLMs is actual: From the tip of November 2022 to the start of Might 2023, the utilization of SaaS LLMs, that are used to entry fashions like OpenAI, grew exponentially with Lakehouse clients at 1310%. Transformer-related libraries like HuggingFace (an NLP toolkit and mannequin hub), that are used to coach homegrown LLMs and have been in demand even earlier than the launch of ChatGPT, grew 82% throughout the identical time-frame.

- Information transformation and integration is extra very important than ever: The quickest rising instruments on Databricks are dbt (206% YoY) and FiveTran (181%). However of the ten hottest information and AI merchandise, six are information integration instruments, together with Informatica and Qlik, making it the quickest rising market on the Databricks Lakehouse.
- Firms eye open supply: When taking a look at the preferred information and AI merchandise, Microsoft Energy BI and Plotly reign above the remainder. However organizations are exhibiting a powerful pull to open applied sciences; 8 of the ten hottest information and AI merchandise are primarily based on open supply software program, together with dbt, Hugging Face and GeoPandas.

- Enterprises are doing extra AI tasks than ever earlier than – and getting higher at it: The variety of fashions which can be candidates for manufacturing (utilized in operations) grew 411% year-over-year, whereas the variety of experimental tasks grew 54%. Our information additionally exhibits that, on common, one in three experimental fashions are a candidate for the real-world, in comparison with one in 5 final yr, suggesting organizations are getting higher at constructing and scaling these tasks.
- AI is rising, however don’t overlook conventional information analytics: Final yr, Energy BI was the preferred program operating on prime of the Lakehouse. The Lakehouse is more and more getting used for information warehousing, together with serverless information warehousing with Databricks SQL, which grew 144% YoY.
Whereas it’s nonetheless early days, these rising developments are certain to outline the way forward for AI. And enterprise leaders want to concentrate. It is by no means been extra clear: the businesses that harness the facility of DS/ML will lead the following era of information.
Obtain the total report right here to be taught extra!