Databricks, the Data and AI company, is introducing DBRX, a general-purpose large language model (LLM) that enables organizations around the world to cost-effectively build, train, and serve their own custom LLMs.
DBRX democratizes the training and tuning of custom, high-performing LLMs for every enterprise so they no longer need to rely on a small handful of closed models, according to the company.
"At Databricks, our vision has always been to democratize data and AI. We're doing that by delivering data intelligence to every enterprise—helping them understand and use their private data to build their own AI systems. DBRX is the result of that aim," said Ali Ghodsi, co-founder and CEO at Databricks. "We're excited about DBRX for three key reasons: first, it beats open source models on state-of-the-art industry benchmarks. Second, it beats GPT-3.5 on most benchmarks, which should accelerate the trend we're seeing across our customer base as organizations replace proprietary models with open source models. Finally, DBRX uses a mixture-of-experts architecture, making the model extremely fast in terms of tokens per second, as well as being cost effective to serve. All in all, DBRX is setting a new standard for open source LLMs—it gives enterprises a platform to build customized reasoning capabilities based on their own data."
According to Databricks, DBRX outperforms existing open source LLMs like Llama 2 70B and Mixtral-8x7B on standard industry benchmarks, such as language understanding, programming, math, and logic. DBRX also outperforms GPT-3.5 on relevant benchmarks.
DBRX was developed by Mosaic AI and trained on NVIDIA DGX Cloud. Databricks optimized DBRX for efficiency with a mixture-of-experts (MoE) architecture, built on the MegaBlocks open source project. The resulting model has leading performance and is up to twice as compute-efficient as other available leading LLMs, as per the company.
Paired with Databricks Mosaic AI’s unified tooling, DBRX helps customers rapidly build and deploy production-quality generative AI applications that are safe, accurate, and governed without giving up control of their data and intellectual property.
Customers benefit from built-in data management, governance, lineage, and monitoring capabilities on the Databricks Data Intelligence Platform, according to Databricks.
DBRX is available on GitHub and Hugging Face for research and commercial use. It is also available on the Databricks Platform, where enterprises can interact with DBRX, leverage its long context abilities in retrieval augmented generation (RAG) systems, and build custom DBRX models on their own unique data. DBRX is also available on AWS and Google Cloud, as well as directly on Microsoft Azure through Azure Databricks.
DBRX is also expected to be available through the NVIDIA API Catalog and supported on the NVIDIA NIM inference microservice.
For more information about this news, visit www.databricks.com.