Extending Databricks Unity Catalog with an Open Apache Hive Metastore API



Immediately, we’re excited to announce the preview of a Hive Metastore (HMS) interface for Databricks Unity Catalog, which permits any software program suitable with Apache Hive to connect with Unity Catalog. Apache Hive is essentially the most extensively supported catalog interface within the {industry}, usable in just about each main computing platform. This characteristic lets organizations centralize their knowledge administration, discovery and governance in Unity Catalog and connect with it from a variety of computing platforms, together with Amazon Elastic MapReduce (EMR), open supply Apache Spark, Amazon Athena, Presto, Trino, and others. It additionally ensures constant knowledge governance throughout these platforms.

To affix this preview, contact your Databricks consultant.

Hive Metastore interface for Unity Catalog
Hive Metastore interface for Unity Catalog

In right this moment’s quickly evolving knowledge administration panorama, many organizations run a number of computing platforms and face the problem of constantly implementing knowledge discovery and governance throughout them. This typically requires knowledge groups to juggle a number of knowledge catalogs and governance instruments, resulting in excessive operational overhead and difficulties in knowledge discovery, entry administration, and auditing.

Databricks Unity Catalog is a unified governance resolution for knowledge, analytics and AI with easy options to find knowledge, handle permissions, audit accesses, observe knowledge lineage and high quality, and share knowledge throughout organizations. With the HMS interface, now you can join any software program that helps the industry-standard Apache Hive API to Unity, tremendously simplifying governance throughout computing platforms.

On this weblog, we are going to discover the benefits of this characteristic and the way it can improve your knowledge administration practices.

Why we constructed a Hive Metastore interface for Unity Catalog


In a various knowledge ecosystem, openness performs an important function in seamless knowledge integration and collaboration. Apache Hive is essentially the most extensively supported catalog API within the {industry}. Unity Catalog’s open HMS interface aligns with our open lakehouse platform technique and offers a unified and standardized method for accessing enterprise knowledge and avoiding vendor lock-in. Through the use of Unity Catalog with this open interface, organizations can simplify their knowledge administration structure and guarantee it could possibly help each present and future instruments.

Constant Knowledge Governance

Managing knowledge throughout a number of platforms poses a big problem in sustaining constant governance. The HMS interface in Unity Catalog successfully tackles this problem, by extending enterprise-grade governance offered by Unity Catalog to a various vary of compute platforms. This integration ensures constant knowledge compliance, enhances safety measures, enforces sturdy entry controls, and facilitates centralized auditing.

Simple Path to Modernize Legacy Workloads

For purchasers who’ve lengthy standing legacy workloads working on completely different computing platforms, the HMS interface integration allows them to centralize metadata and entry controls in Unity Catalog, encompassing each their legacy and Databricks workloads. This integration gives an easy path for migrating these workloads working on platforms reminiscent of Amazon EMR, whereas making certain consistency all through the migration course of.

Price Optimization

The HMS interface for Unity Catalog brings vital value optimization advantages to organizations. Historically, organizations needed to make investments vital time and assets into managing a number of catalogs, which not solely incurred extra prices but additionally launched complexity and potential inconsistencies. Syncing knowledge and insurance policies between a number of catalogs is fragile and error-prone. This integration eliminates the necessity to handle, keep, and synchronize separate knowledge catalogs.


The HMS interface for Unity Catalog brings collectively the perfect of each worlds—open interoperability and enterprise-grade governance. By connecting Unity Catalog with numerous compute platforms, organizations can unlock enhanced knowledge accessibility, improved governance, scalability, value optimization, interoperability, and future-proofing, and get rid of the excessive operational value of managing a number of knowledge platforms right this moment. This lets organizations deal with profiting from their knowledge belongings as an alternative.

Contact your Databricks Consultant to hitch this thrilling preview!

Additionally, don’t miss extra thrilling updates about knowledge and AI governance and register for our periods on the Knowledge and AI Summit.