Databricks Unity Catalog connector

14th February, 2024
Applies to: Dataedo 23.x versions, Article available also for: 24.x (current)
You are looking at documentation for an older release.
Switch to the documentation for Dataedo 24.x (current).

Overview

Databricks is a data processing cloud-based platform. It simplifies collaboration of data analysts, data engineers, and data scientists. Databricks is available in Microsoft Azure, Amazon Web Services, and Google Cloud Platform.

Dataedo will connect to single catalog Unity Catalog via API, and document objects and data lineage within the connected catalog,

Connector features

Data Source Support Schema Lineage Profiling Classification Export comments FK tester DDL import
Databricks Unity Catalog Native Object Level NA NA

Data Catalog

Dataedo will document following objects and their respective properties from Databricks:

Object Name Metadata Lineage
Delta Live Tables
Pipelines Limited
Tables
Views
Columns
External Tables

Objects Properties Configuration & Support

Documentation is created for selected Unity Catalog. If there is a need want to connect multiple catalogs,

Image title

Known Limitations