Purview

4th December, 2024

Azure Purview is a unified data governance solution that enables organizations to catalog, manage, and discover data across their cloud and on-premises environments. It provides tools for organizing and tracking data assets like databases, tables, and more, helping users understand and control their data landscape effectively. Starting with Dataedo 24.4, we now support an Azure Purview connector, allowing users to import single databases stored within Purview, streamlining data cataloging and enhancing data governance.

Documenting Azure Purview

Dataedo imports following Purview elements:

Purview Dataedo
Asset Object
Technical Metadata Fields + Custom Fields

Data Lineage

  • Purview connector builds lineage within single import (it won't create lineage to other technologies/Purview imports).
  • Purview connector builds Object-level lineage

Connecting to Purview

Add new connection

To connect to Purview instance create new documentation by clicking Add and choosing New connection.

On the connection screen choose Purview (it can be found under Catalogs Folder).

Purview connection details

Connection details:

  • Sign in - Authenticate with your Azure Account
  • Subscription - Azure subscription assigned to your Purview instance.
  • Resource group - Resource group where your Purview instance sits in
  • Account name - Account which has access to Purview

Setting Service and Objects Types

In order to correctly import Purview, Dataedo needs to know three things:

  • Service - the technology you would like to import e.g. Hive, HBase
  • Object Types - which object from picked Service is the main one e.g. Hive Database (Due to graph structure of Purview, Dataedo needs information which object type should be treated as Database/Storage)
  • Database/Storage - object instance of picked object type the user would like to import e.g. Database AdventureWorks

After setting this up, Purview connection is ready to begin

Mapping Purview Service to Dataedo

This is crucial step in importing Purview in Dataedo. It allows users to do the following:

  • Pick Dataedo object type for every Purview asset type imported
  • Select whether to import certain Purview assets
  • Map attributes

Object Mapping

Mapping attributes

Mapping attributes is a separate step which can be accessed by clicking Map attributes. In this form you can browse through Purview attributes and set Dataedo attribute/field it should be loaded to. You can also create new Custom Fields and use them for mapping.

Mapping attributes

Specifications

Imported objects

Imported Editable
Assets
Technical properties

Supported features

Feature Is supported
Writing changes back
Data Profiling
CMD Import
PK/FK relationship tester
Linked Sources

Data Lineage

Source Method Version
Internal lineage (object-level) SDK 24.4 (2024)

Plans for future releases

  • automatic mapping (predefined mapping for Purview services)
  • import user defined properties
  • import multiple services at once
  • lineage improvements, lineage cross multiple Purview documentations