Delta Lake

31st August, 2021
Applies to: 9.x (current) versions, Article available also for: 10.x

Dataedo 9.3 added support for Delta Lake files. Dataedo scans Delta Lake file and builds a structure that includes:

  • Primitive data type fields,
  • Nested structs,
  • Arrays,
  • Maps

Each field contains:

  • Name,
  • Data type,
  • Nullability.

To add Delta Lake file:

  • right click on any database or Structures folder, choose Add Object, then Add/Import Structure, or
  • on main ribbon select Add Object then Structure/File, or
  • select Structures folder and on main ribbon select Add Structure/File.

Then select Import from file and Delta Lake format. To read the file, point to a Delta Lake catalog on the disk and click Next. This will scan the content and open Structure designer with a parsed structure. You can use this window to edit names, data types and field types and save with Save button.

If Delta Lake catalog contains partitions Dataedo will select partition with last modification date and show the name of partiton in Location box. Name of the documentation will remain the name of Delta Lake catalog.

Multiple partitions structures designer

If there are no partitions, Location will point to selected Delta Lake:

No partitions structures designer

Guide: Adding files to the catalog

Found issue with this article? Comment below
0
There are no comments. Click here to write the first comment.