Data Lake Cartoons

Data Warehouse vs Data Lake #2: The gatekeeper

Data Warehouse vs Data Lake #2: The gatekeeper

Any data can enter into Data Lake. No dress code. It can be out of shape in any state. It doesn't have to keep any standards. It can be dirty. No one cares.

Data warehouses, on the other hand, are very selective about who (what) can enter, in what shape. All visitors need to fit into rigorous standards. The open hours are also scheduled precisely. It's an exclusive club.

Data Warehouse vs Data Lake #1: The wardrobe

Data Warehouse vs Data Lake #1: The wardrobe

Data Warehouse requires you to have your data in particular order.

No rules for Data Lakes.

At the lake

At the lake

Data lakes are convenient wan to store the data. No need for the design, no constraints. You just dump the data. The hard part is when you are trying to find it.

Data Swamp

Data Swamp

Data Lake is a repository of data stored in raw format (CSV, JSON, XML, text, binary, documents, etc.), in contrast to a traditional (relational) Database that enforces strict predefined schema (where data is arranged in tables and columns).

Data Swamp is a Data Lake is so messy that it is unusable and does not allow you to find or get value from your data.

Subscribe weekly cartoon Use cartoons