Data Glossary

Table of Contents:


    What is Database Management System (DBMS)

    Piotr Kononow - Dataedo Team Piotr Kononow 2018-10-04 2018-10-08

    Database Management System (DBMS) is a software that helps define, create, manage and control databases.

    It's like the guy (or entire staff) in the office that stores, sorts, archives, finds and presents the data in your company. And you're the boss - with the use of native language like SQL you directly, or with the proxy of an application, BI tool or other software, can ask to store or return the data, reports or analytics.

    Characters of the movie Office Space (1999) - Peter (left) and his boss Bill (right).

    Functions of DBMS

    DBMS are widely used because they offer a number of useful functionalities:

    1. Definition of data models (tables, columns, relationships etc.),
    2. Access control,
    3. Handling CRUD operations (Create, Read, Update, Delete),
    4. Ensuring data consistency,
    5. Query access with interface (e.g. SQL),
    6. Handling concurrent users,
    7. Providing and maintaining a data dictionary,
    8. Transaction management - making sure that all operations in a transaction either complete or are not made at all.

    Types of DBMS

    There are hundreds of DBMS software out there and you can group them into different categories.

    Data models

    Depending on data model:

    1. Relational DBMS (SQL databases) store data organized in predefined tables and columns with strict data type and size.
    2. Document DBMS store data in JSON documents, which are hierarchical sets of key-value pairs.
    3. Key-value store store data in simple pairs of key-value (in simpliest case both are plain strings), meaning that you can store some value under certain label.
    4. Graph DBMS represent data as networks built from nodes and relationships.
    5. Wide column store organize data in columns and can handle large amounts of columns without negative impact on storage (empty cells don't take up space).
    6. Time series DBMS are databases optimized for handling time series data - arrays of numbers indexed by time.
    7. Object oriented DBMS represent data in objects, their properties and relationships as defined in object-oriented programming, rather than tables, columns and foreign keys.

    The first one (Relational DBMS) refered to as SQL and the rest of them as NoSQL.

    Architectures

    You can also clasify DBMS depending on the architecture:

    1. Client-server database
    2. In-memory database is a database that uses RAM memory rather than disk storage.
    3. Embedded database are software embedded into an application (e.g. mobile application) rather than as stand alone service.
    4. Cloud database is a database available in private or public cloud infrastructure.
    5. Distributed database is a database that storage is located on disks on separate computers in a network.
    6. Column-orented database are databases that store data physically organized by colum, rather than rows as in regular databases. This improves performance of analytical queries significantly.

    Popular DBMS

    Relational DBMS

    1. SQL Server
    2. MySQL
    3. Oracle
    4. PostgreSQL
    5. IBM Db2

    Document DBMS

    1. MongoDB
    2. DynamoDB
    3. Couchbase
    4. CouchDB

    Key-value stores

    1. Redis
    2. Berkley DB

    Graph DBMS

    1. Neo4J
    2. Giraph

    Wide column stores

    1. Apache Cassandra
    2. Apache HBase
    3. Google Bigtable

    Embedded databases

    1. SQLite
    2. Oracle BerkleyDB
    3. SAP SQL Anywhere

    In-memory databases

    1. Redis
    2. SAP HANA
    3. Memcached
    Comments (0)