Arenadata DB

Arenadata DB (ADB) is a distributed analytical data storage built on the open-source massively parallel Greenplum DBMS and designed to store and process large amounts of structured and semi-structured data (up to tens of petabytes).

TOP-10 popular articles

Partitioning is a way to increase query performance by logical dividing large tables (such as fact tables) into smaller, more manageable parts called partitions. Due to partitioning query optimizers can read the limited number of table rows (based on predicate conditions) instead of scanning all the table contents each time.

The article describes two ways to install ADB via ADCM: onine (on hosts with access to Internet) and offline (without Internet).

ADB 6 release notes. Learn about new features, improvements, bug fixes, etc.

Learn about software requirements for the Arenadata DB cluster installation.

Distribution is one of the most important concepts underlying ADB. It means storing table rows on different segments of the cluster. The more uniform the distribution of the data between segments is — the higher the whole cluster performance is.

Arenadata DB Backup Manager (ADBM) is a fault-tolerant system for ADB binary backups management, which is built on the base of pgBackRest. Learn about ADBM features, architecture, and some internal details.

Spill files (also known as work files) are created on disks if there is not enough memory to store temporary data when executing data queries. This temporary data may include JOIN hashes, sorting results, redistributed tables, etc.

ADB supports several types of tables with different data storage models: heap, append-optimized row-oriented, append-optimized column-oriented. The article describes features and possible use cases for each table type.

This article describes the parameters that can be configured for ADB services via ADCM.

Learn about hardware requirements (CPU, RAM, disk) for installation of ADB, ADB Control, ADBM.

Found a mistake? Seleсt text and press Ctrl+Enter to report it