Arenadata DB

Arenadata DB (ADB) is an open-source massively parallel relational DBMS that is based on PostgreSQL and intended for column storages with flexible horizontal scalability. Due to its architectural features and powerful query optimizer, ADB demonstrates a special reliability and high speed of SQL query processing against large data volumes, so it is widely used for Big Data analytics on an industrial scale.

For more convenient operation and launching practical tasks of any complexity, Arenadata DB comes with a number of additional tools that provide integration with external data storages, binary backup management, and real-time query monitoring. This functionality allows users to build solutions with full coverage of all processes related to business system maintenance.

Use cases

Advanced data analytics

The advanced analytics provided by ADB is being used across many verticals, including finance, manufacturing, automotive, government, energy, education, retail, and so on, to address a wide variety of problems.

Some of the Arenadata DB analytics capabilities include the ability to analyze a multitude of data types, leverage existing SQL knowledge, and train more models in less time by using the MPP architecture.

Additionally, ADB provides in-database analytics which allows you to run analytics directly in the database vs exporting and running your data in an external analytics engine.

Machine learning

Arenadata DB is an excellent database for machine learning – the study of computer algorithms that improve automatically through experience. Apache MADlib is an open-source, SQL-based machine learning library that runs in-database on ADB, as well as on PostgreSQL.

This combination helps to improve the parallelism, scalability, and predictive accuracy of a machine learning deployment. Data transformation and feature engineering capabilities are also available through MADlib for machine learning, including descriptive and inferential statistics, pivoting, sessionization, and categorical variables encoding.

Artificial intelligence

With ADB ability to ingest large volumes of data at high speeds, it makes this database a powerful tool for smart applications that need to interact intelligently based on an unlimited number of unique scenarios.

For example, a telecom company may use Arenadata DB AI capabilities in IoT (Internet of Things) systems with smart sensors to analyze and process events for maintenance, security, and operational efficiency purposes.

Enterprise

Community

Core Greenplum functionality

gpbackup/gprestore

PXF

Deploy & upgrade automation

Monitoring & Alerting

Offline installation

WAL Backup management

x86

Technical support 24/7

Corporate training courses

Tailored solutions

Available integrations

ADQM

ADS

ADPG

Kafka

Oracle

HBase

HDFS

JDBC

Hive

Operating systems

Alt Linux

CentOS

RedHat

Astra Linux

Ubuntu

RedOS

Core Greenplum functionality

gpbackup/gprestore

PXF

Deploy & upgrade automation

Monitoring & Alerting

Offline installation

WAL Backup management

x86

Technical support 24/7

Corporate training courses

Tailored solutions

Available integrations

ADQM

ADS

ADPG

Kafka

Oracle

HBase

HDFS

JDBC

Hive

Operating systems

Alt Linux

CentOS

RedHat

Astra Linux

Ubuntu

RedOS

Features

Performance

ADB can scale horizontally without degrading query performance on petabytes of data

Safety

Built-in audit of user actions on a cluster: authentication, LDAP configuration, resource group configuration

Reliability

Mirroring, safe backup management, ddboost plugin for gpbackup/gprestore utilities

Convenience

Flexible deployment and configuration, upgrades with tested binaries and migrations for all the components

Contribution

Our team is one of the main Greenplum contributors. In addition, we maintain our own documentation and keep it up-to-date

ADB Control

Arenadata DB query monitoring system

It is designed for in-depth research of command execution processes or utilities that work with ADB clusters.

Monitoring is based on real-time information of the query-level resource consumption and the progress of the query plan execution. Additionally, it is possible to monitor the execution of queries in the context of transactions.

The monitoring system has a convenient user interface with the ability to connect several Arenadata DB clusters to it, collect statistics, view its graphical representation, and export metrics.

Arenadata DB Backup Manager

Service for ADB binary backup management

The main feature is asynchronous launch of binary backups on a running cluster.

There is a user interface built into ADB Control, from which you can work with several ADB clusters and for each of them:

configure backup schedules;
manage backup configurations;
create backups of different types (full, incremental, differential) on-demand;
restore cluster databases from existing backups;
perform audit of actions related to backups.

Connectors

ADB Spark Connector

Multifunctional connector with support for parallel read/write operations between Apache Spark and Arenadata DB. Based on it, you can easily build ETL solutions and perform in-memory data analysis.

Provides a flexible configuration and many features:

high data transmission speed;
automatic data schema generation;
flexible partitioning;
support for push-down operators;
support for batch operations.

Read documentation

ADB Kafka Connector

Special connector for Apache Kafka integration with Arenadata DB.

Features:

ability to read and write AVRO data from Kafka topics;
support for CSV and text formats in data read operations;
support for transactions when writing data to Arenadata DB.

Read documentation

ADB PXF Connector

Framework for parallel and high performance access to heterogeneous data sources from Arenadata DB based on built-in connectors.

The data is accessed through the mechanism of external tables, which allows to build complex federal queries.

To connect external data storages, the following connectors are provided: JDBC, S3, Hive, HDFS, and HBase. Authentication may include Kerberos and/or SSL.

ADB ClickHouse Connector

FDW connector for data transmission from Arenadata DB to Arenadata QuickMarts or ClickHouse.

Features:

transactionally load data by automatic creation of staging tables;
use multiple table engine families in ClickHouse;
flexibly distribute and parallelize the write load.

Read documentation

ADB to ADB Connector

Provides two-way interaction between ADB clusters. The connector is implemented on the basis of a foreign data wrapper and parallel retrieve cursors.

ADB to ADB Connector has the following features:

segments of a local cluster have the ability to select data directly from segments of a remote cluster in a parallel mode;
transactional data insertion from local to remote clusters in the "master-master" mode is available.;
ADB to ADB Connector supports the ability to calculate the number of query executors (QE) automatically.

Read documentation

ADB 7 based on Greenplum 7

New Community version

With powerful enhancements and advanced capabilities, ADB 7 opens the door for a new era of Big Data analytics. The product update enhances data reliability, security, and control by giving organizations tools for quick and effective analysis. Companies can enhance their analytical capabilities and accelerate decision-making with the help of ADB 7, which offers new levels of performance and ease of management.

In addition, this version is perfect for those who are looking for a reliable and high-performance open-source solution. Join our community to access new capabilities for effective data management and analysis.

ADB 7 Features

Modern core based on PostgreSQL 12

ADB 7 is based on PostgreSQL 12, which provides improved compatibility and access to the new features provided by this PostgreSQL version.

Significant performance increase

The introduction of JIT compilation, support for new index types, improvements to AO/CO table indexing, addition of optimizer hints and parameterized queries, as well as other improvements in GPORCA provide a significant increase in productivity.

Advanced administration and resource management tools

The updated cluster management capabilities make the administration process more flexible and efficient, allowing you to better manage your resources and provide smooth operation even under high load.

Increased security measures

Access management and data protection tools in ADB 7 have been supplemented by enhanced access control, row-level security, and built-in audit mechanisms, which makes the system more secure.

Enhanced integration

Integration with different systems is improved by additional tools for foreign tables (FDW), resulting in more flexible and efficient interaction with external data sources.

Improved cloud support

ADB 7 offers enhanced cloud support, including optimization for clouds and hybrid infrastructure, which provides greater flexibility and scalability for modern enterprise applications.

Product comparison

Arenadata DB

Compare with

VMWare Tanzu Greenplum

Teradata

Vertica

Oracle Autonomous Data Warehouse

Compare with

VMWare Tanzu Greenplum

Teradata

Vertica

Oracle Autonomous Data Warehouse

Infrastructure

Management system, including easy installation, update, and upgrade

Yes

Cluster expansion

Yes

Only CLI.

Yes

Monitoring system

Yes

Backup/DR system with UI

Yes

IT landscape support

Ability to deploy various combinations of bare metal, cloud

Yes

PaaS support for cloud providers

VK Cloud;

Cloud.ru (in development).

AWS;

Azure;

Google Cloud Platform.

AWS;

Azure;

Google Cloud Platform.

AWS;

Azure;

Google Cloud Platform.

Oracle Cloud.

Domestic OS support

Alt Linux

Yes

Astra Linux

Yes

RED OS

Yes

Ubuntu

Yes

Features

Physical data backup/recovery

Yes

Cluster monitoring

Yes

Query monitoring

Yes

Possible performance degradation.

Yes

Integrations

Kafka

Yes

ADB Kafka Connector.

Yes

Possible performance degradation.

Yes

At extra charge (not available in Russia).

ClickHouse

Yes

ADB ClickHouse Connector.

Yes

PXF only, manual integration.

Yes

JDBC only.

Yes

JDBC only.

Yes

At extra charge, manual integration (not available in Russia).

Hadoop

Yes

ADB PXF Connector.

Yes

At extra charge (not available in Russia).

Spark

Yes

ADB Spark Connector.

Yes

JDBC only.

Yes

At extra charge (not available in Russia).

Cluster-to-cluster

Yes

ADB to ADB Connector.

Yes

At extra charge (not available in Russia).

Security settings

SSL between client and server

Yes

Synchronization with LDAP

Yes

External secret storage

In development

Yes

Additionally

Technical support 24x7

Yes

(not available in Russia).

Yes

(not available in Russia).

Fastest release of bug fixes, new features, and optimization

Yes

Training/workshops

Yes

(not available in Russia).

Yes

(not available in Russia).

Community version

Yes

With limitations.

Documentation in Russian

Yes

Registration in the register of domestic software

Yes

FSTEC certification

Yes

ADB Control & Backup Manager

Compare with

Tanzu Greenplum Command Center

Teradata Viewpoint

Oracle Enterprise Manager

Compare with

Tanzu Greenplum Command Center

Teradata Viewpoint

Oracle Enterprise Manager

Infrastructure

Management system

Arenadata Cluster Manager (ADCM)

Teradata Vantage

Oracle Enterprise Manager Cloud Control

Centralized upgrade

Yes

Via ADCM.

Yes

IT landscape support

Ability to deploy various combinations of bare metal, cloud

Yes

Domestic OS support

Alt Linux

Yes

Astra Linux

Yes

RED OS

Yes

Features

Integration with other products

ADB

Tanzu Greenplum

Teradata

Oracle

User interface with role-based access control

Yes

Work with multiple clusters of different types (with/without Standby and mirroring) via UI

Yes

Ability to load monitoring metrics to an external database via UI

Yes

Monitoring of SQL queries and transactions. Ability to view queries history via UI

Yes

Ability to track plans and execution of SQL queries via UI

Yes

Resource prioritization via UI

Yes

Launch of binary backups on a running cluster via UI

Yes

Backup management via UI: configure, launch on schedule and manually, view, delete, create restore points

Yes

Support for S3 and Posix compatible backups

Yes

Disaster recovery

Yes (asynchronous)

On-premises

Yes

Security settings

SSL encryption

Yes

Via ADCM.

Yes

Standard access separation based on Role Base Access Control

Yes

Logging of user and system actions

Yes

Additionally

Technical support 24x7

Yes

Not available for Russia

Training/workshops

Yes

Full training on working with Arenadata products.

Not available for Russia

Community version

Documentation

Yes

Registration in the register of domestic software

Yes

Successful deployments

Yes

Release history with descriptions

Yes

Complete release history with service versions and description of the upgraded functionality is available in the open domain.

Yes

“Product comparison” section is relevant on the date of 14.11.2024.

Releases

2023

ADB 6.27.1.63

ADBM 2.5.0

ADB Control 4.13.0

ADB 6.27.1.60

ADBM 2.4.0

ADB Control 4.12.0

ADB 6.27.1.59

ADB Control 4.11.0

ADBM 2.3.2

ADB 6.27.1.58

ADB Control 4.10.3

ADBM 2.2.3

ADB 7.2.0.1

ADB 6.27.1.57

ADB Control 4.9.1

ADBM 2.1.2

ADB 6.27.1.56

ADB Control 4.8.8

ADBM 2.0.4

ADB 6.26.2.55

ADB Control 4.7.5

ADBM 1.7.3

ADB 6.26.0.53

ADB Control 4.6.4

ADBM 1.6.3

ADB 6.25.2.52

ADB Control 4.5.3

ADBM 1.5.2

ADB 6.25.1.51

ADB Control 4.4.0

ADBM 1.4.0

ADB 6.25.1.49

ADB Control 4.3.3

ADBM 1.3.3

ADB 6.24.3.48

ADB Control 4.3.2

ADBM 1.3.2

ADB 6.24.3.47

ADB Control 4.3.1

ADBM 1.3.1

ADB 6.23.5

ADB 5.29.11

ADB 6.23.3

ADB Control 4.2.1

ADBM 1.2.1

ADB 6.22.1

ADBM 1.1.0

ADB Control 4.1.0

ADB 6.22.0

ADB Control 3.7.0

ADB 6.21.1

ADB Control 3.6.0

ADB 6.21.0

ADB Control 3.5.1

ADB 6.20.1

ADB Control 3.4.0

ADB 6.19.3

ADB Control 3.3.1

ADB 6.18.2

ADB Control 3.2.5

ADB 6.18.0

ADB Control 3.2.4

ADB 6.17.5

ADB Control 3.1.3

ADB 6.17.1

ADB Control 3.1.0

ADB 6.16.2

ADB Control 2.1.1

ADB 6.15.0

ADB Control 2.0.3

ADB 6.14.1

ADB 6.14.0

ADB 6.13.0

ADB 6.12.1