Arenadata Streaming

Arenadata Streaming (ADS) is a real-time data streaming platform developed by the company Arenadata. It is designed to enable businesses to process, analyze, and react to high-volume data streams in real-time.

The platform uses Apache Kafka as its core messaging system, which is known for its high throughput and low latency. Arenadata Streaming provides a distributed and fault-tolerant architecture that can handle large volumes of data from various sources, including databases, IoT devices, sensors, and other streaming sources.

Use cases

Real-time data ingestion

Arenadata Streaming can ingest data in real-time from various sources, including databases, sensors, and IoT devices.

Data processing

The platform can process and transform data streams in real-time using Apache Kafka's stream processing capabilities.

Analytics

Arenadata Streaming provides tools for real-time data analytics, including machine learning, predictive analytics, and anomaly detection.

Integration

The platform offers integration with other data systems, such as Hadoop, Spark, and NoSQL databases.

IoT

MiNiFi can also be integrated with MQTT (Message Queuing Telemetry Transport) protocol, which is a lightweight messaging protocol designed for IoT devices. This integration allows MiNiFi to receive and publish data to MQTT brokers, which can be used for real-time data streaming and processing at the edge.

Enterprise

Community

Cluster management and monitoring

Deploy & upgrade automation

Offline installation

High availability

Advanced security features (encryption, role-based access control)

Technical support 24/7

Corporate training courses

Tailored solutions

Available integrations

ADB

ADH

ADQM

ADPG

Oracle

MS SQL

MongoDB

AVRO

JSON

Operating systems

Alt Linux

CentOS

RedHat

RedOS

Astra Linux

Ubuntu

Cluster management and monitoring

Deploy & upgrade automation

Offline installation

High availability

Advanced security features (encryption, role-based access control)

Technical support 24/7

Corporate training courses

Tailored solutions

Available integrations

ADB

ADH

ADQM

ADPG

Oracle

MS SQL

MongoDB

AVRO

JSON

Operating systems

Alt Linux

CentOS

RedHat

RedOS

Astra Linux

Ubuntu

Components

Apache ZooKeeper

Apache ZooKeeper is a distributed coordination service used by Arenadata Streaming to manage the configuration and coordination of its clusters. It is a crucial component of the system as it helps to ensure high availability and fault tolerance in Arenadata Streaming clusters.

ZooKeeper provides a hierarchical namespace that allows Arenadata Streaming to store configuration data, manage distributed locks, and coordinate distributed processes. It provides a consistent view of the system state across all nodes in the cluster, which helps to prevent data inconsistencies and ensure data integrity.

For example, Arenadata Streaming uses ZooKeeper to manage its Kafka brokers, topics, and partitions. When a new broker is added to the cluster, ZooKeeper is used to assign it a unique identifier and to coordinate the distribution of data across the cluster.

Apache Kafka

Apache Kafka is a distributed streaming platform used by Arenadata Streaming to manage the ingestion, processing, and analysis of real-time data streams. It provides a scalable, fault-tolerant, and highly available infrastructure for processing and storing real-time data.

Arenadata Streaming leverages Kafka's capabilities to handle large volumes of data and support multiple data sources. It provides a real-time data processing platform that enables businesses to analyze data as it flows through the system, providing near-instant insights into business operations.

Schema Registry

Schema Registry is a centralized repository used by Arenadata Streaming to store and manage schemas for data produced and consumed by Apache Kafka. It allows users to define, evolve, and share schemas across different applications and systems that use Kafka.

In Arenadata Streaming, Schema Registry enables users to ensure data compatibility across different versions of their applications and systems. It provides a way to enforce data validation and to ensure that all data produced and consumed by Kafka conforms to a predefined schema.

KSQL

KSQL is a streaming SQL engine used by Arenadata Streaming to process real-time data streams. It allows users to write SQL queries to transform, aggregate, and analyze data in real-time, making it easy to create real-time data processing pipelines without the need for complex programming.

In Arenadata Streaming, KSQL provides a simple yet powerful way to interact with data streams, enabling users to query, join, and filter data as it flows through the system. It supports a wide range of SQL operations, including windowing, aggregations, and joins, allowing users to create complex processing logic without the need for custom code.

Kafka Connect

Kafka Connect is a data integration framework used by Arenadata Streaming to move data between Apache Kafka and other systems. It provides a scalable and fault-tolerant infrastructure for ingesting and exporting data to and from Kafka, making it easy to integrate different systems and technologies with Kafka.

In Arenadata Streaming, Kafka Connect enables users to integrate data from various sources such as databases, file systems, and messaging systems with Kafka. It provides connectors that can be configured to extract data from different systems and write it to Kafka topics, or to read data from Kafka topics and write it to external systems.

It is also used for MirrorMaker 2. MirrorMaker 2 is a tool used by Arenadata Streaming to replicate data between Apache Kafka clusters. It is a replacement for the original MirrorMaker tool and provides several new features and improvements over its predecessor.

Kafka REST Proxy

Kafka REST Proxy is a tool used by Arenadata Streaming to expose Apache Kafka functionality as a RESTful API. It provides a simple and scalable way to integrate Kafka with other systems and technologies that support RESTful APIs.

Apache NiFi

Apache NiFi is an open-source data integration tool used by Arenadata Streaming to automate the flow of data between different systems and technologies. It provides a visual drag-and-drop interface for designing and configuring data flows, making it easy for users to build complex data pipelines without writing any code.

In Arenadata Streaming, Apache NiFi enables users to build and manage data flows across different systems and technologies. It provides a wide range of processors and connectors that can be used to integrate with various data sources and destinations, including databases, message queues, and cloud platforms.

Apache MiNiFi

Apache MiNiFi is a lightweight data collection tool used by Arenadata Streaming to collect and preprocess data at the edge of the network. It is designed to run on resource-constrained devices, such as sensors and IoT devices, and enables users to collect and process data in real-time, without relying on a central server.

In Arenadata Streaming, Apache MiNiFi enables users to collect and preprocess data at the edge of the network, before sending it to a central server for further processing and analysis. It provides a wide range of processors and connectors that can be used to collect data from various sources, including sensors, cameras, and other IoT devices.

Apache NiFi Registry

Apache NiFi Registry is a version control and management system used by Arenadata Streaming to manage and version data flows and other assets created using Apache NiFi. It provides a central repository for storing and managing NiFi flows, templates, and other artifacts, enabling users to easily version, deploy, and reuse them across different environments.

Kafka Manager

Kafka Manager (also known as CMAK) is a web-based management tool used to manage Apache Kafka clusters. It is designed to simplify the administration of Kafka clusters, providing a user-friendly interface for managing and monitoring Kafka topics, partitions, and brokers.

In Arenadata Streaming, Kafka Manager enables users to easily manage and monitor their Kafka clusters. It provides a web-based interface for performing administrative tasks, such as creating and deleting topics, reassigning partitions, and managing broker configurations. It also provides real-time metrics and monitoring of Kafka clusters, allowing users to easily identify and troubleshoot issues.

Features

Faster deployment

Arenadata Streaming streamlines the installation and configuration process, reducing the time required for setup compared to manual methods

User-friendly

Users can easily deploy and configure their data streaming infrastructure, even without extensive technical knowledge

Consistent installation

Arenadata Streaming ensures standardized deployment across multiple systems, minimizing errors and discrepancies

Improved performance

By optimizing the data streaming setup process, Arenadata Streaming enhances system performance, minimizing downtime and improving efficiency

Community-driven enhancements

Our team evaluates enhancements from the wider data streaming community, ensuring their incorporation into the product for seamless performance

Arenadata Platform Security

Enterprise edition

Arenadata Platform Security (ADPS) is a combination of two security components:

Apache Ranger

Apache Ranger is an open-source security framework that provides centralized policy management for Hadoop and other big data ecosystems. Arenadata Platform integrates with Apache Ranger to provide policy-based access control and fine-grained authorization for data and analytics applications.

Apache Knox

Apache Knox is an open-source gateway that provides secure access to Hadoop clusters and other big data systems. Arenadata Platform integrates with Apache Knox to provide secure access to the platform and its services.

ADPS provides a comprehensive security framework that includes policy-based access control, fine-grained authorization, and secure access to the platform and its services. This helps organizations protect sensitive data and ensure compliance with regulations.

ADS Control

Arenadata Streaming Control is a web-based graphical user interface (GUI) for managing and monitoring Arenadata Streaming clusters. It provides a user-friendly way to manage Kafka Connect instances.

ADS Control allows administrators to manage all aspects of their ADS Connect clusters, including stream processing, cluster configuration. It also provides monitoring capabilities that enable administrators to view the status of their clusters.

Product comparison

Infrastructure

Management system

Arenadata Cluster Manager (ADCM)

A single tool for managing the lifecycle of all Arenadata products.

ADCM is installed with one command and only requires Docker.

Confluent

Two options for cluster deployment and management.

The Self-managed option for on-premises requires manual installation and configuration.

Cloud Managed allows you to control the cluster via cloud interfaces.

Built-in monitoring

Yes

Centralized upgrade

Yes

IT landscape support

Ability to deploy various combinations of bare metal, cloud

Yes

By using infrastructure bundles, ADS supports installation on physical and virtual servers (on-premises), in private and public clouds according to the IaaS model.

Additionally, infrastructure bundles provide automatical installation on existing nodes and nodes creation "on the fly" for part of cloud providers (YC, VK).

Yes

Supported.

Support for cloud providers

Yandex Cloud;

VK Cloud;

Sber Cloud;

Google Cloud.

AWS;

Azure;

Google Cloud Platform.

Domestic OS support

Alt Linux

Yes

Astra Linux

Yes

RedOS

Yes

Features

Offline installation

Yes

High availability

Yes

Integration with other products

Yes

ADS supports a number of proprietary solutions for integration:

ADB Kafka Connector;
ADQM Kafka Connector;
Kafka Picodata Connector;
NiFi Hive streaming processor;
Kafka Connect Mirror Maker 2.

Yes

Connectors within Kafka Connect.

Management

Yes

The Kafka clusters are managed via Kafka Manager based on CMAK.

In addition, there is a proprietary solution — ADS Control, the current functionality of which allows you to manage the Kafka Connect service using a convenient interface.

Yes

Proprietary interface for configuration and administration of all components.

Additional services

Yes

ADS supplies Kafka Connect, Schema Registry, ksqlDB, Kafka REST Proxy, Kafka Manager, NiFi, MiNiFi.

Additionally, you can install ADS Control — solution for Kafka clusters management. It supports the management of multiple Kafka and Kafka Connect clusters with the ability to create, edit, and remove Kafka connectors.

Yes

Includes the entire Apache Kafka ecosystem: Kafka Connect, Schema Registry, ksqlDB, Kafka REST Proxy.

Security settings

SSL encryption

Yes

Via ADCM.

Yes

Standard access separation based on Role Base Access Control

Yes

Flexible settings with Ranger in a separate ADPS product, which can serve multiple instances of ADS and other Arenadata products.

Yes

Single point of secure access

Yes

Knox as a part of ADPS.

Yes

Additionally

Technical support 24/7

Yes

On-demand fixes and improvements

Yes

No data

Training/workshops

Yes

Full training on working with Arenadata products.

Not available for Russia

Community version

Yes

ADS has a free version available. You can just download it.

Yes

Available for the self-managed version in the single node mode only.

Documentation

Yes

Detailed documentation in Russian and English languages for all services, their installation, configuration, and operation.

Publicly available.

Yes

Publicly available.

Registration in the register of domestic software

Yes

Successful deployments

Yes

ADS has been used for hundreds of thousands of hours in more than 20 Russian leader companies as a streaming platform.

Yes

Release history with descriptions

Yes

Complete release history with service versions and description of the upgraded functionality is available in the open domain.

Yes

Complete release history with service versions and description of the upgraded functionality is available in the open domain.

Separate documentation for Cloud and Self-managed versions.

Comparison of current service versions

Service

ADS

Confluent

Kafka

3.9.0

3.9.1

Kafka Connect

3.9.0

3.9.1

ksqlDB

7.9.0

7.9.1

Kafka REST Proxy

7.9.0

7.9.1

Schema Registry

7.9.0

7.9.1

NiFi

1.28.0

MiNiFi

1.28.0

ZooKeeper

3.8.4

“Product comparison” section is relevant on the date of 29.07.2025.

Releases

2023

ADS 3.9.1.1

ADS 3.9.0.1

ADS Control 2.5.0

ADS 3.7.2.1_b1

ADS Control 2.4.0

ADS 3.6.2.2_b2

ADS 3.6.2.2

ADS Control 2.3.0

ADS 3.6.2.1

ADS 3.3.2.2

ADS Control 2.2.0

ADS 3.3.2.1

ADS 2.8.1.1

ADS Control 2.1.1

ADS 1.8.2

ADS 1.8.1

ADS 1.8.0

ADS 1.7.2

ADS 1.7.1

ADS 1.7.0

ADS 1.6.2

ADS 1.6.0

ADS 1.5.0

ADS 1.4.11