Arenadata Hyperwave

Arenadata Hyperwave (ADH) is a universal hybrid platform based on open-source components and proprietary developments, designed for storing, processing, and analyzing data of any structure and volume.

TOP-10 popular articles

The article shows how to create and run your first DAG to process CSV files.

An overview of Apache Iceberg architecture, benefits, and use cases. Iceberg is an open-source table format for data lakes that enables ACID transactions, time travel, schema evolution, partition evolution, and more.

A cheatsheet that describes the most common HDFS commands with examples.

An overview of HDFS (Hadoop Distributed File System) — a highly fault-tolerant distributed file system designed for deployment on low-cost hardware.

An overview of working with sensors in Airflow: available sensor types and parameters. Examples of sensor use, as well as a description of the process of creating a custom sensor.

The tables with Arenadata Hyperwave network requirements: ADH service ports, JMX ports, ports redefined by Kerberos, client ports.

An article about the Airflow concepts (DAG, task, operator) and architectural components. Airflow is a platform that allows you to develop, plan, run, and monitor complex workflows.

Apache Iceberg is an open, high-performance format for large analytic tables. The ADH Spark3 service adopts this format allowing you to work with Iceberg tables through Spark.

The article describes ways of connecting to HiveServer2 using the JDBC interface which is the recommended way for client interaction with HiveServer2.

The section provides reference information on configuration parameters that can be used to configure ADH services via ADCM.

Found a mistake? Seleсt text and press Ctrl+Enter to report it