Arenadata Hyperwave
Arenadata Hyperwave (ADH) is a universal hybrid platform based on open-source components and proprietary developments, designed for storing, processing, and analyzing data of any structure and volume.
TOP-10 popular articles
An overview of Apache Iceberg — an open-source table format for data lakes that enables ACID transactions, time travel, schema evolution, partition evolution, and other features.
An overview of HDFS (Hadoop Distributed File System) — a highly fault-tolerant distributed file system designed for deployment on low-cost hardware.
Apache Iceberg is an open, high-performance format for large analytic tables. The ADH Spark3 service adopts this format allowing you to work with Iceberg tables through Spark.
A cheatsheet that describes the most common HDFS commands with examples.
A list of software requirements for ADH operation.
An overview of Apache Ozone — a distributed key/value object storage optimized for working with both Hadoop services and S3 storages. Major components and concepts, read and write operations flow description.
Hive execution plan analysis using the EXPLAIN and ANALYZE commands.
An overview of the Trino service, which is an SQL query engine used for processing data in parallel, distributed over multiple storages, such as object storages, databases, and file systems.
A description of the built-in Trino catalog for working with Iceberg tables. This catalog uses the Iceberg connector and is ready to work with Iceberg tables stored in your ADH cluster.
An overview of the Kyuubi service — a distributed and multi-tenant JDBC interface for large-scale data processing and analytics.