Arenadata Hadoop

Arenadata Hadoop is a full-fledged enterprise distribution package based on Apache Hadoop and designed for storing and processing semi-structured and unstructured data.

TOP-10 popular articles

ADH release notes. Learn about new features, improvements, bug fixes, etc.

The section provides reference information on configuration parameters that can be used to configure ADH services via ADCM.

The tutorial guides you through the process of installing an Arenadata Hadoop (ADH) cluster using the online and offline installation types.

A cheatsheet that describes the most common HDFS commands with examples.

A guide on using DBeaver to connect to Hive with Kerberos authentication enabled.

Working with Airflow logs in Arenadata Hadoop. The location of log files and their contents.

ADB Spark 3 Connector provides the possibility of high-speed, parallel data exchange between Spark 3 and Arenadata DB. The article contains a full description of the ADB Spark 3 Connector.

The article shows how to create and run your first DAG to process CSV files.

The article provides detailed information on version compatibility for ADH, ADPS, and ADCM products.

Hive provides several ways to work with tables. You can use data manipulation language (DML) queries to import or add data to a table. Also, you can directly ingest data to a Hive table using HDFS commands.

Found a mistake? Seleсt text and press Ctrl+Enter to report it