Arenadata Orchestrator

Arenadata Orchestrator (ADO) is a platform for setting up and operating data pipelines in a production environment. The central component of the platform is Apache Airflow, an open source tool used to programmatically create, schedule, and monitor process and task sequences (DAGs).

TOP-10 popular articles

An overview of Airflow concepts (DAG, task, operator) and Airflow architectural components (Web server, Metadata database, Scheduler, Executor, Worker).

Arenadata Orchestrator software requirements for proper cluster installation.

The article describes the DBT service — SQL-first transformation tool for building ELT pipelines.

Overview of the GitSync service, which is used for synchronizing Airflow DAGs with remote Git repositories.

Overview of existing approaches to delivering the necessary clients to Airflow workers with examples.

The guide that explains how to configure integration with HashiCorp Vault to store secrets.

The table with ADO service ports required for successful Arenadata Orchestrator installation.

This article presents an overview of the Airflow REST API.

The article describes the Airflow web interfaces.

An overview of available performance tuning approaches for Airflow optimization in ADO with examples.

Found a mistake? Seleсt text and press Ctrl+Enter to report it