• Products
      Arenadata DB Arenadata Hadoop Arenadata QuickMarts Arenadata Streaming Arenadata Postgres Arenadata Cluster Manager
  • Download
  • Documentation releases
  • Technical blog
  • Career
    We are hiring!
ADB
ADH
ADQM
ADS
ADPG
ADCM
ADPS
ADSCONTROL
Showing results: 5 of 16. See all results...
Select language
EN | RU
Change color mode
Home
Products
ADB
Arenadata Hadoop
  • Select version
    3.3.6.2
    3.3.6.2
    3.3.6.2
    3.3.6.1
    3.2.4.3
    3.2.4.2
    3.2.4.1
    3.1.2.1
    2.1.10
    2.1.8
    2.1.7

    • Get started
      • Online installation
        • Install ADCM
        • Prepare hosts
        • Install ADH cluster
          • Create a cluster
          • Add services
          • Add hosts to a cluster
          • Add components
          • Configure services
          • Configure a cluster
          • Install a cluster
        • Install monitoring
          • Create a cluster
          • Add services
          • Add hosts to a cluster
          • Add components
          • Configure services
          • Install a cluster
          • Integrate with ADH cluster
      • Offline installation
        • Install ADCM
        • Prepare hosts
        • Install Enterprise Tools cluster
          • Create a cluster
          • Add services
          • Add hosts to a cluster
          • Add components
          • Install a cluster
        • Install ADH cluster
          • Create a cluster
          • Add services
          • Add hosts to a cluster
          • Add components
          • Configure services
          • Configure a cluster
          • Import ET settings
          • Install a cluster
        • Install monitoring
          • Configure Enterprise Tools cluster
          • Integrate with ADH cluster
    • How to
      • Airflow
        • Connect to Airflow
          • CLI
          • REST API
        • UI overview
        • Work with DAGs
          • Create a simple DAG
          • Work with TaskFlow
          • Use sensors
          • Customize DAG scheduling
          • Add custom operators and hooks
          • Generate DAGs dynamically
        • Logging
        • Service management via ADCM
      • Flink
        • UI overview
        • Connect to Flink
          • CLI
        • Flink SQL Gateway
        • Flink on YARN
        • Service management via ADCM
      • HBase
        • Connect to HBase
          • Use native API
          • Use external API
        • UI overview
        • Data querying
          • Quick start
          • Use filters
          • Use coprocessors
          • Scan over snapshot
          • Use medium-sized objects
        • Bulk loading
          • Built-in MapReduce jobs
        • Use snapshots for backup and restore
        • Replication
        • Logging
        • Service management via ADCM
      • HDFS
        • Connect to HDFS
        • UI overview
        • Basic file operations
          • Protect files
          • Copy data to HDFS
          • Add HDFS data directories
          • DataNode hot swapping
          • Remove files & directories
          • Enable WebHDFS
          • Check files via fsck
          • Local read
          • Use distcp
          • Use HttpFS
        • Storage management
          • Storage policies
          • Use quotas
          • Cache management
        • Replication
          • Replication factor
          • Rack awareness
          • Erasure coding
          • NameNode safemode
        • Backup and restore
          • Snapshots
          • NameNode recovery
        • Configure
        • Checkpointing
        • Integration with S3
        • Logging
        • Service management via ADCM
      • Hive
        • Connect to Hive
          • Beeline shell
          • JDBC
        • UI overview
        • Data querying
          • Work with tables
          • Computation example
          • Multi-table inserts
          • Analyze queries
          • Query types
            • Subqueries
            • Aggregate functions
            • Windowing functions
            • Use materialized views
        • Integrations
          • Hive on Spark
          • Hive with HBase
          • TEZ
            • View a Hive job
        • Performance tuning
        • Backup and restore
        • Replication
        • High availability
        • Logging
        • Use ACID
        • Iceberg tables
        • Service management via ADCM
      • HUE
        • UI overview
        • Add and use an interpreter
        • Iceberg tables
        • SPNEGO authentication
        • Service management via ADCM
      • Impala
        • Connect to Impala
          • impala-shell
          • JDBC
        • Web UIs overview
        • Performance tuning
        • Integrations
          • Impala with Hive
          • Impala with HBase
        • LDAP authentication
        • Iceberg tables
        • Service management via ADCM
      • Kyuubi
        • UI overview
        • LDAP authentication
        • Service management via ADCM
      • MapReduce
        • Quick start
        • Connect to MapReduce
          • CLI
          • REST API
        • Logging
        • Pluggable shuffle and sort
        • Encrypted shuffle
      • MariaDB
        • Service management via ADCM
      • Ozone
        • Connect to Ozone
          • CLI
        • UI overview
        • Ozone integration with cluster services
        • Service management via ADCM
      • Phoenix
        • Connect to Phoenix
        • Data querying
          • Use SQLLine
        • Map with HBase tables
        • Bulk loading
      • Solr
        • Connect to Solr
          • CLI
        • Indexing
          • Overview
          • Index nested documents
          • Partial document updates
          • Reindex data
          • Deduplication
        • UI overview
        • Solr collections
        • Logging in Solr
        • Performance tuning
        • Service management via ADCM
      • Spark
        • Connect to Spark
          • spark-submit & spark-shell
        • Examples of launching jobs
          • In-memory batch processing
            • Dataset
            • RDD
            • DataFrame
        • Integrations
          • Spark and Hive
          • ADB Spark 3 Connector
            • Overview
            • Prerequisites
            • Usage examples
            • Options
          • ADQM Spark 3 Connector
            • Overview
            • Prerequisites
            • Usage example
            • Options
          • PySpark
          • Spark Connect
          • Work with Livy
        • Scheduler jobs
        • Logging
        • Performance tuning
        • Iceberg tables
        • Service management via ADCM
          • Spark
          • Spark3
      • Sqoop
        • Connect to Sqoop
          • CLI
        • Quick start
        • Logging
        • Import and export examples
          • PostgreSQL and Hive
        • Service management via ADCM
      • SSM
        • UI overview
        • Define rules
        • Define actions
        • Rule usage examples
        • Service management via ADCM
      • Trino
        • Connect to Trino
          • CLI
          • JDBC
        • UI overview
        • Catalog management
        • Integrations
          • Overview of Trino connectors
          • Iceberg catalog
          • Hive catalog
        • Service management via ADCM
      • YARN
        • UI overview
        • Connect to YARN
          • CLI
          • REST API
        • Schedulers
          • FairScheduler
          • CapacityScheduler
        • High availability
          • Overview
          • Examples
        • Logging
        • GPU on YARN
        • Service management via ADCM
      • Zeppelin
        • UI overview
        • Work with interpreters
          • Add a custom interpreter to a group
          • Install a third-party interpreter
          • User-managed interpreters
        • Examples of notes
          • Phoenix
          • Solr
        • Logging
        • Service management via ADCM
      • ZooKeeper
        • Connect to ZooKeeper
          • CLI
          • API
        • Work with znodes
        • Logging
        • Configure
        • Service management via ADCM
    • Tutorials
      • Cluster management via ADCM
        • Cluster actions
          • Check
          • Manage SSL
          • Manage Credential Encryption
          • Manage Kerberos
          • Reinstall status-checker
          • Start
          • Stop
          • Upgrade
        • Hosts actions
          • Add a host to a cluster
          • View host status
          • Remove a host from a cluster
        • Service actions
      • Security
        • ADPS
          • Overview
          • Ranger plugins
            • HBase plugin
            • HDFS plugin
            • Hive plugin
            • Impala plugin
            • Solr plugin
            • Spark3 plugin
            • YARN plugin
        • Kerberos
          • Overview
          • MIT Kerberos
            • Kerberos server settings
            • Configure via ADCM
          • MS Active Directory
            • Install AD Certification Service
            • Create certificate template
            • Issue certificate template
            • Request a new certificate
            • Export CA certificate
            • Configure via ADCM
            • Enable Kerberos with prepared Active Directory server and no admin permissions
            • AD Kerberos common problems
          • FreeIPA
            • Overview
            • Configure via ADCM
            • Enable Kerberos with prepared FreeIPA server and no admin permissions
          • Samba
            • Configure via ADCM
          • Connect to Hive from DBeaver with Kerberos
          • SPNEGO
        • SSL
          • Requirements for SSL certificates
          • Set TLS version
    • References
      • Hadoop Command-line
        • Hadoop CLI
          • User commands
            • archive
            • checknative
            • CLASSNAME
            • classpath
            • conftest
            • credential
            • distch
            • distcp
            • dtutil
            • envvars
            • fs
            • gridmix
            • jar
            • jnipath
            • kerbname
            • kdiag
            • key
            • kms
            • trace
            • version
          • Administration commands
            • daemonlog
        • HDFS CLI
          • User commands
            • classpath
            • dfs
            • envvars
            • fetchdt
            • fsck
            • getconf
            • groups
            • httpfs
            • lsSnapshottableDir
            • jmxget
            • oev
            • oiv
            • oiv_legacy
            • snapshotDiff
            • version
          • Administration commands
            • balancer
            • cacheadmin
            • crypto
            • datanode
            • dfsadmin
            • dfsrouter
            • dfsrouteradmin
            • diskbalancer
            • ec
            • haadmin
            • journalnode
            • mover
            • namenode
            • nfs3
            • portmap
            • secondarynamenode
            • storagepolicies
            • zkfc
          • Debug commands
            • computeMeta
            • recoverLease
            • verifyMeta
        • FileSystem shell
          • appendToFile
          • cat
          • checksum
          • chgrp
          • chmod
          • chown
          • copyFromLocal
          • copyToLocal
          • count
          • cp
          • createSnapshot
          • deleteSnapshot
          • df
          • du
          • dus
          • expunge
          • find
          • get
          • getfacl
          • getfattr
          • getmerge
          • head
          • help
          • ls
          • lsr
          • mkdir
          • moveFromLocal
          • moveToLocal
          • mv
          • put
          • renameSnapshot
          • rm
          • rmdir
          • rmr
          • setfacl
          • setfattr
          • setrep
          • stat
          • tail
          • test
          • text
          • touch
          • touchz
          • truncate
          • usage
        • HDFS cheatsheet
      • HBase
        • HBase shell commands
          • General
            • processlist
            • status
            • table_help
            • version
            • whoami
          • DDL
            • alter
            • alter_async
            • alter_status
            • create
            • describe
            • disable
            • disable_all
            • drop
            • drop_all
            • enable
            • enable_all
            • exists
            • get_table
            • is_enabled
            • is_disabled
            • list
            • list_regions
            • locate_region
            • show_filters
          • Namespace
            • alter_namespace
            • create_namespace
            • describe_namespace
            • drop_namespace
            • list_namespace
            • list_namespace_tables
          • DML
            • append
            • count
            • delete
            • deleteall
            • get
            • get_counter
            • get_splits
            • incr
            • put
            • scan
            • truncate
            • truncate_preserve
          • Tools
            • assign
            • balancer
            • balancer_enabled
            • balance_switch
            • catalogjanitor_enabled
            • catalogjanitor_run
            • catalogjanitor_switch
            • cleaner_chore_enabled
            • cleaner_chore_run
            • cleaner_chore_switch
            • clear_block_cache
            • clear_compaction_queues
            • clear_deadservers
            • close_region
            • compact
            • compaction_state
            • compact_rs
            • flush
            • is_in_maintenance_mode
            • list_deadservers
            • major_compact
            • merge_region
            • move
            • normalize
            • normalizer_enabled
            • normalizer_switch
            • split
            • splitormerge_enabled
            • splitormerge_switch
            • trace
            • unassign
            • wal_roll
            • zk_dump
          • Replication
            • add_peer
            • append_peer_namespaces
            • append_peer_tableCFs
            • disable_peer
            • disable_table_replication
            • enable_peer
            • enable_table_replication
            • get_peer_config
            • list_peers
            • list_peer_configs
            • list_replicated_tables
            • remove_peer
            • remove_peer_namespaces
            • remove_peer_tableCFs
            • set_peer_bandwidth
            • set_peer_exclude_namespaces
            • set_peer_exclude_tableCFs
            • set_peer_namespaces
            • set_peer_replicate_all
            • set_peer_tableCFs
            • show_peer_tableCFs
            • update_peer_config
          • Snapshots
            • clone_snapshot
            • delete_all_snapshot
            • delete_snapshot
            • delete_table_snapshots
            • list_snapshots
            • list_table_snapshots
            • restore_snapshot
            • snapshot
          • Configuration
            • update_all_config
            • update_config
          • Quotas
            • list_quotas
            • list_snapshot_sizes
            • list_quota_snapshots
            • list_quota_table_sizes
            • set_quota
          • Security
            • grant
            • list_security_capabilities
            • revoke
            • user_permission
          • Procedures
            • abort_procedure
            • list_locks
            • list_procedures
          • Visibility labels
            • add_labels
            • clear_auths
            • get_auths
            • list_labels
            • set_auths
            • set_visibility
          • Rsgroup
            • add_rsgroup
            • balance_rsgroup
            • get_rsgroup
            • get_server_rsgroup
            • get_table_rsgroup
            • list_rsgroups
            • move_namespaces_rsgroup
            • move_servers_rsgroup
            • move_servers_namespaces_rsgroup
            • move_servers_tables_rsgroup
            • move_tables_rsgroup
            • remove_rsgroup
            • remove_servers_rsgroup
      • Hive
        • Hive on Spark parameters
        • Beeline command line
      • Impala
        • Impala shell commands
      • MapReduce
        • MapReduce CLI
          • User commands
            • archive
            • archives-logs
            • classpath
            • distcp
            • envvars
            • job
            • pipes
            • queue
            • version
          • Administration commands
            • frameworkuploader
            • historyserver
            • hsadmin
      • Spark
        • spark-shell
        • spark-submit
      • Sqoop
        • Sqoop CLI
          • codegen
          • create-hive-table
          • eval
          • export
          • help
          • import-all-tables
          • import-mainframe
          • import
          • job
          • list-databases
          • list-tables
          • merge
          • metastore
          • version
      • SSM
        • Actions
        • Rules
      • YARN
        • YARN CLI
          • User commands
            • application
            • applicationattempt
            • classpath
            • container
            • envvars
            • jar
            • logs
            • node
            • queue
            • top
            • version
          • Administration commands
            • daemonlog
            • nodemanager
            • proxyserver
            • registrydns
            • resourcemanager
            • rmadmin
            • schedulerconf
            • scmadmin
            • sharedcachemanager
            • timelineserver
      • Configuration parameters
      • Monitoring
        • ADH monitoring metrics
      • Glossary
    • Planning guide
      • Software requirements
      • Hardware requirements
        • Depending on node types
          • NameNode
          • DataNode
          • YARN
          • PostgresSQL nodes used with Hive Metastore
        • Depending on workload patterns
      • Filesystem requirements
      • Network requirements
        • ADH network ports
      • Typical configurations
        • Pilot cluster
      • Custom Java settings
    • Concepts
      • Architecture and data model
        • Airflow
        • Flink
          • Overview
        • HBase
          • Architecture
          • Data model
        • HDFS
          • Architecture
            • Scalability
        • Hive
          • TEZ
        • Iceberg
          • Architecture
        • Impala
          • Architecture
        • Kyuubi
          • Architecture
          • Kyuubi vs HiveServer vs Spark Thrift Server
          • Share levels
        • MapReduce
          • Architecture
          • Overview
        • Ozone
          • Architecture
          • HDFS vs Ozone
        • Solr
          • Overview
        • Spark
          • Spark Connect
        • Sqoop
          • Overview
        • SSM
          • Architecture
        • Trino
          • Architecture
        • YARN
          • Architecture
        • Zeppelin
        • ZooKeeper
      • Security
        • Kerberos
    • Release notes
      • ADH releases
      • Supported services
      • Version compatibility matrix
      • Known issues
ADQM ADS ADPG ADCM ADPS hp-ssh hp-yandex hp-croc hp-cloudru hp-vmware hp-adqmcloud hp-adbcloud ADSCONTROL
  • Products
      Arenadata DB Arenadata Hadoop Arenadata QuickMarts Arenadata Streaming Arenadata Postgres Arenadata Cluster Manager
  • Download
  • Documentation releases
  • Technical blog
  • Career
    We are hiring!
ADB
ADH
ADQM
ADS
ADPG
ADCM
ADPS
ADSCONTROL
Showing results: 5 of 16. See all results...
Select language
EN | RU
Change color mode
  • Home
  • Arenadata Hadoop
  • How to
Arenadata Hadoop
  • Select version
    3.3.6.2
    3.3.6.2
    3.3.6.2
    3.3.6.1
    3.2.4.3
    3.2.4.2
    3.2.4.1
    3.1.2.1
    2.1.10
    2.1.8
    2.1.7

    • Get started
      • Online installation
        • Install ADCM
        • Prepare hosts
        • Install ADH cluster
          • Create a cluster
          • Add services
          • Add hosts to a cluster
          • Add components
          • Configure services
          • Configure a cluster
          • Install a cluster
        • Install monitoring
          • Create a cluster
          • Add services
          • Add hosts to a cluster
          • Add components
          • Configure services
          • Install a cluster
          • Integrate with ADH cluster
      • Offline installation
        • Install ADCM
        • Prepare hosts
        • Install Enterprise Tools cluster
          • Create a cluster
          • Add services
          • Add hosts to a cluster
          • Add components
          • Install a cluster
        • Install ADH cluster
          • Create a cluster
          • Add services
          • Add hosts to a cluster
          • Add components
          • Configure services
          • Configure a cluster
          • Import ET settings
          • Install a cluster
        • Install monitoring
          • Configure Enterprise Tools cluster
          • Integrate with ADH cluster
    • How to
      • Airflow
        • Connect to Airflow
          • CLI
          • REST API
        • UI overview
        • Work with DAGs
          • Create a simple DAG
          • Work with TaskFlow
          • Use sensors
          • Customize DAG scheduling
          • Add custom operators and hooks
          • Generate DAGs dynamically
        • Logging
        • Service management via ADCM
      • Flink
        • UI overview
        • Connect to Flink
          • CLI
        • Flink SQL Gateway
        • Flink on YARN
        • Service management via ADCM
      • HBase
        • Connect to HBase
          • Use native API
          • Use external API
        • UI overview
        • Data querying
          • Quick start
          • Use filters
          • Use coprocessors
          • Scan over snapshot
          • Use medium-sized objects
        • Bulk loading
          • Built-in MapReduce jobs
        • Use snapshots for backup and restore
        • Replication
        • Logging
        • Service management via ADCM
      • HDFS
        • Connect to HDFS
        • UI overview
        • Basic file operations
          • Protect files
          • Copy data to HDFS
          • Add HDFS data directories
          • DataNode hot swapping
          • Remove files & directories
          • Enable WebHDFS
          • Check files via fsck
          • Local read
          • Use distcp
          • Use HttpFS
        • Storage management
          • Storage policies
          • Use quotas
          • Cache management
        • Replication
          • Replication factor
          • Rack awareness
          • Erasure coding
          • NameNode safemode
        • Backup and restore
          • Snapshots
          • NameNode recovery
        • Configure
        • Checkpointing
        • Integration with S3
        • Logging
        • Service management via ADCM
      • Hive
        • Connect to Hive
          • Beeline shell
          • JDBC
        • UI overview
        • Data querying
          • Work with tables
          • Computation example
          • Multi-table inserts
          • Analyze queries
          • Query types
            • Subqueries
            • Aggregate functions
            • Windowing functions
            • Use materialized views
        • Integrations
          • Hive on Spark
          • Hive with HBase
          • TEZ
            • View a Hive job
        • Performance tuning
        • Backup and restore
        • Replication
        • High availability
        • Logging
        • Use ACID
        • Iceberg tables
        • Service management via ADCM
      • HUE
        • UI overview
        • Add and use an interpreter
        • Iceberg tables
        • SPNEGO authentication
        • Service management via ADCM
      • Impala
        • Connect to Impala
          • impala-shell
          • JDBC
        • Web UIs overview
        • Performance tuning
        • Integrations
          • Impala with Hive
          • Impala with HBase
        • LDAP authentication
        • Iceberg tables
        • Service management via ADCM
      • Kyuubi
        • UI overview
        • LDAP authentication
        • Service management via ADCM
      • MapReduce
        • Quick start
        • Connect to MapReduce
          • CLI
          • REST API
        • Logging
        • Pluggable shuffle and sort
        • Encrypted shuffle
      • MariaDB
        • Service management via ADCM
      • Ozone
        • Connect to Ozone
          • CLI
        • UI overview
        • Ozone integration with cluster services
        • Service management via ADCM
      • Phoenix
        • Connect to Phoenix
        • Data querying
          • Use SQLLine
        • Map with HBase tables
        • Bulk loading
      • Solr
        • Connect to Solr
          • CLI
        • Indexing
          • Overview
          • Index nested documents
          • Partial document updates
          • Reindex data
          • Deduplication
        • UI overview
        • Solr collections
        • Logging in Solr
        • Performance tuning
        • Service management via ADCM
      • Spark
        • Connect to Spark
          • spark-submit & spark-shell
        • Examples of launching jobs
          • In-memory batch processing
            • Dataset
            • RDD
            • DataFrame
        • Integrations
          • Spark and Hive
          • ADB Spark 3 Connector
            • Overview
            • Prerequisites
            • Usage examples
            • Options
          • ADQM Spark 3 Connector
            • Overview
            • Prerequisites
            • Usage example
            • Options
          • PySpark
          • Spark Connect
          • Work with Livy
        • Scheduler jobs
        • Logging
        • Performance tuning
        • Iceberg tables
        • Service management via ADCM
          • Spark
          • Spark3
      • Sqoop
        • Connect to Sqoop
          • CLI
        • Quick start
        • Logging
        • Import and export examples
          • PostgreSQL and Hive
        • Service management via ADCM
      • SSM
        • UI overview
        • Define rules
        • Define actions
        • Rule usage examples
        • Service management via ADCM
      • Trino
        • Connect to Trino
          • CLI
          • JDBC
        • UI overview
        • Catalog management
        • Integrations
          • Overview of Trino connectors
          • Iceberg catalog
          • Hive catalog
        • Service management via ADCM
      • YARN
        • UI overview
        • Connect to YARN
          • CLI
          • REST API
        • Schedulers
          • FairScheduler
          • CapacityScheduler
        • High availability
          • Overview
          • Examples
        • Logging
        • GPU on YARN
        • Service management via ADCM
      • Zeppelin
        • UI overview
        • Work with interpreters
          • Add a custom interpreter to a group
          • Install a third-party interpreter
          • User-managed interpreters
        • Examples of notes
          • Phoenix
          • Solr
        • Logging
        • Service management via ADCM
      • ZooKeeper
        • Connect to ZooKeeper
          • CLI
          • API
        • Work with znodes
        • Logging
        • Configure
        • Service management via ADCM
    • Tutorials
      • Cluster management via ADCM
        • Cluster actions
          • Check
          • Manage SSL
          • Manage Credential Encryption
          • Manage Kerberos
          • Reinstall status-checker
          • Start
          • Stop
          • Upgrade
        • Hosts actions
          • Add a host to a cluster
          • View host status
          • Remove a host from a cluster
        • Service actions
      • Security
        • ADPS
          • Overview
          • Ranger plugins
            • HBase plugin
            • HDFS plugin
            • Hive plugin
            • Impala plugin
            • Solr plugin
            • Spark3 plugin
            • YARN plugin
        • Kerberos
          • Overview
          • MIT Kerberos
            • Kerberos server settings
            • Configure via ADCM
          • MS Active Directory
            • Install AD Certification Service
            • Create certificate template
            • Issue certificate template
            • Request a new certificate
            • Export CA certificate
            • Configure via ADCM
            • Enable Kerberos with prepared Active Directory server and no admin permissions
            • AD Kerberos common problems
          • FreeIPA
            • Overview
            • Configure via ADCM
            • Enable Kerberos with prepared FreeIPA server and no admin permissions
          • Samba
            • Configure via ADCM
          • Connect to Hive from DBeaver with Kerberos
          • SPNEGO
        • SSL
          • Requirements for SSL certificates
          • Set TLS version
    • References
      • Hadoop Command-line
        • Hadoop CLI
          • User commands
            • archive
            • checknative
            • CLASSNAME
            • classpath
            • conftest
            • credential
            • distch
            • distcp
            • dtutil
            • envvars
            • fs
            • gridmix
            • jar
            • jnipath
            • kerbname
            • kdiag
            • key
            • kms
            • trace
            • version
          • Administration commands
            • daemonlog
        • HDFS CLI
          • User commands
            • classpath
            • dfs
            • envvars
            • fetchdt
            • fsck
            • getconf
            • groups
            • httpfs
            • lsSnapshottableDir
            • jmxget
            • oev
            • oiv
            • oiv_legacy
            • snapshotDiff
            • version
          • Administration commands
            • balancer
            • cacheadmin
            • crypto
            • datanode
            • dfsadmin
            • dfsrouter
            • dfsrouteradmin
            • diskbalancer
            • ec
            • haadmin
            • journalnode
            • mover
            • namenode
            • nfs3
            • portmap
            • secondarynamenode
            • storagepolicies
            • zkfc
          • Debug commands
            • computeMeta
            • recoverLease
            • verifyMeta
        • FileSystem shell
          • appendToFile
          • cat
          • checksum
          • chgrp
          • chmod
          • chown
          • copyFromLocal
          • copyToLocal
          • count
          • cp
          • createSnapshot
          • deleteSnapshot
          • df
          • du
          • dus
          • expunge
          • find
          • get
          • getfacl
          • getfattr
          • getmerge
          • head
          • help
          • ls
          • lsr
          • mkdir
          • moveFromLocal
          • moveToLocal
          • mv
          • put
          • renameSnapshot
          • rm
          • rmdir
          • rmr
          • setfacl
          • setfattr
          • setrep
          • stat
          • tail
          • test
          • text
          • touch
          • touchz
          • truncate
          • usage
        • HDFS cheatsheet
      • HBase
        • HBase shell commands
          • General
            • processlist
            • status
            • table_help
            • version
            • whoami
          • DDL
            • alter
            • alter_async
            • alter_status
            • create
            • describe
            • disable
            • disable_all
            • drop
            • drop_all
            • enable
            • enable_all
            • exists
            • get_table
            • is_enabled
            • is_disabled
            • list
            • list_regions
            • locate_region
            • show_filters
          • Namespace
            • alter_namespace
            • create_namespace
            • describe_namespace
            • drop_namespace
            • list_namespace
            • list_namespace_tables
          • DML
            • append
            • count
            • delete
            • deleteall
            • get
            • get_counter
            • get_splits
            • incr
            • put
            • scan
            • truncate
            • truncate_preserve
          • Tools
            • assign
            • balancer
            • balancer_enabled
            • balance_switch
            • catalogjanitor_enabled
            • catalogjanitor_run
            • catalogjanitor_switch
            • cleaner_chore_enabled
            • cleaner_chore_run
            • cleaner_chore_switch
            • clear_block_cache
            • clear_compaction_queues
            • clear_deadservers
            • close_region
            • compact
            • compaction_state
            • compact_rs
            • flush
            • is_in_maintenance_mode
            • list_deadservers
            • major_compact
            • merge_region
            • move
            • normalize
            • normalizer_enabled
            • normalizer_switch
            • split
            • splitormerge_enabled
            • splitormerge_switch
            • trace
            • unassign
            • wal_roll
            • zk_dump
          • Replication
            • add_peer
            • append_peer_namespaces
            • append_peer_tableCFs
            • disable_peer
            • disable_table_replication
            • enable_peer
            • enable_table_replication
            • get_peer_config
            • list_peers
            • list_peer_configs
            • list_replicated_tables
            • remove_peer
            • remove_peer_namespaces
            • remove_peer_tableCFs
            • set_peer_bandwidth
            • set_peer_exclude_namespaces
            • set_peer_exclude_tableCFs
            • set_peer_namespaces
            • set_peer_replicate_all
            • set_peer_tableCFs
            • show_peer_tableCFs
            • update_peer_config
          • Snapshots
            • clone_snapshot
            • delete_all_snapshot
            • delete_snapshot
            • delete_table_snapshots
            • list_snapshots
            • list_table_snapshots
            • restore_snapshot
            • snapshot
          • Configuration
            • update_all_config
            • update_config
          • Quotas
            • list_quotas
            • list_snapshot_sizes
            • list_quota_snapshots
            • list_quota_table_sizes
            • set_quota
          • Security
            • grant
            • list_security_capabilities
            • revoke
            • user_permission
          • Procedures
            • abort_procedure
            • list_locks
            • list_procedures
          • Visibility labels
            • add_labels
            • clear_auths
            • get_auths
            • list_labels
            • set_auths
            • set_visibility
          • Rsgroup
            • add_rsgroup
            • balance_rsgroup
            • get_rsgroup
            • get_server_rsgroup
            • get_table_rsgroup
            • list_rsgroups
            • move_namespaces_rsgroup
            • move_servers_rsgroup
            • move_servers_namespaces_rsgroup
            • move_servers_tables_rsgroup
            • move_tables_rsgroup
            • remove_rsgroup
            • remove_servers_rsgroup
      • Hive
        • Hive on Spark parameters
        • Beeline command line
      • Impala
        • Impala shell commands
      • MapReduce
        • MapReduce CLI
          • User commands
            • archive
            • archives-logs
            • classpath
            • distcp
            • envvars
            • job
            • pipes
            • queue
            • version
          • Administration commands
            • frameworkuploader
            • historyserver
            • hsadmin
      • Spark
        • spark-shell
        • spark-submit
      • Sqoop
        • Sqoop CLI
          • codegen
          • create-hive-table
          • eval
          • export
          • help
          • import-all-tables
          • import-mainframe
          • import
          • job
          • list-databases
          • list-tables
          • merge
          • metastore
          • version
      • SSM
        • Actions
        • Rules
      • YARN
        • YARN CLI
          • User commands
            • application
            • applicationattempt
            • classpath
            • container
            • envvars
            • jar
            • logs
            • node
            • queue
            • top
            • version
          • Administration commands
            • daemonlog
            • nodemanager
            • proxyserver
            • registrydns
            • resourcemanager
            • rmadmin
            • schedulerconf
            • scmadmin
            • sharedcachemanager
            • timelineserver
      • Configuration parameters
      • Monitoring
        • ADH monitoring metrics
      • Glossary
    • Planning guide
      • Software requirements
      • Hardware requirements
        • Depending on node types
          • NameNode
          • DataNode
          • YARN
          • PostgresSQL nodes used with Hive Metastore
        • Depending on workload patterns
      • Filesystem requirements
      • Network requirements
        • ADH network ports
      • Typical configurations
        • Pilot cluster
      • Custom Java settings
    • Concepts
      • Architecture and data model
        • Airflow
        • Flink
          • Overview
        • HBase
          • Architecture
          • Data model
        • HDFS
          • Architecture
            • Scalability
        • Hive
          • TEZ
        • Iceberg
          • Architecture
        • Impala
          • Architecture
        • Kyuubi
          • Architecture
          • Kyuubi vs HiveServer vs Spark Thrift Server
          • Share levels
        • MapReduce
          • Architecture
          • Overview
        • Ozone
          • Architecture
          • HDFS vs Ozone
        • Solr
          • Overview
        • Spark
          • Spark Connect
        • Sqoop
          • Overview
        • SSM
          • Architecture
        • Trino
          • Architecture
        • YARN
          • Architecture
        • Zeppelin
        • ZooKeeper
      • Security
        • Kerberos
    • Release notes
      • ADH releases
      • Supported services
      • Version compatibility matrix
      • Known issues
    Get started
    Online installation
    Offline installation
    Get started
    Install ADCM
    Prepare hosts
    Install ADH cluster
    Install monitoring
    Online installation
    Create a cluster
    Add services
    Add hosts to a cluster
    Add components
    Configure services
    Configure a cluster
    Install a cluster
    Online installation
    Create a cluster
    Add services
    Add hosts to a cluster
    Add components
    Configure services
    Install a cluster
    Integrate with ADH cluster
    Get started
    Install ADCM
    Prepare hosts
    Install Enterprise Tools cluster
    Install ADH cluster
    Install monitoring
    Offline installation
    Create a cluster
    Add services
    Add hosts to a cluster
    Add components
    Install a cluster
    Offline installation
    Create a cluster
    Add services
    Add hosts to a cluster
    Add components
    Configure services
    Configure a cluster
    Import ET settings
    Install a cluster
    Offline installation
    Configure Enterprise Tools cluster
    Integrate with ADH cluster
    How to
    Airflow
    Flink
    HBase
    HDFS
    Hive
    HUE
    Impala
    Kyuubi
    MapReduce
    MariaDB
    Ozone
    Phoenix
    Solr
    Spark
    Sqoop
    SSM
    Trino
    YARN
    Zeppelin
    ZooKeeper
    How to
    Connect to Airflow
    UI overview
    Work with DAGs
    Logging
    Service management via ADCM
    Airflow
    CLI
    REST API
    Airflow
    Create a simple DAG
    Work with TaskFlow
    Use sensors
    Customize DAG scheduling
    Add custom operators and hooks
    Generate DAGs dynamically
    How to
    UI overview
    Connect to Flink
    Flink SQL Gateway
    Flink on YARN
    Service management via ADCM
    Flink
    CLI
    How to
    Connect to HBase
    UI overview
    Data querying
    Bulk loading
    Use snapshots for backup and restore
    Replication
    Logging
    Service management via ADCM
    HBase
    Use native API
    Use external API
    HBase
    Quick start
    Use filters
    Use coprocessors
    Scan over snapshot
    Use medium-sized objects
    HBase
    Built-in MapReduce jobs
    How to
    Connect to HDFS
    UI overview
    Basic file operations
    Storage management
    Replication
    Backup and restore
    Configure
    Checkpointing
    Integration with S3
    Logging
    Service management via ADCM
    HDFS
    Protect files
    Copy data to HDFS
    Add HDFS data directories
    DataNode hot swapping
    Remove files & directories
    Enable WebHDFS
    Check files via fsck
    Local read
    Use distcp
    Use HttpFS
    HDFS
    Storage policies
    Use quotas
    Cache management
    HDFS
    Replication factor
    Rack awareness
    Erasure coding
    NameNode safemode
    HDFS
    Snapshots
    NameNode recovery
    How to
    Connect to Hive
    UI overview
    Data querying
    Integrations
    Performance tuning
    Backup and restore
    Replication
    High availability
    Logging
    Use ACID
    Iceberg tables
    Service management via ADCM
    Hive
    Beeline shell
    JDBC
    Hive
    Work with tables
    Computation example
    Multi-table inserts
    Analyze queries
    Query types
    Data querying
    Subqueries
    Aggregate functions
    Windowing functions
    Use materialized views
    Hive
    Hive on Spark
    Hive with HBase
    TEZ
    Integrations
    View a Hive job
    How to
    UI overview
    Add and use an interpreter
    Iceberg tables
    SPNEGO authentication
    Service management via ADCM
    How to
    Connect to Impala
    Web UIs overview
    Performance tuning
    Integrations
    LDAP authentication
    Iceberg tables
    Service management via ADCM
    Impala
    impala-shell
    JDBC
    Impala
    Impala with Hive
    Impala with HBase
    How to
    UI overview
    LDAP authentication
    Service management via ADCM
    How to
    Quick start
    Connect to MapReduce
    Logging
    Pluggable shuffle and sort
    Encrypted shuffle
    MapReduce
    CLI
    REST API
    How to
    Service management via ADCM
    How to
    Connect to Ozone
    UI overview
    Ozone integration with cluster services
    Service management via ADCM
    Ozone
    CLI
    How to
    Connect to Phoenix
    Data querying
    Map with HBase tables
    Bulk loading
    Phoenix
    Use SQLLine
    How to
    Connect to Solr
    Indexing
    UI overview
    Solr collections
    Logging in Solr
    Performance tuning
    Service management via ADCM
    Solr
    CLI
    Solr
    Overview
    Index nested documents
    Partial document updates
    Reindex data
    Deduplication
    How to
    Connect to Spark
    Examples of launching jobs
    Integrations
    Scheduler jobs
    Logging
    Performance tuning
    Iceberg tables
    Service management via ADCM
    Spark
    spark-submit & spark-shell
    Spark
    In-memory batch processing
    Examples of launching jobs
    Dataset
    RDD
    DataFrame
    Spark
    Spark and Hive
    ADB Spark 3 Connector
    ADQM Spark 3 Connector
    PySpark
    Spark Connect
    Work with Livy
    Integrations
    Overview
    Prerequisites
    Usage examples
    Options
    Integrations
    Overview
    Prerequisites
    Usage example
    Options
    Spark
    Spark
    Spark3
    How to
    Connect to Sqoop
    Quick start
    Logging
    Import and export examples
    Service management via ADCM
    Sqoop
    CLI
    Sqoop
    PostgreSQL and Hive
    How to
    UI overview
    Define rules
    Define actions
    Rule usage examples
    Service management via ADCM
    How to
    Connect to Trino
    UI overview
    Catalog management
    Integrations
    Service management via ADCM
    Trino
    CLI
    JDBC
    Trino
    Overview of Trino connectors
    Iceberg catalog
    Hive catalog
    How to
    UI overview
    Connect to YARN
    Schedulers
    High availability
    Logging
    GPU on YARN
    Service management via ADCM
    YARN
    CLI
    REST API
    YARN
    FairScheduler
    CapacityScheduler
    YARN
    Overview
    Examples
    How to
    UI overview
    Work with interpreters
    Examples of notes
    Logging
    Service management via ADCM
    Zeppelin
    Add a custom interpreter to a group
    Install a third-party interpreter
    User-managed interpreters
    Zeppelin
    Phoenix
    Solr
    How to
    Connect to ZooKeeper
    Work with znodes
    Logging
    Configure
    Service management via ADCM
    ZooKeeper
    CLI
    API
    Tutorials
    Cluster management via ADCM
    Security
    Tutorials
    Cluster actions
    Hosts actions
    Service actions
    Cluster management via ADCM
    Check
    Manage SSL
    Manage Credential Encryption
    Manage Kerberos
    Reinstall status-checker
    Start
    Stop
    Upgrade
    Cluster management via ADCM
    Add a host to a cluster
    View host status
    Remove a host from a cluster
    Tutorials
    ADPS
    Kerberos
    SSL
    Security
    Overview
    Ranger plugins
    ADPS
    HBase plugin
    HDFS plugin
    Hive plugin
    Impala plugin
    Solr plugin
    Spark3 plugin
    YARN plugin
    Security
    Overview
    MIT Kerberos
    MS Active Directory
    FreeIPA
    Samba
    Connect to Hive from DBeaver with Kerberos
    SPNEGO
    Kerberos
    Kerberos server settings
    Configure via ADCM
    Kerberos
    Install AD Certification Service
    Create certificate template
    Issue certificate template
    Request a new certificate
    Export CA certificate
    Configure via ADCM
    Enable Kerberos with prepared Active Directory server and no admin permissions
    AD Kerberos common problems
    Kerberos
    Overview
    Configure via ADCM
    Enable Kerberos with prepared FreeIPA server and no admin permissions
    Kerberos
    Configure via ADCM
    Security
    Requirements for SSL certificates
    Set TLS version
    References
    Hadoop Command-line
    HBase
    Hive
    Impala
    MapReduce
    Spark
    Sqoop
    SSM
    YARN
    Configuration parameters
    Monitoring
    Glossary
    References
    Hadoop CLI
    HDFS CLI
    FileSystem shell
    HDFS cheatsheet
    Hadoop Command-line
    User commands
    Administration commands
    Hadoop CLI
    archive
    checknative
    CLASSNAME
    classpath
    conftest
    credential
    distch
    distcp
    dtutil
    envvars
    fs
    gridmix
    jar
    jnipath
    kerbname
    kdiag
    key
    kms
    trace
    version
    Hadoop CLI
    daemonlog
    Hadoop Command-line
    User commands
    Administration commands
    Debug commands
    HDFS CLI
    classpath
    dfs
    envvars
    fetchdt
    fsck
    getconf
    groups
    httpfs
    lsSnapshottableDir
    jmxget
    oev
    oiv
    oiv_legacy
    snapshotDiff
    version
    HDFS CLI
    balancer
    cacheadmin
    crypto
    datanode
    dfsadmin
    dfsrouter
    dfsrouteradmin
    diskbalancer
    ec
    haadmin
    journalnode
    mover
    namenode
    nfs3
    portmap
    secondarynamenode
    storagepolicies
    zkfc
    HDFS CLI
    computeMeta
    recoverLease
    verifyMeta
    Hadoop Command-line
    appendToFile
    cat
    checksum
    chgrp
    chmod
    chown
    copyFromLocal
    copyToLocal
    count
    cp
    createSnapshot
    deleteSnapshot
    df
    du
    dus
    expunge
    find
    get
    getfacl
    getfattr
    getmerge
    head
    help
    ls
    lsr
    mkdir
    moveFromLocal
    moveToLocal
    mv
    put
    renameSnapshot
    rm
    rmdir
    rmr
    setfacl
    setfattr
    setrep
    stat
    tail
    test
    text
    touch
    touchz
    truncate
    usage
    References
    HBase shell commands
    HBase
    General
    DDL
    Namespace
    DML
    Tools
    Replication
    Snapshots
    Configuration
    Quotas
    Security
    Procedures
    Visibility labels
    Rsgroup
    HBase shell commands
    processlist
    status
    table_help
    version
    whoami
    HBase shell commands
    alter
    alter_async
    alter_status
    create
    describe
    disable
    disable_all
    drop
    drop_all
    enable
    enable_all
    exists
    get_table
    is_enabled
    is_disabled
    list
    list_regions
    locate_region
    show_filters
    HBase shell commands
    alter_namespace
    create_namespace
    describe_namespace
    drop_namespace
    list_namespace
    list_namespace_tables
    HBase shell commands
    append
    count
    delete
    deleteall
    get
    get_counter
    get_splits
    incr
    put
    scan
    truncate
    truncate_preserve
    HBase shell commands
    assign
    balancer
    balancer_enabled
    balance_switch
    catalogjanitor_enabled
    catalogjanitor_run
    catalogjanitor_switch
    cleaner_chore_enabled
    cleaner_chore_run
    cleaner_chore_switch
    clear_block_cache
    clear_compaction_queues
    clear_deadservers
    close_region
    compact
    compaction_state
    compact_rs
    flush
    is_in_maintenance_mode
    list_deadservers
    major_compact
    merge_region
    move
    normalize
    normalizer_enabled
    normalizer_switch
    split
    splitormerge_enabled
    splitormerge_switch
    trace
    unassign
    wal_roll
    zk_dump
    HBase shell commands
    add_peer
    append_peer_namespaces
    append_peer_tableCFs
    disable_peer
    disable_table_replication
    enable_peer
    enable_table_replication
    get_peer_config
    list_peers
    list_peer_configs
    list_replicated_tables
    remove_peer
    remove_peer_namespaces
    remove_peer_tableCFs
    set_peer_bandwidth
    set_peer_exclude_namespaces
    set_peer_exclude_tableCFs
    set_peer_namespaces
    set_peer_replicate_all
    set_peer_tableCFs
    show_peer_tableCFs
    update_peer_config
    HBase shell commands
    clone_snapshot
    delete_all_snapshot
    delete_snapshot
    delete_table_snapshots
    list_snapshots
    list_table_snapshots
    restore_snapshot
    snapshot
    HBase shell commands
    update_all_config
    update_config
    HBase shell commands
    list_quotas
    list_snapshot_sizes
    list_quota_snapshots
    list_quota_table_sizes
    set_quota
    HBase shell commands
    grant
    list_security_capabilities
    revoke
    user_permission
    HBase shell commands
    abort_procedure
    list_locks
    list_procedures
    HBase shell commands
    add_labels
    clear_auths
    get_auths
    list_labels
    set_auths
    set_visibility
    HBase shell commands
    add_rsgroup
    balance_rsgroup
    get_rsgroup
    get_server_rsgroup
    get_table_rsgroup
    list_rsgroups
    move_namespaces_rsgroup
    move_servers_rsgroup
    move_servers_namespaces_rsgroup
    move_servers_tables_rsgroup
    move_tables_rsgroup
    remove_rsgroup
    remove_servers_rsgroup
    References
    Hive on Spark parameters
    Beeline command line
    References
    Impala shell commands
    References
    MapReduce CLI
    MapReduce
    User commands
    Administration commands
    MapReduce CLI
    archive
    archives-logs
    classpath
    distcp
    envvars
    job
    pipes
    queue
    version
    MapReduce CLI
    frameworkuploader
    historyserver
    hsadmin
    References
    spark-shell
    spark-submit
    References
    Sqoop CLI
    Sqoop
    codegen
    create-hive-table
    eval
    export
    help
    import-all-tables
    import-mainframe
    import
    job
    list-databases
    list-tables
    merge
    metastore
    version
    References
    Actions
    Rules
    References
    YARN CLI
    YARN
    User commands
    Administration commands
    YARN CLI
    application
    applicationattempt
    classpath
    container
    envvars
    jar
    logs
    node
    queue
    top
    version
    YARN CLI
    daemonlog
    nodemanager
    proxyserver
    registrydns
    resourcemanager
    rmadmin
    schedulerconf
    scmadmin
    sharedcachemanager
    timelineserver
    References
    ADH monitoring metrics
    Planning guide
    Software requirements
    Hardware requirements
    Filesystem requirements
    Network requirements
    Typical configurations
    Custom Java settings
    Planning guide
    Depending on node types
    Depending on workload patterns
    Hardware requirements
    NameNode
    DataNode
    YARN
    PostgresSQL nodes used with Hive Metastore
    Planning guide
    ADH network ports
    Planning guide
    Pilot cluster
    Concepts
    Architecture and data model
    Security
    Concepts
    Airflow
    Flink
    HBase
    HDFS
    Hive
    Iceberg
    Impala
    Kyuubi
    MapReduce
    Ozone
    Solr
    Spark
    Sqoop
    SSM
    Trino
    YARN
    Zeppelin
    ZooKeeper
    Architecture and data model
    Overview
    Architecture and data model
    Architecture
    Data model
    Architecture and data model
    Architecture
    HDFS
    Scalability
    Architecture and data model
    TEZ
    Architecture and data model
    Architecture
    Architecture and data model
    Architecture
    Architecture and data model
    Architecture
    Kyuubi vs HiveServer vs Spark Thrift Server
    Share levels
    Architecture and data model
    Architecture
    Overview
    Architecture and data model
    Architecture
    HDFS vs Ozone
    Architecture and data model
    Overview
    Architecture and data model
    Spark Connect
    Architecture and data model
    Overview
    Architecture and data model
    Architecture
    Architecture and data model
    Architecture
    Architecture and data model
    Architecture
    Concepts
    Kerberos
    Release notes
    ADH releases
    Supported services
    Version compatibility matrix
    Known issues
adh adh

How to

Airflow

airflow dark colored
airflow dark gray
airflow light colored
airflow light gray

Flink

flink dark colored
flink dark gray
flink light colored
flink light gray

HBase

hbase dark colored
hbase dark gray
hbase light colored
hbase light gray

HDFS

hdfs dark colored
hdfs dark gray
hdfs light colored
hdfs light gray

Hive

hive dark colored
hive dark gray
hive light colored
hive light gray

HUE

hue dark colored
hue dark gray
hue light colored
hue light gray

Impala

impala dark colored
impala dark gray
impala light colored
impala light gray

Kyuubi

kyuubi dark colored
kyuubi dark gray
kyuubi light colored
kyuubi light gray

MapReduce

mapreduce dark colored
mapreduce dark gray
mapreduce light colored
mapreduce light gray

MariaDB

mariadb dark colored
mariadb dark gray
mariadb light colored
mariadb light gray

Ozone

ozone dark colored
ozone dark gray
ozone light colored
ozone light gray

Phoenix

phoenix dark colored
phoenix dark gray
phoenix light colored
phoenix light gray

Solr

solr dark colored
solr dark gray
solr light colored
solr light gray

Spark

spark dark colored
spark dark gray
spark light colored
spark light gray

Sqoop

sqoop dark colored
sqoop dark gray
sqoop light colored
sqoop light gray

SSM

ssm dark colored
ssm dark gray
ssm light colored
ssm light gray

Trino

trino dark colored
trino dark gray
trino light colored
trino light gray

YARN

yarn dark colored
yarn dark gray
yarn light colored
yarn light gray

Zeppelin

zeppelin dark colored
zeppelin dark gray
zeppelin light colored
zeppelin light gray

ZooKeeper

zookeeper dark colored
zookeeper dark gray
zookeeper light colored
zookeeper light gray
Found a mistake? Seleсt text and press Ctrl+Enter to report it
Integrate with ADH cluster CLI
Documentation
Arenadata DB Arenadata Hadoop Arenadata QuickMarts Arenadata Streaming Arenadata Postgres Arenadata Cluster Manager Arenadata Platform Security Arenadata Streaming Control
About
News Partners Contacts
arenadata.tech
arenadata.tech
© Arenadata Software LLC,
| User agreement