Конференция Arenadata
Новое время — новый Greenplum
Мы приглашаем вас принять участие в конференции, посвященной будущему Open-Source Greenplum 19 сентября в 18:00:00 UTC +3. Встреча будет проходить в гибридном формате — и офлайн, и онлайн. Онлайн-трансляция будет доступна для всех желающих.
Внезапное закрытие Greenplum его владельцем — компанией Broadcom - стало неприятным сюрпризом для всех, кто использует или планирует начать использовать решения на базе этой технологии. Многие ожидают выхода стабильной версии Greenplum 7 и надеются на её дальнейшее активное развитие.
Arenadata не могла допустить, чтобы разрабатываемый годами Open-Source проект Greenplum прекратил своё существование, поэтому 19 сентября мы представим наш ответ на данное решение Broadcom, а участники сообщества получат исчерпывающие разъяснения на все вопросы о дальнейшей судьбе этой технологии.

На конференции вас ждёт обсуждение следующих тем:

  • План возрождения Greenplum;
  • Дорожная карта;
  • Экспертное обсуждение и консультации.
Осталось до события

Hadoop CLI


This article includes the reference documentation for the Hadoop shell command-line tool.

Currently, the hadoop dfs command is deprecated, use hdfs dfs instead

All the Hadoop commands and subprojects follow the same basic structure:

The usage is as follows:


Hadoop shell basic structure


The command of the project being invoked. For example, Hadoop commonly uses hadoop, HDFS uses hdfs, and YARN uses yarn


Options that the shell processes before executing Java


Action to perform


The common set of options supported by multiple commands


Various command options for the Hadoop common subprojects

All the shell commands accept a common set of options. For some commands, these options are ignored. For example, passing --hostnames on a command that only executes on a single host will be ignored.

Shell options


Enables developer versions of JARs

--config confdir

Overwrites the default configuration directory. The default directory is $HADOOP_HOME/etc/hadoop

--daemon mode

If the command supports daemonization (e.g., hdfs namenode), executes in the appropriate mode.

Supported modes are start to start the process in a daemon mode, stop to stop the process, and status to determine the active status of the process.

The status will return an LSB-compliant result code.

If no option is provided, commands that support daemonization will run in the foreground.

For commands that don’t support daemonization, this option is ignored


Enables shell-level configuration debugging information


Displays shell script usage information


When --workers is used, overrides the workers file with a whitespace-delimited list of hostnames where to execute a multi-host subcommand.

If --workers isn’t used, this option is ignored


When --workers is used, overrides the workers file with another file that contains a list of hostnames where to execute a multi-host subcommand.

If --workers isn’t used, this option is ignored

--loglevel loglevel

Overrides the log level.

Valid log levels are FATAL, ERROR, WARN, INFO, DEBUG, and TRACE. Default is INFO


If possible, executes this command on all hosts in the workers file

Many subcommands share a common set of configuration options to alter their behavior.

Generic options

-archives <comma separated list of archives>

Specifies comma-separated archives to be extracted onto the compute machines. Applies only to a job

-conf <configuration file>

Specifies an application configuration file

-D <property>=<value>

Sets a value for a given property

-files <comma separated list of files>

Specifies comma-separated files to be copied to the MapReduce cluster. Applies only to a job

-fs <file:///> or <hdfs://namenode:port>

Specifies the default file system URL to use.

Overrides fs.defaultFS property from configurations

-jt <local> or <resourcemanager:port>

Specifies a ResourceManager. Applies only to a job

-libjars <comma separated list of jars>

Specifies comma-separated JAR files to include in the classpath. Applies only to a job

User commands

These commands are helpful for Hadoop cluster users.

Command Description


Creates a Hadoop archive


Checks the availability of the Hadoop native code


Runs an arbitrary Java class


Prints the classpath


Manages credentials, passwords, and secrets


Changes the ownership and permissions on files


Copies file or directories recursively


Utility to fetch and manage Hadoop tokens


Displays computed Hadoop environment variables


It’s a synonym for hdfs dfs when HDFS is in use


Benchmark tool for Hadoop cluster


Runs a JAR file


Prints the computed java.library.path


Converts the named principal via the auth_to_local rules to the Hadoop username


Diagnoses Kerberos problems


Manages keys via the KeyProvider


Runs the Key Management Server


Views and modifies Hadoop tracing settings


Prints the version

Administration commands

Commands are useful for administrators of a Hadoop cluster.

Command Description


Gets/sets the log level

Found a mistake? Seleсt text and press Ctrl+Enter to report it