Monitoring service
The Monitoring service deploys the Prometheus server inside ADQM to collect and store ADQM cluster monitoring metrics, and also supports the ability to use the Grafana web application for visualization and analysis of information. This article describes steps required to add this service to an ADQM cluster.
Overview
When you add the Monitoring service to an ADQM cluster, node exporter is installed on all hosts. It is a monitoring agent that reads host system metrics that Prometheus will collect. Prometheus will also collect metrics from ADQM services (ClickHouse, ZooKeeper, ClickHouse Keeper, Chproxy). These metrics will be available in the Prometheus format on ports and endpoints specified in service configuration settings. You can also use the Prometheus and Grafana web interfaces to view and analyze data that the monitoring service collects.
NOTE
|
Step 1. Add the service
-
In the ADCM interface, open the Clusters page and click your ADQM cluster name. On the cluster page that opens, switch to the Services tab and click Add services.
Switch to adding services -
In the opened dialog, select the Monitoring service and click Add.
Select the serviceAs a result, the added service is displayed on the Services tab.
The result of successful adding the service to the cluster
Step 2. Add components
-
On the cluster page, open the Mapping tab to proceed to mapping service components to cluster hosts.
Switch to mapping service componentsAll components of the Monitoring service are mandatory (highlighted in red). Each component should be installed on one host of the cluster.
Monitoring service components Component Description Prometheus Server
Deploys a Prometheus server that serves as a:
-
proxy channel for all collectors of metrics on a host;
-
storage for all metrics of a cluster;
-
generator of alerts based on collected metrics.
Grafana
Visualizes ADQM metrics as graphs and charts organized into dashboards
Pushgateway
Receives static metrics and sends them to Prometheus. In ADQM, it is used to pass a cluster structure to Prometheus
-
-
Assign a host to each component of the Monitoring service — click Add hosts and select the desired host in the pop-up window.
Select a host for a componentCAUTIONIt is not recommended to install the Prometheus Server and Pushgateway components on hosts with ADQM — use separate hosts for them. Otherwise, if an ADQM host fails and/or the load on it is critically high, information about the corresponding problems will not be saved. -
After the distribution of components is completed, click Save.
Save mapping of components
Step 3. Configure the service
-
Open the Services tab on the cluster page and click the Monitoring service name in the Name column.
Switch to the service configuration -
In the page that opens, fill in the service’s configuration parameters — see the Monitoring section in the Configuration parameters article for parameter descriptions. Fields highlighted in red are required.
Configure the Monitoring serviceAfter specifying all necessary parameters, click Save.
Step 4. Install the service
-
On the Services tab, click the icon for the Monitoring service in the Actions column and run the Install action.
Switch to the service installation -
Confirm the action in the opened window.
Confirm the action -
Wait until the installation is completed. Then check that the service state has changed from
created
toinstalled
.Installation is completeTo view the service installation process and analyze errors if they occur, select Jobs in the left navigation menu and click the Install job name in the Jobs list.
Install service job page
Step 5. View results
The Monitoring service starts automatically after installation. To ensure that monitoring works correctly, check the following:
-
Metrics are collected from all hosts of your cluster, not only from the hosts where components of the monitoring service are deployed.
-
Both system and ADQM service metrics are collected from the cluster hosts — see ADQM monitoring metrics.
To check both these points, you can view metrics in the Prometheus format in the browser, and also use the Prometheus or Grafana web interface.
Metrics in the Prometheus format
-
In the address bar of your browser, enter an address of an ADQM host with the specified port and endpoint to listen for service or system metrics. Port numbers and endpoints are defined in the corresponding sections on the configuration page of the Monitoring service:
-
ADQM’s services metric settings — settings for access to monitoring metrics of ADQM services;
-
Node exporter settings — settings for access to system metrics of the ADQM cluster hosts.
For example, http://10.92.40.107:9363/metrics is an address to view metrics of the ClickHouse server on a host with IP
10.92.40.107
. -
-
The page that opens will display monitoring metrics from the specified host of the ADQM cluster in the Prometheus format.
Prometheus web interface
-
In the address bar of your browser, enter an IP address of a host where the Prometheus Server component is installed. Specify also a port number from the listen_address parameter in the Prometheus settings section of the Monitoring service configuration (the default port is
9092
). For example: http://10.92.40.107:9092. To log into the Prometheus interface, use a username and password that are also specified in the Prometheus settings section — the Prometheus users to login/logout to Prometheus setting. -
In the Expression field, you can enter a metric and click Execute — values of this metric on all hosts of the ADQM cluster will be shown in the interface.
Grafana web interface
-
In the address bar of your browser, enter an address of a host on which Grafana is deployed and add a port number — a value of the Grafana listen port parameter located in the Grafana settings section of the Monitoring service configuration (the default value is
3000
). For example, http://10.92.40.107:3000. To log in, useadmin
as a username, and the Grafana administrator’s password parameter value (also found in the Grafana settings section of service configuration parameters) as a password. -
In the window that opens, navigate to Home → Dashboards and expand the ADQM Dashboard <ADQM_cluster_name> section. In this section, you can select one of available dashboards to view service or system metrics coming from your ADQM cluster.