Use ADB DDBoost plugin
Overview
ADB DDBoost is a plugin-connector for native usage of the Dell EMC Data Domain storage system with Greenplum utilities for logical data backup (gpbackup) and restore (gprestore).
When the DDBoost plugin is used with gpbackup and gprestore, it connects to the Data Domain Platform server. To integrate with Data Domain Platform, the plugin requires the libDDBoost.so library version 7.7, which is supplied as part of the Dell DDBoostSDK 7.7 package. This library is proprietary and should be obtained through Dell Support or Dell Data Domain representatives in your country. Due to deduplication of data provided by the library, the DDBoost plugin allows you to significantly accelerate backup operations and reduce the physical storage usage.
NOTE
|
Installation and configuration
To install the ADB DDBoost plugin, follow the steps:
-
Ensure that you use Arenadata DB Enterprise 6.22.0.38 or higher. Update your ADB cluster if necessary.
-
Get the adb-ddp-plugin RPM package from the Arenadata support team. The package name contains a specific version number. In the example below, the version number is omitted for simplification. Install the RPM package on all cluster hosts under the user with
sudo
privileges. Below is an example of installation via the YUM package manager:$ sudo yum install -y adb-ddp-plugin.rpm
As a result of the successful command execution, a plugin executable file is added to the following location: $GPHOME/bin/adb_ddp_plugin.
-
Install the libDDBoost.so library on all cluster hosts if you have not done it before. Below is an example of installation via the YUM package manager:
$ sudo yum install -y libddboost.rpm
NOTEIn the current example, the CentOS 7 operating system is used, and the libDDBoost.so file is created in the /lib64 directory. The path may differ in your environment.
-
Connect to the ADB master host under the
gpadmin
user:$ sudo su - gpadmin
-
On the master host, create a YAML configuration file in the /home/gpadmin directory (e.g. adb_ddp_plugin.yaml). A file structure and its description are given below.
executablepath: /usr/lib/gpdb/bin/adb_ddp_plugin options: hostname: "testhost" username: "testuser" password: "XXXXXXXXXXXX" storage_unit: "testunit" directory: "testdir" write_buffer_size: "1048576" read_buffer_size: "1048576" log_level: "DEBUG" log_path: "/home/gpadmin"
Name | Description | Default | Required |
---|---|---|---|
executablepath |
An absolute path to the plugin in the file system of ADB hosts (along with the plugin executable file name). The plugin should be installed on all cluster hosts in the same directory. The default path to the plugin executable file is $GPHOME/bin/adb_ddp_plugin |
— |
Yes |
hostname |
An IP address or name of the host that provides operations with DDBoost. Can contain no more than 30 characters |
— |
Yes |
username |
A name of the user who is granted permissions to work with DDBoost. The user is configured on the DDBoost side. This is neither operating system nor ADB user. Can contain no more than 30 characters |
— |
Yes |
password |
A password of the user who is granted permissions to work with DDBoost |
— |
Yes |
storage_unit |
A name of the storage unit that is configured on the DDBoost side |
— |
Yes |
directory |
A name of the directory in the DDBoost file system. That directory is used to store all backup files created when the gpbackup utility is run with the DDBoost plugin. In the /<storage_unit>/<directory> folder, the plugin automatically creates all subdirectories corresponding to the creation date and time of each backup: /<storage_unit>/<directory>/YYYYMMDD/YYYYMMDDHHmmSS/ |
— |
Yes |
write_buffer_size |
A buffer size for writing data to DDBoost (in bytes). Values from the following range are allowed: 64 <= write_buffer_size <= 1048576 |
— |
Yes |
read_buffer_size |
A buffer size for reading data from DDBoost (in bytes). Values from the following range are allowed: 64 <= read_buffer_size <= 1048576 |
— |
Yes |
log_level |
A log level. Possible values:
|
WARN |
No |
log_path |
An absolute directory path to write plugin logs. The adb_ddp_plugin.log file is created in the specified directory when the plugin is used |
/home/gpadmin/gpAdminLogs |
No |
Usage examples
gpbackup
To use the ADB DDBoost plugin when creating backups via the gpbackup utility, specify the --plugin-config <yaml_path>
command argument, where <yaml_path>
is an absolute path to the plugin configuration file in YAML format.
The following example shows how to backup one table of the adb
database:
-
Connect to the
adb
database on ADB master under thegpadmin
user (for example, via psql). Using the psql command\dt
, ensure the required table (public.test
in the current example) exists in the database:\dt
The result:
List of relations Schema | Name | Type | Owner | Storage --------+-----------------+-------+---------+---------------------- public | spatial_ref_sys | table | gpadmin | heap public | test | table | gpadmin | heap public | test2 | table | gpadmin | append only columnar public | test3 | table | gpadmin | append only columnar (4 rows)
-
Disconnect from
adb
and run gpbackup with the--plugin-config
parameter under thegpadmin
user. Note that the--include-table
parameter allows you to backup one table:$ gpbackup --dbname adb --no-compression --single-data-file --include-table public.test --plugin-config /home/gpadmin/adb_ddp_plugin.yaml
TIPWhen the DDBoost plugin is used along with gpbackup, it is recommended to set the backup mode "all segment tables — one file" (
--single-data-file
). Do not use the parallel backup mode (--jobs
) because of its extremely low write performance.If the data backup succeeds, the command output ends with the following message:
[INFO]:-Backup completed successfully
-
Ensure that the backup files are generated and stored on the selected Data Domain Platform server. To do this, open the /<storage_unit>/<directory>/YYYYMMDD/YYYYMMDDHHmmSS directory on the server, where:
-
<storage_unit>
and<directory>
— values of eponymous fields from the plugin YAML configuration file. -
YYYYMMDD
— backup creation date. -
YYYYMMDDHHmmSS
— backup creation date and time. It is a timestamp that will be used as the--timestamp
value when restoring data from the current backup.
$ ls ./ddtest-dstu/gpbackup/20240130/20240130160839
The directory contents are as follows:
gpbackup_0_20240130160839 gpbackup_3_20240130160839 gpbackup_0_20240130160839_toc.yaml gpbackup_3_20240130160839_toc.yaml gpbackup_1_20240130160839 gpbackup_4_20240130160839 gpbackup_1_20240130160839_toc.yaml gpbackup_4_20240130160839_toc.yaml gpbackup_20240130160839_config.yaml gpbackup_5_20240130160839 gpbackup_20240130160839_metadata.sql gpbackup_5_20240130160839_toc.yaml gpbackup_20240130160839_plugin_config.yaml gpbackup_6_20240130160839 gpbackup_20240130160839_report gpbackup_6_20240130160839_toc.yaml gpbackup_20240130160839_toc.yaml gpbackup_7_20240130160839 gpbackup_2_20240130160839 gpbackup_7_20240130160839_toc.yaml gpbackup_2_20240130160839_toc.yaml
-
gprestore
To use the ADB DDBoost plugin when restoring data via the gprestore utility, specify the --plugin-config <yaml_path>
command argument, where <yaml_path>
is an absolute path to the plugin configuration file in YAML format.
The following example shows how to restore the public.test
table, for which a backup was previously created:
-
Connect to the
adb
database on ADB master under thegpadmin
user (for example, via psql). Drop thepublic.test
table:DROP TABLE test;
-
Ensure the table does not exist via the psql command
\dt
:\dt
The result:
List of relations Schema | Name | Type | Owner | Storage --------+-----------------+-------+---------+---------------------- public | spatial_ref_sys | table | gpadmin | heap public | test2 | table | gpadmin | append only columnar public | test3 | table | gpadmin | append only columnar (3 rows)
-
Disconnect from
adb
and run gprestore with the--plugin-config
parameter under thegpadmin
user. Note that the--timestamp
value should contain the backup creation timestamp in theYYYYMMDDHHmmSS
format:$ gprestore --timestamp 20240130160839 --plugin-config /home/gpadmin/adb_ddp_plugin.yaml --on-error-continue --include-table public.test
If the data restore succeeds, the command output ends with the following message:
[INFO]:-Restore completed successfully
-
Connect to the
adb
database again and check that thepublic.test
table is available:\dt
The result:
List of relations Schema | Name | Type | Owner | Storage --------+-----------------+-------+---------+---------------------- public | spatial_ref_sys | table | gpadmin | heap public | test | table | gpadmin | heap public | test2 | table | gpadmin | append only columnar public | test3 | table | gpadmin | append only columnar (4 rows)