MRS provides backup and restoration for user data and system data. The backup function is provided based on components to back up Manager data (including OMS data and LdapServer data), Hive user data, component metadata saved in DBService, and HDFS metadata.
Backup is used in the following scenarios:
Backup Type |
Backup Content |
---|---|
OMS |
Database data (excluding alarm data) and configuration data in the cluster management system (by default) |
LdapServer |
User information (about usernames, passwords, keys, password policies, and user groups) |
DBService |
Metadata of the components (Hive) managed by DBService |
NameNode |
HDFS metadata |
Task
Before backup or restoration, you need to create a backup or restoration task and set task parameters, such as the task name, backup data source, and type of backup file save path. Data backup and restoration can be performed by executing backup and restoration tasks. When MRS is used to recover the data of HDFS, HBase, Hive, and NameNode, no cluster can be accessed.
Each backup task can back up data of different data sources and generates an independent backup file for each data source. All the backup files generated in each backup task form a backup file set, which can be used in restoration tasks. Backup data can be stored on Linux local disks, local cluster HDFS, and standby cluster HDFS. The backup task provides the full backup or incremental backup policies. HDFS and Hive backup tasks support the incremental backup policy, while OMS, LdapServer, DBService, and NameNode backup tasks support only the full backup policy.
Task execution rules:
Item |
Specifications |
---|---|
Maximum number of backup or restoration tasks |
100 |
Number of concurrent running tasks |
1 |
Maximum number of waiting tasks |
199 |
Maximum size of backup files on a Linux local disk (GB) |
600 |
Item |
OMS |
LdapServer |
DBService |
NameNode |
---|---|---|---|---|
Backup period |
1 hour |
|||
Maximum number of copies |
2 |
|||
Maximum size of a backup file |
10 MB |
20 MB |
100 MB |
1.5 GB |
Maximum size of disk space used |
20 MB |
40 MB |
200 MB |
3 GB |
Save path of backup data |
Save path/LocalBackup/ of the active and standby management nodes |
The backup data of the default task must be periodically transferred and saved outside the cluster based on the enterprise O&M requirements.