Redistributing Data

Data redistribution, where data in existing nodes is evenly allocated to the new nodes after you scale out a cluster, is a time-consuming yet crucial task that accelerates service response.

By default, redistribution is automatically started after cluster scale-out. For enhanced reliability, disable the automatic redistribution function and manually start a redistribution task after the scale-out is successful. In this way, both scale-out and redistribution can be retried upon failures.

Currently, offline redistribution and online redistribution are supported. The default mode is offline redistribution.

  • The cluster redistribution function is supported in 8.1.1.200 or later.
  • This function can be manually enabled only when the cluster task information displays To be redistributed after scale-out.
  • You can also select the redistribution mode when you configure cluster scale-out (see Configure advanced parameters).

Offline Redistribution

Precautions

Procedure

  1. Log in to the GaussDB(DWS) management console.
  2. Choose Clusters. All clusters are displayed by default.
  3. In the Operation column of the target cluster, choose More > Redistribute, as shown in the following figure.

  4. On the Redistribute page that is displayed, keep the default offline redistribution mode and click Next: Confirm to submit the task.

Online Redistribution

Precautions

In online redistribution mode, the database supports partial DDL and DCL operations.

Procedure

  1. Log in to the GaussDB(DWS) management console.
  2. Choose Clusters. All clusters are displayed by default.
  3. In the Operation column of the target cluster, choose More > Redistribute, as shown in the following figure.

  4. On the Redistribute page that is displayed, set Advanced to Custom, set the redistribution mode to Online, and click Next: Confirm to submit the task.