doc-exports/docs/dataartsstudio/umn/dataartsstudio_03_0336.html
Xiong, Chen Xiao f3ee29a3dd DataArts UMN 20230329 version
Reviewed-by: Rechenburg, Matthias <matthias.rechenburg@t-systems.com>
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com>
Co-authored-by: Xiong, Chen Xiao <chenxiaoxiong@huawei.com>
Co-committed-by: Xiong, Chen Xiao <chenxiaoxiong@huawei.com>
2023-05-11 17:09:25 +00:00

5.3 KiB

How Do I Set the Number of Concurrent Extractors for a CDM Migration Job?

The number of concurrent extractors in a CDM migration job is related to the cluster specifications and table size. The value range is 1 to 300. If the value is too large, the extractors are queued.

You are advised to set 4 concurrent extractors for each 1 CU (1 CU = 1 vCPU and 4 GB), as listed in Table 1. You can also adjust the value as needed. If each row of the table contains less than or equal to 1 MB data, you can extract data concurrently. If each row contains more than 1 MB data, you are advised to extract data in a single thread.
  • When data is to be migrated to files, CDM does not support multiple concurrent tasks. In this case, set a single process to extract data.
  • The number of concurrent extractors of a job is affected by Maximum Concurrent Extractors configured on the Settings page. The Maximum Concurrent Extractors parameter specifies the total number of concurrent extractions.
Table 1 Reference configurations of concurrent extractors

CDM Cluster Flavor

vCPUs/Memory

Concurrent Extractors

cdm.large

8 vCPUs, 16 GB

16

cdm.xlarge

16 vCPUs, 32 GB

32

cdm.4xlarge

64 vCPUs, 128 GB

128