Before GaussDB(DWS) reads data from MRS HDFS, you need to create an MRS data source connection that functions as a channel of transporting data warehouse cluster data and MRS cluster data.
Configure parameters as required. For details, see "Cluster Operation Guide > Custom Creation of a Cluster" in the MapReduce Service User Guide.
Cluster Version can also be set to 1.6.x, 1.7.x, 1.8.x, or 2.0.x.
If you enable Kerberos authentication for an MRS cluster, use MRS Manager to create a user for interconnecting GaussDB(DWS) with the system after the MRS cluster is created. The user type must be Human-Machine and the user, user group hadoop, and role Manager_administrator must be bound together. The user password must be changed on the MRS Manager page after the user is created.
If you already have a qualified MRS cluster, skip this step.
Parameter |
Description |
---|---|
MRS Data Source |
Specifies the MRS cluster to which GaussDB(DWS) can connect. By default, all available analytic MRS clusters that are in the same VPC and subnet as the current data warehouse cluster and in the Available state are displayed. After you select an MRS cluster, the system automatically displays whether Kerberos authentication is enabled for the selected cluster. Click View MRS Cluster to view its detailed information. If the MRS Data Source drop-down list is empty, click Create MRS Cluster to create an MRS cluster. |
MRS Account |
Specifies the account used when a data warehouse cluster connects to an MRS cluster. This parameter is available only when Kerberos authentication is selected for the MRS cluster. |
Password |
Specifies the password of the connection user. If you change the password, you need to create a connection again. This parameter is valid only for clusters with MRS Kerberos authentication enabled. |
Description |
Describes the connection. |
Configuration Status turns to Creating. You can view the connection that is successfully created in the MRS data source list and the connection status is Available.