doc-exports/docs/dws/umn/dws_01_0055.html
Lu, Huayi 95132e24fc DWS UMN 830.201_new version
Reviewed-by: Pruthi, Vineet <vineet.pruthi@t-systems.com>
Reviewed-by: Rechenburg, Matthias <matthias.rechenburg@t-systems.com>
Co-authored-by: Lu, Huayi <luhuayi@huawei.com>
Co-committed-by: Lu, Huayi <luhuayi@huawei.com>
2024-05-27 11:54:34 +00:00

21 lines
4.3 KiB
HTML

<a name="EN-US_TOPIC_0000001658895358"></a><a name="EN-US_TOPIC_0000001658895358"></a>
<h1 class="topictitle1">MRS Data Source Usage Overview</h1>
<div id="body8662426"><div class="section" id="EN-US_TOPIC_0000001658895358__en-us_topic_0000001372999230_section34418118183527"><h4 class="sectiontitle">MRS Cluster Overview</h4><p id="EN-US_TOPIC_0000001658895358__en-us_topic_0000001372999230_p8060118">MRS is a big data cluster running based on the open-source Hadoop ecosystem. It provides the industry's latest cutting-edge storage and analysis capabilities of massive volumes of data, satisfying your data storage and processing requirements. For details about MRS services, see the <i><cite id="EN-US_TOPIC_0000001658895358__citeab00060b28cc45ebb2e8ed3670ff8289165022">MapReduce Service User Guide</cite></i>.</p>
<p id="EN-US_TOPIC_0000001658895358__en-us_topic_0000001372999230_p56178650111411">You can use Hive/Spark (analysis cluster of MRS) to store massive volumes of service data. Hive/Spark data files are stored in HDFS. On GaussDB(DWS), you can connect a data warehouse cluster to MRS clusters, read data from HDFS files, and write the data to GaussDB(DWS) when the clusters are on the same network.</p>
</div>
<div class="section" id="EN-US_TOPIC_0000001658895358__en-us_topic_0000001372999230_section4774472184623"><h4 class="sectiontitle">Operation Process</h4><p id="EN-US_TOPIC_0000001658895358__en-us_topic_0000001372999230_p098116386811">Perform the following operations to import data from MRS to a data warehouse cluster:</p>
<ol id="EN-US_TOPIC_0000001658895358__en-us_topic_0000001372999230_ol2946194915157"><li id="EN-US_TOPIC_0000001658895358__en-us_topic_0000001372999230_li17564117115920">Prerequisites<ol type="a" id="EN-US_TOPIC_0000001658895358__en-us_topic_0000001372999230_ol11524636165910"><li id="EN-US_TOPIC_0000001658895358__en-us_topic_0000001372999230_li121622255419">Create an MRS cluster in a GaussDB(DWS) cluster. For details, see "Buying a Custom Cluster" in <i><cite id="EN-US_TOPIC_0000001658895358__en-us_topic_0000001372999230_cite1615715251411">MapReduce User Guide</cite></i>.</li><li id="EN-US_TOPIC_0000001658895358__en-us_topic_0000001372999230_li2152139185915">Create an HDFS foreign table for querying data from the MRS cluster over APIs of a foreign server.<p id="EN-US_TOPIC_0000001658895358__en-us_topic_0000001372999230_p1290418513596"><a name="EN-US_TOPIC_0000001658895358__en-us_topic_0000001372999230_li2152139185915"></a><a name="en-us_topic_0000001372999230_li2152139185915"></a>For details, see <span class="filepath" id="EN-US_TOPIC_0000001658895358__filepath16852657142216"><b>Data Import &gt; <span id="EN-US_TOPIC_0000001658895358__text5852115715224">Importing</span> Data from MRS to a Cluster</b></span> in the <i><cite id="EN-US_TOPIC_0000001658895358__en-us_topic_0000001372999230_cite159041519595">Data Warehouse Service Database Development Guide</cite></i>.</p>
<div class="note" id="EN-US_TOPIC_0000001658895358__en-us_topic_0000001372999230_note8151839135919"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><ul id="EN-US_TOPIC_0000001658895358__en-us_topic_0000001372999230_ul315113955917"><li id="EN-US_TOPIC_0000001658895358__en-us_topic_0000001372999230_li115143912593">Multiple MRS data sources can exist on the same network, but one GaussDB(DWS) cluster can connect to only one MRS cluster at a time.</li></ul>
</div></div>
</li></ol>
</li><li id="EN-US_TOPIC_0000001658895358__en-us_topic_0000001372999230_li15447351271">In the data warehouse cluster, create an MRS data source connection according to <a href="dws_01_0059.html">Creating an MRS Data Source Connection</a>.</li><li id="EN-US_TOPIC_0000001658895358__en-us_topic_0000001372999230_li1153072012292">Import data from an MRS data source to the cluster. For details, see "Data Import &gt; Importing Data from MRS to a Data Warehouse Cluster".</li><li id="EN-US_TOPIC_0000001658895358__en-us_topic_0000001372999230_li796932415269">(Optional) When the HDFS configuration of the MRS cluster changes, update the MRS data source configuration on GaussDB(DWS). For details, see <a href="dws_01_0156.html">Updating the MRS Data Source Configuration</a>.</li></ol>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="dws_01_0057.html">MRS Data Sources</a></div>
</div>
</div>