Yang, Tong 3f5759eed2 MRS comp-lts 2.0.38.SP20 version
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com>
Co-authored-by: Yang, Tong <yangtong2@huawei.com>
Co-committed-by: Yang, Tong <yangtong2@huawei.com>
2023-01-19 17:08:45 +00:00

29 lines
11 KiB
HTML

<a name="mrs_01_1736"></a><a name="mrs_01_1736"></a>
<h1 class="topictitle1">Managing a HetuEngine Compute Instance</h1>
<div id="body32001227"><div class="section" id="mrs_01_1736__en-us_topic_0000001173789730_section16689919163913"><h4 class="sectiontitle">Scenario</h4><p id="mrs_01_1736__en-us_topic_0000001173789730_p1290164533117">On the <span id="mrs_01_1736__en-us_topic_0000001173789730_text49428541632">HetuEngine</span> web UI, you can start, stop, delete, and roll-restart a single compute instance or compute instances in batches.</p>
<div class="notice" id="mrs_01_1736__en-us_topic_0000001173789730_note128625461938"><span class="noticetitle"><img src="public_sys-resources/notice_3.0-en-us.png"> </span><div class="noticebody"><ul id="mrs_01_1736__en-us_topic_0000001173789730_ul164841652185118"><li id="mrs_01_1736__en-us_topic_0000001173789730_li137105835614">Restarting <span id="mrs_01_1736__en-us_topic_0000001173789730_text1834014165611">HetuEngine</span><p id="mrs_01_1736__en-us_topic_0000001173789730_p723041925615">During the restart or rolling restart of <span id="mrs_01_1736__en-us_topic_0000001173789730_text214177122165422">HetuEngine</span>, do not create, start, stop, or delete <span id="mrs_01_1736__en-us_topic_0000001173789730_text201882477465422">HetuEngine</span> compute instances on HSConsole.</p>
</li><li id="mrs_01_1736__en-us_topic_0000001173789730_li1711611219568">Restarting <span id="mrs_01_1736__en-us_topic_0000001173789730_text159134219665422">HetuEngine</span> compute instances<ul id="mrs_01_1736__en-us_topic_0000001173789730_ul111964405711"><li id="mrs_01_1736__en-us_topic_0000001173789730_li9117727578">During the restart or rolling restart of <span id="mrs_01_1736__en-us_topic_0000001173789730_text134125411521">HetuEngine</span> compute instances, do not perform any change operations on the data sources on the <span id="mrs_01_1736__en-us_topic_0000001173789730_text118518385317">HetuEngine</span> and <span id="mrs_01_1736__en-us_topic_0000001173789730_text23851236183014">HetuEngine</span> web UI, including restarting <span id="mrs_01_1736__en-us_topic_0000001173789730_text1129325412591">HetuEngine</span> and changing its configurations.</li><li id="mrs_01_1736__en-us_topic_0000001173789730_li111892165712">If a compute instance has only one coordinator or worker, do not roll-restart the instance.</li><li id="mrs_01_1736__en-us_topic_0000001173789730_li6119132155712">If the number of workers is greater than 10, the rolling restart may take more than 200 minutes. During this period, do not perform other O&amp;M operations.</li><li id="mrs_01_1736__en-us_topic_0000001173789730_li51204211571">During the rolling restart of compute instances, HetuEngine releases Yarn resources and applies for them again. Ensure that the CPU and memory of Yarn are sufficient for starting 20% workers and Yarn resources are not preempted by other jobs. Otherwise, the rolling restart will fail.<p id="mrs_01_1736__en-us_topic_0000001173789730_p07328064010"><a name="mrs_01_1736__en-us_topic_0000001173789730_li51204211571"></a><a name="en-us_topic_0000001173789730_li51204211571"></a>Viewing Yarn resources: Log in to FusionInsight Manager and choose <strong id="mrs_01_1736__en-us_topic_0000001173789730_b13307817269">Tenant Resources</strong>. On the navigation pane on the left, choose <strong id="mrs_01_1736__en-us_topic_0000001173789730_b19894142411260">Tenant Resources Management</strong> to view the available queue resources of Yarn in the <strong id="mrs_01_1736__en-us_topic_0000001173789730_b16895825321">Resource Quota</strong> area.</p>
<p id="mrs_01_1736__en-us_topic_0000001173789730_p154512459484">Viewing the CPU and memory of a worker container: Log in to FusionInsight Manager as a user who can access the <span id="mrs_01_1736__en-us_topic_0000001173789730_text936619109254">HetuEngine</span> WebUI and choose <strong id="mrs_01_1736__en-us_topic_0000001173789730_b15880189123415">Cluster</strong> &gt; <strong id="mrs_01_1736__en-us_topic_0000001173789730_b1272161253412">Services</strong> &gt; <strong id="mrs_01_1736__en-us_topic_0000001173789730_b454432563414"><span id="mrs_01_1736__en-us_topic_0000001173789730_text11182231486">HetuEngine</span></strong>. In the <strong id="mrs_01_1736__en-us_topic_0000001173789730_b13791145118379">Basic Information</strong> area, click the link next to <strong id="mrs_01_1736__en-us_topic_0000001173789730_b132786511386">HSConsole WebUI</strong> to go to the HSConsole page. Click <strong id="mrs_01_1736__en-us_topic_0000001173789730_b823611438385">Operation</strong> in the row where the target instance is located and click <strong id="mrs_01_1736__en-us_topic_0000001173789730_b7238612103916">Configure</strong>.</p>
</li><li id="mrs_01_1736__en-us_topic_0000001173789730_li44194818543">During the rolling restart, ensure that Application Manager of coordinators or workers in the Yarn queue runs stably.</li></ul>
<p id="mrs_01_1736__en-us_topic_0000001173789730_p767335295417">Troubleshooting</p>
<ul id="mrs_01_1736__en-us_topic_0000001173789730_ul10687115295411"><li id="mrs_01_1736__en-us_topic_0000001173789730_li568745265416">If Application Manager of coordinators or workers in the Yarn queues is restarted during the rolling restart, the compute instances may be abnormal. In this case, you need to stop the compute instances and then start the compute instance for recovery.</li><li id="mrs_01_1736__en-us_topic_0000001173789730_li156878523544">Compute instances are in the subhealthy state if they fail to be roll-restarted, which may lead to inconsistent configuration or number of coordinators or workers. In this case, the subhealth state of the instances will not be automatically restored. You need to manually check the instance status or restore the instance to healthy by performing the rolling restart again or stopping the compute instances.</li></ul>
</li></ul>
</div></div>
</div>
<div class="section" id="mrs_01_1736__en-us_topic_0000001173789730_section4159158922"><h4 class="sectiontitle">Prerequisites</h4><p id="mrs_01_1736__en-us_topic_0000001173789730_p19688725262">You have created an HetuEngine administrator for accessing the <span id="mrs_01_1736__en-us_topic_0000001173789730_text1160410941018">HetuEngine</span> web UI. For details, see <a href="mrs_01_1714.html">Creating a HetuEngine User</a>.</p>
<div class="note" id="mrs_01_1736__en-us_topic_0000001173789730_note196657493100"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><ul id="mrs_01_1736__en-us_topic_0000001173789730_ul2492050101017"><li id="mrs_01_1736__en-us_topic_0000001173789730_li13492350121013">Users in the <strong id="mrs_01_1736__en-us_topic_0000001173789730_b13222524104814">hetuadmin</strong> user group are HetuEngine administrators. Administrators have the permission to start, stop, and delete instances, and common users have only the permission to query instances.</li><li id="mrs_01_1736__en-us_topic_0000001173789730_li24921850161010">To modify the configuration of the current compute instance, you need to delete the instance on the HSConsole page.</li></ul>
</div></div>
</div>
<div class="section" id="mrs_01_1736__en-us_topic_0000001173789730_section11334163112411"><h4 class="sectiontitle">Procedure</h4><ol id="mrs_01_1736__en-us_topic_0000001173789730_ol135619383518"><li id="mrs_01_1736__en-us_topic_0000001173789730_li435613381253"><span>Log in to FusionInsight Manager as an administrator who can access the <span id="mrs_01_1736__en-us_topic_0000001173789730_text16483010151011">HetuEngine</span> web UI and choose <strong id="mrs_01_1736__en-us_topic_0000001173789730_b14856161974915">Cluster</strong> &gt; <strong id="mrs_01_1736__en-us_topic_0000001173789730_b4857819104910">Services</strong> &gt; <strong id="mrs_01_1736__en-us_topic_0000001173789730_b9511112355015"><span id="mrs_01_1736__en-us_topic_0000001173789730_text12685131191011">HetuEngine</span></strong>. The <strong id="mrs_01_1736__en-us_topic_0000001173789730_b631113355015"><span id="mrs_01_1736__en-us_topic_0000001173789730_text1646321271011">HetuEngine</span></strong> service page is displayed.</span></li><li id="mrs_01_1736__en-us_topic_0000001173789730_li93564385513"><span>In the <strong id="mrs_01_1736__en-us_topic_0000001173789730_b2629835155020">Basic Information</strong> area on the <strong id="mrs_01_1736__en-us_topic_0000001173789730_b1563412355506">Dashboard</strong> page, click the link next to <strong id="mrs_01_1736__en-us_topic_0000001173789730_b2635035165014">HSConsole WebUI</strong>. The HSConsole page is displayed.</span></li><li id="mrs_01_1736__en-us_topic_0000001173789730_li4356193810514"><span>In the <strong id="mrs_01_1736__en-us_topic_0000001173789730_b9263152710510">Operation</strong> column of the instance, you can perform the following operations on a single job:</span><p><ul id="mrs_01_1736__en-us_topic_0000001173789730_ul1750544121115"><li id="mrs_01_1736__en-us_topic_0000001173789730_li8750164461115">To start an instance, click <strong id="mrs_01_1736__en-us_topic_0000001173789730_b1944773135110">Start</strong>.</li><li id="mrs_01_1736__en-us_topic_0000001173789730_li177501244191112">To stop an instance, click <strong id="mrs_01_1736__en-us_topic_0000001173789730_b16869721185213">Stop</strong>.</li><li id="mrs_01_1736__en-us_topic_0000001173789730_li14750544161114">To delete an instance that is no longer used, click <strong id="mrs_01_1736__en-us_topic_0000001173789730_b1136320844312">Delete</strong>. The configuration information of the instance is also deleted.</li><li id="mrs_01_1736__en-us_topic_0000001173789730_li278013763814">To roll-restart an instance, click <strong id="mrs_01_1736__en-us_topic_0000001173789730_b17216113243">Rolling Restart</strong>.</li></ul>
</p></li><li id="mrs_01_1736__en-us_topic_0000001173789730_li61591246194614"><span>In the upper part of the instance list, you can perform the following operations on jobs:</span><p><ul id="mrs_01_1736__en-us_topic_0000001173789730_ul1116714624617"><li id="mrs_01_1736__en-us_topic_0000001173789730_li816704618466">To start instances in batches, select the target instances in the instance list and click <strong id="mrs_01_1736__en-us_topic_0000001173789730_b2282513114018">Start</strong>.</li><li id="mrs_01_1736__en-us_topic_0000001173789730_li1216714634612">To stop instances in batches, select the target instances in the instance list and click <strong id="mrs_01_1736__en-us_topic_0000001173789730_b1798205214542">Stop</strong>.</li><li id="mrs_01_1736__en-us_topic_0000001173789730_li111678462462">To delete instances in batches, select the target instances in the instance list and click <strong id="mrs_01_1736__en-us_topic_0000001173789730_b15939151195517">Delete</strong>.</li><li id="mrs_01_1736__en-us_topic_0000001173789730_li13981163073915">To roll-restart instances in batches, select the target instances in the instance list and click <strong id="mrs_01_1736__en-us_topic_0000001173789730_b50298050765422">Rolling Restart</strong>.</li></ul>
</p></li></ol>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1729.html">Managing Compute Instances</a></div>
</div>
</div>