forked from docs/doc-exports
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com> Co-authored-by: Yang, Tong <yangtong2@huawei.com> Co-committed-by: Yang, Tong <yangtong2@huawei.com>
87 lines
12 KiB
HTML
87 lines
12 KiB
HTML
<a name="ALM-17007"></a><a name="ALM-17007"></a>
|
|
|
|
<h1 class="topictitle1">ALM-17007 Garbage Collection (GC) Time of the Oozie Process Exceeds the Threshold</h1>
|
|
<div id="body8642448"><div class="section" id="ALM-17007__s275e19e7593e451dab5fef4e98e1fdcd"><h4 class="sectiontitle">Description</h4><p id="ALM-17007__en-us_topic_0070543680_p27840804">The system checks GC time of the Oozie process every 60 seconds. The alarm is generated when GC time of the Oozie process exceeds the threshold (default value: <strong id="ALM-17007__en-us_topic_0070543680_b49240651">12 seconds</strong>). The alarm is cleared when GC time is less than the threshold.</p>
|
|
</div>
|
|
<div class="section" id="ALM-17007__s31d538b728f24c6399d03eda20f41cf0"><h4 class="sectiontitle">Attribute</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-17007__en-us_topic_0070543680_table29069770" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-17007__en-us_topic_0070543680_row4676306"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-17007__en-us_topic_0070543680_p43236531">Alarm ID</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-17007__en-us_topic_0070543680_p12498161">Alarm Severity</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-17007__en-us_topic_0070543680_p5718131">Automatically Cleared</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-17007__en-us_topic_0070543680_row60515490"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-17007__en-us_topic_0070543680_p2807653">17007</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-17007__en-us_topic_0070543680_p26093349">Major</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-17007__en-us_topic_0070543680_p33186505">Yes</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-17007__sa21d8d65b6464af2b8610cacab11ca39"><h4 class="sectiontitle">Parameters</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-17007__en-us_topic_0070543680_table3752378" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-17007__en-us_topic_0070543680_row47812263"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-17007__en-us_topic_0070543680_p47588127">Name</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-17007__en-us_topic_0070543680_p29433063">Meaning</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-17007__row12644152972310"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-17007__p192431315431">Source</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-17007__p692551319435">Specifies the cluster for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-17007__en-us_topic_0070543680_row35267879"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-17007__en-us_topic_0070543680_p38125960">ServiceName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-17007__en-us_topic_0070543680_p1195057">Specifies the service for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-17007__en-us_topic_0070543680_row10755521"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-17007__en-us_topic_0070543680_p65890858">RoleName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-17007__en-us_topic_0070543680_p35559280">Specifies the role for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-17007__en-us_topic_0070543680_row51598065"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-17007__en-us_topic_0070543680_p18693708">HostName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-17007__en-us_topic_0070543680_p37795384">Specifies the host for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-17007__en-us_topic_0070543680_row4614140"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-17007__en-us_topic_0070543680_p38201027">Trigger Condition</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-17007__en-us_topic_0070543680_p7275503">Specifies the threshold triggering the alarm. If the current indicator value exceeds this threshold, the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-17007__s5917a677208643748689c02700431b8c"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-17007__en-us_topic_0070543680_p52444904">Oozie responds slowly when it is used to submit tasks.</p>
|
|
</div>
|
|
<div class="section" id="ALM-17007__sed6d0e4c37684e3baf99229232355e79"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-17007__en-us_topic_0070543680_p20178837">The heap memory of the Oozie instance is overused or the heap memory is inappropriately allocated.</p>
|
|
</div>
|
|
<div class="section" id="ALM-17007__sb4fe72b776134ac5bed2cee758676ede"><h4 class="sectiontitle">Procedure</h4><p id="ALM-17007__en-us_topic_0070543680_p23873119"><strong id="ALM-17007__b62462687174517">Check GC time.</strong></p>
|
|
<ol id="ALM-17007__en-us_topic_0070543680_ol13531482"><li id="ALM-17007__en-us_topic_0070543680_li54674477"><span>On the FusionInsight Manager portal, choose <strong id="ALM-17007__b1150777143118">O&M</strong> > <strong id="ALM-17007__b137099363111">Alarm</strong> > <strong id="ALM-17007__en-us_topic_0070543680_b22308249">Alarms</strong> > <strong id="ALM-17007__en-us_topic_0070543680_b66556520">Garbage Collection (GC) Time of the Oozie Process Exceeds the Threshold</strong> > <strong id="ALM-17007__en-us_topic_0070543680_b62137769">Location</strong>. Check the IP address of the instance involved in this alarm.</span></li><li id="ALM-17007__en-us_topic_0070543680_li22369015"><span>On the FusionInsight Manager portal, choose<strong id="ALM-17007__b13994214381"> Cluster > </strong><em id="ALM-17007__i140092183814">Name of the desired cluster</em> > <strong id="ALM-17007__b4692965417821">Services</strong> > <strong id="ALM-17007__b1971370417821">Oozie</strong> > <strong id="ALM-17007__b4320561417821">Instance</strong>. Click the instance for which the alarm is generated to go to the page for the instance. Click the drop-down menu in the chart<strong id="ALM-17007__b16773929106"> </strong>area and choose<strong id="ALM-17007__b9418154451518"> Customize</strong> > <strong id="ALM-17007__b18771101565912">GC</strong> > <strong id="ALM-17007__b616515208018">Garbage Collection (GC) Time of Oozie</strong>. Click <strong id="ALM-17007__b8983203712228">OK</strong>.</span></li><li id="ALM-17007__en-us_topic_0070543680_li54369966"><span>Check whether GC time of the Oozie process every second exceeds the threshold (default value: <strong id="ALM-17007__en-us_topic_0070543680_b19567648">12 seconds</strong>).</span><p><ul id="ALM-17007__en-us_topic_0070543680_ul41891112"><li id="ALM-17007__en-us_topic_0070543680_li41475694">If yes, go to <a href="#ALM-17007__l054b8501eb0f42fc8d5e96d6a497ec94">4</a>.</li><li id="ALM-17007__en-us_topic_0070543680_li4088064">If no, go to <a href="#ALM-17007__en-us_topic_0070543680_d0e32615">6</a>.</li></ul>
|
|
</p></li><li id="ALM-17007__l054b8501eb0f42fc8d5e96d6a497ec94"><a name="ALM-17007__l054b8501eb0f42fc8d5e96d6a497ec94"></a><a name="l054b8501eb0f42fc8d5e96d6a497ec94"></a><span>On the FusionInsight Manager portal, choose <strong id="ALM-17007__b75655328387">Cluster > </strong><em id="ALM-17007__i1756723216384">Name of the desired cluster</em><strong id="ALM-17007__b2056653218387"> </strong>><strong id="ALM-17007__b2209710125416"> Services</strong> > <strong id="ALM-17007__en-us_topic_0070543680_b45356306">Oozie</strong> > <strong id="ALM-17007__en-us_topic_0070543680_b5553575">Configurations</strong>. Click <strong id="ALM-17007__en-us_topic_0070543680_b47186420">All Configurations</strong>. Search <strong id="ALM-17007__en-us_topic_0070543680_b22024598">GC_OPTS</strong> in the search box. Increase the value of <strong id="ALM-17007__en-us_topic_0070543680_b64003657">-Xmx</strong> as required, and click <strong id="ALM-17007__en-us_topic_0070543680_b39162007">Save</strong>. Click <strong id="ALM-17007__en-us_topic_0070543680_b18005964">OK</strong>.</span><p><div class="note" id="ALM-17007__note17850123124113"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-17007__p855517297466">Suggestions on GC parameter settings for Oozie:</p>
|
|
<p id="ALM-17007__p109315361419">You are advised to set <strong id="ALM-17007__b757555955819">-Xms</strong> and <strong id="ALM-17007__b257555935818">-Xmx</strong> to the same value to prevent adverse impact on performance when JVM dynamically adjusts the heap memory size.</p>
|
|
</div></div>
|
|
</p></li><li id="ALM-17007__en-us_topic_0070543680_li27835949"><span>Restart the affected services or instances and check whether the alarm is cleared.</span><p><ul id="ALM-17007__en-us_topic_0070543680_ul49196957"><li id="ALM-17007__en-us_topic_0070543680_li40119436">If yes, no further action is required.</li><li id="ALM-17007__en-us_topic_0070543680_li25530612">If no, go to <a href="#ALM-17007__en-us_topic_0070543680_d0e32615">6</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p id="ALM-17007__en-us_topic_0070543680_p54713654"><strong id="ALM-17007__b46154631174546">Collect fault information.</strong></p>
|
|
<ol start="6" id="ALM-17007__ol63424552174540"><li id="ALM-17007__en-us_topic_0070543680_d0e32615"><a name="ALM-17007__en-us_topic_0070543680_d0e32615"></a><a name="en-us_topic_0070543680_d0e32615"></a><span>On the FusionInsight Manager portal, choose <strong id="ALM-17007__b1675650195515">O&M</strong> > <strong id="ALM-17007__b191721958165419">Log </strong>><strong id="ALM-17007__b11173358165413"> Download</strong>.</span></li><li id="ALM-17007__en-us_topic_0070543680_li31665069"><span>Select <strong id="ALM-17007__en-us_topic_0070543680_b16550169">Oozie</strong> in the required cluster from the <strong id="ALM-17007__en-us_topic_0070543680_b14733798">Service</strong> drop-down list.</span></li><li id="ALM-17007__li1145664103113"><span>Click <span><img id="ALM-17007__image1945644173117" src="en-us_image_0269417389.png"></span> in the upper right corner, and set <strong id="ALM-17007__b6456941173117">Start Date</strong> and <strong id="ALM-17007__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-17007__b13456164113319">Download</strong>.</span></li><li id="ALM-17007__en-us_topic_0070543680_li16800086"><span>Contact the <span id="ALM-17007__text4614151421417">O&M personnel</span> and send the collected logs.</span></li></ol>
|
|
</div>
|
|
<div class="section" id="ALM-17007__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-17007__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
|
|
</div>
|
|
<div class="section" id="ALM-17007__s1c5d055ea5c74980a70253d5b8630699"><h4 class="sectiontitle">Related Information</h4><p id="ALM-17007__en-us_topic_0070543676_p62381196">None</p>
|
|
</div>
|
|
</div>
|
|
<div>
|
|
<div class="familylinks">
|
|
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
|
|
</div>
|
|
</div>
|
|
|