forked from docs/doc-exports
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com> Co-authored-by: Yang, Tong <yangtong2@huawei.com> Co-committed-by: Yang, Tong <yangtong2@huawei.com>
84 lines
12 KiB
HTML
84 lines
12 KiB
HTML
<a name="ALM-43009"></a><a name="ALM-43009"></a>
|
|
|
|
<h1 class="topictitle1">ALM-43009 JobHistory2x Process GC Time Exceeds the Threshold</h1>
|
|
<div id="body8662426"><div class="section" id="ALM-43009__s49648fd7217e4c14bd95923861d4a05e"><h4 class="sectiontitle">Description</h4><p id="ALM-43009__a66fc43001ef94ad6aed1741fcb205e3a">The system checks the garbage collection (GC) time of the JobHistory2x Process every 60 seconds. This alarm is generated when the detected GC time exceeds the threshold (exceeds 5 seconds for three consecutive checks.) To change the threshold, choose <strong id="ALM-43009__b15708164105915">O&M</strong> > <strong id="ALM-43009__b19724155785913">Alarm</strong> > <strong id="ALM-43009__a2fe16df2440a4061b56b521230aa4e41">Thresholds </strong>> <em id="ALM-43009__i968618143612">N</em><em id="ALM-43009__i66862149616">ame of the desired cluster</em> > <strong id="ALM-43009__ad5cf58584bfd4d0aa5826f5a646a3e30">Spark2x</strong> > <strong id="ALM-43009__b181153475114">GC Time</strong> > <strong id="ALM-43009__a3ea3c00f2f814f999badf3924d6b0197">Total GC time in milliseconds (JobHistory2x)</strong>. This alarm is cleared when the JobHistory2x GC time is shorter than or equal to the threshold.</p>
|
|
</div>
|
|
<div class="section" id="ALM-43009__sdb4ad9b409304156a95d43d72df6e9b5"><h4 class="sectiontitle">Attribute</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-43009__t39edabfe3c904adf87faa52a90960cc3" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-43009__r0394423927c1463d92ddfa66b523f924"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-43009__a0bbcf763906b41669e51aa9997ab8ac2">Alarm ID</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-43009__a8468ad126de04db89f283eb62ae4971c">Alarm Severity</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-43009__af62f3ac0985a4bee90aab9ed0259ccb7">Auto Clear</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-43009__ra94cdfbfa46d40d4ab44dd8b3ca6cbd9"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-43009__ab1613a2c923c4177956293bbd008edcf">43009</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-43009__a760d6242de3145e7aceb6456cb5d0441">Major</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-43009__ac577f1b98b2546b8bcd0a599aa3dc114">Yes</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-43009__s20e99abb15784875a1494c2a35c0792f"><h4 class="sectiontitle">Parameters</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-43009__t4df588c30dfe4a89bb84ecd3483d31e6" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-43009__r0ebdaa92c6ac4e46b90cf043c88536db"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-43009__ab0d66aa125014053903f07004e3da786">Name</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-43009__a1a65eda0eec842ae9ba4b416b5aa57f2">Meaning</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-43009__row6476122085411"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-43009__p192431315431">Source</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-43009__p692551319435">Specifies the cluster for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-43009__r343852930e484ed5bdcdfa17df0971e2"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-43009__a3e76c6cdc65b4e38971b0dc0ded0c01e">ServiceName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-43009__a2597838b05f64f5c9d6d36c7f026c3cb">Specifies the service name for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-43009__rf9903f012f8f411bb7294b6cb98a35b8"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-43009__ae98ab676c73a4ceeaeb964eb66b36953">RoleName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-43009__a40b173852c9748a3be32f978b48be40a">Specifies the role name for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-43009__r44cdb80d774b4d00952e0dba2219e090"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-43009__a850bf3a6fd604f84a85c08208eff09b0">HostName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-43009__a8d3b495e6f554462b73f7f90645c7680">Specifies the object (host ID) for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-43009__r701eed2c43ff46798dbddb4b531720a9"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-43009__a9b101ff253c248678730cfac92b1447e">Trigger Condition</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-43009__ad4e1cf3ca6414a90aa145885601adbfd">Specifies the threshold triggering the alarm. If the current indicator value exceeds this threshold, the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-43009__se6ed605d723842afa46c0bb39f61c0ad"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-43009__a23b3647ef693416ab0b51bfd5c666a11">If the GC time exceeds the threshold, JobHistory2x maybe run in low performance.</p>
|
|
</div>
|
|
<div class="section" id="ALM-43009__seabe8d2dd5574b60b0f4ff0a945f74da"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-43009__a7eba4d611cf64e668a7d7377b7fc5299">The memory of JobHistory2x is overused, the heap memory is inappropriately allocated. As a result, GCs occur frequently.</p>
|
|
</div>
|
|
<div class="section" id="ALM-43009__s222adeca8500489a879bb41532abec31"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-43009__a2b3579629d314695ba5feee366b16841"><strong id="ALM-43009__b1765141210320">Check the GC time.</strong></p>
|
|
<ol id="ALM-43009__ol102851621133212"><li id="ALM-43009__li122859216328"><span>On the FusionInsight Manager portal, choose <strong id="ALM-43009__b128331241925">O&M > Alarm<strong id="ALM-43009__b27872374104950"> > Alarm</strong></strong><strong id="ALM-43009__b87871559455">s</strong> and select the alarm whose<strong id="ALM-43009__a4d0701a0e0e84257b2283e69eeaaea29"> ID</strong> is <strong id="ALM-43009__af57edf249e9b405c926b867257b803d5">43009</strong>. Check the <strong id="ALM-43009__b1955573445015">RoleName</strong> in <strong id="ALM-43009__b052583712505">Location</strong> and confirm the IP address of <strong id="ALM-43009__b1241513413507">HostName</strong>.</span></li><li id="ALM-43009__li18285132112323"><span>On the FusionInsight Manager portal, choose <strong id="ALM-43009__b1723994221010">Cluster > </strong><em id="ALM-43009__i192431342121016">N</em><em id="ALM-43009__i6243134214101">ame of the desired cluster</em> <strong id="ALM-43009__b524019421108">> Services</strong> > <strong id="ALM-43009__ab35829542e63402fb65bb1b6b515458f">Spark2x</strong> > <strong id="ALM-43009__adbe01f47f16c45f6b50d6af6eabc44fe">Instance</strong> and click the JobHistory2x for which the alarm is generated to go to the<strong id="ALM-43009__b14303164441516"> Dashboard </strong>page. Click the drop-down menu in the Chart area and choose<strong id="ALM-43009__b6496174113616"> Customize</strong> > <strong id="ALM-43009__b28551248134811">GC Time</strong> > <strong id="ALM-43009__a770cede23dc6452e8071141b025e7b5d">Garbage Collection (GC) Time of JobHistory2x</strong> from the drop-down list box in the upper right corner and click <strong id="ALM-43009__aba84130408f04c31b7a114e01158e7a8">OK</strong> to check whether the GC time is longer than the threshold(default value: 12 seconds).</span><p><ul class="subitemlist" id="ALM-43009__u2456ffb935b44f39958869284d9b9d54"><li id="ALM-43009__l8e6c2a0b142845a3add77a73391e1915">If yes, go to <a href="#ALM-43009__li16285182113329">3</a>.</li><li id="ALM-43009__la39faa5415ce41d383d16c647e7bc501">If no, go to <a href="#ALM-43009__li81551125133212">6</a>.</li></ul>
|
|
</p></li><li id="ALM-43009__li16285182113329"><a name="ALM-43009__li16285182113329"></a><a name="li16285182113329"></a><span>On the FusionInsight Manager portal, choose <strong id="ALM-43009__b11516135261016">Cluster > </strong><em id="ALM-43009__i1054415527101">N</em><em id="ALM-43009__i05443528107">ame of the desired cluster</em> <strong id="ALM-43009__b7517205210104">> Services</strong> > <strong id="ALM-43009__a920f66f657bc42e7aaa2b679bd2f5c3b">Spark2x</strong> > <strong id="ALM-43009__a48fcbd6371e7476ab6d5be249dfd48bc">Configurations</strong>, and click <strong id="ALM-43009__a37cc430ffffa4074925ad719af7047fb">All Configurations</strong>. Choose <strong id="ALM-43009__a3aaa7e5d602541b8bea07c48bca68479">JobHistory2x</strong> > <strong id="ALM-43009__aed57d6efea9e4c06a0bdf792ebfbde6c">Default</strong>. The default value of <strong id="ALM-43009__a03db0ccb226d485b9c237c0b8c5d5d70">SPARK_DAEMON_MEMORY</strong> is 4GB. You can change the value according to the following rules: If this alarm is generated occasionally, increase the value by 0.5 times. If the alarm is frequently reported, increase the value by 1 time.</span></li><li id="ALM-43009__li656515289295"><span>Restart all JobHistory2x instances.</span></li><li id="ALM-43009__li62852211323"><span>After 10 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-43009__u55f1c54be5b8482c8adceb04752b8330"><li id="ALM-43009__lf1438365c49546998d1834390ba5559b">If yes, no further action is required.</li><li id="ALM-43009__ld434cf83b1d048cc8af8d2df75fb988a">If no, go to <a href="#ALM-43009__li81551125133212">6</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p id="ALM-43009__en-us_topic_0085589652_p77266312050"><strong id="ALM-43009__b848317157321">Collect fault information.</strong></p>
|
|
<ol start="6" id="ALM-43009__ol171551625103215"><li id="ALM-43009__li81551125133212"><a name="ALM-43009__li81551125133212"></a><a name="li81551125133212"></a><span>On the FusionInsight Manager interface of active and standby clusters, choose <strong id="ALM-43009__b1986595213411">O&M</strong> > <strong id="ALM-43009__a1fe5d788b08f48cdb46c44be810b271a">Log > Download</strong>.</span></li><li id="ALM-43009__li4155172518326"><span>Select <strong id="ALM-43009__b19148724254">Spark2x </strong>in the required cluster from the <strong id="ALM-43009__b6680122620515">Service</strong>.</span></li><li id="ALM-43009__li315522573212"><span>Click <span><img id="ALM-43009__image1945644173117" src="en-us_image_0269417538.png"></span> in the upper right corner, and set <strong id="ALM-43009__b6456941173117">Start Date</strong> and <strong id="ALM-43009__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-43009__b13456164113319">Download</strong>.</span></li><li id="ALM-43009__li1415512519324"><span>Contact the <span id="ALM-43009__text4614151421417">O&M personnel</span> and send the collected logs.</span></li></ol>
|
|
</div>
|
|
<div class="section" id="ALM-43009__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-43009__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
|
|
</div>
|
|
<div class="section" id="ALM-43009__s6fd53c1a301948979d1c8713912838b2"><h4 class="sectiontitle">Related Information</h4><p id="ALM-43009__a26e1f730806342c0a9af30eb487c1a90">None</p>
|
|
</div>
|
|
</div>
|
|
<div>
|
|
<div class="familylinks">
|
|
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
|
|
</div>
|
|
</div>
|
|
|