forked from docs/doc-exports
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com> Co-authored-by: Yang, Tong <yangtong2@huawei.com> Co-committed-by: Yang, Tong <yangtong2@huawei.com>
86 lines
14 KiB
HTML
86 lines
14 KiB
HTML
<a name="ALM-43007"></a><a name="ALM-43007"></a>
|
|
|
|
<h1 class="topictitle1">ALM-43007 Non-Heap Memory Usage of the JobHistory2x Process Exceeds the Threshold</h1>
|
|
<div id="body8662426"><div class="section" id="ALM-43007__s7997ffec107042e3884c8eb287e2661d"><h4 class="sectiontitle">Description</h4><p id="ALM-43007__aad3d0ffc3ad74f6798fa377974a4877f">The system checks the JobHistory2x Process status every 30 seconds. The alarm is generated when the non-heap memory usage of a JobHistory2x Process exceeds the threshold (95% of the maximum memory).</p>
|
|
</div>
|
|
<div class="section" id="ALM-43007__sdcda35e6e1d5455386c878d945951eb6"><h4 class="sectiontitle">Attribute</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-43007__t011bcbc551824d8e91da470dd8168fe9" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-43007__r441aafdbbbaf43a4859c73f0a7ee2e94"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-43007__a6159bce96c0c4fe2a1a8db89aa406b11">Alarm ID</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-43007__a4d14342a61a74d908ae466e6098cbeba">Alarm Severity</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-43007__af5b9634998534c6c97ef69579cb03b5d">Auto Clear</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-43007__rc285d4b71e64474c876bb8289e243aa4"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-43007__aa8245af68d1945fd993294e0eee66cef">43007</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-43007__a46517897bfc34f6daadd551372b0c58b">Major</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-43007__adc374c3c6f6a46f7ab66755a8f345fdb">Yes</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-43007__s056abf721d274cd28aa9a9f32ef0ff8d"><h4 class="sectiontitle">Parameters</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-43007__t2957540583ff4790bfd80353c4be70d3" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-43007__r9045089ad4754ab49a0b8b8419d837d3"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-43007__a9418c3fe6e9f406fbdb3f83e71de08bc">Name</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-43007__a1e060e1843ab4a28bb18bf9720353748">Meaning</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-43007__row12957173235419"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-43007__p192431315431">Source</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-43007__p692551319435">Specifies the cluster for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-43007__rdd463995327a4742822ce8cbbd7e3ce5"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-43007__a036d4d08612849878cb94ceda8ed5db5">ServiceName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-43007__a081da440f5054a55b2518f4539604e52">Specifies the service name for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-43007__rdadab547e6514abdb9c980da5827f092"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-43007__a7cf41c55eb3b4df7963fe27440e1b980">RoleName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-43007__a57ff263d01aa4579b6e1cf9c2bd8122b">Specifies the role name for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-43007__r1f49fd16d6994b0eb218c092690b9edc"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-43007__a40ba6bead7534be09a461d9f37671f27">HostName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-43007__a61f869154fcd42e7b0256761b0083fff">Specifies the object (host ID) for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-43007__r0d209c8bbb1d4c2cb4de8da9fca4c13c"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-43007__ac2e36c469b7d4dd58752c958b2a6e7f7">Trigger Condition</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-43007__a9c3de911d0a8471f8e1a572c23059611">Specifies the threshold triggering the alarm. If the current indicator value exceeds this threshold, the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-43007__sb58b3094066c4724aea0378e266942d9"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-43007__ad8f78ab812124ccda21ac38d9461a10b">If the available JobHistory2x Process non-heap memory is insufficient, a memory overflow occurs and the service breaks down.</p>
|
|
</div>
|
|
<div class="section" id="ALM-43007__s5aefff0758384c4c81ea89e60e85e24e"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-43007__a36e0a86cad8d4481a4f6dd9072cc73d7">The non-heap memory of the JobHistory2x Process is overused or the non-heap memory is inappropriately allocated.</p>
|
|
</div>
|
|
<div class="section" id="ALM-43007__s852145bf55934df0a8107a6cfa0a47d0"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-43007__a37b3150578594d6e8aa537675026f4de"><strong id="ALM-43007__b9373182963014">Check non-heap memory usage.</strong></p>
|
|
<ol id="ALM-43007__ol6175131223018"><li id="ALM-43007__li217531216309"><span>On the FusionInsight Manager portal, choose <strong id="ALM-43007__b14700183717537">O&M > Alarm<strong id="ALM-43007__b27872374104950"> > Alarms</strong></strong> and select the alarm whose <strong id="ALM-43007__ad374fdcec6a147469d37c3042a579a35">ID</strong> is <strong id="ALM-43007__a34469015b5794eb7ae6581967cbc99f4">43007</strong>. Check the <strong id="ALM-43007__b1955573445015">RoleName</strong> in <strong id="ALM-43007__b052583712505">Location</strong> and confirm the IP address of <strong id="ALM-43007__b1241513413507">HostName</strong>.</span></li><li id="ALM-43007__li317516128302"><span>On the FusionInsight Manager portal, choose <strong id="ALM-43007__b13769184711816">Cluster > </strong><em id="ALM-43007__i37733478818">N</em><em id="ALM-43007__i1477310474811">ame of the desired cluster</em> <strong id="ALM-43007__b97714471785">> Services</strong> > <strong id="ALM-43007__ab4746e72ba7a49008768f93ae84a43f5">Spark2x</strong> > <strong id="ALM-43007__add24e1ccc8f74cf5b9094fb8fbde4ac1">Instance</strong> and click the JobHistory2x for which the alarm is generated to go to the<strong id="ALM-43007__b14303164441516"> Dashboard </strong>page. Click the drop-down menu in the Chart area and choose<strong id="ALM-43007__b5807223103112"> Customize</strong> > <strong id="ALM-43007__b77425517317">Memory</strong> > <strong id="ALM-43007__ab4185bd9729c4edea508e2b688c63350">JobHistory2x Memory Usage Statistics</strong> from the drop-down list box in the upper right corner and click <strong id="ALM-43007__a0c064f22c705424b868e1c6593105a44">OK</strong>, Check whether the used non-heap memory of the JobHistory2x Process reaches the threshold(default value is 95%) of the maximum non-heap memory specified for JobHistory2x.</span><p><ul class="subitemlist" id="ALM-43007__ubaa130b84a5f453db53ee04525beb46f"><li id="ALM-43007__lcd90a36fbdaf47258f417ca35024c994">If yes, go to <a href="#ALM-43007__li1580615311553">3</a>.</li><li id="ALM-43007__l5b791855f9af4edd98897f92a46116b0">If no, go to <a href="#ALM-43007__li18556111517300">7</a>.</li></ul>
|
|
</p></li><li id="ALM-43007__li1580615311553"><a name="ALM-43007__li1580615311553"></a><a name="li1580615311553"></a><span>On the FusionInsight Manager home page, choose <strong id="ALM-43007__b18071235556">Cluster</strong> > <em id="ALM-43007__i17807533555">Name of the desired cluster</em> > <strong id="ALM-43007__b680719385514">Service</strong><strong id="ALM-43007__b12263133114327">s</strong> > <strong id="ALM-43007__b880712365514">Spark2x</strong> > <strong id="ALM-43007__b48071832556">Instance</strong>. Click<strong id="ALM-43007__b88071536554"> </strong><strong id="ALM-43007__b12807133155513">JobHistory2x </strong>by which the alarm is reported to go to the<strong id="ALM-43007__b11791244142014"> Dashboard </strong>page, click the drop-down list in the upper right corner of the chart area, choose <strong id="ALM-43007__b3807531558">Customize</strong> > <strong id="ALM-43007__b1280716315555"><strong id="ALM-43007__b17807123195519">Memory </strong>> Statistics for the </strong><strong id="ALM-43007__b1480533195515">non-heap</strong> <strong id="ALM-43007__b11806103145514">memory of the JobHistory2x Process</strong>, and click <strong id="ALM-43007__b88991336318">OK</strong>. Based on the alarm generation time, check the values of the used non-heap memory of the JobHistory2x process in the corresponding period and obtain the maximum value.</span></li><li id="ALM-43007__li181751512113012"><span>On the FusionInsight Manager portal, choose <strong id="ALM-43007__b65548571588">Cluster > </strong><em id="ALM-43007__i85571057186">N</em><em id="ALM-43007__i1255712577811">ame of the desired cluster</em><strong id="ALM-43007__b145553571816"> > Services</strong> > <strong id="ALM-43007__ab26c912f57aa47b5abb88826b362c2d8">Spark2x</strong> > <strong id="ALM-43007__a3736f54a9ff144f5aaea881453a55ab2">Configurations</strong>, and click <strong id="ALM-43007__a5330dfe4578f48bcaa3422b4afe34dd2">All Configurations</strong>. Choose <strong id="ALM-43007__a6aee0a1e02294f0d93d6367d218f67fd">JobHistory2x</strong> > <strong id="ALM-43007__aa7614798dbf1414087f74697a8758cc9">Default</strong>. You can change the value of <strong id="ALM-43007__afeb5ec5cf7d442618c938f97ae580c84">-XX:MaxMetaspaceSize</strong> in <strong id="ALM-43007__a25899b461ce244e68bb01feaa64e2969">SPARK_DAEMON_JAVA_OPTS</strong> according to the following rules: Ratio of the JobHistory2x non-heap memory usage to the <strong id="ALM-43007__b12326121855613">Threshold</strong><strong id="ALM-43007__b1632731819565"> </strong>of<strong id="ALM-43007__b13464111125612"> JobHistory2x </strong><strong id="ALM-43007__b1546441175613">Non-Heap</strong><strong id="ALM-43007__b14464181112564"> Memory Usage Statistics (JobHistory2x)</strong> in the alarm period.</span><p><div class="note" id="ALM-43007__note720841116452"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-43007__p44749381536">On the FusionInsight Manager home page, choose <strong id="ALM-43007__b24731738185312">O&M</strong> > <strong id="ALM-43007__b17473193815320">Alarm</strong> > <strong id="ALM-43007__b114733389534">Thresholds </strong>> <em id="ALM-43007__i7473203835313">Name of the desired cluster</em> <strong id="ALM-43007__b184731638125310">> </strong><strong id="ALM-43007__b44746382530">Spark2x</strong> > <strong id="ALM-43007__b8474938105317">Memory </strong>><strong id="ALM-43007__b16474938165316">JobHistory2x </strong><strong id="ALM-43007__b446917385534">Non-Heap</strong><strong id="ALM-43007__b147113381538"> Memory Usage Statistics (JobHistory2x)</strong> to view<strong id="ALM-43007__b952271223218"> Threshold</strong>.</p>
|
|
</div></div>
|
|
</p></li><li id="ALM-43007__li656515289295"><span>Restart all JobHistory2x instances.</span></li><li id="ALM-43007__li917571223020"><span>After 10 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-43007__ufedbe076685c4f8b98d17ed0f67b1e49"><li id="ALM-43007__lc0be4a2868f840688371774706ef993a">If yes, no further action is required.</li><li id="ALM-43007__l32be2d2e10ff4781a5a31580ba0fe978">If no, go to <a href="#ALM-43007__li18556111517300">7</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p id="ALM-43007__a3fe669b0fd0e4e6f959e6d7245968a2b"><strong id="ALM-43007__b17819183312300">Collect fault information.</strong></p>
|
|
<ol start="7" id="ALM-43007__ol455681533015"><li id="ALM-43007__li18556111517300"><a name="ALM-43007__li18556111517300"></a><a name="li18556111517300"></a><span>On the FusionInsight Manager portal, choose <strong id="ALM-43007__b3160191905511">O&M</strong> > <strong id="ALM-43007__a920abccf89a149b0b7a8739c050c1806">Log > Download</strong>.</span></li><li id="ALM-43007__li165561315123018"><span>Select <strong id="ALM-43007__ad25755565c904be692dd9e1f455aa25b">Spark2x</strong> in the required cluster from the <strong id="ALM-43007__a3ff74c500d76437ca965c2121a26fd5a">Service</strong>.</span></li><li id="ALM-43007__li1955618155305"><span>Click <span><img id="ALM-43007__image1945644173117" src="en-us_image_0269417535.png"></span> in the upper right corner, and set <strong id="ALM-43007__b6456941173117">Start Date</strong> and <strong id="ALM-43007__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-43007__b13456164113319">Download</strong>.</span></li><li id="ALM-43007__li155661553020"><span>Contact the <span id="ALM-43007__text4614151421417">O&M personnel</span> and send the collected logs.</span></li></ol>
|
|
</div>
|
|
<div class="section" id="ALM-43007__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-43007__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
|
|
</div>
|
|
<div class="section" id="ALM-43007__s055c3a8456e041409a18c894839fa473"><h4 class="sectiontitle">Related Information</h4><p id="ALM-43007__a12857c632d9e4fa58380af328fad23d6">None</p>
|
|
</div>
|
|
</div>
|
|
<div>
|
|
<div class="familylinks">
|
|
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
|
|
</div>
|
|
</div>
|
|
|