doc-exports/docs/mrs/umn/ALM-43021.html
Yang, Tong 3b1f73dece MRS UMN 2.0.38.SP20 version
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com>
Co-authored-by: Yang, Tong <yangtong2@huawei.com>
Co-committed-by: Yang, Tong <yangtong2@huawei.com>
2022-12-13 12:03:34 +00:00

86 lines
13 KiB
HTML

<a name="ALM-43021"></a><a name="ALM-43021"></a>
<h1 class="topictitle1">ALM-43021 Direct Memory Usage of the IndexServer2x Process Exceeds the Threshold</h1>
<div id="body1573719169454"><div class="section" id="ALM-43021__sc21fe29c411e41409776a7f363f27627"><h4 class="sectiontitle">Description</h4><p id="ALM-43021__a1a98c1f016af4292bc4448f93c715403">The system checks the IndexServer2x process status every 30 seconds. The alarm is generated when the direct heap memory usage of a IndexServer2x process exceeds the threshold (95% of the maximum memory).</p>
</div>
<div class="section" id="ALM-43021__s4090a0887ac94f6ea0c313fc93558621"><h4 class="sectiontitle">Attribute</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-43021__t8f99cc5400c44ffa97ab609a58c35834" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-43021__r7c7f9b98ceca4ec0a0800ecf76ccef9e"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-43021__a384abf8fd5d4496eb3f24e3659360208">Alarm ID</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-43021__a9fb6fed3911b489d82df1a64bde4b070">Severity</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-43021__a822c4dd1c3a9441a8112bb1f236f3815">Auto Clear</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-43021__rfea230e6015b4acf959f0f8c1aee9d3e"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-43021__a8708f4e87bc341ff8f0cd0d2584d3ffa">43021</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-43021__af37be87b74214641ba5b07a6bb7c9eb2">Major</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-43021__a6498abf69ae84417a67ec941de431a88">Yes</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-43021__s0b7620cc8d904f908eab3fd94fc26272"><h4 class="sectiontitle">Parameters</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-43021__t68d85ce289c84a24ae299d43425ff8ae" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-43021__r3e98675c76d14517b23a76d0cb501e87"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-43021__a5d03d695bf0b4c43a5caf3d367a4c0a5">Parameter</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-43021__a90bb9bc2e82a426bb93b0b2f7053fc1f">Description</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-43021__row19850194112121"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-43021__p17935380415">Source</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-43021__p187931338134115">Specifies the cluster for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-43021__r9ded483eb9844736ac68a17c358d7da2"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-43021__p41293795">ServiceName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-43021__a56301b53d00544029a8c8ddafa6d31ff">Specifies the service for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-43021__r510741ebfe864dd981b589f5f8595102"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-43021__p23892775">RoleName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-43021__ad76b983a401944ca91e141a3a69f8ad1">Specifies the role for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-43021__r03c3304824734d9fa9a737747e27b53a"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-43021__p14847206">HostName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-43021__a57618ecf6f7645a59e09d708b6c95e65">Specifies the host for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-43021__red34012f122e4af9a640d3c1332a6922"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-43021__a32a03a9c53d54278812ea7a1ed559514">Trigger Condition</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-43021__a7b936a21a6e5475881a69a1c6d9c30a6">Specifies the threshold for triggering the alarm.</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-43021__sb4c2a4c4b87c4f90b322a83ec9b6d735"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-43021__ae1c5a656b94040c897bbc44bdf229cbf">If the available IndexServer2x process direct memory is insufficient, a memory overflow occurs and the service breaks down.</p>
</div>
<div class="section" id="ALM-43021__sd4ce190e23f7470ea72265183e1c0f77"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-43021__a651784db16b940d381413aea18485975">The direct heap memory of the IndexServer2x process is overused or the direct heap memory is inappropriately allocated.</p>
</div>
<div class="section" id="ALM-43021__section360922201817"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-43021__acff47ad7e2354ab48f7276cf38dac673"><strong id="ALM-43021__b02321844366">Check direct heap memory usage.</strong></p>
<ol id="ALM-43021__ol11133331103816"><li id="ALM-43021__li7132331113811"><span>On FusionInsight Manager, choose <strong id="ALM-43021__b1078012339334">O&amp;M</strong> &gt; <strong id="ALM-43021__b1688165991316">Alarm</strong><strong id="ALM-43021__b27872374104950"> &gt; Alarms</strong>. In the displayed alarm list, choose the alarm for which the ID is <strong id="ALM-43021__b187813338338">43021</strong>, and check the <strong id="ALM-43021__b1955573445015">RoleName</strong> in <strong id="ALM-43021__b052583712505">Location</strong> and confirm the IP address of <strong id="ALM-43021__b1241513413507">HostName</strong>.</span></li><li id="ALM-43021__li18132031143816"><span>On FusionInsight Manager, choose <strong id="ALM-43021__b1183785011334">Cluster</strong> &gt; <span id="ALM-43021__text813210311381"><em id="ALM-43021__i1713203123811">Name of the desired cluster</em></span> &gt; <strong id="ALM-43021__b0839175017333">Services</strong> &gt; <strong id="ALM-43021__b10840185073314">Spark2x</strong> &gt; <strong id="ALM-43021__b9840135023315">Instance</strong>. Click the IndexServer2x that reported the alarm to go to the <strong id="ALM-43021__b659464922315">Dashboard</strong> page. Click the drop-down list in the upper right corner of the chart area, and choose <strong id="ALM-43021__b2841550123316">Customize</strong> &gt;<strong id="ALM-43021__b445693918189"> Memory</strong> &gt; <strong id="ALM-43021__b128421350133318">IndexServer2x Memory Usage Statistics</strong> &gt; <strong id="ALM-43021__b17842195073313">OK</strong>. Check whether the direct memory used by the IndexServer2x process reaches the maximum direct memory threshold.</span><p><ul class="subitemlist" id="ALM-43021__ul113283117387"><li id="ALM-43021__li19132631183811">If the threshold is reached, go to <a href="#ALM-43021__li141321031113812">3</a>.</li><li id="ALM-43021__li1813220314387">If the threshold is not reached, go to <a href="#ALM-43021__li181301231123812">7</a>.</li></ul>
</p></li><li id="ALM-43021__li141321031113812"><a name="ALM-43021__li141321031113812"></a><a name="li141321031113812"></a><span>On FusionInsight Manager, choose <strong id="ALM-43021__b7431162018354">Cluster</strong> &gt; <span id="ALM-43021__text121321431203815"><em id="ALM-43021__i7132143133818">Name of the desired cluster</em></span> &gt; <strong id="ALM-43021__b643618204359">Services</strong> &gt; <strong id="ALM-43021__b143717207355">Spark2x</strong> &gt; <strong id="ALM-43021__b943912063519">Instance</strong>. Click the IndexServer2x that reported the alarm to go to the <strong id="ALM-43021__b16244163254">Dashboard</strong> page. Click the drop-down list in the upper right corner of the chart area, and choose <strong id="ALM-43021__b13442520113511">Customize</strong> &gt;<strong id="ALM-43021__b21920372191"> Memory</strong> &gt; <strong id="ALM-43021__b144417208354">Direct Memory of IndexServer2x</strong> &gt; <strong id="ALM-43021__b84477201354">OK</strong>. Based on the alarm generation time, check the values of the used direct memory of the IndexServer2x process in the corresponding period and obtain the maximum value.</span></li><li id="ALM-43021__li11331231133812"><span>On FusionInsight Manager, choose <strong id="ALM-43021__b46921459173613">Cluster</strong> &gt; <span id="ALM-43021__text1913215313387"><em id="ALM-43021__i8132113123816">Name of the desired cluster</em></span> &gt; <strong id="ALM-43021__b15693155918362">Services</strong> &gt; <strong id="ALM-43021__b11693185913364">Spark2x</strong> &gt; <strong id="ALM-43021__b569455973614">Configurations</strong> &gt; <strong id="ALM-43021__b15694959123620">All Configurations</strong> &gt; <strong id="ALM-43021__b1769485963617">IndexServer2x</strong>&gt; <strong id="ALM-43021__b269415918368">Tuning</strong>. You can change the value of <strong id="ALM-43021__b1969425973611">XX:MaxDirectMemorySize</strong> (the default value is 512 MB) in the <strong id="ALM-43021__b13694959173619">spark.driver.extraJavaOptions</strong> parameter based on the ratio of the maximum direct memory used by the IndexServer2x process to the threshold specified by <strong id="ALM-43021__b2695195973620">IndexServer2x Direct Memory Usage Statistics (IndexServer2x)</strong> in the alarm period. If the alarm persists after the parameter value is changed, increase the value by 0.5 times. If the alarm is generated frequently, double the rate.</span><p><div class="note" id="ALM-43021__note111334314386"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-43021__p2133131183816">On FusionInsight Manager, you can choose <strong id="ALM-43021__b36958184118">O&amp;M</strong> &gt; <strong id="ALM-43021__b269681134112">Alarm</strong> &gt; <strong id="ALM-43021__b569717117419">Thresholds</strong> &gt; <em id="ALM-43021__i1669819116416">Name of the desired cluster</em> &gt; <strong id="ALM-43021__b166982164119">Spark2x</strong> &gt; <strong id="ALM-43021__b196999110414">Memory</strong> &gt; <strong id="ALM-43021__b16700161154114">IndexServer2x Direct Memory Usage Statistics (IndexServer2x)</strong> to view the threshold.</p>
</div></div>
</p></li><li id="ALM-43021__li656515289295"><span>Restart all IndexServer2x instances.</span></li><li id="ALM-43021__li11331131153817"><span>After 10 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-43021__ul1513303112386"><li id="ALM-43021__li1313323116382">If the alarm is cleared, no further action is required.</li><li id="ALM-43021__li1133123119383">If the alarm is not cleared, go to <a href="#ALM-43021__li181301231123812">7</a>.</li></ul>
</p></li></ol>
<p id="ALM-43021__aad409ece02ce48f39201eba5d1ac57eb"><strong id="ALM-43021__b12456119135518">Collect fault information.</strong></p>
<ol start="7" id="ALM-43021__ol171301316387"><li id="ALM-43021__li181301231123812"><a name="ALM-43021__li181301231123812"></a><a name="li181301231123812"></a><span>On FusionInsight Manager, choose <strong id="ALM-43021__b1854624844415">O&amp;M</strong> &gt; <strong id="ALM-43021__b1547144819446">Log</strong> &gt; <strong id="ALM-43021__b115474485449">Download</strong>.</span></li><li id="ALM-43021__li1513003153811"><span>Expand the <strong id="ALM-43021__b1899192712389">Service</strong> drop-down list, and select <strong id="ALM-43021__b71306317335">Spark2x</strong> for the target cluster.</span></li><li id="ALM-43021__li1130631173817"><span>Click <span><img id="ALM-43021__image213012312387" src="en-us_image_0269417547.png"></span> in the upper right corner, and set <strong id="ALM-43021__b18831547123618">Start Date</strong> and <strong id="ALM-43021__b1483434743616">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time respectively. Then, click <strong id="ALM-43021__b583564713611">Download</strong>.</span></li><li id="ALM-43021__li191309318381"><span>Contact the <span id="ALM-43021__text4614151421417">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div>
<div class="section" id="ALM-43021__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-43021__p754913417333">After the fault is rectified, the system automatically clears this alarm.</p>
</div>
<div class="section" id="ALM-43021__s7833451ce9564e0d8e771dfc18982cb8"><h4 class="sectiontitle">Reference</h4><p id="ALM-43021__a64546a5522c948c6819fe0ead8f3de08">None</p>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
</div>
</div>