doc-exports/docs/mrs/umn/ALM-26053.html
Yang, Tong 3b1f73dece MRS UMN 2.0.38.SP20 version
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com>
Co-authored-by: Yang, Tong <yangtong2@huawei.com>
Co-committed-by: Yang, Tong <yangtong2@huawei.com>
2022-12-13 12:03:34 +00:00

93 lines
12 KiB
HTML

<a name="ALM-26053"></a><a name="ALM-26053"></a>
<h1 class="topictitle1">ALM-26053 Storm Slot Usage Exceeds the Threshold</h1>
<div id="body35690847"><div class="section" id="ALM-26053__sae7badd5d25146d289d18f34a04821b7"><h4 class="sectiontitle">Description</h4><p id="ALM-26053__en-us_topic_0070543552_p33373357">The system checks the slot usage every 60 seconds and compares the actual slot usage with the threshold. This alarm is generated when the slot usage is greater than the threshold.</p>
<p id="ALM-26053__en-us_topic_0070543552_p31924758">You can change the threshold in <strong id="ALM-26053__b10802443175011">O&amp;M</strong> &gt; <strong id="ALM-26053__b1387783920502">Alarm </strong>&gt;<strong id="ALM-26053__b19880113913508"> Thresholds</strong>.</p>
<p id="ALM-26053__en-us_topic_0070543552_p53482049">This alarm is cleared when the slot usage is less than or equal to the threshold.</p>
</div>
<div class="section" id="ALM-26053__s5b2d64609ec84186909a7c477b915dca"><h4 class="sectiontitle">Attribute</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-26053__en-us_topic_0070543552_table37078712" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-26053__en-us_topic_0070543552_row39365168"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-26053__en-us_topic_0070543552_p34462009">Alarm ID</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-26053__en-us_topic_0070543552_p39959306">Alarm Severity</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-26053__en-us_topic_0070543552_p15478348">Automatically Cleared</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-26053__en-us_topic_0070543552_row45786662"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-26053__en-us_topic_0070543552_p17732161">26053</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-26053__en-us_topic_0070543552_p27018949">Major</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-26053__en-us_topic_0070543552_p41051224">Yes</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-26053__sb740e84935ea4adcbd3bf8ff1a473144"><h4 class="sectiontitle">Parameters</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-26053__en-us_topic_0070543552_table36814885" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-26053__en-us_topic_0070543552_row24734149"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-26053__en-us_topic_0070543552_p57309018">Name</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-26053__en-us_topic_0070543552_p11518859">Meaning</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-26053__row1458311391837"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-26053__p192431315431">Source</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-26053__p692551319435">Specifies the cluster for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-26053__en-us_topic_0070543552_row60612418"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-26053__en-us_topic_0070543552_p10658788">ServiceName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-26053__en-us_topic_0070543552_p58055515">Specifies the service for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-26053__en-us_topic_0070543552_row52737594"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-26053__en-us_topic_0070543552_p43886731">RoleName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-26053__en-us_topic_0070543552_p65164324">Specifies the role for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-26053__en-us_topic_0070543552_row49608012"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-26053__en-us_topic_0070543552_p58826021">HostName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-26053__en-us_topic_0070543552_p178395">Specifies the host for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-26053__en-us_topic_0070543552_row1605556"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-26053__en-us_topic_0070543552_p62941183">Trigger condition</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-26053__en-us_topic_0070543552_p65071067">Specifies the threshold triggering the alarm. If the current indicator value exceeds this threshold, the alarm is generated.</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-26053__s96a47d74ddf64ff9956401eacd27ca56"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-26053__en-us_topic_0070543552_p36265071">New Storm tasks cannot be performed.</p>
</div>
<div class="section" id="ALM-26053__sce8470d9439d4799af36bb4e0ae9a453"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-26053__en-us_topic_0070543552_ul51789668"><li id="ALM-26053__en-us_topic_0070543552_li63453834">The status of some Supervisors in the cluster is abnormal.</li><li id="ALM-26053__en-us_topic_0070543552_li34213602">The status of all Supervisors is normal, but the processing capability is insufficient.</li></ul>
</div>
<div class="section" id="ALM-26053__s034ffc790e0b4f9b94541b98421fe2b6"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-26053__en-us_topic_0070543552_p19838352"><strong id="ALM-26053__b5225160320659">Check the Supervisor status.</strong></p>
<ol id="ALM-26053__ol106117932079"><li id="ALM-26053__li2139112920655"><span>Choose <strong id="ALM-26053__b2167132218559">Cluster </strong>&gt; <em id="ALM-26053__i12255195843816">Name of the desired cluster</em> &gt;<strong id="ALM-26053__b4253258193818"> Services</strong> &gt; <strong id="ALM-26053__b1103464620655">Storm</strong> &gt; <strong id="ALM-26053__b3785516112715">Instance </strong>to go to the Storm instance management page.</span></li><li id="ALM-26053__li2109506020655"><span>Check whether any instance whose status is <strong id="ALM-26053__b143721433753">Faulty</strong> or <strong id="ALM-26053__b143529512454">Restoring</strong> exists.</span><p><ul class="subitemlist" id="ALM-26053__ul2471351620655"><li id="ALM-26053__li177328820655">If yes, go to <a href="#ALM-26053__li3410841620655">3</a>.</li><li id="ALM-26053__li1765902720655">If no, go to <a href="#ALM-26053__li4446687120655">5</a>.</li></ul>
</p></li><li id="ALM-26053__li3410841620655"><a name="ALM-26053__li3410841620655"></a><a name="li3410841620655"></a><span>Select Supervisor role instances whose status is <strong id="ALM-26053__b980911551059">Faulty</strong> or <strong id="ALM-26053__b18659191124316">Restoring</strong>, choose <strong id="ALM-26053__b1036952320655">More</strong> &gt; <strong id="ALM-26053__b2621684520655">Restart Instance</strong>, and check whether the instances restart successfully.</span><p><ul class="subitemlist" id="ALM-26053__ul1124636420655"><li id="ALM-26053__li4318971620655">If yes, go to <a href="#ALM-26053__li6572378120655">4</a>.</li><li id="ALM-26053__li870613620655">If no, go to <a href="#ALM-26053__li1692048320655">10</a>.</li></ul>
</p></li><li id="ALM-26053__li6572378120655"><a name="ALM-26053__li6572378120655"></a><a name="li6572378120655"></a><span>Wait several minutes, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-26053__ul6377774620655"><li id="ALM-26053__li3854029320655">If yes, no further action is required.</li><li id="ALM-26053__li4436911820655">If no, go to <a href="#ALM-26053__li4446687120655">5</a>.</li></ul>
</p></li></ol>
<p class="tableheading" id="ALM-26053__p4464289020711"><strong id="ALM-26053__b6624169420711">Increase the number of slots in each Supervisor.</strong></p>
<ol start="5" id="ALM-26053__ol3941524320729"><li id="ALM-26053__li4446687120655"><a name="ALM-26053__li4446687120655"></a><a name="li4446687120655"></a><span>Log in to the FusionInsight Manager portal, choose <strong id="ALM-26053__b10426355145516">Cluster </strong>&gt;<strong id="ALM-26053__b186781883910"> </strong><em id="ALM-26053__i7680138133910">Name of the desired cluster</em> &gt;<strong id="ALM-26053__b13679383393"> Services</strong> &gt; <strong id="ALM-26053__b2202601620655">Storm</strong> &gt; <strong id="ALM-26053__b3927687620655">Configurations</strong> &gt; <strong id="ALM-26053__b2731038420655">All</strong> <strong id="ALM-26053__b1287926185611">Configurations</strong>.</span></li><li id="ALM-26053__li18816181114815"><span>Increase the number of ports in the <strong id="ALM-26053__b491688144818">supervisor.slots.ports</strong> parameter of each Supervisor role and restart the instance.</span></li><li id="ALM-26053__li3900363020655"><span>Wait several minutes, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-26053__ul4273525520655"><li id="ALM-26053__li2491375620655">If yes, no further action is required.</li><li id="ALM-26053__li474836120655">If no, go to <a href="#ALM-26053__li517745320655">8</a>.</li></ul>
</p></li></ol><ol start="8" id="ALM-26053__ol5726300720750"><li id="ALM-26053__li517745320655"><a name="ALM-26053__li517745320655"></a><a name="li517745320655"></a><span>Perform capacity expansion for Supervisor.</span></li><li id="ALM-26053__li4780030320655"><span>Wait several minutes, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-26053__ul1218919020655"><li id="ALM-26053__li4659708420655">If yes, no further action is required.</li><li id="ALM-26053__li1626743520655">If no, go to <a href="#ALM-26053__li1692048320655">10</a>.<div class="note" id="ALM-26053__note664819163111"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-26053__p55701018141118">Services are interrupted when the Supervisor is being restarted. Then, services are restored after the restarting.</p>
</div></div>
</li></ul>
</p></li></ol>
<p class="tableheading" id="ALM-26053__p4259384720655"><strong id="ALM-26053__b5621207520743">Collect fault information.</strong></p>
<ol start="10" id="ALM-26053__ol3294607520757"><li id="ALM-26053__li1692048320655"><a name="ALM-26053__li1692048320655"></a><a name="li1692048320655"></a><span>On the FusionInsight Manager portal, choose <strong id="ALM-26053__b94418407566">O&amp;M</strong> &gt; <strong id="ALM-26053__b1644624513567">Log </strong>&gt;<strong id="ALM-26053__b20447145105612"> Download</strong>.</span></li><li id="ALM-26053__li2088152720655"><span>Select <strong id="ALM-26053__b1806662620655">Storm</strong> and <strong id="ALM-26053__b2838190820655">ZooKeeper</strong> in the required cluster from the <strong id="ALM-26053__b5411058820655">Service</strong> drop-down list box.</span></li><li id="ALM-26053__li1145664103113"><span>Click <span><img id="ALM-26053__image1945644173117" src="en-us_image_0269417462.png"></span> in the upper right corner, and set <strong id="ALM-26053__b6456941173117">Start Date</strong> and <strong id="ALM-26053__b11456154113318">End Date</strong> for log collection to 1 hour ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-26053__b13456164113319">Download</strong>.</span></li><li id="ALM-26053__li4396313120655"><span>Contact the <span id="ALM-26053__text4614151421417">O&amp;M personnel</span> and send the collected logs.</span></li></ol>
</div>
<div class="section" id="ALM-26053__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-26053__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div>
<div class="section" id="ALM-26053__s03d18e14bd3746449ab83201cee300b6"><h4 class="sectiontitle">Related Information</h4><p id="ALM-26053__en-us_topic_0070543552_p51751889">None</p>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
</div>
</div>