forked from docs/doc-exports
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com> Co-authored-by: Yang, Tong <yangtong2@huawei.com> Co-committed-by: Yang, Tong <yangtong2@huawei.com>
84 lines
9.8 KiB
HTML
84 lines
9.8 KiB
HTML
<a name="ALM-13008"></a><a name="ALM-13008"></a>
|
|
|
|
<h1 class="topictitle1">ALM-13008 ZooKeeper Znode Usage Exceeds the Threshold</h1>
|
|
<div id="body1559547426810"><div class="section" id="ALM-13008__section18794533"><h4 class="sectiontitle">Description</h4><p id="ALM-13008__p1225011584337">The system checks the level-2 Znode status in the ZooKeeper data directory every hour. This alarm is generated when the system detects that the level-2 Znode usage exceeds the threshold.</p>
|
|
</div>
|
|
<div class="section" id="ALM-13008__section34933073"><h4 class="sectiontitle">Attribute</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-13008__table52262125" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-13008__row24697033"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-13008__en-us_topic_0070543636_p44032603">Alarm ID</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-13008__en-us_topic_0070543636_p9871120">Alarm Severity</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-13008__en-us_topic_0070543636_p61363278">Automatically Cleared</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-13008__row37919625"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-13008__p1163219417345">13008</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-13008__en-us_topic_0070543638_p9804735">Major</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-13008__en-us_topic_0070543638_p55986102">Yes</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-13008__section45962205"><h4 class="sectiontitle">Parameters</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-13008__table51772816" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-13008__row55869420"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-13008__en-us_topic_0070543636_p28093062">Name</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-13008__en-us_topic_0070543636_p60945575">Meaning</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-13008__row1011301918376"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-13008__p192431315431">Source</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-13008__p692551319435">Specifies the cluster for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-13008__row57640736"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-13008__p38388029">ServiceName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-13008__en-us_topic_0070543636_p25480534">Specifies the service name for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-13008__row477048"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-13008__p38640893">ServiceDirectory</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-13008__p17361818514">Specifies the directory for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-13008__row111316194717"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-13008__p39186745">RoleName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-13008__en-us_topic_0070543636_p60763443">Specifies the role for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-13008__row50597141"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-13008__p4727789">Trigger Condition</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-13008__p2543134315394">Specifies the cause of the alarm.</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-13008__section11006666"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-13008__p14730421">A large amount of data is written to the ZooKeeper data directory. As a result, ZooKeeper cannot provide services properly.</p>
|
|
</div>
|
|
<div class="section" id="ALM-13008__section31951138"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-13008__ul8764021163412"><li id="ALM-13008__li87641421163417">A large amount of data is written to the ZooKeeper data directory.</li><li id="ALM-13008__li17764112116349">The user-defined threshold is inappropriate.</li></ul>
|
|
</div>
|
|
<div class="section" id="ALM-13008__section1651164173016"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-13008__p33897081"><strong id="ALM-13008__b1275916344342">Check whether a large amount of data is written into the directory for which the alarm is generated.</strong></p>
|
|
<ol id="ALM-13008__ol18001226161846"><li id="ALM-13008__li13718172511465"><span>Log in to FusionInsight Manager, choose <strong id="ALM-13008__b141601653133416">Cluster</strong> > <em id="ALM-13008__i61601053203418">Name of the desired cluster</em> > <strong id="ALM-13008__b1116005353413">Services</strong> > <strong id="ALM-13008__b1160253193418">ZooKeeper</strong>, and click <strong id="ALM-13008__b161606537344">Resource</strong>. Click <strong id="ALM-13008__b101605531340">By Znode quantity</strong> in <strong id="ALM-13008__b516015530340">Used Resources (By Second-Level Znode)</strong>, and check whether a large amount of data is written to the top Znode.</span><p><ul class="subitemlist" id="ALM-13008__ul27177282161840"><li id="ALM-13008__li66225583161840">If yes, go to <a href="#ALM-13008__li787172215383">2</a>.</li><li id="ALM-13008__li62672021161840">If no, go to <a href="#ALM-13008__li10279134491613">4</a>.</li></ul>
|
|
</p></li><li id="ALM-13008__li787172215383"><a name="ALM-13008__li787172215383"></a><a name="li787172215383"></a><span>Log in to FusionInsight Manager, choose <strong id="ALM-13008__b158832210389">O&M > Alarm > Alarms</strong>, select <strong id="ALM-13008__b088122218387">Location</strong> from the drop-down list box next to <strong id="ALM-13008__b1088622183818">ALM-13008 ZooKeeper Znode Quantity Usage Exceeds Threshold</strong>, and obtain the Znode path in <strong id="ALM-13008__b5881822123819">ServiceDirectory</strong>.</span></li><li id="ALM-13008__li1314310536368"><span>Log in to the ZooKeeper client as a cluster user and delete unnecessary data from the Znode corresponding to the alarm.</span></li><li id="ALM-13008__li10279134491613"><a name="ALM-13008__li10279134491613"></a><a name="li10279134491613"></a><span>Log in to FusionInsight Manager, choose <strong id="ALM-13008__b68614315364">Cluster</strong> > <em id="ALM-13008__i186117315362">Name of the desired cluster</em> > <strong id="ALM-13008__b1862839367">Services</strong> > <strong id="ALM-13008__b58628311361">ZooKeeper</strong> > <strong id="ALM-13008__b486218319364">Configurations</strong> > <strong id="ALM-13008__b78628317361">All Configurations</strong>, and search for <strong id="ALM-13008__b18862173183615">max.znode.count</strong>, which is the maximum number of ZooKeeper directories. The alarm threshold is 80% of this parameter. Increase the value of this parameter, click <strong id="ALM-13008__b198626313361">Save</strong>, and restart the service for the configuration to take effect.</span></li><li id="ALM-13008__li817635715531"><span>Check whether the alarm is cleared.</span><p><ul id="ALM-13008__ul1786016224361"><li id="ALM-13008__li156235263365">If yes, no further action is required.</li><li id="ALM-13008__li11860122113616">If no, go to <a href="#ALM-13008__li180651333416">6</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p class="tableheading" id="ALM-13008__p4863846161727"><strong id="ALM-13008__b60263252161739">Collect fault information.</strong></p>
|
|
<ol start="6" id="ALM-13008__ol128061313163411"><li id="ALM-13008__li180651333416"><a name="ALM-13008__li180651333416"></a><a name="li180651333416"></a><span>On the FusionInsight Manager portal, choose <strong id="ALM-13008__b14806171317349">O&M</strong> > <strong id="ALM-13008__b180661315344">Log > Download</strong>.</span></li><li id="ALM-13008__li88061413103411"><span>Select <strong id="ALM-13008__b20806111313415">ZooKeeper</strong> in the required cluster from the <strong id="ALM-13008__b1280661313417">Service</strong>.</span></li><li id="ALM-13008__li4806813193415"><span>Click <span><img id="ALM-13008__image4806513113414" src="en-us_image_0269383953.png"></span> in the upper right corner, and set <strong id="ALM-13008__b68061813133420">Start Date</strong> and <strong id="ALM-13008__b138067139349">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-13008__b1280619137345">Download</strong>.</span></li><li id="ALM-13008__li98061213103418"><span>Contact the <span id="ALM-13008__text4614151421417">O&M personnel</span> and send the collected logs.</span></li></ol>
|
|
</div>
|
|
<div class="section" id="ALM-13008__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-13008__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
|
|
</div>
|
|
<div class="section" id="ALM-13008__sb2eb8883fb1940d0b05b690215576d2e"><h4 class="sectiontitle">Related Information</h4><p id="ALM-13008__en-us_topic_0070543636_p64481034">None</p>
|
|
</div>
|
|
</div>
|
|
<div>
|
|
<div class="familylinks">
|
|
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
|
|
</div>
|
|
</div>
|
|
|