forked from docs/doc-exports
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com> Co-authored-by: Yang, Tong <yangtong2@huawei.com> Co-committed-by: Yang, Tong <yangtong2@huawei.com>
94 lines
10 KiB
HTML
94 lines
10 KiB
HTML
<a name="ALM-13007"></a><a name="ALM-13007"></a>
|
|
|
|
<h1 class="topictitle1">ALM-13007 Available ZooKeeper Client Connections Are Insufficient</h1>
|
|
<div id="body1547262939874"><div class="section" id="ALM-13007__section18794533"><h4 class="sectiontitle">Description</h4><p id="ALM-13007__p27761636191513">The system periodically detects the number of active processes between the ZooKeeper client and the ZooKeeper server every 60 seconds. This alarm is generated when the number of connections exceeds the threshold.</p>
|
|
</div>
|
|
<div class="section" id="ALM-13007__section8784133162417"><h4 class="sectiontitle">Attribute</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-13007__table187851833132420" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-13007__row1789173312414"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-13007__en-us_topic_0070543636_p44032603">Alarm ID</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-13007__en-us_topic_0070543636_p9871120">Alarm Severity</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-13007__en-us_topic_0070543636_p61363278">Automatically Cleared</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-13007__row979223319247"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-13007__en-us_topic_0070543636_p18392962">13007</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-13007__en-us_topic_0070543636_p13434953">Minor</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-13007__en-us_topic_0070543636_p14489400">Yes</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-13007__section45962205"><h4 class="sectiontitle">Parameters</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-13007__table51772816" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-13007__row55869420"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-13007__en-us_topic_0070543636_p28093062">Name</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-13007__en-us_topic_0070543636_p60945575">Meaning</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-13007__row2066363033719"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-13007__p192431315431">Source</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-13007__p692551319435">Specifies the cluster for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-13007__row57640736"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-13007__en-us_topic_0070543636_p29312231">ServiceName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-13007__en-us_topic_0070543636_p25480534">Specifies the service name for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-13007__row1361705619912"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-13007__en-us_topic_0070543634_p63355478">RoleName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-13007__en-us_topic_0070543634_p31520075">Specifies the role name for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-13007__row477048"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-13007__p126071502164">HostName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-13007__p160730181611">Specifies the host name for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-13007__row1472745151511"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-13007__p1160718071617">ClientIP</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-13007__p2607609169">Specifies the client IP address.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-13007__row220615551514"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-13007__p15607100121611">ServerIP</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-13007__p9607403165">Specifies the server IP address.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-13007__row50597141"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-13007__p4727789">Trigger Condition</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-13007__p47406613">Specifies the cause of the alarm.</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-13007__section11006666"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-13007__p14730421">A large number of connections to ZooKeeper caused the ZooKeeper to be fully connected and unable to provide normal services.</p>
|
|
</div>
|
|
<div class="section" id="ALM-13007__section31951138"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-13007__p13908419124316">A large number of client processes are connected to ZooKeeper. The thresholds are not appropriate.</p>
|
|
</div>
|
|
<div class="section" id="ALM-13007__section433103353311"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-13007__p33897081"><strong id="ALM-13007__b3926356627">Check whether there are a large number of client processes connected to ZooKeeper.</strong></p>
|
|
<ol id="ALM-13007__ol167406317264"><li id="ALM-13007__li13925125054018"><span>On FusionInsight Manager, choose <strong id="ALM-13007__b0989163143314">O&M</strong> > <strong id="ALM-13007__b1098973103314">Alarm </strong>> <strong id="ALM-13007__b39898316336">Alarms</strong>. On the displayed interface, click the drop-down button of <strong id="ALM-13007__b077716111321">Available ZooKeeper Client Connections Are Insufficient</strong>. Confirm the node IP address of the host for which the alarm is generated in the Location Information.</span></li><li id="ALM-13007__li137391311268"><span>Open the ZooKeeper service interface, click <strong id="ALM-13007__b1783216211454">Resource</strong> to enter the <strong id="ALM-13007__b1783211215518">Resource</strong> page, and check whether the number of connections of the client with the IP address specified by <strong id="ALM-13007__b1683292115518">Number of Connections</strong><strong id="ALM-13007__b208321121756"> (By Client IP Address)</strong> is large.</span><p><ul class="subitemlist" id="ALM-13007__ul473915316264"><li id="ALM-13007__li10738153162617">If it is, go to <a href="#ALM-13007__li9739531132620">3</a>.</li><li id="ALM-13007__li57399316268">If it is not, go to <a href="#ALM-13007__li1373973122619">4</a>.</li></ul>
|
|
</p></li><li id="ALM-13007__li9739531132620"><a name="ALM-13007__li9739531132620"></a><a name="li9739531132620"></a><span>Check whether connection leakage occurs on the client process.</span></li><li id="ALM-13007__li1373973122619"><a name="ALM-13007__li1373973122619"></a><a name="li1373973122619"></a><span>Click<span><img id="ALM-13007__image82088421961" src="en-us_image_0269383950.gif"></span> in the <strong id="ALM-13007__b142081742268">Number of Connections</strong> <strong id="ALM-13007__b32086421668">(by Client IP Address) </strong>to enter the <strong id="ALM-13007__b22081942169">Thresholds</strong><strong id="ALM-13007__b912333161"> </strong>page, and click <strong id="ALM-13007__b22081842861">Modify</strong> under <strong id="ALM-13007__b22084425616">Operation</strong>. Increase the threshold by referring to the value of <strong id="ALM-13007__b620844217620">maxClientCnxns</strong> by choosing <strong id="ALM-13007__b6212173752610">Cluster > </strong><em id="ALM-13007__i421463722619">Name of the desired cluster</em><strong id="ALM-13007__b1421320375260"> > Services</strong> > <strong id="ALM-13007__b12335119161213">ZooKeeper</strong> > <strong id="ALM-13007__b102081427618">Configurations > All Configurations > quorumpeer</strong>.</span></li><li id="ALM-13007__li8740173118265"><span>Check whether the alarm is cleared.</span><p><ul id="ALM-13007__ul874073110261"><li id="ALM-13007__li3739153114263">If it is, no further action is required.</li><li id="ALM-13007__li574063120263">If it is not, go to <a href="#ALM-13007__li27361331112613">6</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p class="tableheading" id="ALM-13007__p4863846161727"><strong id="ALM-13007__b60263252161739">Collect fault information.</strong></p>
|
|
<ol start="6" id="ALM-13007__ol1473843117261"><li id="ALM-13007__li27361331112613"><a name="ALM-13007__li27361331112613"></a><a name="li27361331112613"></a><span>On the FusionInsight Manager portal, choose <strong id="ALM-13007__b17735193112269">O&M</strong> > <strong id="ALM-13007__b1373518312260">Log > Download</strong>.</span></li><li id="ALM-13007__li107369316261"><span>Select <strong id="ALM-13007__b18736183182620">ZooKeeper</strong> in the required cluster from the <strong id="ALM-13007__b6736331152610">Service</strong>.</span></li><li id="ALM-13007__li11736103132617"><span>Click <span><img id="ALM-13007__image157361231102619" src="en-us_image_0269383952.png"></span> in the upper right corner, and set <strong id="ALM-13007__b11736231122619">Start Date</strong> and <strong id="ALM-13007__b1873663112618">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-13007__b1573619314263">Download</strong>.</span></li><li id="ALM-13007__li27381331192612"><span>Contact the <span id="ALM-13007__text4614151421417">O&M personnel</span> and send the collected logs.</span></li></ol>
|
|
</div>
|
|
<div class="section" id="ALM-13007__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-13007__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
|
|
</div>
|
|
<div class="section" id="ALM-13007__sb2eb8883fb1940d0b05b690215576d2e"><h4 class="sectiontitle">Related Information</h4><p id="ALM-13007__en-us_topic_0070543636_p64481034">None</p>
|
|
</div>
|
|
</div>
|
|
<div>
|
|
<div class="familylinks">
|
|
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
|
|
</div>
|
|
</div>
|
|
|