forked from docs/doc-exports
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com> Co-authored-by: Yang, Tong <yangtong2@huawei.com> Co-committed-by: Yang, Tong <yangtong2@huawei.com>
118 lines
25 KiB
HTML
118 lines
25 KiB
HTML
<a name="ALM-19000"></a><a name="ALM-19000"></a>
|
|
|
|
<h1 class="topictitle1">ALM-19000 HBase Service Unavailable</h1>
|
|
<div id="body31505145"><div class="section" id="ALM-19000__se4d8370b2cc642859f53aec325cc8680"><h4 class="sectiontitle">Description</h4><p id="ALM-19000__en-us_topic_0070543519_p43536440">This alarm is generated when the HBase service is unavailable. The alarm module checks the HBase service status every 120 seconds.</p>
|
|
<p id="ALM-19000__en-us_topic_0070543519_p56283646">This alarm is cleared when the HBase service recovers.</p>
|
|
<div class="note" id="ALM-19000__en-us_topic_0070543519_note36790769"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p class="text" id="ALM-19000__en-us_topic_0070543519_p62681466">If the multi-instance function is enabled in the cluster and multiple HBase service instances are installed, you need to determine the HBase service instance where the alarm is generated based on the value of <strong id="ALM-19000__en-us_topic_0070543519_b27262282">ServiceName</strong> in <strong id="ALM-19000__en-us_topic_0070543519_b44033946">Location</strong>. For example, if the HBase1 service is unavailable, ServiceName=HBase1 is displayed in <strong id="ALM-19000__en-us_topic_0070543519_b9979846">Location</strong>, and the operation object in the procedure needs to be changed from HBase to HBase1.</p>
|
|
</div></div>
|
|
</div>
|
|
<div class="section" id="ALM-19000__se68e92dd1d3c443583eac1bb1e885c84"><h4 class="sectiontitle">Attribute</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-19000__en-us_topic_0070543519_table3061223" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-19000__en-us_topic_0070543519_row6847126"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-19000__en-us_topic_0070543519_p17746329">Alarm ID</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-19000__en-us_topic_0070543519_p28166553">Alarm Severity</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-19000__en-us_topic_0070543519_p66898301">Automatically Cleared</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-19000__en-us_topic_0070543519_row50053294"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-19000__en-us_topic_0070543519_p27784979">19000</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-19000__en-us_topic_0070543519_p35990804">Critical</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-19000__en-us_topic_0070543519_p29574043">Yes</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-19000__s22f073f7b7e14ba1a4ba384eb5ea716d"><h4 class="sectiontitle">Parameters</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-19000__en-us_topic_0070543519_table46687245" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-19000__en-us_topic_0070543519_row3037596"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-19000__en-us_topic_0070543519_p44718725">Name</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-19000__en-us_topic_0070543519_p65447007">Meaning</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-19000__row14335750121020"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-19000__p192431315431">Source</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-19000__p692551319435">Specifies the cluster for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-19000__en-us_topic_0070543519_row66716242"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-19000__en-us_topic_0070543519_p35306501">ServiceName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-19000__en-us_topic_0070543519_p41254304">Specifies the service for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-19000__en-us_topic_0070543519_row35744420"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-19000__en-us_topic_0070543519_p9616944">RoleName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-19000__en-us_topic_0070543519_p40774967">Specifies the role for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-19000__en-us_topic_0070543519_row31430389"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-19000__en-us_topic_0070543519_p62833603">HostName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-19000__en-us_topic_0070543519_p56357057">Specifies the host for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-19000__s6dbfda5d08c54e228164940bb9462e6a"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-19000__en-us_topic_0070543519_p1518871">Operations, such as reading or writing data and creating tables, cannot be performed.</p>
|
|
</div>
|
|
<div class="section" id="ALM-19000__sa16a051bac584db5bad49d794b8a8ab5"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-19000__en-us_topic_0070543519_ul55919726"><li id="ALM-19000__en-us_topic_0070543519_li33515489">The ZooKeeper service is abnormal.</li><li id="ALM-19000__en-us_topic_0070543519_li33203949">The HDFS service is abnormal.</li><li id="ALM-19000__en-us_topic_0070543519_li30400092">The HBase service is abnormal.</li><li id="ALM-19000__en-us_topic_0070543519_li5165373">The network is abnormal.</li></ul>
|
|
</div>
|
|
<div class="section" id="ALM-19000__sa7511492888b4e8ca9c8b31ca395266f"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-19000__en-us_topic_0070543519_p15742086"><strong id="ALM-19000__b36635536192615">Check the ZooKeeper service status.</strong></p>
|
|
<ol id="ALM-19000__ol61514467192629"><li id="ALM-19000__li58779041192610"><span>On the FusionInsight Manager, check whether the running status of ZooKeeper is <strong id="ALM-19000__b1953016571206">Normal</strong> on service list.</span><p><ul class="subitemlist" id="ALM-19000__ul51270247192610"><li id="ALM-19000__li39194152192610">If yes, go to <a href="#ALM-19000__li31549687192610">5</a>.</li><li id="ALM-19000__li20609775192610">If no, go to <a href="#ALM-19000__li42710393192610">2</a>.</li></ul>
|
|
</p></li><li id="ALM-19000__li42710393192610"><a name="ALM-19000__li42710393192610"></a><a name="li42710393192610"></a><span>In the alarm list, check whether <strong id="ALM-19000__b099920513281">ALM-13000 ZooKeeper Service Unavailable</strong> exists.</span><p><ul class="subitemlist" id="ALM-19000__ul27115220192610"><li id="ALM-19000__li34466159192610">If yes, go to <a href="#ALM-19000__li36989843192610">3</a>.</li><li id="ALM-19000__li40295504192610">If no, go to <a href="#ALM-19000__li31549687192610">5</a>.</li></ul>
|
|
</p></li><li id="ALM-19000__li36989843192610"><a name="ALM-19000__li36989843192610"></a><a name="li36989843192610"></a><span>Rectify the fault by following the steps provided in <strong id="ALM-19000__b15593184512811">ALM-13000 ZooKeeper Service Unavailable</strong>.</span></li><li id="ALM-19000__li55607700192610"><span>Wait several minutes, and check whether alarm is cleared.</span><p><ul class="subitemlist" id="ALM-19000__ul24713145192610"><li id="ALM-19000__li64473138192610">If yes, no further action is required.</li><li id="ALM-19000__li54941688192610">If no, go to <a href="#ALM-19000__li31549687192610">5</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p class="tableheading" id="ALM-19000__p21091714192610"><strong id="ALM-19000__b40700349192656">Check the HDFS service status.</strong></p>
|
|
<ol start="5" id="ALM-19000__ol1006540619288"><li id="ALM-19000__li31549687192610"><a name="ALM-19000__li31549687192610"></a><a name="li31549687192610"></a><span>In the alarm list, check whether <strong id="ALM-19000__b137123614296">ALM-14000 HDFS Service Unavailable</strong> exists.</span><p><ul class="subitemlist" id="ALM-19000__ul18418601192610"><li id="ALM-19000__li4259844192610">If yes, go to <a href="#ALM-19000__li5387888192610">6</a>.</li><li id="ALM-19000__li9503051192610">If no, go to <a href="#ALM-19000__li7395880192610">8</a>.</li></ul>
|
|
</p></li><li id="ALM-19000__li5387888192610"><a name="ALM-19000__li5387888192610"></a><a name="li5387888192610"></a><span>Rectify the fault by following the steps provided in <strong id="ALM-19000__b1250182417295">ALM-14000 HDFS Service Unavailable</strong>.</span></li><li id="ALM-19000__li53416482192610"><span>Wait several minutes, and check whether alarm is cleared.</span><p><ul class="subitemlist" id="ALM-19000__ul50674407192610"><li id="ALM-19000__li48490997192610">If yes, no further action is required.</li><li id="ALM-19000__li35456651192610">If no, go to <a href="#ALM-19000__li7395880192610">8</a>.</li></ul>
|
|
</p></li><li id="ALM-19000__li7395880192610"><a name="ALM-19000__li7395880192610"></a><a name="li7395880192610"></a><span>On the FusionInsight Manager portal, choose <strong id="ALM-19000__b1894114682412">Cluster</strong><em id="ALM-19000__i187545519241"> > Name of the desired cluster</em> > <strong id="ALM-19000__b1299170202511">Services</strong> > <strong id="ALM-19000__b1846464172519">HDFS</strong>. Check whether <strong id="ALM-19000__b1464733392714">Safe Mode </strong>is<strong id="ALM-19000__b127611045192720"> ON</strong>.</span><p><ul class="subitemlist" id="ALM-19000__ul30647926192610"><li id="ALM-19000__li30420539192610">If yes, go to <a href="#ALM-19000__li42432199192610">9</a>.</li><li id="ALM-19000__li48144567192610">If no, go to <a href="#ALM-19000__li3109192610">12</a>.</li></ul>
|
|
</p></li><li id="ALM-19000__li42432199192610"><a name="ALM-19000__li42432199192610"></a><a name="li42432199192610"></a><span>Log in to the HDFS client as user <strong id="ALM-19000__b66562928192610">root</strong>. <span id="ALM-19000__text1619316594018"></span>Run <strong id="ALM-19000__b62195446192610">cd</strong> to switch to the client installation directory, and run <strong id="ALM-19000__b22888107192610">source bigdata_env</strong>.</span><p><p class="litext" id="ALM-19000__p41997391192610">If the cluster uses the security mode, perform security authentication. Obtain the password of user hdfs from the administrator, run the <strong id="ALM-19000__b4666376192610">kinit hdfs</strong> command and enter the password as prompted.</p>
|
|
</p></li><li id="ALM-19000__li62995951192610"><span>Run the following command to manually exit the safe mode:</span><p><p class="litext" id="ALM-19000__p14456090192610"><strong id="ALM-19000__b46345474192610">hdfs dfsadmin -safemode leave</strong></p>
|
|
</p></li><li id="ALM-19000__li32403735192610"><span>Wait several minutes and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-19000__ul60052369192610"><li id="ALM-19000__li30092650192610">If yes, no further action is required.</li><li id="ALM-19000__li21585566192610">If no, go to <a href="#ALM-19000__li3109192610">12</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p class="tableheading" id="ALM-19000__p3600415192610"><strong id="ALM-19000__b4197330192827">Check the HBase service status.</strong></p>
|
|
<ol start="12" id="ALM-19000__ol3023397219298"><li id="ALM-19000__li3109192610"><a name="ALM-19000__li3109192610"></a><a name="li3109192610"></a><span>On the FusionInsight Manager portal, click <strong id="ALM-19000__b46519202617">Cluster</strong> > <em id="ALM-19000__i58881131263">Name of the desired cluster</em> ><strong id="ALM-19000__b191427255268"> Services</strong> > <strong id="ALM-19000__b1673382820266">HBase</strong>.</span></li><li id="ALM-19000__li42103381192610"><span>Check whether there is one active HMaster and one standby HMaster.</span><p><ul class="subitemlist" id="ALM-19000__ul49417396192610"><li id="ALM-19000__li251900192610">If yes, go to <a href="#ALM-19000__li26121173192610">15</a>.</li><li id="ALM-19000__li20403902192610">If no, go to <a href="#ALM-19000__li51944053192610">14</a>.</li></ul>
|
|
</p></li><li id="ALM-19000__li51944053192610"><a name="ALM-19000__li51944053192610"></a><a name="li51944053192610"></a><span>Click <strong id="ALM-19000__b43386116192610">Instances</strong>, select the HMaster whose status is not <strong id="ALM-19000__b54930729192610">Active</strong>, click <strong id="ALM-19000__b24614517192610">More</strong>, and select <strong id="ALM-19000__b20204065192610">Restart Instance</strong> to restart the HMaster. Check whether there is one active HMaster and one standby HMaster again.</span><p><ul class="subitemlist" id="ALM-19000__ul35597723192610"><li id="ALM-19000__li25916582192610">If yes, go to <a href="#ALM-19000__li26121173192610">15</a>.</li><li id="ALM-19000__li18868383192610">If no, go to <a href="#ALM-19000__li23797537192610">21</a>.</li></ul>
|
|
</p></li><li id="ALM-19000__li26121173192610"><a name="ALM-19000__li26121173192610"></a><a name="li26121173192610"></a><span>Choose <strong id="ALM-19000__b685816983615">Cluster </strong>><em id="ALM-19000__i4970423193417">Name of the desired cluster</em> ><strong id="ALM-19000__b196752311342"> Services</strong> > <strong id="ALM-19000__b46718735192610">HBase</strong> > <strong id="ALM-19000__b17815433192610">HMaster(Active)</strong> to go to the HMaster WebUI.</span><p><div class="note" id="ALM-19000__note840916461457"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-19000__en-us_topic_0193189480_p91833832915">By default, the <strong id="ALM-19000__en-us_topic_0193189480_b4780151814294">admin</strong> user does not have the permissions to manage other components. If the page cannot be opened or the displayed content is incomplete when you access the native UI of a component due to insufficient permissions, you can manually create a user with the permissions to manage that component.</p>
|
|
</div></div>
|
|
</p></li><li id="ALM-19000__li58869484192610"><span>Check whether at least one RegionServer exists under <strong id="ALM-19000__b337515311010">Region Servers</strong>.</span><p><ul class="subitemlist" id="ALM-19000__ul58736836192610"><li id="ALM-19000__li50527289192610">If yes, go to <a href="#ALM-19000__li52728456192610">17</a>.</li><li id="ALM-19000__li66178638192610">If no, go to <a href="#ALM-19000__li23797537192610">21</a>.</li></ul>
|
|
</p></li><li id="ALM-19000__li52728456192610"><a name="ALM-19000__li52728456192610"></a><a name="li52728456192610"></a><span>Check <strong id="ALM-19000__b60063309192610">Tables</strong> > <strong id="ALM-19000__b3698874192610">System Tables</strong>, as shown in <a href="#ALM-19000__fig13078536192610">Figure 1</a>. Check whether <strong id="ALM-19000__b9442152618572">hbase:meta</strong>, <strong id="ALM-19000__b17443162695715">hbase:namespace</strong>, and <strong id="ALM-19000__b17445112665711">hbase:acl</strong> exist in the <strong id="ALM-19000__b12124644192610">Table Name</strong> column.</span><p><ul class="subitemlist" id="ALM-19000__ul30764662192610"><li id="ALM-19000__li42572098192610">If yes, go to <a href="#ALM-19000__li52774331192610">18</a>.</li><li id="ALM-19000__li25787917192610">If no, go to <a href="#ALM-19000__li2123961192610">19</a>.</li></ul>
|
|
<div class="fignone" id="ALM-19000__fig13078536192610"><a name="ALM-19000__fig13078536192610"></a><a name="fig13078536192610"></a><span class="figcap"><b>Figure 1 </b>HBase system table</span><br><span><img id="ALM-19000__image1854312193919" src="en-us_image_0269417415.png"></span></div>
|
|
</p></li><li id="ALM-19000__li52774331192610"><a name="ALM-19000__li52774331192610"></a><a name="li52774331192610"></a><span>As shown in <a href="#ALM-19000__fig13078536192610">Figure 1</a>, click the <strong id="ALM-19000__b2094692675610">hbase:meta</strong>, <strong id="ALM-19000__b1034693111561">hbase:namespace</strong>, and <strong id="ALM-19000__b1125511375565">hbase:acl</strong> hyperlinks and check whether the pages are properly displayed. If the pages are properly displayed, the tables are normal.</span><p><p id="ALM-19000__p1833418375529">If they are, go to <a href="#ALM-19000__li2123961192610">19</a>.</p>
|
|
<p id="ALM-19000__p633423735212">If they are not, go to <a href="#ALM-19000__li52963882192610">23</a>.</p>
|
|
<div class="note" id="ALM-19000__note9144144115521"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-19000__p17681412135315">In normal mode, <strong id="ALM-19000__b78401453165610">ACL</strong> is enabled for HBase by default. The <strong id="ALM-19000__b11457112577">hbase:acl</strong> table is generated only when <strong id="ALM-19000__b78051054579">ACL</strong> is manually enabled. In this case, check this table. In other scenarios, this table does not need to be checked.</p>
|
|
</div></div>
|
|
</p></li><li id="ALM-19000__li2123961192610"><a name="ALM-19000__li2123961192610"></a><a name="li2123961192610"></a><span>View the HMaster startup status.</span><p><p class="litext" id="ALM-19000__p34574471192610">In <a href="#ALM-19000__fig2133867192610">Figure 2</a>, if the <strong id="ALM-19000__b46862405192610">RUNNING</strong> state exists in <strong id="ALM-19000__b19108467192610">Tasks</strong>, HMaster is being started. In the <strong id="ALM-19000__b37758480192610">State</strong> column, you can view the time when HMaster is in the <strong id="ALM-19000__b4282005192610">RUNNING</strong> state. In <a href="#ALM-19000__fig41660353192610">Figure 3</a>, if the state is <strong id="ALM-19000__b11298148192610">COMPLETE</strong>, HMaster is started.</p>
|
|
<p class="litext" id="ALM-19000__p49068804192610">Check whether HMaster is in the <strong id="ALM-19000__b42734791192610">RUNNING</strong> state for a long time.</p>
|
|
<div class="fignone" id="ALM-19000__fig2133867192610"><a name="ALM-19000__fig2133867192610"></a><a name="fig2133867192610"></a><span class="figcap"><b>Figure 2 </b>HMaster is being started</span><br><span><img id="ALM-19000__image15150177192610" src="en-us_image_0269417416.png"></span></div>
|
|
<div class="fignone" id="ALM-19000__fig41660353192610"><a name="ALM-19000__fig41660353192610"></a><a name="fig41660353192610"></a><span class="figcap"><b>Figure 3 </b>HMaster is started</span><br><span><img id="ALM-19000__image12085468192610" src="en-us_image_0269417417.png"></span></div>
|
|
<ul class="subitemlist" id="ALM-19000__ul235995192610"><li id="ALM-19000__li37190969192610">If yes, go to <a href="#ALM-19000__li34107122192610">20</a>.</li><li id="ALM-19000__li59678545192610">If no, go to <a href="#ALM-19000__li23797537192610">21</a>.</li></ul>
|
|
</p></li><li id="ALM-19000__li34107122192610"><a name="ALM-19000__li34107122192610"></a><a name="li34107122192610"></a><span>On the HMaster WebUI, check whether any hbase:meta is in the <strong id="ALM-19000__b19115650192610">Region in Transition</strong> state for a long time.</span><p><div class="fignone" id="ALM-19000__fig43774618192610"><span class="figcap"><b>Figure 4 </b>Region in Transition</span><br><span><img id="ALM-19000__image4863846192610" src="en-us_image_0269417418.png"></span></div>
|
|
<ul class="subitemlist" id="ALM-19000__ul3789680192610"><li id="ALM-19000__li34986499192610">If yes, go to <a href="#ALM-19000__li23797537192610">21</a>.</li><li id="ALM-19000__li15334156192610">If no, go to <a href="#ALM-19000__li53096940192610">22</a>.</li></ul>
|
|
</p></li><li id="ALM-19000__li23797537192610"><a name="ALM-19000__li23797537192610"></a><a name="li23797537192610"></a><span>In the precondition that services are not affected, log in to the FusionInsight Manager portal and choose <strong id="ALM-19000__b1484633819456">Cluster </strong>> <em id="ALM-19000__i175981233194118">Name of the desired cluster</em> ><strong id="ALM-19000__b19846183844514"> Services</strong> > <strong id="ALM-19000__b11213468192610">HBase</strong> > <strong id="ALM-19000__b33812350192610">More</strong> > <strong id="ALM-19000__b35875698192610">Restart Service</strong>. Enter the administrator password and click <strong id="ALM-19000__b9566182520469">OK</strong>.</span><p><ul class="subitemlist" id="ALM-19000__ul54839953192610"><li id="ALM-19000__li48036430192610">If yes, go to <a href="#ALM-19000__li53096940192610">22</a>.</li><li id="ALM-19000__li65745651192610">If no, go to <a href="#ALM-19000__li52963882192610">23</a>.</li></ul>
|
|
</p></li><li id="ALM-19000__li53096940192610"><a name="ALM-19000__li53096940192610"></a><a name="li53096940192610"></a><span>Wait several minutes and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-19000__ul40423733192610"><li id="ALM-19000__li12851242192610">If yes, no further action is required.</li><li id="ALM-19000__li34317687192610">If no, go to <a href="#ALM-19000__li52963882192610">23</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p class="tableheading" id="ALM-19000__p28269281192610"><strong id="ALM-19000__b58906846192934">Check the network connection between HMaster and dependent components.</strong></p>
|
|
<ol start="23" id="ALM-19000__ol24696967192946"><li id="ALM-19000__li52963882192610"><a name="ALM-19000__li52963882192610"></a><a name="li52963882192610"></a><span>On the FusionInsight Manager, choose <strong id="ALM-19000__b141874916461">Cluster </strong>><em id="ALM-19000__i132394433418">Name of the desired cluster</em> ><strong id="ALM-19000__b1321104418343"> Services</strong> > <strong id="ALM-19000__b5884875192610">HBase</strong>.</span></li><li id="ALM-19000__li6333253192610"><a name="ALM-19000__li6333253192610"></a><a name="li6333253192610"></a><span>Click <strong id="ALM-19000__b6912891192610">Instance</strong> and the HMaster instance list is displayed. Record the<strong id="ALM-19000__b297564385019"> management IP Address </strong>in the row of <strong id="ALM-19000__b23073316192610">HMaster(Active)</strong>.</span></li><li id="ALM-19000__li12086129192610"><span>Use the IP address obtained in <a href="#ALM-19000__li6333253192610">24</a> to log in to the host where the active HMaster runs as user <strong id="ALM-19000__b56999277192610">omm</strong> .</span></li><li id="ALM-19000__li8782905192610"><span>Run the <strong id="ALM-19000__b41666304192610">ping</strong> command to check whether communication between the host that runs the active HMaster and the hosts that run the dependent components. (The dependent components include ZooKeeper, HDFS and Yarn. Obtain the IP addresses of the hosts that run these services in the same way as that for obtaining the IP address of the active HMaster.)</span><p><ul class="subitemlist" id="ALM-19000__ul8432418192610"><li id="ALM-19000__li19527450192610">If yes, go to <a href="#ALM-19000__li5658542192610">29</a>.</li><li id="ALM-19000__li38219637192610">If no, go to <a href="#ALM-19000__li11937281192610">27</a>.</li></ul>
|
|
</p></li><li id="ALM-19000__li11937281192610"><a name="ALM-19000__li11937281192610"></a><a name="li11937281192610"></a><span>Contact the administrator to restore the network.</span></li><li id="ALM-19000__li29557755192610"><span>In the alarm list, check whether <strong id="ALM-19000__b40326668192610">HBase Service Unavailable</strong> is cleared.</span><p><ul class="subitemlist" id="ALM-19000__ul40133126192610"><li id="ALM-19000__li27395700192610">If yes, no further action is required.</li><li id="ALM-19000__li4459236192610">If no, go to <a href="#ALM-19000__li5658542192610">29</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p class="tableheading" id="ALM-19000__p25653816192610"><strong id="ALM-19000__b32683177192958">Collect fault information.</strong></p>
|
|
<ol start="29" id="ALM-19000__ol5327190919302"><li id="ALM-19000__li5658542192610"><a name="ALM-19000__li5658542192610"></a><a name="li5658542192610"></a><span>On the FusionInsight Manager, choose <strong id="ALM-19000__b474362412519">O&M</strong> > <strong id="ALM-19000__b17891131105114">Log </strong>><strong id="ALM-19000__b58911431125110"> Download</strong>.</span></li><li id="ALM-19000__li29828798192610"><span>Select the following nodes in the required cluster from the <strong id="ALM-19000__b50926879192610">Service</strong> drop-down list:</span><p><ul class="subitemlist" id="ALM-19000__ul33140472192610"><li id="ALM-19000__li31436574192610">ZooKeeper</li><li id="ALM-19000__li14493718192610">HDFS</li><li id="ALM-19000__li63334598192610">HBase</li></ul>
|
|
</p></li><li id="ALM-19000__li1145664103113"><span>Click <span><img id="ALM-19000__image1945644173117" src="en-us_image_0269417419.png"></span> in the upper right corner, and set <strong id="ALM-19000__b6456941173117">Start Date</strong> and <strong id="ALM-19000__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-19000__b13456164113319">Download</strong>.</span></li><li id="ALM-19000__li58928280192610"><span>Contact the <span id="ALM-19000__text4614151421417">O&M personnel</span> and send the collected logs.</span></li></ol>
|
|
</div>
|
|
<div class="section" id="ALM-19000__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-19000__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
|
|
</div>
|
|
<div class="section" id="ALM-19000__sb2dbdd61f4bb4b8888fba00f82ed87ba"><h4 class="sectiontitle">Related Information</h4><p id="ALM-19000__en-us_topic_0070543519_p39937165">None</p>
|
|
</div>
|
|
</div>
|
|
<div>
|
|
<div class="familylinks">
|
|
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
|
|
</div>
|
|
</div>
|
|
|