forked from docs/doc-exports
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com> Co-authored-by: Yang, Tong <yangtong2@huawei.com> Co-committed-by: Yang, Tong <yangtong2@huawei.com>
92 lines
10 KiB
HTML
92 lines
10 KiB
HTML
<a name="ALM-27003"></a><a name="ALM-27003"></a>
|
|
|
|
<h1 class="topictitle1">ALM-27003 DBService Heartbeat Interruption Between the Active and Standby Nodes</h1>
|
|
<div id="body63647056"><div class="section" id="ALM-27003__sc60211fd4f9942908b35c800f0914bea"><h4 class="sectiontitle">Description</h4><p id="ALM-27003__p1151317503372">This alarm is generated when the active or standby DBService node does not receive heartbeat messages from the peer node for 7 seconds.</p>
|
|
<p id="ALM-27003__en-us_topic_0070543555_p36719502">This alarm is cleared when the heartbeat recovers.</p>
|
|
</div>
|
|
<div class="section" id="ALM-27003__sdc48d53428c1434bb8d4f5b8208748b3"><h4 class="sectiontitle">Attribute</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-27003__en-us_topic_0070543555_table21489667" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-27003__en-us_topic_0070543555_row38220454"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-27003__en-us_topic_0070543555_p8849102">Alarm ID</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-27003__en-us_topic_0070543555_p45688646">Alarm Severity</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-27003__en-us_topic_0070543555_p9792822">Automatically Cleared</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-27003__en-us_topic_0070543555_row55021127"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-27003__en-us_topic_0070543555_p27526304">27003</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-27003__en-us_topic_0070543555_p15038186">Major</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-27003__en-us_topic_0070543555_p10133581">Yes</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-27003__s8148aa065fae4679912a009469c3a6cf"><h4 class="sectiontitle">Parameters</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-27003__en-us_topic_0070543555_table15513712" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-27003__en-us_topic_0070543555_row33194707"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-27003__en-us_topic_0070543555_p4416714">Name</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-27003__en-us_topic_0070543555_p22209574">Meaning</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-27003__row14649182737"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-27003__p192431315431">Source</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-27003__p692551319435">Specifies the cluster for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-27003__en-us_topic_0070543555_row54145103"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-27003__en-us_topic_0070543555_p23677228">ServiceName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-27003__en-us_topic_0070543555_p38807334">Specifies the service for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-27003__en-us_topic_0070543555_row13721689"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-27003__en-us_topic_0070543555_p37715016">RoleName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-27003__en-us_topic_0070543555_p35017480">Specifies the role for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-27003__en-us_topic_0070543555_row46721870"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-27003__en-us_topic_0070543555_p26375114">HostName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-27003__en-us_topic_0070543555_p56009450">Specifies the host for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-27003__en-us_topic_0070543555_row34323003"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-27003__en-us_topic_0070543555_p28699853">Local DBService HA Name</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-27003__en-us_topic_0070543555_p42986743">Specifies a local DBService HA.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-27003__en-us_topic_0070543555_row51336373"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-27003__en-us_topic_0070543555_p64605559">Peer DBService HA Name</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-27003__en-us_topic_0070543555_p65667814">Specifies a peer DBService HA.</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-27003__sdd99b6c33f874003991c7a6c14c5404c"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-27003__en-us_topic_0070543555_p17492713">During the DBService heartbeat interruption, only one node can provide the service. If this node is faulty, no standby node is available for failover and the service is unavailable.</p>
|
|
</div>
|
|
<div class="section" id="ALM-27003__se3e2e46e22654dffbe0e3a0151221382"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-27003__en-us_topic_0070543555_p7623657">The link between the active and standby DBService nodes is abnormal.</p>
|
|
</div>
|
|
<div class="section" id="ALM-27003__s9f57557a0a614e6aae4f2818aacdaba7"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-27003__en-us_topic_0070543555_p13536461"><strong id="ALM-27003__b62881565195332">Check whether the network between the active DBService server and the standby DBService server is normal.</strong></p>
|
|
<ol id="ALM-27003__ol13279871195410"><li id="ALM-27003__li65465089195327"><span>In the alarm list on FusionInsight Manager, click <span><img id="ALM-27003__image168221113135319" src="en-us_image_0269417465.png"></span> in the row where the alarm is located in the real-time alarm list and view the standby DBService server address.</span></li><li id="ALM-27003__li9648113195327"><span>Log in to the active DBService server as user <strong id="ALM-27003__b52314895195327">root</strong>. <span id="ALM-27003__text389810521903"></span></span></li></ol><ol start="3" id="ALM-27003__ol19784628195430"><li id="ALM-27003__li15666119195327"><span>Run the <strong id="ALM-27003__b19724160195327">ping </strong><em id="ALM-27003__i43299712195327">standby DBService heartbeat IP address</em> command to check whether the standby DBService server is reachable.</span><p><ul class="subitemlist" id="ALM-27003__ul24110301195327"><li id="ALM-27003__li17615779195327">If yes, go to <a href="#ALM-27003__li45543702195327">6</a>.</li><li id="ALM-27003__li17592003195327">If no, go to <a href="#ALM-27003__li25387710195327">4</a>.</li></ul>
|
|
</p></li><li id="ALM-27003__li25387710195327"><a name="ALM-27003__li25387710195327"></a><a name="li25387710195327"></a><span>Contact the network administrator to check whether the network is faulty.</span><p><ul class="subitemlist" id="ALM-27003__ul40103558195327"><li id="ALM-27003__li60996120195327">If yes, go to <a href="#ALM-27003__li34675550195327">5</a>.</li><li id="ALM-27003__li41738653195327">If no, go to <a href="#ALM-27003__li45543702195327">6</a>.</li></ul>
|
|
</p></li><li id="ALM-27003__li34675550195327"><a name="ALM-27003__li34675550195327"></a><a name="li34675550195327"></a><span>Rectify the network fault and check whether the alarm is cleared from the alarm list.</span><p><ul class="subitemlist" id="ALM-27003__ul4570615195327"><li id="ALM-27003__li27162805195327">If yes, no further action is required.</li><li id="ALM-27003__li52703629195327">If no, go to <a href="#ALM-27003__li45543702195327">6</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p class="tableheading" id="ALM-27003__p41135541195327"><strong id="ALM-27003__b32447151195424">Collect fault information.</strong></p>
|
|
<ol start="6" id="ALM-27003__ol19163183195436"><li id="ALM-27003__li45543702195327"><a name="ALM-27003__li45543702195327"></a><a name="li45543702195327"></a><span>On the FusionInsight Manager portal, choose <strong id="ALM-27003__b19668195811185">O&M</strong> > <strong id="ALM-27003__b57256194195327">Log > Download</strong>.</span></li><li id="ALM-27003__li19811214195327"><span>Select the following nodes in the required cluster from the <strong id="ALM-27003__b7240138195327">Service</strong>:</span><p><ul class="subitemlist" id="ALM-27003__ul39483948195327"><li id="ALM-27003__li49580319195327">DBService</li><li id="ALM-27003__li43569689195327">Controller</li><li id="ALM-27003__li56582888195327">NodeAgent</li></ul>
|
|
</p></li><li id="ALM-27003__li1682465602619"><span>Click <span><img id="ALM-27003__image38241856192612" src="en-us_image_0269417466.png"></span> in the upper right corner, and set <strong id="ALM-27003__b118261256132619">Start Date</strong> and <strong id="ALM-27003__b5826135682618">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-27003__b17827135619267">Download</strong>.</span></li><li id="ALM-27003__li50209338195327"><span>Contact the <span id="ALM-27003__text4614151421417">O&M personnel</span> and send the collected logs.</span></li></ol>
|
|
</div>
|
|
<div class="section" id="ALM-27003__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-27003__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
|
|
</div>
|
|
<div class="section" id="ALM-27003__sd7aeb315b1fe44f787f1adacc0abc2c0"><h4 class="sectiontitle">Related Information</h4><p id="ALM-27003__en-us_topic_0070543555_p33813517">None</p>
|
|
</div>
|
|
</div>
|
|
<div>
|
|
<div class="familylinks">
|
|
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
|
|
</div>
|
|
</div>
|
|
|