doc-exports/docs/mrs/umn/ALM-12076.html
Yang, Tong 3b1f73dece MRS UMN 2.0.38.SP20 version
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com>
Co-authored-by: Yang, Tong <yangtong2@huawei.com>
Co-committed-by: Yang, Tong <yangtong2@huawei.com>
2022-12-13 12:03:34 +00:00

103 lines
10 KiB
HTML

<a name="ALM-12076"></a><a name="ALM-12076"></a>
<h1 class="topictitle1">ALM-12076 GaussDB Resource Is Abnormal</h1>
<div id="body1547193420658"><div class="section" id="ALM-12076__section18454155121012"><h4 class="sectiontitle">Description</h4><p id="ALM-12076__p1861865115102">HA checks the Manager database every 10 seconds. This alarm is generated when HA detects that the database is abnormal for 3 consecutive times.</p>
<p id="ALM-12076__p116181151191012">This alarm is cleared when the database is normal.</p>
</div>
<div class="section" id="ALM-12076__section104556519109"><h4 class="sectiontitle">Attribute</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-12076__table1745695115106" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-12076__row156185518109"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-12076__p761815161017">Alarm ID</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-12076__p761814513103">Alarm Severity</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-12076__p56181851141015">Auto Clear</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-12076__row1461895131012"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-12076__p8618155113108">12076</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-12076__p8618151141019">Major</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-12076__p661855111105">Yes</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-12076__section20462351131010"><h4 class="sectiontitle">Parameters</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-12076__table94638515104" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-12076__row561995111013"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-12076__p6619145141011">Name</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-12076__p1661975131018">Meaning</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-12076__row22241547133919"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-12076__p192431315431">Source</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-12076__p692551319435">Specifies the cluster or system for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-12076__row1861905121017"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-12076__p961935181016">ServiceName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-12076__p10619105171019">Specifies the service for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-12076__row16619105191018"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-12076__p261913519105">RoleName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-12076__p13619125191013">Specifies the role for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-12076__row1761965131017"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-12076__p8619185110109">HostName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-12076__p1561911512102">Specifies the host for which the alarm is generated.</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-12076__section2469251131017"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-12076__p1061915113105">If databases are abnormal, all core services and related service processes, such as alarms and monitoring functions, are affected.</p>
</div>
<div class="section" id="ALM-12076__section3471175116104"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12076__p164401942121114">An exception occurs in the database.</p>
</div>
<div class="section" id="ALM-12076__section1747318511104"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12076__p76201851101014"><strong id="ALM-12076__b06204519105">Check the database status of the active and standby management nodes.</strong></p>
<ol id="ALM-12076__ol1520153541220"><li id="ALM-12076__li1551973519125"><span>Log in to the active and standby management nodes respectively as user <strong id="ALM-12076__b113633226399">root</strong>. <span id="ALM-12076__text985593916354"></span>Run the <strong id="ALM-12076__b55181135141216">su - ommdba </strong>command to switch to user <strong id="ALM-12076__b6518143551210">ommdba</strong>, and then run the <strong id="ALM-12076__b1551883514127">gs_ctl query</strong> command to check whether the following information is displayed in the command output. <span id="ALM-12076__text208713505396"></span></span><p><p id="ALM-12076__p65187356126">Command output of the active management node:</p>
<pre class="screen" id="ALM-12076__screen1751833519122"> Ha state:
LOCAL_ROLE: Primary
STATIC_CONNECTIONS : 1
DB_STATE : Normal
DETAIL_INFORMATION : user/password invalid
Senders info:
No information
Receiver info:
No information </pre>
<p id="ALM-12076__p851813355122">Command output of the standby management node:</p>
<pre class="screen" id="ALM-12076__screen19518335121212"> Ha state:
LOCAL_ROLE: Standby
STATIC_CONNECTIONS : 1
DB_STATE : Normal
DETAIL_INFORMATION : user/password invalid
Senders info:
No information
Receiver info:
No information</pre>
<ul id="ALM-12076__ul16519163571219"><li id="ALM-12076__li5658142616138">If it is, go to <a href="#ALM-12076__li251973518126">3</a>.</li><li id="ALM-12076__li051920357125">If it is not, go to <a href="#ALM-12076__li1051911355122">2</a>.</li></ul>
</p></li><li id="ALM-12076__li1051911355122"><a name="ALM-12076__li1051911355122"></a><a name="li1051911355122"></a><span>Contact the network administrator to check whether the network is faulty.</span><p><ul id="ALM-12076__ul1551911353128"><li id="ALM-12076__li15519935141218">If it is, go to <a href="#ALM-12076__li251973518126">3</a>.</li><li id="ALM-12076__li85199356121">If it is not, go to <a href="#ALM-12076__li151723519124">5</a>.</li></ul>
</p></li><li id="ALM-12076__li251973518126"><a name="ALM-12076__li251973518126"></a><a name="li251973518126"></a><span>Five minutes later, check whether the alarm is cleared.</span><p><ul id="ALM-12076__ul105190350128"><li id="ALM-12076__li65197352129">If it is, no further action is required.</li><li id="ALM-12076__li175191235161218">If it is not, go to <a href="#ALM-12076__li85203358122">4</a>.</li></ul>
</p></li><li id="ALM-12076__li85203358122"><a name="ALM-12076__li85203358122"></a><a name="li85203358122"></a><span>Log in to the active and standby management nodes, run the <strong id="ALM-12076__b4519163518121">su -omm</strong> command to switch to user <strong id="ALM-12076__b4519935181219">omm</strong>, go to the <strong id="ALM-12076__b13519193510124">${BIGDATA_HOME} /om-server/om/sbin/</strong> directory, and run the <strong id="ALM-12076__b10519835161213">status-oms.sh</strong> script to check whether the floating IP addresses and GaussDB resources of the active and standby FusionInsight Managers are in the status shown in the following figure.</span><p><p id="ALM-12076__p25191635111211"><span><img id="ALM-12076__image1051903591215" src="en-us_image_0269383921.jpg"></span></p>
<ul id="ALM-12076__ul19520203561215"><li id="ALM-12076__li9519133518128">If they are, find the alarm in the alarm list and manually clear the alarm.</li><li id="ALM-12076__li85191435201215">If they are not, go to <a href="#ALM-12076__li151723519124">5</a>.</li></ul>
</p></li></ol>
<p id="ALM-12076__p763032816122"><strong id="ALM-12076__b662011517104">Collect fault information.</strong></p>
<ol start="5" id="ALM-12076__ol14517835201211"><li id="ALM-12076__li151723519124"><a name="ALM-12076__li151723519124"></a><a name="li151723519124"></a><span>On FusionInsight Manager, choose <strong id="ALM-12076__b25156350129">O&amp;M</strong> &gt; <strong id="ALM-12076__b4515183511218">Log</strong> &gt; <strong id="ALM-12076__b45171135131220">Download</strong>.</span></li><li id="ALM-12076__li1951733518129"><span>Select <strong id="ALM-12076__b651714354127">OmmServer</strong> for <strong id="ALM-12076__b155172035181215">Service</strong> and click <strong id="ALM-12076__b3991118545">OK</strong>.</span></li><li id="ALM-12076__li851710358124"><span>Click <span><img id="ALM-12076__image1851713359128" src="en-us_image_0269383922.png"></span> in the upper right corner. In the displayed dialog box, set <strong id="ALM-12076__b9517153581213">Start Date</strong> and <strong id="ALM-12076__b451703561216">End Date</strong> to 10 minutes before and after the alarm generation time respectively and click <strong id="ALM-12076__b75171735171214">OK</strong>. Then, click <strong id="ALM-12076__b651783541215">Download</strong>.</span></li><li id="ALM-12076__li495644512588"><span>Contact the <span id="ALM-12076__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div>
<div class="section" id="ALM-12076__section2512351141014"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12076__p562218511100">This alarm will be automatically cleared after the fault is rectified.</p>
</div>
<div class="section" id="ALM-12076__section11513205151015"><h4 class="sectiontitle">Related Information</h4><p id="ALM-12076__p19622951111010">None</p>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
</div>
</div>