doc-exports/docs/mrs/umn/ALM-24000.html
Yang, Tong 3b1f73dece MRS UMN 2.0.38.SP20 version
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com>
Co-authored-by: Yang, Tong <yangtong2@huawei.com>
Co-committed-by: Yang, Tong <yangtong2@huawei.com>
2022-12-13 12:03:34 +00:00

79 lines
7.2 KiB
HTML

<a name="ALM-24000"></a><a name="ALM-24000"></a>
<h1 class="topictitle1">ALM-24000 Flume Service Unavailable</h1>
<div id="body37238517"><div class="section" id="ALM-24000__section62655484"><h4 class="sectiontitle">Description</h4><p id="ALM-24000__p63529868">The alarm module checks the Flume service status every 180 seconds. This alarm is generated if the Flume service is abnormal.</p>
<p id="ALM-24000__p34897906">This alarm is automatically cleared after the Flume service recovers.</p>
</div>
<div class="section" id="ALM-24000__section27028451"><h4 class="sectiontitle">Attribute</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-24000__table8158160" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-24000__row12642219"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-24000__p17386810">Alarm ID</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-24000__p66154394">Alarm Severity</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-24000__p56905715">Auto Clear</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-24000__row45960172"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-24000__p31786413">24000</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-24000__p24562655">Critical</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-24000__p43418025">Yes</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-24000__section41929471"><h4 class="sectiontitle">Parameters</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-24000__table27199156" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-24000__row33667339"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-24000__p42699947">Name</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-24000__p36143663">Meaning</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-24000__row1613632821611"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-24000__p13858113752316">Source</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-24000__p187931338134115">Specifies the cluster for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-24000__row41955584"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-24000__p39123317">ServiceName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-24000__p57135782">Specifies the service for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-24000__row44459997"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-24000__p37226997">RoleName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-24000__p46923229">Specifies the role for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-24000__row19655878"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-24000__p66118565">HostName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-24000__p46093362">Specifies the host for which the alarm is generated.</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-24000__section41820921"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-24000__p42574855">Flume cannot work and data transmission is interrupted.</p>
</div>
<div class="section" id="ALM-24000__section40843970"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-24000__p119950719618">All Flume instances are faulty.</p>
</div>
<div class="section" id="ALM-24000__section32051411"><h4 class="sectiontitle">Procedure</h4><ol id="ALM-24000__ol21712978111417"><li id="ALM-24000__li17837656154120"><span>Log in to a Flume node as user <strong id="ALM-24000__b30785590235943">omm</strong> and run the <strong id="ALM-24000__b33655745635943">ps -ef|grep "flume.role=server"</strong> command to check whether the Flume process exists on the node.</span><p><ul class="subitemlist" id="ALM-24000__ul1997718810221"><li id="ALM-24000__li19780822210">If yes, go to <a href="#ALM-24000__li22384958105055">3</a>.</li><li class="subitemlist" id="ALM-24000__li497817822219">If no, restart the faulty Flume node or Flume service and go to <a href="#ALM-24000__li62139541105055">2</a>.</li></ul>
</p></li><li id="ALM-24000__li62139541105055"><a name="ALM-24000__li62139541105055"></a><a name="li62139541105055"></a><span>In the alarm list, check whether alarm "Flume Service Unavailable" is cleared.</span><p><ul class="subitemlist" id="ALM-24000__ul44677893105055"><li id="ALM-24000__li8714555105055">If yes, no further action is required.</li><li id="ALM-24000__li34790372105055">If no, go to <a href="#ALM-24000__li22384958105055">3</a>.</li></ul>
</p></li></ol>
<p class="tableheading" id="ALM-24000__p66556717105055"><strong id="ALM-24000__b53307902105430">Collect the fault information.</strong></p>
<ol start="3" id="ALM-24000__ol31283219105433"><li id="ALM-24000__li22384958105055"><a name="ALM-24000__li22384958105055"></a><a name="li22384958105055"></a><span>On FusionInsight Manager, choose <strong id="ALM-24000__b1456215616541">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-24000__b6568116145418">Log</strong> &gt; <strong id="ALM-24000__b1356856155410">Download</strong>.</span></li><li id="ALM-24000__li138033105055"><span>Expand the <strong id="ALM-24000__b1488943692015">Service</strong> drop-down list, and select <strong id="ALM-24000__b209008361209">Flume</strong> for the target cluster.</span></li><li id="ALM-24000__li1242304105055"><span>Click <span><img id="ALM-24000__image104601319175315" src="en-us_image_0263895532.png"></span> in the upper right corner, and set <strong id="ALM-24000__b14135758735943">Start Date</strong> and <strong id="ALM-24000__b118588374235943">End Date</strong> for log collection to 1 hour ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-24000__b53191441835943">Download</strong>.</span></li><li id="ALM-24000__li33517786105055"><span>Contact <span id="ALM-24000__text107380413218">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div>
<div class="section" id="ALM-24000__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-24000__p754913417333">This alarm is automatically cleared after the fault is rectified.</p>
</div>
<div class="section" id="ALM-24000__section20027245"><h4 class="sectiontitle">Related Information</h4><p id="ALM-24000__p16257720">None</p>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
</div>
</div>