1
0
forked from docs/doc-exports

MRS UMN 20231220 version update

Reviewed-by: Pruthi, Vineet <vineet.pruthi@t-systems.com>
Reviewed-by: Rechenburg, Matthias <matthias.rechenburg@t-systems.com>
Co-authored-by: Yang, Tong <yangtong2@huawei.com>
Co-committed-by: Yang, Tong <yangtong2@huawei.com>
This commit is contained in:
Yang, Tong 2024-05-16 09:40:21 +00:00 committed by zuul
parent ccbf63b495
commit 2195db241c
1040 changed files with 5599 additions and 5409 deletions

File diff suppressed because it is too large Load Diff

View File

@ -59,7 +59,7 @@
<div class="section" id="ALM-12001__s32c81d1592824ca0b8dbb3a21428f59d"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12001__en-us_topic_0070543614_ul65599284"><li id="ALM-12001__en-us_topic_0070543614_li53522652">The network connection is abnormal.</li><li id="ALM-12001__en-us_topic_0070543614_li11941827">The username, password, or dump directory of the dump server does not meet the configuration conditions.</li><li id="ALM-12001__en-us_topic_0070543614_li40367587">The disk space of the dump directory is insufficient.</li></ul> <div class="section" id="ALM-12001__s32c81d1592824ca0b8dbb3a21428f59d"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12001__en-us_topic_0070543614_ul65599284"><li id="ALM-12001__en-us_topic_0070543614_li53522652">The network connection is abnormal.</li><li id="ALM-12001__en-us_topic_0070543614_li11941827">The username, password, or dump directory of the dump server does not meet the configuration conditions.</li><li id="ALM-12001__en-us_topic_0070543614_li40367587">The disk space of the dump directory is insufficient.</li></ul>
</div> </div>
<div class="section" id="ALM-12001__s3a2cd89f53084ce98c69427e4cf85a18"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12001__en-us_topic_0070543614_p48549077"><strong id="ALM-12001__b2009504815379">Check whether the network connection is normal.</strong></p> <div class="section" id="ALM-12001__s3a2cd89f53084ce98c69427e4cf85a18"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12001__en-us_topic_0070543614_p48549077"><strong id="ALM-12001__b2009504815379">Check whether the network connection is normal.</strong></p>
<ol id="ALM-12001__ol38120521153659"><li id="ALM-12001__li28892250153659"><span>On the FusionInsight Manager home page, choose <strong id="ALM-12001__b40492952153659">Audit &gt; Configurations</strong>.</span></li><li id="ALM-12001__li58703659153659"><span>Check whether the SFTP IP on the dump configuration page is valid.</span><p><div class="litext" id="ALM-12001__p44686402153726">Log in to the node where Manager is located as user <strong id="ALM-12001__b58570884153659">root</strong> and run the <strong id="ALM-12001__b57375910153659">ping</strong> command to check whether the network connection between the SFTP server and the cluster is normal. <span id="ALM-12001__text187511520308"></span><span id="ALM-12001__text325002212305"></span><ul class="subitemlist" id="ALM-12001__ul66251821153659"><li id="ALM-12001__li16937138153659">If yes, go to <a href="#ALM-12001__li33093593154533">5</a>.</li><li id="ALM-12001__li29730934153659">If no, go to <a href="#ALM-12001__li64797305153659">3</a>.</li></ul> <ol id="ALM-12001__ol38120521153659"><li id="ALM-12001__li28892250153659"><span>On the <span id="ALM-12001__text67509419010">MRS</span> Manager home page, choose <strong id="ALM-12001__b40492952153659">Audit &gt; Configurations</strong>.</span></li><li id="ALM-12001__li58703659153659"><span>Check whether the SFTP IP on the dump configuration page is valid.</span><p><div class="litext" id="ALM-12001__p44686402153726">Log in to the node where Manager is located as user <strong id="ALM-12001__b58570884153659">root</strong> and run the <strong id="ALM-12001__b57375910153659">ping</strong> command to check whether the network connection between the SFTP server and the cluster is normal. <span id="ALM-12001__text187511520308"></span><span id="ALM-12001__text325002212305"></span><ul class="subitemlist" id="ALM-12001__ul66251821153659"><li id="ALM-12001__li16937138153659">If yes, go to <a href="#ALM-12001__li33093593154533">5</a>.</li><li id="ALM-12001__li29730934153659">If no, go to <a href="#ALM-12001__li64797305153659">3</a>.</li></ul>
</div> </div>
</p></li><li id="ALM-12001__li64797305153659"><a name="ALM-12001__li64797305153659"></a><a name="li64797305153659"></a><span>Repair the network connection, reset the SFTP password, and click <strong id="ALM-12001__b59395483153659">OK</strong>.</span></li><li id="ALM-12001__li4235613153659"><span>Wait for 2 minutes and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12001__ul470623153659"><li id="ALM-12001__li46304841153659">If yes, no further action is required.</li><li id="ALM-12001__li59704615153659">If no, go to <a href="#ALM-12001__li33093593154533">5</a>.</li></ul> </p></li><li id="ALM-12001__li64797305153659"><a name="ALM-12001__li64797305153659"></a><a name="li64797305153659"></a><span>Repair the network connection, reset the SFTP password, and click <strong id="ALM-12001__b59395483153659">OK</strong>.</span></li><li id="ALM-12001__li4235613153659"><span>Wait for 2 minutes and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12001__ul470623153659"><li id="ALM-12001__li46304841153659">If yes, no further action is required.</li><li id="ALM-12001__li59704615153659">If no, go to <a href="#ALM-12001__li33093593154533">5</a>.</li></ul>
</p></li></ol> </p></li></ol>
@ -72,10 +72,10 @@
</p></li><li id="ALM-12001__li61877356154547"><a name="ALM-12001__li61877356154547"></a><a name="li61877356154547"></a><span>Expand disk space capacity for the third-party server, Reset the SFTP password and click <strong id="ALM-12001__b36701423154547">OK</strong></span></li><li id="ALM-12001__li53906996154547"><span>Wait for 2 minutes, view real-time alarms and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12001__ul35815828154547"><li id="ALM-12001__li20025293154547">If yes, no further action is required.</li><li id="ALM-12001__li11436076154547">If no, go to <a href="#ALM-12001__li37575023154554">11</a>.</li></ul> </p></li><li id="ALM-12001__li61877356154547"><a name="ALM-12001__li61877356154547"></a><a name="li61877356154547"></a><span>Expand disk space capacity for the third-party server, Reset the SFTP password and click <strong id="ALM-12001__b36701423154547">OK</strong></span></li><li id="ALM-12001__li53906996154547"><span>Wait for 2 minutes, view real-time alarms and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12001__ul35815828154547"><li id="ALM-12001__li20025293154547">If yes, no further action is required.</li><li id="ALM-12001__li11436076154547">If no, go to <a href="#ALM-12001__li37575023154554">11</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12001__en-us_topic_0070543614_p43960067"><strong id="ALM-12001__b4787357154551">Reset the dump rule.</strong></p> <p class="tableheading" id="ALM-12001__en-us_topic_0070543614_p43960067"><strong id="ALM-12001__b4787357154551">Reset the dump rule.</strong></p>
<ol start="11" id="ALM-12001__ol38224750154621"><li id="ALM-12001__li37575023154554"><a name="ALM-12001__li37575023154554"></a><a name="li37575023154554"></a><span>On the FusionInsight Manager home page, choose <strong id="ALM-12001__b41457704154554">Audit &gt; Configurations</strong>.</span></li><li id="ALM-12001__li23678021154554"><span>Reset dump rules, set the parameters properly, and click <strong id="ALM-12001__b2630891154554">OK</strong>.</span></li><li id="ALM-12001__li17396949154554"><span>Wait for 2 minutes, view real-time alarms and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12001__ul61585317154554"><li id="ALM-12001__li11775598154554">If yes, no further action is required.</li><li id="ALM-12001__li14299353154554">If no, go to <a href="#ALM-12001__li5991045915463">14</a>.</li></ul> <ol start="11" id="ALM-12001__ol38224750154621"><li id="ALM-12001__li37575023154554"><a name="ALM-12001__li37575023154554"></a><a name="li37575023154554"></a><span>On the <span id="ALM-12001__text194391642132610">MRS</span> Manager home page, choose <strong id="ALM-12001__b41457704154554">Audit &gt; Configurations</strong>.</span></li><li id="ALM-12001__li23678021154554"><span>Reset dump rules, set the parameters properly, and click <strong id="ALM-12001__b2630891154554">OK</strong>.</span></li><li id="ALM-12001__li17396949154554"><span>Wait for 2 minutes, view real-time alarms and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12001__ul61585317154554"><li id="ALM-12001__li11775598154554">If yes, no further action is required.</li><li id="ALM-12001__li14299353154554">If no, go to <a href="#ALM-12001__li5991045915463">14</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12001__p28445835153631"><strong id="ALM-12001__b57966164154559">Collect fault information.</strong></p> <p id="ALM-12001__p28445835153631"><strong id="ALM-12001__b57966164154559">Collect fault information.</strong></p>
<ol start="14" id="ALM-12001__ol17392131154624"><li id="ALM-12001__li5991045915463"><a name="ALM-12001__li5991045915463"></a><a name="li5991045915463"></a><span>On the FusionInsight Manager, choose <strong id="ALM-12001__b5263123115415">O&amp;M</strong> &gt; <strong id="ALM-12001__b2156979815463">Log &gt; Download</strong>.</span></li><li id="ALM-12001__li5396317115463"><span>Select <strong id="ALM-12001__b20461631242">OmmServer</strong> from the <strong id="ALM-12001__b63941092411">Service</strong> and click <strong id="ALM-12001__b3991118545">OK</strong>.</span></li><li id="ALM-12001__li1145664103113"><span>Click <span><img id="ALM-12001__image1945644173117" src="en-us_image_0000001582807597.png"></span> in the upper right corner, and set <strong id="ALM-12001__b6456941173117">Start Date</strong> and <strong id="ALM-12001__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12001__b13456164113319">Download</strong>.</span></li><li id="ALM-12001__li495644512588"><span>Contact the <span id="ALM-12001__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="14" id="ALM-12001__ol17392131154624"><li id="ALM-12001__li5991045915463"><a name="ALM-12001__li5991045915463"></a><a name="li5991045915463"></a><span>On the <span id="ALM-12001__text514544414261">MRS</span> Manager, choose <strong id="ALM-12001__b5263123115415">O&amp;M</strong> &gt; <strong id="ALM-12001__b2156979815463">Log &gt; Download</strong>.</span></li><li id="ALM-12001__li5396317115463"><span>Select <strong id="ALM-12001__b20461631242">OmmServer</strong> from the <strong id="ALM-12001__b63941092411">Service</strong> and click <strong id="ALM-12001__b3991118545">OK</strong>.</span></li><li id="ALM-12001__li1145664103113"><span>Click <span><img id="ALM-12001__image1945644173117" src="en-us_image_0000001582807597.png"></span> in the upper right corner, and set <strong id="ALM-12001__b6456941173117">Start Date</strong> and <strong id="ALM-12001__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12001__b13456164113319">Download</strong>.</span></li><li id="ALM-12001__li495644512588"><span>Contact the <span id="ALM-12001__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12001__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12001__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12001__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12001__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -60,7 +60,7 @@
<div class="section" id="ALM-12004__s2055fade2c7e40f2a441dbfc0e19bfd1"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12004__a09b163110d984d42bbb5cfbfcf38a2ea">The LdapServer process in the Manager is abnormal.</p> <div class="section" id="ALM-12004__s2055fade2c7e40f2a441dbfc0e19bfd1"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12004__a09b163110d984d42bbb5cfbfcf38a2ea">The LdapServer process in the Manager is abnormal.</p>
</div> </div>
<div class="section" id="ALM-12004__s7f2ec925ce1940fbabe11f029654bc7b"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12004__en-us_topic_0070546150_p58223301"><strong id="ALM-12004__a09a6fcf99d72473783393ce6884de55d">Check whether the LdapServer process in the Manager is normal.</strong></p> <div class="section" id="ALM-12004__s7f2ec925ce1940fbabe11f029654bc7b"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12004__en-us_topic_0070546150_p58223301"><strong id="ALM-12004__a09a6fcf99d72473783393ce6884de55d">Check whether the LdapServer process in the Manager is normal.</strong></p>
<ol id="ALM-12004__oeee7008874234273afbf9f549eb46324"><li id="ALM-12004__l2b9a8ef7e6084db985420f5c44c16922"><span>Log in the Manager node in the cluster as user <strong id="ALM-12004__ada87ef7d3aaa449aa0eed17ccc419567">omm</strong>.</span><p><p class="litext" id="ALM-12004__a0db40b4c1cf54723b74339e6ea159597">Log in to FusionInsight Manager using the floating IP address, and run the <strong id="ALM-12004__a19c8a973f73044ff98879fae4c7a74b8">sh ${BIGDATA_HOME}/om-server/om/sbin/status-oms.sh</strong> command to check the information about the current Manager two-node cluster.</p> <ol id="ALM-12004__oeee7008874234273afbf9f549eb46324"><li id="ALM-12004__l2b9a8ef7e6084db985420f5c44c16922"><span>Log in the Manager node in the cluster as user <strong id="ALM-12004__ada87ef7d3aaa449aa0eed17ccc419567">omm</strong>.</span><p><p class="litext" id="ALM-12004__a0db40b4c1cf54723b74339e6ea159597">Log in to <span id="ALM-12004__text67509419010">MRS</span> Manager using the floating IP address, and run the <strong id="ALM-12004__a19c8a973f73044ff98879fae4c7a74b8">sh ${BIGDATA_HOME}/om-server/om/sbin/status-oms.sh</strong> command to check the information about the current Manager two-node cluster.</p>
</p></li><li id="ALM-12004__lbc17079424494ad69f2dd0257acee2cd"><span>Run <strong id="ALM-12004__aecdb043853624e46b48efdd0222e0784">ps -ef | grep slapd</strong> command to check whether the LdapServer resource process in the <strong id="ALM-12004__accfc52122caf4e08aadc9544e71c1b2b">${BIGDATA_HOME}/om-server/om/</strong> in the process configuration file is running properly.</span><p><div class="note" id="ALM-12004__ndfac406ca6354edebc30d3508da5c0d4"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12004__acd7b61dbeba848bd8fc6e70ca411da8f">You can determine that the resource is normal by checking the following information:</p> </p></li><li id="ALM-12004__lbc17079424494ad69f2dd0257acee2cd"><span>Run <strong id="ALM-12004__aecdb043853624e46b48efdd0222e0784">ps -ef | grep slapd</strong> command to check whether the LdapServer resource process in the <strong id="ALM-12004__accfc52122caf4e08aadc9544e71c1b2b">${BIGDATA_HOME}/om-server/om/</strong> in the process configuration file is running properly.</span><p><div class="note" id="ALM-12004__ndfac406ca6354edebc30d3508da5c0d4"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12004__acd7b61dbeba848bd8fc6e70ca411da8f">You can determine that the resource is normal by checking the following information:</p>
<ol type="a" id="ALM-12004__ol6348204419565"><li id="ALM-12004__li1234914413567">After the <strong id="ALM-12004__b7350134413568">sh ${BIGDATA_HOME}/om-server/om/sbin/status-oms.sh</strong> command runs, <strong id="ALM-12004__b193511144175615">ResHAStatus</strong> of the OLdap is <strong id="ALM-12004__b7351144455613">Normal</strong>.</li><li id="ALM-12004__li1035119448564">After the <strong id="ALM-12004__b10352114418567">ps -ef | grep slapd</strong> command runs, the slapd process of port 21750 can be viewed.<ul id="ALM-12004__ul1735384414561"><li id="ALM-12004__li43551544175617">If yes, go to <a href="#ALM-12004__l6ef892f9c8f749aa9e6871e1a63797b1">3</a>.</li><li id="ALM-12004__li935754420569">If no, go to <a href="#ALM-12004__l4b1abbc809ee41c28ade2b2c4cfa6fde">4</a>.</li></ul> <ol type="a" id="ALM-12004__ol6348204419565"><li id="ALM-12004__li1234914413567">After the <strong id="ALM-12004__b7350134413568">sh ${BIGDATA_HOME}/om-server/om/sbin/status-oms.sh</strong> command runs, <strong id="ALM-12004__b193511144175615">ResHAStatus</strong> of the OLdap is <strong id="ALM-12004__b7351144455613">Normal</strong>.</li><li id="ALM-12004__li1035119448564">After the <strong id="ALM-12004__b10352114418567">ps -ef | grep slapd</strong> command runs, the slapd process of port 21750 can be viewed.<ul id="ALM-12004__ul1735384414561"><li id="ALM-12004__li43551544175617">If yes, go to <a href="#ALM-12004__l6ef892f9c8f749aa9e6871e1a63797b1">3</a>.</li><li id="ALM-12004__li935754420569">If no, go to <a href="#ALM-12004__l4b1abbc809ee41c28ade2b2c4cfa6fde">4</a>.</li></ul>
</li></ol> </li></ol>
@ -68,7 +68,7 @@
</p></li><li id="ALM-12004__l6ef892f9c8f749aa9e6871e1a63797b1"><a name="ALM-12004__l6ef892f9c8f749aa9e6871e1a63797b1"></a><a name="l6ef892f9c8f749aa9e6871e1a63797b1"></a><span>Run the <strong id="ALM-12004__ac81351982ea44a3080848652eb80641f">kill -2</strong> <em id="ALM-12004__adf6ccee5cb6e4773b82ca5f68a8d4218">ldap pid</em> command to restart the LdapServer process and wait for 20 seconds. The HA starts the OLdap process automatically. Check whether the current OLdap resource is in normal state.</span><p><ul id="ALM-12004__u8057658d3505467190171bde28259d37"><li id="ALM-12004__lf80ef17bc2cc40138da0188b47a8b323">If yes, the operation is complete.</li><li id="ALM-12004__l0cc9afd9cc8b4222b41bcf9983d15d1e">If no, go to <a href="#ALM-12004__l4b1abbc809ee41c28ade2b2c4cfa6fde">4</a>.</li></ul> </p></li><li id="ALM-12004__l6ef892f9c8f749aa9e6871e1a63797b1"><a name="ALM-12004__l6ef892f9c8f749aa9e6871e1a63797b1"></a><a name="l6ef892f9c8f749aa9e6871e1a63797b1"></a><span>Run the <strong id="ALM-12004__ac81351982ea44a3080848652eb80641f">kill -2</strong> <em id="ALM-12004__adf6ccee5cb6e4773b82ca5f68a8d4218">ldap pid</em> command to restart the LdapServer process and wait for 20 seconds. The HA starts the OLdap process automatically. Check whether the current OLdap resource is in normal state.</span><p><ul id="ALM-12004__u8057658d3505467190171bde28259d37"><li id="ALM-12004__lf80ef17bc2cc40138da0188b47a8b323">If yes, the operation is complete.</li><li id="ALM-12004__l0cc9afd9cc8b4222b41bcf9983d15d1e">If no, go to <a href="#ALM-12004__l4b1abbc809ee41c28ade2b2c4cfa6fde">4</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12004__abb5516fb7b8647a3942c4c5b7f74fded"><strong id="ALM-12004__a760add342117469495c4fbe7e3daf04f">Collect fault information.</strong></p> <p id="ALM-12004__abb5516fb7b8647a3942c4c5b7f74fded"><strong id="ALM-12004__a760add342117469495c4fbe7e3daf04f">Collect fault information.</strong></p>
<ol start="4" id="ALM-12004__o9661752a744349fba78569b7f04fcbcf"><li id="ALM-12004__l4b1abbc809ee41c28ade2b2c4cfa6fde"><a name="ALM-12004__l4b1abbc809ee41c28ade2b2c4cfa6fde"></a><a name="l4b1abbc809ee41c28ade2b2c4cfa6fde"></a><span>On the FusionInsight Manager home page, choose <strong id="ALM-12004__b76841116134212">O&amp;M</strong> &gt; <strong id="ALM-12004__abd8fe9ab79df48fdb7b8bfe92c7768bc">Log &gt; Download</strong>.</span></li><li id="ALM-12004__l19f3de8474a147ef88ac2d40f27fe72e"><span>Select <strong id="ALM-12004__a5ee1ffd31e954215a608adc09390aabe">OmsLdapServer</strong> and <strong id="ALM-12004__afed03600c0b1449aa46a036940dae621">OmmServer</strong> from the <strong id="ALM-12004__a6cf5036ea700402980e42d73cf308a63">Service</strong> and click <strong id="ALM-12004__b3991118545">OK</strong>.</span></li><li id="ALM-12004__li1145664103113"><span>Click <span><img id="ALM-12004__image1945644173117" src="en-us_image_0000001532767626.png"></span> in the upper right corner, and set <strong id="ALM-12004__b6456941173117">Start Date</strong> and <strong id="ALM-12004__b11456154113318">End Date</strong> for log collection to 1 hour ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12004__b13456164113319">Download</strong>.</span></li><li id="ALM-12004__li495644512588"><span>Contact the <span id="ALM-12004__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="4" id="ALM-12004__o9661752a744349fba78569b7f04fcbcf"><li id="ALM-12004__l4b1abbc809ee41c28ade2b2c4cfa6fde"><a name="ALM-12004__l4b1abbc809ee41c28ade2b2c4cfa6fde"></a><a name="l4b1abbc809ee41c28ade2b2c4cfa6fde"></a><span>On the <span id="ALM-12004__text20467194712264">MRS</span> Manager home page, choose <strong id="ALM-12004__b76841116134212">O&amp;M</strong> &gt; <strong id="ALM-12004__abd8fe9ab79df48fdb7b8bfe92c7768bc">Log &gt; Download</strong>.</span></li><li id="ALM-12004__l19f3de8474a147ef88ac2d40f27fe72e"><span>Select <strong id="ALM-12004__a5ee1ffd31e954215a608adc09390aabe">OmsLdapServer</strong> and <strong id="ALM-12004__afed03600c0b1449aa46a036940dae621">OmmServer</strong> from the <strong id="ALM-12004__a6cf5036ea700402980e42d73cf308a63">Service</strong> and click <strong id="ALM-12004__b3991118545">OK</strong>.</span></li><li id="ALM-12004__li1145664103113"><span>Click <span><img id="ALM-12004__image1945644173117" src="en-us_image_0000001532767626.png"></span> in the upper right corner, and set <strong id="ALM-12004__b6456941173117">Start Date</strong> and <strong id="ALM-12004__b11456154113318">End Date</strong> for log collection to 1 hour ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12004__b13456164113319">Download</strong>.</span></li><li id="ALM-12004__li495644512588"><span>Contact the <span id="ALM-12004__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12004__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12004__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12004__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12004__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -55,17 +55,17 @@
</table> </table>
</div> </div>
</div> </div>
<div class="section" id="ALM-12005__sb62541a2e6e943b684e2619714ec9325"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-12005__en-us_topic_0070543646_p13091062">The component WebUI authentication services are unavailable and cannot provide security authentication functions for web upper-layer services. Users may be unable to log in to FusionInsight Manager and the WebUIs of components.</p> <div class="section" id="ALM-12005__sb62541a2e6e943b684e2619714ec9325"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-12005__en-us_topic_0070543646_p13091062">The component WebUI authentication services are unavailable and cannot provide security authentication functions for web upper-layer services. Users may be unable to log in to <span id="ALM-12005__text67509419010">MRS</span> Manager and the WebUIs of components.</p>
</div> </div>
<div class="section" id="ALM-12005__s9e07b149b27f429cb5b27b19fec75063"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12005__en-us_topic_0070543646_p53743093">The OLdap resource on which the Okerberos depends is abnormal.</p> <div class="section" id="ALM-12005__s9e07b149b27f429cb5b27b19fec75063"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12005__en-us_topic_0070543646_p53743093">The OLdap resource on which the Okerberos depends is abnormal.</p>
</div> </div>
<div class="section" id="ALM-12005__sd77225bf1fcd431089a828a7a4601dd6"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12005__en-us_topic_0070543646_p58223301"><strong id="ALM-12005__b65926177164951">Check whether the OLdap resource on which the Okerberos depends is abnormal in the Manager.</strong></p> <div class="section" id="ALM-12005__sd77225bf1fcd431089a828a7a4601dd6"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12005__en-us_topic_0070543646_p58223301"><strong id="ALM-12005__b65926177164951">Check whether the OLdap resource on which the Okerberos depends is abnormal in the Manager.</strong></p>
<ol id="ALM-12005__ol2732064816486"><li id="ALM-12005__li2258678016486"><span>Log in the Manager node in the cluster as user <strong id="ALM-12005__b2487926316486">omm</strong>.</span><p><p class="litext" id="ALM-12005__p144011943114415">Log in to FusionInsight Manager using the floating IP address, and run the <strong id="ALM-12005__b195443516486">sh ${BIGDATA_HOME}/om-server/om/sbin/status-oms.sh</strong> command to check the information about the current Manager two-node cluster.</p> <ol id="ALM-12005__ol2732064816486"><li id="ALM-12005__li2258678016486"><span>Log in the Manager node in the cluster as user <strong id="ALM-12005__b2487926316486">omm</strong>.</span><p><p class="litext" id="ALM-12005__p144011943114415">Log in to <span id="ALM-12005__text14788135082615">MRS</span> Manager using the floating IP address, and run the <strong id="ALM-12005__b195443516486">sh ${BIGDATA_HOME}/om-server/om/sbin/status-oms.sh</strong> command to check the information about the current Manager two-node cluster.</p>
</p></li><li id="ALM-12005__li593131416486"><span>Run the <strong id="ALM-12005__b1758991516486">sh ${BIGDATA_HOME}/om-server/OMS/workspace0/ha/module/hacom/script/status_ha.sh</strong> command to check whether the OLdap resource status managed by HA is normal. (In single-node mode, the OLdap resource is in the Active_normal state; in the two-node mode, the OLdap resource is in the Active_normal state on the active node and in the Standby_normal state on the standby node.)</span><p><ul class="subitemlist" id="ALM-12005__ul2302865616486"><li id="ALM-12005__li1549700616486">If yes, go to <a href="#ALM-12005__li34421516164820">4</a>.</li><li id="ALM-12005__li4729798216486">If no, go to <a href="#ALM-12005__li4031832916486">3</a>.</li></ul> </p></li><li id="ALM-12005__li593131416486"><span>Run the <strong id="ALM-12005__b1758991516486">sh ${BIGDATA_HOME}/om-server/OMS/workspace0/ha/module/hacom/script/status_ha.sh</strong> command to check whether the OLdap resource status managed by HA is normal. (In single-node mode, the OLdap resource is in the Active_normal state; in the two-node mode, the OLdap resource is in the Active_normal state on the active node and in the Standby_normal state on the standby node.)</span><p><ul class="subitemlist" id="ALM-12005__ul2302865616486"><li id="ALM-12005__li1549700616486">If yes, go to <a href="#ALM-12005__li34421516164820">4</a>.</li><li id="ALM-12005__li4729798216486">If no, go to <a href="#ALM-12005__li4031832916486">3</a>.</li></ul>
</p></li><li id="ALM-12005__li4031832916486"><a name="ALM-12005__li4031832916486"></a><a name="li4031832916486"></a><span>See the procedure in <a href="ALM-12004.html">ALM-12004 OLdap Resource Abnormal</a> to resolve the problem. After the OLdap resource status recovers, check whether the OKerberos resource status is normal.</span><p><ul class="subitemlist" id="ALM-12005__ul6413213716486"><li id="ALM-12005__li1067441916486">If yes, the operation is complete.</li><li id="ALM-12005__li5932157616486">If no, go to <a href="#ALM-12005__li34421516164820">4</a>.</li></ul> </p></li><li id="ALM-12005__li4031832916486"><a name="ALM-12005__li4031832916486"></a><a name="li4031832916486"></a><span>See the procedure in <a href="ALM-12004.html">ALM-12004 OLdap Resource Abnormal</a> to resolve the problem. After the OLdap resource status recovers, check whether the OKerberos resource status is normal.</span><p><ul class="subitemlist" id="ALM-12005__ul6413213716486"><li id="ALM-12005__li1067441916486">If yes, the operation is complete.</li><li id="ALM-12005__li5932157616486">If no, go to <a href="#ALM-12005__li34421516164820">4</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12005__p59418417164755"><strong id="ALM-12005__b21602359164826">Collect fault information.</strong></p> <p id="ALM-12005__p59418417164755"><strong id="ALM-12005__b21602359164826">Collect fault information.</strong></p>
<ol start="4" id="ALM-12005__ol49138498164822"><li id="ALM-12005__li34421516164820"><a name="ALM-12005__li34421516164820"></a><a name="li34421516164820"></a><span>On the FusionInsight Manager home page, choose <strong id="ALM-12005__b87862548435">O&amp;M</strong> &gt; <strong id="ALM-12005__b11281153164820">Log &gt; Download</strong>.</span></li><li id="ALM-12005__li29990712164820"><span>Select <strong id="ALM-12005__b41358196164820">OmsKerberos</strong> and <strong id="ALM-12005__b36679449164820">OmmServer</strong> from the <strong id="ALM-12005__b18615181618813">Service</strong> and click <strong id="ALM-12005__b627792117815">OK</strong>.</span></li><li id="ALM-12005__li1145664103113"><span>Click <span><img id="ALM-12005__image1945644173117" src="en-us_image_0000001532607838.png"></span> in the upper right corner, and set <strong id="ALM-12005__b6456941173117">Start Date</strong> and <strong id="ALM-12005__b11456154113318">End Date</strong> for log collection to 1 hour ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12005__b13456164113319">Download</strong>.</span></li><li id="ALM-12005__li495644512588"><span>Contact the <span id="ALM-12005__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="4" id="ALM-12005__ol49138498164822"><li id="ALM-12005__li34421516164820"><a name="ALM-12005__li34421516164820"></a><a name="li34421516164820"></a><span>On the <span id="ALM-12005__text1421225262618">MRS</span> Manager home page, choose <strong id="ALM-12005__b87862548435">O&amp;M</strong> &gt; <strong id="ALM-12005__b11281153164820">Log &gt; Download</strong>.</span></li><li id="ALM-12005__li29990712164820"><span>Select <strong id="ALM-12005__b41358196164820">OmsKerberos</strong> and <strong id="ALM-12005__b36679449164820">OmmServer</strong> from the <strong id="ALM-12005__b18615181618813">Service</strong> and click <strong id="ALM-12005__b627792117815">OK</strong>.</span></li><li id="ALM-12005__li1145664103113"><span>Click <span><img id="ALM-12005__image1945644173117" src="en-us_image_0000001532607838.png"></span> in the upper right corner, and set <strong id="ALM-12005__b6456941173117">Start Date</strong> and <strong id="ALM-12005__b11456154113318">End Date</strong> for log collection to 1 hour ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12005__b13456164113319">Download</strong>.</span></li><li id="ALM-12005__li495644512588"><span>Contact the <span id="ALM-12005__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12005__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12005__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12005__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12005__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -60,7 +60,7 @@
<div class="section" id="ALM-12006__section59380201"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12006__ul652118533718"><li id="ALM-12006__li1852116517378">The network is disconnected, the hardware is faulty, or the operating system runs slowly.</li><li id="ALM-12006__li1473614616373">The memory of the NodeAgent process is insufficient.</li></ul> <div class="section" id="ALM-12006__section59380201"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12006__ul652118533718"><li id="ALM-12006__li1852116517378">The network is disconnected, the hardware is faulty, or the operating system runs slowly.</li><li id="ALM-12006__li1473614616373">The memory of the NodeAgent process is insufficient.</li></ul>
</div> </div>
<div class="section" id="ALM-12006__section64659764"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12006__p17236725"><strong id="ALM-12006__b662616519645">Check whether the network is disconnected, whether the hardware is faulty, or whether the operating system runs commands slowly.</strong></p> <div class="section" id="ALM-12006__section64659764"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12006__p17236725"><strong id="ALM-12006__b662616519645">Check whether the network is disconnected, whether the hardware is faulty, or whether the operating system runs commands slowly.</strong></p>
<ol id="ALM-12006__ol25386555165047"><li id="ALM-12006__li14747189165028"><span>On FusionInsight Manager, choose <strong id="ALM-12006__b147455436444647">O&amp;M</strong> &gt; <strong id="ALM-12006__b123126530744647">Alarm</strong> &gt; <strong id="ALM-12006__b61870647444647">Alarms</strong>. On the page that is displayed, click <span><img id="ALM-12006__image186131198418" src="en-us_image_0000001583127417.png"></span> in the row containing the alarm, click the host name, and view the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12006__li13283100165028"><span>Log in to the active management node as user <strong id="ALM-12006__b6368294144647">root</strong>. <span id="ALM-12006__text1460138164615"></span> <span id="ALM-12006__text18300131824619"></span></span><p><div class="note" id="ALM-12006__note17203152312217"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12006__p920518235220">If the faulty node is the active management node and fails login, the network of the active management node may be faulty. In this case, go to <a href="#ALM-12006__li61437024165028">4</a>.</p> <ol id="ALM-12006__ol25386555165047"><li id="ALM-12006__li14747189165028"><span>On <span id="ALM-12006__text34789336432">MRS</span> Manager, choose <strong id="ALM-12006__b147455436444647">O&amp;M</strong> &gt; <strong id="ALM-12006__b123126530744647">Alarm</strong> &gt; <strong id="ALM-12006__b61870647444647">Alarms</strong>. On the page that is displayed, click <span><img id="ALM-12006__image186131198418" src="en-us_image_0000001583127417.png"></span> in the row containing the alarm, click the host name, and view the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12006__li13283100165028"><span>Log in to the active management node as user <strong id="ALM-12006__b6368294144647">root</strong>. <span id="ALM-12006__text1460138164615"></span> <span id="ALM-12006__text18300131824619"></span></span><p><div class="note" id="ALM-12006__note17203152312217"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12006__p920518235220">If the faulty node is the active management node and fails login, the network of the active management node may be faulty. In this case, go to <a href="#ALM-12006__li61437024165028">4</a>.</p>
</div></div> </div></div>
</p></li><li id="ALM-12006__li59218045165028"><span>Run the <strong id="ALM-12006__b12346761832">ping </strong><em id="ALM-12006__i22903501421">IP address of the faulty host</em> command to check whether the faulty node is reachable.</span><p><ul class="subitemlist" id="ALM-12006__ul28949404165028"><li id="ALM-12006__li52511662165028">If yes, go to <a href="#ALM-12006__li5888111210353">12</a>.</li><li id="ALM-12006__li25586221165028">If no, go to <a href="#ALM-12006__li61437024165028">4</a>.</li></ul> </p></li><li id="ALM-12006__li59218045165028"><span>Run the <strong id="ALM-12006__b12346761832">ping </strong><em id="ALM-12006__i22903501421">IP address of the faulty host</em> command to check whether the faulty node is reachable.</span><p><ul class="subitemlist" id="ALM-12006__ul28949404165028"><li id="ALM-12006__li52511662165028">If yes, go to <a href="#ALM-12006__li5888111210353">12</a>.</li><li id="ALM-12006__li25586221165028">If no, go to <a href="#ALM-12006__li61437024165028">4</a>.</li></ul>
</p></li><li id="ALM-12006__li61437024165028"><a name="ALM-12006__li61437024165028"></a><a name="li61437024165028"></a><span>Contact the network administrator to check whether the network is faulty.</span><p><ul class="subitemlist" id="ALM-12006__ul59022119165028"><li id="ALM-12006__li31932358165028">If yes, go to <a href="#ALM-12006__li23885090165028">5</a>.</li><li id="ALM-12006__li36384175165028">If no, go to <a href="#ALM-12006__li9040006165028">6</a>.</li></ul> </p></li><li id="ALM-12006__li61437024165028"><a name="ALM-12006__li61437024165028"></a><a name="li61437024165028"></a><span>Contact the network administrator to check whether the network is faulty.</span><p><ul class="subitemlist" id="ALM-12006__ul59022119165028"><li id="ALM-12006__li31932358165028">If yes, go to <a href="#ALM-12006__li23885090165028">5</a>.</li><li id="ALM-12006__li36384175165028">If no, go to <a href="#ALM-12006__li9040006165028">6</a>.</li></ul>
@ -98,7 +98,7 @@ Feb 11 11:44:44 10-120-205-33 ntpq: nss_ldap: failed to bind to LDAP server ldap
</p></li><li id="ALM-12006__li428520391437"><span>Check whether the log file contains an error indicating that the metaspace size or heap memory size is insufficient.</span><p><ul id="ALM-12006__ul3488183864410"><li id="ALM-12006__li12488173894411">If yes, contact <span id="ALM-12006__text11646204144519">O&amp;M personnel</span> personnel to change the memory size.</li><li id="ALM-12006__li136386250454">If no, go to <a href="#ALM-12006__li6096449165028">14</a>.</li></ul> </p></li><li id="ALM-12006__li428520391437"><span>Check whether the log file contains an error indicating that the metaspace size or heap memory size is insufficient.</span><p><ul id="ALM-12006__ul3488183864410"><li id="ALM-12006__li12488173894411">If yes, contact <span id="ALM-12006__text11646204144519">O&amp;M personnel</span> personnel to change the memory size.</li><li id="ALM-12006__li136386250454">If no, go to <a href="#ALM-12006__li6096449165028">14</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12006__p20324852165055"><strong id="ALM-12006__b38076319165058">Collect fault information.</strong></p> <p class="tableheading" id="ALM-12006__p20324852165055"><strong id="ALM-12006__b38076319165058">Collect fault information.</strong></p>
<ol start="14" id="ALM-12006__ol4338365616513"><li id="ALM-12006__li6096449165028"><a name="ALM-12006__li6096449165028"></a><a name="li6096449165028"></a><span>On FusionInsight Manager, choose <strong id="ALM-12006__b101119310920">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12006__b3161731891">Log</strong> &gt; <strong id="ALM-12006__b1818636918">Download</strong>.</span></li><li id="ALM-12006__li17328746165028"><span>Select the following nodes from <strong id="ALM-12006__b74792121696">Services</strong> and click <strong id="ALM-12006__b24914121699">OK</strong>.</span><p><ul class="subitemlist" id="ALM-12006__ul1925416165028"><li id="ALM-12006__li54868049165028">NodeAgent</li><li id="ALM-12006__li24050400165028">Controller</li><li id="ALM-12006__li15127016165028">OS</li></ul> <ol start="14" id="ALM-12006__ol4338365616513"><li id="ALM-12006__li6096449165028"><a name="ALM-12006__li6096449165028"></a><a name="li6096449165028"></a><span>On <span id="ALM-12006__text13426183816432">MRS</span> Manager, choose <strong id="ALM-12006__b101119310920">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12006__b3161731891">Log</strong> &gt; <strong id="ALM-12006__b1818636918">Download</strong>.</span></li><li id="ALM-12006__li17328746165028"><span>Select the following nodes from <strong id="ALM-12006__b74792121696">Services</strong> and click <strong id="ALM-12006__b24914121699">OK</strong>.</span><p><ul class="subitemlist" id="ALM-12006__ul1925416165028"><li id="ALM-12006__li54868049165028">NodeAgent</li><li id="ALM-12006__li24050400165028">Controller</li><li id="ALM-12006__li15127016165028">OS</li></ul>
</p></li><li id="ALM-12006__li21740992165028"><span>Click <span><img id="ALM-12006__image104601319175315" src="en-us_image_0000001532767474.png"></span> in the upper right corner, and set <strong id="ALM-12006__b54651160144647">Start Date</strong> and <strong id="ALM-12006__b209720056144647">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12006__b190140052244647">Download</strong>.</span></li><li id="ALM-12006__li16189904165028"><span>Contact <span id="ALM-12006__text126301214142412">O&amp;M personnel</span> and provide the collected logs.</span></li></ol> </p></li><li id="ALM-12006__li21740992165028"><span>Click <span><img id="ALM-12006__image104601319175315" src="en-us_image_0000001532767474.png"></span> in the upper right corner, and set <strong id="ALM-12006__b54651160144647">Start Date</strong> and <strong id="ALM-12006__b209720056144647">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12006__b190140052244647">Download</strong>.</span></li><li id="ALM-12006__li16189904165028"><span>Contact <span id="ALM-12006__text126301214142412">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12006__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12006__p754913417333">This alarm is automatically cleared after the fault is rectified.</p> <div class="section" id="ALM-12006__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12006__p754913417333">This alarm is automatically cleared after the fault is rectified.</p>

View File

@ -62,7 +62,7 @@
</div></div> </div></div>
</div> </div>
<div class="section" id="ALM-12007__sad734a42f8ef40529fb21b797d8b41e9"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12007__en-us_topic_0070543667_p65245121"><strong id="ALM-12007__b73856891719">Check whether the instance process is abnormal.</strong></p> <div class="section" id="ALM-12007__sad734a42f8ef40529fb21b797d8b41e9"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12007__en-us_topic_0070543667_p65245121"><strong id="ALM-12007__b73856891719">Check whether the instance process is abnormal.</strong></p>
<ol id="ALM-12007__ol5390063317638"><li id="ALM-12007__li42005517036"><a name="ALM-12007__li42005517036"></a><a name="li42005517036"></a><span>In the FusionInsight Manager portal, click <strong id="ALM-12007__b3064793094522">O&amp;M &gt; Alarm<strong id="ALM-12007__b27872374104950"> &gt; Alarms</strong></strong>, click <span><img id="ALM-12007__image14626452517" src="en-us_image_0000001582807817.png"></span> in the row where the alarm is located , and click the host name to view the host address for which the alarm is generated</span></li><li id="ALM-12007__li911601917036"><span>On the <strong id="ALM-12007__b378050117036">Alarms</strong> page, check whether the <a href="ALM-12006.html">ALM-12006 Node Fault</a> is generated.</span><p><ul class="subitemlist" id="ALM-12007__ul846943117036"><li id="ALM-12007__li452236417036">If yes, go to <a href="#ALM-12007__li20006517036">3</a>.</li><li id="ALM-12007__li3076720917036">If no, go to <a href="#ALM-12007__li195150317036">4</a>.</li></ul> <ol id="ALM-12007__ol5390063317638"><li id="ALM-12007__li42005517036"><a name="ALM-12007__li42005517036"></a><a name="li42005517036"></a><span>In the <span id="ALM-12007__text34789336432">MRS</span> Manager portal, click <strong id="ALM-12007__b3064793094522">O&amp;M &gt; Alarm<strong id="ALM-12007__b27872374104950"> &gt; Alarms</strong></strong>, click <span><img id="ALM-12007__image14626452517" src="en-us_image_0000001582807817.png"></span> in the row where the alarm is located , and click the host name to view the host address for which the alarm is generated</span></li><li id="ALM-12007__li911601917036"><span>On the <strong id="ALM-12007__b378050117036">Alarms</strong> page, check whether the <a href="ALM-12006.html">ALM-12006 Node Fault</a> is generated.</span><p><ul class="subitemlist" id="ALM-12007__ul846943117036"><li id="ALM-12007__li452236417036">If yes, go to <a href="#ALM-12007__li20006517036">3</a>.</li><li id="ALM-12007__li3076720917036">If no, go to <a href="#ALM-12007__li195150317036">4</a>.</li></ul>
</p></li><li id="ALM-12007__li20006517036"><a name="ALM-12007__li20006517036"></a><a name="li20006517036"></a><span>Handle the alarm according to <a href="ALM-12006.html">ALM-12006 Node Fault</a>.</span></li><li id="ALM-12007__li195150317036"><a name="ALM-12007__li195150317036"></a><a name="li195150317036"></a><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12007__b8307212154711">root</strong>. <span id="ALM-12007__text43649449460"></span>Check whether the installation directory user, user group, and permission of the alarm role are correct. The user, user group, and the permission must be <strong id="ALM-12007__b180058917036">omm:ficommon 750</strong>.</span><p><p class="subitemlist" id="ALM-12007__p7190141912118">For example, the NameNode installation directory is<strong id="ALM-12007__b16534123110112"> </strong><em id="ALM-12007__i677216419119">${BIGDATA_HOME}</em><strong id="ALM-12007__b177174617112">/FusionInsight_Current/</strong><em id="ALM-12007__i137264460113">1_8_NameNode</em><strong id="ALM-12007__b13731846191113">/etc</strong>.</p> </p></li><li id="ALM-12007__li20006517036"><a name="ALM-12007__li20006517036"></a><a name="li20006517036"></a><span>Handle the alarm according to <a href="ALM-12006.html">ALM-12006 Node Fault</a>.</span></li><li id="ALM-12007__li195150317036"><a name="ALM-12007__li195150317036"></a><a name="li195150317036"></a><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12007__b8307212154711">root</strong>. <span id="ALM-12007__text43649449460"></span>Check whether the installation directory user, user group, and permission of the alarm role are correct. The user, user group, and the permission must be <strong id="ALM-12007__b180058917036">omm:ficommon 750</strong>.</span><p><p class="subitemlist" id="ALM-12007__p7190141912118">For example, the NameNode installation directory is<strong id="ALM-12007__b16534123110112"> </strong><em id="ALM-12007__i677216419119">${BIGDATA_HOME}</em><strong id="ALM-12007__b177174617112">/FusionInsight_Current/</strong><em id="ALM-12007__i137264460113">1_8_NameNode</em><strong id="ALM-12007__b13731846191113">/etc</strong>.</p>
<ul class="subitemlist" id="ALM-12007__ul2258645517036"><li id="ALM-12007__li1163004517036">If yes, go to <a href="#ALM-12007__li3396349817036">6</a>.</li><li id="ALM-12007__li250960617036">If no, go to <a href="#ALM-12007__li3247692317036">5</a>.</li></ul> <ul class="subitemlist" id="ALM-12007__ul2258645517036"><li id="ALM-12007__li1163004517036">If yes, go to <a href="#ALM-12007__li3396349817036">6</a>.</li><li id="ALM-12007__li250960617036">If no, go to <a href="#ALM-12007__li3247692317036">5</a>.</li></ul>
</p></li><li id="ALM-12007__li3247692317036"><a name="ALM-12007__li3247692317036"></a><a name="li3247692317036"></a><span>Run the following command to set the permission to <strong id="ALM-12007__b1756352717036">750</strong> and <strong id="ALM-12007__b2385401617036">User:Group</strong> to <strong id="ALM-12007__b1335955517036">omm:ficommon</strong>:</span><p><p class="litext" id="ALM-12007__p833090817036"><strong id="ALM-12007__b5312713817036">chmod 750 </strong><em id="ALM-12007__i838219617036">&lt;folder_name&gt;</em></p> </p></li><li id="ALM-12007__li3247692317036"><a name="ALM-12007__li3247692317036"></a><a name="li3247692317036"></a><span>Run the following command to set the permission to <strong id="ALM-12007__b1756352717036">750</strong> and <strong id="ALM-12007__b2385401617036">User:Group</strong> to <strong id="ALM-12007__b1335955517036">omm:ficommon</strong>:</span><p><p class="litext" id="ALM-12007__p833090817036"><strong id="ALM-12007__b5312713817036">chmod 750 </strong><em id="ALM-12007__i838219617036">&lt;folder_name&gt;</em></p>
@ -70,12 +70,12 @@
</p></li><li id="ALM-12007__li3396349817036"><a name="ALM-12007__li3396349817036"></a><a name="li3396349817036"></a><span>Wait for 5 minutes. In the alarm list, check whether <strong id="ALM-12007__b2385685117036">ALM-12007 Process Fault</strong> is cleared.</span><p><ul class="subitemlist" id="ALM-12007__ul2693144617036"><li id="ALM-12007__li1338507017036">If yes, no further action is required.</li><li id="ALM-12007__li1044892317036">If no, go to <a href="#ALM-12007__li2657388817036">7</a>.</li></ul> </p></li><li id="ALM-12007__li3396349817036"><a name="ALM-12007__li3396349817036"></a><a name="li3396349817036"></a><span>Wait for 5 minutes. In the alarm list, check whether <strong id="ALM-12007__b2385685117036">ALM-12007 Process Fault</strong> is cleared.</span><p><ul class="subitemlist" id="ALM-12007__ul2693144617036"><li id="ALM-12007__li1338507017036">If yes, no further action is required.</li><li id="ALM-12007__li1044892317036">If no, go to <a href="#ALM-12007__li2657388817036">7</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12007__p5673574217645"><strong id="ALM-12007__b1353742417650">Check whether disk space is sufficient.</strong></p> <p class="tableheading" id="ALM-12007__p5673574217645"><strong id="ALM-12007__b1353742417650">Check whether disk space is sufficient.</strong></p>
<ol start="7" id="ALM-12007__ol2289926317658"><li id="ALM-12007__li2657388817036"><a name="ALM-12007__li2657388817036"></a><a name="li2657388817036"></a><span>On the FusionInsight Manager, check whether the alarm list contains <strong id="ALM-12007__b3723602917036">ALM-12017 Insufficient Disk Capacity</strong>.</span><p><ul class="subitemlist" id="ALM-12007__ul6260497717036"><li id="ALM-12007__li6332838717036">If yes, go to <a href="#ALM-12007__li500135217036">8</a>.</li><li id="ALM-12007__li2932572917036">If no, go to <a href="#ALM-12007__li1622379717036">11</a>.</li></ul> <ol start="7" id="ALM-12007__ol2289926317658"><li id="ALM-12007__li2657388817036"><a name="ALM-12007__li2657388817036"></a><a name="li2657388817036"></a><span>On the <span id="ALM-12007__text1029234412436">MRS</span> Manager, check whether the alarm list contains <strong id="ALM-12007__b3723602917036">ALM-12017 Insufficient Disk Capacity</strong>.</span><p><ul class="subitemlist" id="ALM-12007__ul6260497717036"><li id="ALM-12007__li6332838717036">If yes, go to <a href="#ALM-12007__li500135217036">8</a>.</li><li id="ALM-12007__li2932572917036">If no, go to <a href="#ALM-12007__li1622379717036">11</a>.</li></ul>
</p></li><li id="ALM-12007__li500135217036"><a name="ALM-12007__li500135217036"></a><a name="li500135217036"></a><span>Rectify the fault by following the steps provided in <a href="ALM-12017.html">ALM-12017 Insufficient Disk Capacity</a>.</span></li><li id="ALM-12007__li2288625317036"><span>Wait for 5 minutes. In the alarm list, check whether <strong id="ALM-12007__b4501217017036">ALM-12017 Insufficient Disk Capacity</strong> is cleared.</span><p><ul class="subitemlist" id="ALM-12007__ul999945717036"><li id="ALM-12007__li2210716917036">If yes, go to <a href="#ALM-12007__li1723673717036">10</a>.</li><li id="ALM-12007__li4585029317036">If no, go to <a href="#ALM-12007__li1622379717036">11</a>.</li></ul> </p></li><li id="ALM-12007__li500135217036"><a name="ALM-12007__li500135217036"></a><a name="li500135217036"></a><span>Rectify the fault by following the steps provided in <a href="ALM-12017.html">ALM-12017 Insufficient Disk Capacity</a>.</span></li><li id="ALM-12007__li2288625317036"><span>Wait for 5 minutes. In the alarm list, check whether <strong id="ALM-12007__b4501217017036">ALM-12017 Insufficient Disk Capacity</strong> is cleared.</span><p><ul class="subitemlist" id="ALM-12007__ul999945717036"><li id="ALM-12007__li2210716917036">If yes, go to <a href="#ALM-12007__li1723673717036">10</a>.</li><li id="ALM-12007__li4585029317036">If no, go to <a href="#ALM-12007__li1622379717036">11</a>.</li></ul>
</p></li><li id="ALM-12007__li1723673717036"><a name="ALM-12007__li1723673717036"></a><a name="li1723673717036"></a><span>Wait for 5 minutes. In the alarm list, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12007__ul3418148317036"><li id="ALM-12007__li464969017036">If yes, no further action is required.</li><li id="ALM-12007__li4108064417036">If no, go to <a href="#ALM-12007__li1622379717036">11</a>.</li></ul> </p></li><li id="ALM-12007__li1723673717036"><a name="ALM-12007__li1723673717036"></a><a name="li1723673717036"></a><span>Wait for 5 minutes. In the alarm list, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12007__ul3418148317036"><li id="ALM-12007__li464969017036">If yes, no further action is required.</li><li id="ALM-12007__li4108064417036">If no, go to <a href="#ALM-12007__li1622379717036">11</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12007__p3392472417052"><strong id="ALM-12007__b2313861417057">Collect fault information.</strong></p> <p id="ALM-12007__p3392472417052"><strong id="ALM-12007__b2313861417057">Collect fault information.</strong></p>
<ol start="11" id="ALM-12007__ol481086251710"><li id="ALM-12007__li1622379717036"><a name="ALM-12007__li1622379717036"></a><a name="li1622379717036"></a><span>On the FusionInsight Manager, choose <strong id="ALM-12007__b2091290617036">O&amp;M</strong> &gt; <strong id="ALM-12007__b5399842717036">Log &gt; Download</strong>.</span></li><li id="ALM-12007__li1598834917036"><span>According to the service name obtained in <a href="#ALM-12007__li42005517036">1</a>, select the component and <strong id="ALM-12007__b68821814172417">NodeAgent</strong> from the <strong id="ALM-12007__b15959191911544">Service</strong> and click <strong id="ALM-12007__b3991118545">OK</strong>.</span></li><li id="ALM-12007__li1145664103113"><span>Click <span><img id="ALM-12007__image1945644173117" src="en-us_image_0000001583127509.png"></span> in the upper right corner, and set <strong id="ALM-12007__b6456941173117">Start Date</strong> and <strong id="ALM-12007__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12007__b13456164113319">Download</strong>.</span></li><li id="ALM-12007__li495644512588"><span>Contact the <span id="ALM-12007__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="11" id="ALM-12007__ol481086251710"><li id="ALM-12007__li1622379717036"><a name="ALM-12007__li1622379717036"></a><a name="li1622379717036"></a><span>On the <span id="ALM-12007__text1770545174318">MRS</span> Manager, choose <strong id="ALM-12007__b2091290617036">O&amp;M</strong> &gt; <strong id="ALM-12007__b5399842717036">Log &gt; Download</strong>.</span></li><li id="ALM-12007__li1598834917036"><span>According to the service name obtained in <a href="#ALM-12007__li42005517036">1</a>, select the component and <strong id="ALM-12007__b68821814172417">NodeAgent</strong> from the <strong id="ALM-12007__b15959191911544">Service</strong> and click <strong id="ALM-12007__b3991118545">OK</strong>.</span></li><li id="ALM-12007__li1145664103113"><span>Click <span><img id="ALM-12007__image1945644173117" src="en-us_image_0000001583127509.png"></span> in the upper right corner, and set <strong id="ALM-12007__b6456941173117">Start Date</strong> and <strong id="ALM-12007__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12007__b13456164113319">Download</strong>.</span></li><li id="ALM-12007__li495644512588"><span>Contact the <span id="ALM-12007__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12007__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12007__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12007__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12007__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -61,7 +61,7 @@
<ul id="ALM-12010__ul11347112011510"><li id="ALM-12010__li17347132014154">The link between the active and standby Manager is abnormal.</li><li id="ALM-12010__li127451022151512">The node name configuration is incorrect.</li><li id="ALM-12010__li15347620181517">The port is disabled by the firewall.</li></ul> <ul id="ALM-12010__ul11347112011510"><li id="ALM-12010__li17347132014154">The link between the active and standby Manager is abnormal.</li><li id="ALM-12010__li127451022151512">The node name configuration is incorrect.</li><li id="ALM-12010__li15347620181517">The port is disabled by the firewall.</li></ul>
</div> </div>
<div class="section" id="ALM-12010__s8af1753e22d647b9b1328244e85fc0a1"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12010__en-us_topic_0070543674_p27190637"><strong id="ALM-12010__b5350194613159">Check whether the network between the active and standby Manager server is normal.</strong></p> <div class="section" id="ALM-12010__s8af1753e22d647b9b1328244e85fc0a1"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12010__en-us_topic_0070543674_p27190637"><strong id="ALM-12010__b5350194613159">Check whether the network between the active and standby Manager server is normal.</strong></p>
<ol id="ALM-12010__ol20655039202014"><li id="ALM-12010__li3649153912014"><span>In the FusionInsight Manager portal, click <strong id="ALM-12010__b3064793094522">O&amp;M &gt; Alarm<strong id="ALM-12010__b27872374104950"> &gt; Alarms</strong></strong>, click <span><img id="ALM-12010__image4649163910207" src="en-us_image_0000001583127401.png"></span> in the row containing the alarm and view the IP address of the standby Manager (Peer Manager) server in the alarm details.</span></li><li id="ALM-12010__li665018399204"><span>Log in to the active Manager server as user <strong id="ALM-12010__b16650193982017">root</strong>. <span id="ALM-12010__text13862037144910"></span><span id="ALM-12010__text077751144915"></span></span></li><li id="ALM-12010__li86511539112014"><span>Run the <strong id="ALM-12010__b14650439102018">ping</strong> <em id="ALM-12010__i96503394205">standby Manager heartbeat IP address</em> command to check whether the standby Manager server is reachable.</span><p><ul class="subitemlist" id="ALM-12010__ul565043917209"><li id="ALM-12010__li665012399202">If yes, go to <a href="#ALM-12010__li206521339172011">6</a>.</li><li id="ALM-12010__li36504394207">If no, go to <a href="#ALM-12010__li18651103915205">4</a>.</li></ul> <ol id="ALM-12010__ol20655039202014"><li id="ALM-12010__li3649153912014"><span>In the <span id="ALM-12010__text34789336432">MRS</span> Manager portal, click <strong id="ALM-12010__b3064793094522">O&amp;M &gt; Alarm<strong id="ALM-12010__b27872374104950"> &gt; Alarms</strong></strong>, click <span><img id="ALM-12010__image4649163910207" src="en-us_image_0000001583127401.png"></span> in the row containing the alarm and view the IP address of the standby Manager (Peer Manager) server in the alarm details.</span></li><li id="ALM-12010__li665018399204"><span>Log in to the active Manager server as user <strong id="ALM-12010__b16650193982017">root</strong>. <span id="ALM-12010__text13862037144910"></span><span id="ALM-12010__text077751144915"></span></span></li><li id="ALM-12010__li86511539112014"><span>Run the <strong id="ALM-12010__b14650439102018">ping</strong> <em id="ALM-12010__i96503394205">standby Manager heartbeat IP address</em> command to check whether the standby Manager server is reachable.</span><p><ul class="subitemlist" id="ALM-12010__ul565043917209"><li id="ALM-12010__li665012399202">If yes, go to <a href="#ALM-12010__li206521339172011">6</a>.</li><li id="ALM-12010__li36504394207">If no, go to <a href="#ALM-12010__li18651103915205">4</a>.</li></ul>
</p></li><li id="ALM-12010__li18651103915205"><a name="ALM-12010__li18651103915205"></a><a name="li18651103915205"></a><span>Contact the network administrator to check whether the network is faulty.</span><p><ul class="subitemlist" id="ALM-12010__ul1465123917207"><li id="ALM-12010__li7651539162019">If yes, go to <a href="#ALM-12010__li166511739102017">5</a>.</li><li id="ALM-12010__li12651153932016">If no, go to <a href="#ALM-12010__li206521339172011">6</a>.</li></ul> </p></li><li id="ALM-12010__li18651103915205"><a name="ALM-12010__li18651103915205"></a><a name="li18651103915205"></a><span>Contact the network administrator to check whether the network is faulty.</span><p><ul class="subitemlist" id="ALM-12010__ul1465123917207"><li id="ALM-12010__li7651539162019">If yes, go to <a href="#ALM-12010__li166511739102017">5</a>.</li><li id="ALM-12010__li12651153932016">If no, go to <a href="#ALM-12010__li206521339172011">6</a>.</li></ul>
</p></li><li id="ALM-12010__li166511739102017"><a name="ALM-12010__li166511739102017"></a><a name="li166511739102017"></a><span>Rectify the network fault and check whether the alarm is cleared from the alarm list.</span><p><ul class="subitemlist" id="ALM-12010__ul12651143992015"><li id="ALM-12010__li66510391204">If yes, no further action is required.</li><li id="ALM-12010__li165193912202">If no, go to <a href="#ALM-12010__li206521339172011">6</a>.</li></ul> </p></li><li id="ALM-12010__li166511739102017"><a name="ALM-12010__li166511739102017"></a><a name="li166511739102017"></a><span>Rectify the network fault and check whether the alarm is cleared from the alarm list.</span><p><ul class="subitemlist" id="ALM-12010__ul12651143992015"><li id="ALM-12010__li66510391204">If yes, no further action is required.</li><li id="ALM-12010__li165193912202">If no, go to <a href="#ALM-12010__li206521339172011">6</a>.</li></ul>
</p></li><li class="subitemlist" id="ALM-12010__li206521339172011"><a name="ALM-12010__li206521339172011"></a><a name="li206521339172011"></a><span>Run the following command to go to the software installation directory:</span><p><p id="ALM-12010__p1652939182013"><strong id="ALM-12010__b136521139172015">cd /opt</strong></p> </p></li><li class="subitemlist" id="ALM-12010__li206521339172011"><a name="ALM-12010__li206521339172011"></a><a name="li206521339172011"></a><span>Run the following command to go to the software installation directory:</span><p><p id="ALM-12010__p1652939182013"><strong id="ALM-12010__b136521139172015">cd /opt</strong></p>
@ -76,7 +76,7 @@
</p></li><li id="ALM-12010__li5649163982013"><span>Check whether the alarm is cleared from the alarm list.</span><p><ul id="ALM-12010__ul12649143919207"><li id="ALM-12010__li76481939182016">If yes, no further action is required.</li><li id="ALM-12010__li6649839152018">If no, go to <a href="#ALM-12010__li41244883171443">16</a>.</li></ul> </p></li><li id="ALM-12010__li5649163982013"><span>Check whether the alarm is cleared from the alarm list.</span><p><ul id="ALM-12010__ul12649143919207"><li id="ALM-12010__li76481939182016">If yes, no further action is required.</li><li id="ALM-12010__li6649839152018">If no, go to <a href="#ALM-12010__li41244883171443">16</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12010__p66076255171453"><strong id="ALM-12010__b56103124171459">Collect fault information.</strong></p> <p id="ALM-12010__p66076255171453"><strong id="ALM-12010__b56103124171459">Collect fault information.</strong></p>
<ol start="16" id="ALM-12010__ol4742499917152"><li id="ALM-12010__li41244883171443"><a name="ALM-12010__li41244883171443"></a><a name="li41244883171443"></a><span>On the FusionInsight Manager, choose <strong id="ALM-12010__b2091290617036">O&amp;M</strong> &gt; <strong id="ALM-12010__b4582764171443">Log &gt; Download</strong>.</span></li><li id="ALM-12010__li52887856171443"><span>Select the following nodes from the <strong id="ALM-12010__b1114195518811">Service</strong> and click<strong id="ALM-12010__b11411559819"> OK</strong>:</span><p><ul class="subitemlist" id="ALM-12010__ul58072211171443"><li id="ALM-12010__li2749285171443">OmmServer</li><li id="ALM-12010__li24743571171443">Controller</li><li id="ALM-12010__li21365548171443">NodeAgent</li></ul> <ol start="16" id="ALM-12010__ol4742499917152"><li id="ALM-12010__li41244883171443"><a name="ALM-12010__li41244883171443"></a><a name="li41244883171443"></a><span>On the <span id="ALM-12010__text15892848144315">MRS</span> Manager, choose <strong id="ALM-12010__b2091290617036">O&amp;M</strong> &gt; <strong id="ALM-12010__b4582764171443">Log &gt; Download</strong>.</span></li><li id="ALM-12010__li52887856171443"><span>Select the following nodes from the <strong id="ALM-12010__b1114195518811">Service</strong> and click<strong id="ALM-12010__b11411559819"> OK</strong>:</span><p><ul class="subitemlist" id="ALM-12010__ul58072211171443"><li id="ALM-12010__li2749285171443">OmmServer</li><li id="ALM-12010__li24743571171443">Controller</li><li id="ALM-12010__li21365548171443">NodeAgent</li></ul>
</p></li><li id="ALM-12010__li1145664103113"><span>Click <span><img id="ALM-12010__image1945644173117" src="en-us_image_0000001532767502.png"></span> in the upper right corner, and set <strong id="ALM-12010__b6456941173117">Start Date</strong> and <strong id="ALM-12010__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12010__b13456164113319">Download</strong>.</span></li><li id="ALM-12010__li495644512588"><span>Contact the <span id="ALM-12010__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> </p></li><li id="ALM-12010__li1145664103113"><span>Click <span><img id="ALM-12010__image1945644173117" src="en-us_image_0000001532767502.png"></span> in the upper right corner, and set <strong id="ALM-12010__b6456941173117">Start Date</strong> and <strong id="ALM-12010__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12010__b13456164113319">Download</strong>.</span></li><li id="ALM-12010__li495644512588"><span>Contact the <span id="ALM-12010__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12010__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12010__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12010__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12010__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>

View File

@ -60,7 +60,7 @@
<div class="section" id="ALM-12011__s77f5924161444716a130206f2960adf2"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12011__ul14145165617812"><li id="ALM-12011__li414545618819">The link between the active and standby Managers is interrupted or The storage space of the <strong id="ALM-12011__b239214252236">/srv/BigData/LocalBackup</strong> directory is full.</li><li id="ALM-12011__li42413581788">The synchronization file does not exist or the file permission is incorrect.</li></ul> <div class="section" id="ALM-12011__s77f5924161444716a130206f2960adf2"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12011__ul14145165617812"><li id="ALM-12011__li414545618819">The link between the active and standby Managers is interrupted or The storage space of the <strong id="ALM-12011__b239214252236">/srv/BigData/LocalBackup</strong> directory is full.</li><li id="ALM-12011__li42413581788">The synchronization file does not exist or the file permission is incorrect.</li></ul>
</div> </div>
<div class="section" id="ALM-12011__s229a984fc400445ab382e100b6b3e00c"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12011__en-us_topic_0070543504_p8274172"><strong id="ALM-12011__b65333561171814">Check whether the network between the active Manager server and the standby Manager server is normal.</strong></p> <div class="section" id="ALM-12011__s229a984fc400445ab382e100b6b3e00c"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12011__en-us_topic_0070543504_p8274172"><strong id="ALM-12011__b65333561171814">Check whether the network between the active Manager server and the standby Manager server is normal.</strong></p>
<ol id="ALM-12011__ol53693817171754"><li id="ALM-12011__li61677856171750"><span>In the FusionInsight Manager portal, click <strong id="ALM-12011__b3064793094522">O&amp;M &gt; Alarm<strong id="ALM-12011__b27872374104950"> &gt; Alarms</strong></strong>, click <span><img id="ALM-12011__image168221113135319" src="en-us_image_0000001532607938.png"></span> in the row where the alarm is located and obtain the standby Manager server IP address (Peer Manager IP address) in the alarm details.</span></li><li id="ALM-12011__li218225171750"><span>Log in to the active Manager server as user <strong id="ALM-12011__b18229793171750">root</strong>. <span id="ALM-12011__text43649449460"></span></span></li><li id="ALM-12011__li45826494171750"><span>Run the <strong id="ALM-12011__b1964026171750">ping </strong><em id="ALM-12011__i17676237171750">standby Manager IP address</em> command to check whether the standby Manager server is reachable.</span><p><ul class="subitemlist" id="ALM-12011__ul20004913171750"><li id="ALM-12011__li22489118171750">If yes, go to <a href="#ALM-12011__li983315367129">6</a>.</li><li id="ALM-12011__li9679308171750">If no, go to <a href="#ALM-12011__li3033024171750">4</a>.</li></ul> <ol id="ALM-12011__ol53693817171754"><li id="ALM-12011__li61677856171750"><span>In the <span id="ALM-12011__text34789336432">MRS</span> Manager portal, click <strong id="ALM-12011__b3064793094522">O&amp;M &gt; Alarm<strong id="ALM-12011__b27872374104950"> &gt; Alarms</strong></strong>, click <span><img id="ALM-12011__image168221113135319" src="en-us_image_0000001532607938.png"></span> in the row where the alarm is located and obtain the standby Manager server IP address (Peer Manager IP address) in the alarm details.</span></li><li id="ALM-12011__li218225171750"><span>Log in to the active Manager server as user <strong id="ALM-12011__b18229793171750">root</strong>. <span id="ALM-12011__text43649449460"></span></span></li><li id="ALM-12011__li45826494171750"><span>Run the <strong id="ALM-12011__b1964026171750">ping </strong><em id="ALM-12011__i17676237171750">standby Manager IP address</em> command to check whether the standby Manager server is reachable.</span><p><ul class="subitemlist" id="ALM-12011__ul20004913171750"><li id="ALM-12011__li22489118171750">If yes, go to <a href="#ALM-12011__li983315367129">6</a>.</li><li id="ALM-12011__li9679308171750">If no, go to <a href="#ALM-12011__li3033024171750">4</a>.</li></ul>
</p></li><li id="ALM-12011__li3033024171750"><a name="ALM-12011__li3033024171750"></a><a name="li3033024171750"></a><span>Contact the network administrator to check whether the network is faulty.</span><p><ul class="subitemlist" id="ALM-12011__ul45076245171750"><li id="ALM-12011__li20958557171750">If yes, go to <a href="#ALM-12011__li52745930171750">5</a>.</li><li id="ALM-12011__li19921552171750">If no, go to <a href="#ALM-12011__li983315367129">6</a>.</li></ul> </p></li><li id="ALM-12011__li3033024171750"><a name="ALM-12011__li3033024171750"></a><a name="li3033024171750"></a><span>Contact the network administrator to check whether the network is faulty.</span><p><ul class="subitemlist" id="ALM-12011__ul45076245171750"><li id="ALM-12011__li20958557171750">If yes, go to <a href="#ALM-12011__li52745930171750">5</a>.</li><li id="ALM-12011__li19921552171750">If no, go to <a href="#ALM-12011__li983315367129">6</a>.</li></ul>
</p></li><li id="ALM-12011__li52745930171750"><a name="ALM-12011__li52745930171750"></a><a name="li52745930171750"></a><span>Rectify the network fault and check whether the alarm is cleared from the alarm list.</span><p><ul class="subitemlist" id="ALM-12011__ul35448373171750"><li id="ALM-12011__li27297218171750">If yes, no further action is required.</li><li id="ALM-12011__li63591031171750">If no, go to <a href="#ALM-12011__li983315367129">6</a>.</li></ul> </p></li><li id="ALM-12011__li52745930171750"><a name="ALM-12011__li52745930171750"></a><a name="li52745930171750"></a><span>Rectify the network fault and check whether the alarm is cleared from the alarm list.</span><p><ul class="subitemlist" id="ALM-12011__ul35448373171750"><li id="ALM-12011__li27297218171750">If yes, no further action is required.</li><li id="ALM-12011__li63591031171750">If no, go to <a href="#ALM-12011__li983315367129">6</a>.</li></ul>
</p></li></ol> </p></li></ol>
@ -70,7 +70,7 @@
</p></li><li id="ALM-12011__li11402194014150"><a name="ALM-12011__li11402194014150"></a><a name="li11402194014150"></a><span>Run the following command to clear unnecessary backup files:</span><p><p id="ALM-12011__p59241552111512"><strong id="ALM-12011__b1924152141510">rm -rf</strong> <em id="ALM-12011__i169241552171513">Directory to be cleared</em></p> </p></li><li id="ALM-12011__li11402194014150"><a name="ALM-12011__li11402194014150"></a><a name="li11402194014150"></a><span>Run the following command to clear unnecessary backup files:</span><p><p id="ALM-12011__p59241552111512"><strong id="ALM-12011__b1924152141510">rm -rf</strong> <em id="ALM-12011__i169241552171513">Directory to be cleared</em></p>
<p id="ALM-12011__p1925765620165">Example:</p> <p id="ALM-12011__p1925765620165">Example:</p>
<p id="ALM-12011__p19627153013162"><strong id="ALM-12011__b3427132016256">rm -rf </strong><strong id="ALM-12011__b124271920172511">/srv/BigData/LocalBackup/0/default-oms_20191211143443</strong></p> <p id="ALM-12011__p19627153013162"><strong id="ALM-12011__b3427132016256">rm -rf </strong><strong id="ALM-12011__b124271920172511">/srv/BigData/LocalBackup/0/default-oms_20191211143443</strong></p>
</p></li><li id="ALM-12011__li11225183641710"><span>On FusionInsight Manager, choose <strong id="ALM-12011__b164529671912">O&amp;M</strong> &gt; <strong id="ALM-12011__b154528641917">Backup and Restoration</strong> &gt; <strong id="ALM-12011__b1045266201919">Backup Management</strong>.</span><p><p id="ALM-12011__p15628204013192">In the <strong id="ALM-12011__b16628184018193">Operation</strong> column of the backup task to be performed, click <strong id="ALM-12011__b662824016198">Configure</strong> and change the value of <strong id="ALM-12011__b36284403191">Maximum Number of Backup Copies</strong> to reduce the number of backup file sets.</p> </p></li><li id="ALM-12011__li11225183641710"><span>On <span id="ALM-12011__text1289365116437">MRS</span> Manager, choose <strong id="ALM-12011__b164529671912">O&amp;M</strong> &gt; <strong id="ALM-12011__b154528641917">Backup and Restoration</strong> &gt; <strong id="ALM-12011__b1045266201919">Backup Management</strong>.</span><p><p id="ALM-12011__p15628204013192">In the <strong id="ALM-12011__b16628184018193">Operation</strong> column of the backup task to be performed, click <strong id="ALM-12011__b662824016198">Configure</strong> and change the value of <strong id="ALM-12011__b36284403191">Maximum Number of Backup Copies</strong> to reduce the number of backup file sets.</p>
</p></li><li id="ALM-12011__li10257204812200"><span>Wait about 1 minute and check whether the alarm is cleared.</span><p><ul id="ALM-12011__ul1841101272115"><li id="ALM-12011__li8410127216">If yes, no further action is required.</li><li id="ALM-12011__li035423602114">If no, go to <a href="#ALM-12011__li1826164817917">10</a>.</li></ul> </p></li><li id="ALM-12011__li10257204812200"><span>Wait about 1 minute and check whether the alarm is cleared.</span><p><ul id="ALM-12011__ul1841101272115"><li id="ALM-12011__li8410127216">If yes, no further action is required.</li><li id="ALM-12011__li035423602114">If no, go to <a href="#ALM-12011__li1826164817917">10</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12011__p17556109182017"><strong id="ALM-12011__b11174181719106">Check whether the synchronization file exists and whether the file permission is normal.</strong></p> <p id="ALM-12011__p17556109182017"><strong id="ALM-12011__b11174181719106">Check whether the synchronization file exists and whether the file permission is normal.</strong></p>
@ -93,7 +93,7 @@
</p></li><li id="ALM-12011__li985632952514"><a name="ALM-12011__li985632952514"></a><a name="li985632952514"></a><span>Wait about 10 minute and check whether the alarm is cleared.</span><p><ul id="ALM-12011__ul118561229142514"><li id="ALM-12011__li1085622915256">If yes, no further action is required.</li><li id="ALM-12011__li20856429172515">If no, go to <a href="#ALM-12011__li65512922171750">14</a>.</li></ul> </p></li><li id="ALM-12011__li985632952514"><a name="ALM-12011__li985632952514"></a><a name="li985632952514"></a><span>Wait about 10 minute and check whether the alarm is cleared.</span><p><ul id="ALM-12011__ul118561229142514"><li id="ALM-12011__li1085622915256">If yes, no further action is required.</li><li id="ALM-12011__li20856429172515">If no, go to <a href="#ALM-12011__li65512922171750">14</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12011__p3197719917181"><strong id="ALM-12011__b5988844117185">Collect fault information.</strong></p> <p class="tableheading" id="ALM-12011__p3197719917181"><strong id="ALM-12011__b5988844117185">Collect fault information.</strong></p>
<ol start="14" id="ALM-12011__ol2384130317188"><li id="ALM-12011__li65512922171750"><a name="ALM-12011__li65512922171750"></a><a name="li65512922171750"></a><span>On the FusionInsight Manager, choose <strong id="ALM-12011__b173565410011">O&amp;M</strong> &gt; <strong id="ALM-12011__b44561915171750">Log &gt; Download</strong>.</span></li><li id="ALM-12011__li28001919171750"><span>Select the following nodes from the <strong id="ALM-12011__b18740191794">Service</strong> and click <strong id="ALM-12011__b15740101993">OK</strong>:</span><p><ul class="subitemlist" id="ALM-12011__ul40394026171750"><li id="ALM-12011__li44518484171750">OmmServer</li><li id="ALM-12011__li65122042171750">Controller</li><li id="ALM-12011__li49227467171750">NodeAgent</li></ul> <ol start="14" id="ALM-12011__ol2384130317188"><li id="ALM-12011__li65512922171750"><a name="ALM-12011__li65512922171750"></a><a name="li65512922171750"></a><span>On the <span id="ALM-12011__text17301205344318">MRS</span> Manager, choose <strong id="ALM-12011__b173565410011">O&amp;M</strong> &gt; <strong id="ALM-12011__b44561915171750">Log &gt; Download</strong>.</span></li><li id="ALM-12011__li28001919171750"><span>Select the following nodes from the <strong id="ALM-12011__b18740191794">Service</strong> and click <strong id="ALM-12011__b15740101993">OK</strong>:</span><p><ul class="subitemlist" id="ALM-12011__ul40394026171750"><li id="ALM-12011__li44518484171750">OmmServer</li><li id="ALM-12011__li65122042171750">Controller</li><li id="ALM-12011__li49227467171750">NodeAgent</li></ul>
</p></li><li id="ALM-12011__li1145664103113"><span>Click <span><img id="ALM-12011__image1945644173117" src="en-us_image_0000001582927829.png"></span> in the upper right corner, and set <strong id="ALM-12011__b6456941173117">Start Date</strong> and <strong id="ALM-12011__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12011__b13456164113319">Download</strong>.</span></li><li id="ALM-12011__li495644512588"><span>Contact the <span id="ALM-12011__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> </p></li><li id="ALM-12011__li1145664103113"><span>Click <span><img id="ALM-12011__image1945644173117" src="en-us_image_0000001582927829.png"></span> in the upper right corner, and set <strong id="ALM-12011__b6456941173117">Start Date</strong> and <strong id="ALM-12011__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12011__b13456164113319">Download</strong>.</span></li><li id="ALM-12011__li495644512588"><span>Contact the <span id="ALM-12011__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12011__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12011__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12011__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12011__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>

View File

@ -55,7 +55,7 @@
</table> </table>
</div> </div>
</div> </div>
<div class="section" id="ALM-12012__section46714221"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-12012__p52316961">The time on the node is inconsistent with that on other nodes in the cluster. Therefore, some FusionInsight applications on the node may not run properly.</p> <div class="section" id="ALM-12012__section46714221"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-12012__p52316961">The time on the node is inconsistent with that on other nodes in the cluster. Therefore, some <span id="ALM-12012__text156568322235">MRS</span> applications on the node may not run properly.</p>
</div> </div>
<div class="section" id="ALM-12012__section17774812"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12012__ul9815411"><li id="ALM-12012__li21229841">The NTP service on the current node cannot start properly.</li><li id="ALM-12012__li56850842">The current node fails to synchronize time with the NTP service on the active OMS node.</li><li id="ALM-12012__li41895537">The key value authenticated by the NTP service on the current node is inconsistent with that on the active OMS node.</li><li id="ALM-12012__li41515518">The time offset between the node and the NTP service on the active OMS node is large.</li></ul> <div class="section" id="ALM-12012__section17774812"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12012__ul9815411"><li id="ALM-12012__li21229841">The NTP service on the current node cannot start properly.</li><li id="ALM-12012__li56850842">The current node fails to synchronize time with the NTP service on the active OMS node.</li><li id="ALM-12012__li41895537">The key value authenticated by the NTP service on the current node is inconsistent with that on the active OMS node.</li><li id="ALM-12012__li41515518">The time offset between the node and the NTP service on the active OMS node is large.</li></ul>
</div> </div>
@ -66,7 +66,7 @@
</div></div> </div></div>
</p></li></ol> </p></li></ol>
<p id="ALM-12012__p5707131016224"><strong id="ALM-12012__b18791952101610">Check whether the chrony service on the node is started properly.</strong></p> <p id="ALM-12012__p5707131016224"><strong id="ALM-12012__b18791952101610">Check whether the chrony service on the node is started properly.</strong></p>
<ol start="2" id="ALM-12012__ol26524396229"><li id="ALM-12012__li565253915220"><a name="ALM-12012__li565253915220"></a><a name="li565253915220"></a><span>On FusionInsight Manager, choose <strong id="ALM-12012__b1469615331711">O&amp;M</strong> &gt; <strong id="ALM-12012__b1671010381713">Alarm</strong> &gt; <strong id="ALM-12012__b3714193151712">Alarms</strong>. On the page that is displayed, click <span><img id="ALM-12012__image15842101152317" src="en-us_image_0000001583087389.png"></span> in the row containing the alarm, and view the name of the host for which the alarm is generated in <strong id="ALM-12012__b164471729181">Location</strong>.</span></li><li id="ALM-12012__li1698314318229"><span>Check whether the chronyd process is running on the node where the alarm is generated. Log in to the node for which the alarm is generated as user <strong id="ALM-12012__b10361337172212">root</strong> and run the <strong id="ALM-12012__b736437172211">ps -ef | grep </strong><strong id="ALM-12012__b5365374225">chronyd<strong id="ALM-12012__b103613712212"> | grep -v grep</strong></strong> command to check whether the command output contains the chronyd process.</span><p><ul id="ALM-12012__ul16645143102316"><li id="ALM-12012__li1364519382310">If yes, go to <a href="#ALM-12012__li128001354104320">6</a>.</li><li id="ALM-12012__li176451636231">If no, go to <a href="#ALM-12012__li112931223172310">4</a>.</li></ul> <ol start="2" id="ALM-12012__ol26524396229"><li id="ALM-12012__li565253915220"><a name="ALM-12012__li565253915220"></a><a name="li565253915220"></a><span>On <span id="ALM-12012__text34789336432">MRS</span> Manager, choose <strong id="ALM-12012__b1469615331711">O&amp;M</strong> &gt; <strong id="ALM-12012__b1671010381713">Alarm</strong> &gt; <strong id="ALM-12012__b3714193151712">Alarms</strong>. On the page that is displayed, click <span><img id="ALM-12012__image15842101152317" src="en-us_image_0000001583087389.png"></span> in the row containing the alarm, and view the name of the host for which the alarm is generated in <strong id="ALM-12012__b164471729181">Location</strong>.</span></li><li id="ALM-12012__li1698314318229"><span>Check whether the chronyd process is running on the node where the alarm is generated. Log in to the node for which the alarm is generated as user <strong id="ALM-12012__b10361337172212">root</strong> and run the <strong id="ALM-12012__b736437172211">ps -ef | grep </strong><strong id="ALM-12012__b5365374225">chronyd<strong id="ALM-12012__b103613712212"> | grep -v grep</strong></strong> command to check whether the command output contains the chronyd process.</span><p><ul id="ALM-12012__ul16645143102316"><li id="ALM-12012__li1364519382310">If yes, go to <a href="#ALM-12012__li128001354104320">6</a>.</li><li id="ALM-12012__li176451636231">If no, go to <a href="#ALM-12012__li112931223172310">4</a>.</li></ul>
</p></li><li id="ALM-12012__li112931223172310"><a name="ALM-12012__li112931223172310"></a><a name="li112931223172310"></a><span>Run the <strong id="ALM-12012__b934512092314">systemctl chronyd start</strong> command to start the NTP service. (Currently, only CentOS and Red Hat Enterprise Linux 7.0 or later are supported.)</span></li><li id="ALM-12012__li1370682632319"><span>Check whether the alarm is cleared 10 minutes later.</span><p><ul id="ALM-12012__ul9733140142314"><li id="ALM-12012__li873412401239">If yes, no further action is required.</li><li id="ALM-12012__li273494010236">If no, go to <a href="#ALM-12012__li128001354104320">6</a>.</li></ul> </p></li><li id="ALM-12012__li112931223172310"><a name="ALM-12012__li112931223172310"></a><a name="li112931223172310"></a><span>Run the <strong id="ALM-12012__b934512092314">systemctl chronyd start</strong> command to start the NTP service. (Currently, only CentOS and Red Hat Enterprise Linux 7.0 or later are supported.)</span></li><li id="ALM-12012__li1370682632319"><span>Check whether the alarm is cleared 10 minutes later.</span><p><ul id="ALM-12012__ul9733140142314"><li id="ALM-12012__li873412401239">If yes, no further action is required.</li><li id="ALM-12012__li273494010236">If no, go to <a href="#ALM-12012__li128001354104320">6</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12012__p622313424434"><strong id="ALM-12012__b164701532264">Check whether the current node can synchronize time properly with the chrony service on the active OMS node.</strong></p> <p id="ALM-12012__p622313424434"><strong id="ALM-12012__b164701532264">Check whether the current node can synchronize time properly with the chrony service on the active OMS node.</strong></p>
@ -109,7 +109,7 @@ host01:~ #</pre>
</p></li><li id="ALM-12012__li10221981492"><span>After 10 minutes, check whether the alarm is cleared.</span><p><ul id="ALM-12012__ul1791815174916"><li id="ALM-12012__li77919155495">If yes, no further action is required.</li><li id="ALM-12012__li279221504916">If no, go to <a href="#ALM-12012__li3559109817193">38</a>.</li></ul> </p></li><li id="ALM-12012__li10221981492"><span>After 10 minutes, check whether the alarm is cleared.</span><p><ul id="ALM-12012__ul1791815174916"><li id="ALM-12012__li77919155495">If yes, no further action is required.</li><li id="ALM-12012__li279221504916">If no, go to <a href="#ALM-12012__li3559109817193">38</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12012__p7313798"><strong id="ALM-12012__b47285034171917">Check whether the NTP service on the node is started properly.</strong></p> <p class="tableheading" id="ALM-12012__p7313798"><strong id="ALM-12012__b47285034171917">Check whether the NTP service on the node is started properly.</strong></p>
<ol start="20" id="ALM-12012__ol66642017198"><li id="ALM-12012__li18137932125120"><a name="ALM-12012__li18137932125120"></a><a name="li18137932125120"></a><span>On FusionInsight Manager, choose <strong id="ALM-12012__b127562128012">O&amp;M</strong> &gt; <strong id="ALM-12012__b1776581217019">Alarm</strong> &gt; <strong id="ALM-12012__b676620121104">Alarms</strong>. On the page that is displayed, click <span><img id="ALM-12012__image168221113135319" src="en-us_image_0000001532448250.png"></span> in the row containing the alarm, and view the name of the host for which the alarm is generated in <strong id="ALM-12012__b77671212506">Location</strong>.</span></li><li id="ALM-12012__li4741980517193"><span>Check whether the ntpd process is running on the node using the following method. Log in to the alarm node as user <strong id="ALM-12012__b3803747417193">root</strong> and run the <strong id="ALM-12012__b6113654817193">ps -ef | grep ntpd | grep -v grep</strong> command to check whether the command output contains the ntpd process. <span id="ALM-12012__text7780723164819"></span></span><p><ul class="subitemlist" id="ALM-12012__ul6492119017193"><li id="ALM-12012__li5311334217193">If yes, go to <a href="#ALM-12012__li3507541817193">24</a>.</li><li id="ALM-12012__li721346517193">If no, go to <a href="#ALM-12012__li797292017193">22</a>.</li></ul> <ol start="20" id="ALM-12012__ol66642017198"><li id="ALM-12012__li18137932125120"><a name="ALM-12012__li18137932125120"></a><a name="li18137932125120"></a><span>On <span id="ALM-12012__text29014016446">MRS</span> Manager, choose <strong id="ALM-12012__b127562128012">O&amp;M</strong> &gt; <strong id="ALM-12012__b1776581217019">Alarm</strong> &gt; <strong id="ALM-12012__b676620121104">Alarms</strong>. On the page that is displayed, click <span><img id="ALM-12012__image168221113135319" src="en-us_image_0000001532448250.png"></span> in the row containing the alarm, and view the name of the host for which the alarm is generated in <strong id="ALM-12012__b77671212506">Location</strong>.</span></li><li id="ALM-12012__li4741980517193"><span>Check whether the ntpd process is running on the node using the following method. Log in to the alarm node as user <strong id="ALM-12012__b3803747417193">root</strong> and run the <strong id="ALM-12012__b6113654817193">ps -ef | grep ntpd | grep -v grep</strong> command to check whether the command output contains the ntpd process. <span id="ALM-12012__text7780723164819"></span></span><p><ul class="subitemlist" id="ALM-12012__ul6492119017193"><li id="ALM-12012__li5311334217193">If yes, go to <a href="#ALM-12012__li3507541817193">24</a>.</li><li id="ALM-12012__li721346517193">If no, go to <a href="#ALM-12012__li797292017193">22</a>.</li></ul>
</p></li><li id="ALM-12012__li797292017193"><a name="ALM-12012__li797292017193"></a><a name="li797292017193"></a><span>Run the <strong id="ALM-12012__b2412506117193">service ntp start</strong> command (or the <strong id="ALM-12012__b1579896017193">service ntpd start</strong> command in Red Hat Enterprise Linux) to start the NTP service.</span></li><li id="ALM-12012__li1743906917193"><span>After 10 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12012__ul3252697217193"><li id="ALM-12012__li464742117193">If yes, no further action is required.</li><li id="ALM-12012__li4089681017193">If no, go to <a href="#ALM-12012__li3507541817193">24</a>.</li></ul> </p></li><li id="ALM-12012__li797292017193"><a name="ALM-12012__li797292017193"></a><a name="li797292017193"></a><span>Run the <strong id="ALM-12012__b2412506117193">service ntp start</strong> command (or the <strong id="ALM-12012__b1579896017193">service ntpd start</strong> command in Red Hat Enterprise Linux) to start the NTP service.</span></li><li id="ALM-12012__li1743906917193"><span>After 10 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12012__ul3252697217193"><li id="ALM-12012__li464742117193">If yes, no further action is required.</li><li id="ALM-12012__li4089681017193">If no, go to <a href="#ALM-12012__li3507541817193">24</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12012__p41015395171913"><strong id="ALM-12012__b35363904171920">Check whether the node can synchronize time properly with the NTP service on the active OMS node.</strong></p> <p class="tableheading" id="ALM-12012__p41015395171913"><strong id="ALM-12012__b35363904171920">Check whether the node can synchronize time properly with the NTP service on the active OMS node.</strong></p>
@ -154,7 +154,7 @@ host01:~ #</pre>
</p></li><li id="ALM-12012__li4052710217193"><span>After 10 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12012__ul4938209917193"><li id="ALM-12012__li2575137817193">If yes, no further action is required.</li><li id="ALM-12012__li548689917193">If no, go to <a href="#ALM-12012__li3559109817193">38</a>.</li></ul> </p></li><li id="ALM-12012__li4052710217193"><span>After 10 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12012__ul4938209917193"><li id="ALM-12012__li2575137817193">If yes, no further action is required.</li><li id="ALM-12012__li548689917193">If no, go to <a href="#ALM-12012__li3559109817193">38</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12012__p3192256717193"><strong id="ALM-12012__b26235450172135">Collect the fault information.</strong></p> <p class="tableheading" id="ALM-12012__p3192256717193"><strong id="ALM-12012__b26235450172135">Collect the fault information.</strong></p>
<ol start="38" id="ALM-12012__ol54944969172137"><li id="ALM-12012__li3559109817193"><a name="ALM-12012__li3559109817193"></a><a name="li3559109817193"></a><span>On FusionInsight Manager, choose <strong id="ALM-12012__b7101141212916">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12012__b1211119121996">Log</strong> &gt; <strong id="ALM-12012__b1411221216915">Download</strong>.</span></li><li id="ALM-12012__li5188443417193"><span>In the <strong id="ALM-12012__b27579580544738">Services</strong> area, select <strong id="ALM-12012__b114113779744738">NodeAgent</strong> and <strong id="ALM-12012__b19147618544738">OmmServer</strong>, and click <strong id="ALM-12012__b50779606044738">OK</strong>. Expand the <strong id="ALM-12012__b14507185512497">Hosts</strong> dialog box and select the alarm node and the active OMS node.</span></li><li id="ALM-12012__li6430672717193"><span>Click <span><img id="ALM-12012__image104601319175315" src="en-us_image_0000001532767474.png"></span> in the upper right corner, and set <strong id="ALM-12012__b192485503844738">Start Date</strong> and <strong id="ALM-12012__b150132047644738">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time respectively. Then, click <strong id="ALM-12012__b44594281844738">Download</strong>.</span></li><li id="ALM-12012__li4146238817193"><span>Contact <span id="ALM-12012__text693353931014">O&amp;M personnel</span> and provide the collected logs.</span></li></ol> <ol start="38" id="ALM-12012__ol54944969172137"><li id="ALM-12012__li3559109817193"><a name="ALM-12012__li3559109817193"></a><a name="li3559109817193"></a><span>On <span id="ALM-12012__text8335132174410">MRS</span> Manager, choose <strong id="ALM-12012__b7101141212916">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12012__b1211119121996">Log</strong> &gt; <strong id="ALM-12012__b1411221216915">Download</strong>.</span></li><li id="ALM-12012__li5188443417193"><span>In the <strong id="ALM-12012__b27579580544738">Services</strong> area, select <strong id="ALM-12012__b114113779744738">NodeAgent</strong> and <strong id="ALM-12012__b19147618544738">OmmServer</strong>, and click <strong id="ALM-12012__b50779606044738">OK</strong>. Expand the <strong id="ALM-12012__b14507185512497">Hosts</strong> dialog box and select the alarm node and the active OMS node.</span></li><li id="ALM-12012__li6430672717193"><span>Click <span><img id="ALM-12012__image104601319175315" src="en-us_image_0000001532767474.png"></span> in the upper right corner, and set <strong id="ALM-12012__b192485503844738">Start Date</strong> and <strong id="ALM-12012__b150132047644738">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time respectively. Then, click <strong id="ALM-12012__b44594281844738">Download</strong>.</span></li><li id="ALM-12012__li4146238817193"><span>Contact <span id="ALM-12012__text693353931014">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12012__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12012__p754913417333">This alarm is automatically cleared after the fault is rectified.</p> <div class="section" id="ALM-12012__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12012__p754913417333">This alarm is automatically cleared after the fault is rectified.</p>
</div> </div>

View File

@ -69,12 +69,12 @@
</div> </div>
<div class="section" id="ALM-12014__s86d97a0503184bfd9d0e267312170d65"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12014__en-us_topic_0070543526_ul45207585"><li id="ALM-12014__en-us_topic_0070543526_li4215088">The hard disk is removed.</li><li id="ALM-12014__en-us_topic_0070543526_li37935797">The hard disk is offline, or a bad sector exists on the hard disk.</li></ul> <div class="section" id="ALM-12014__s86d97a0503184bfd9d0e267312170d65"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12014__en-us_topic_0070543526_ul45207585"><li id="ALM-12014__en-us_topic_0070543526_li4215088">The hard disk is removed.</li><li id="ALM-12014__en-us_topic_0070543526_li37935797">The hard disk is offline, or a bad sector exists on the hard disk.</li></ul>
</div> </div>
<div class="section" id="ALM-12014__sb1a1ee7b7a444d5dbe8388e9c9e8bba9"><h4 class="sectiontitle">Procedure</h4><ol id="ALM-12014__ol43371064173421"><li id="ALM-12014__li30640494173421"><span>On FusionInsight Manager, click <strong id="ALM-12014__b18317580173421">O&amp;M &gt; Alarm &gt; Alarms</strong>, and click <span><img id="ALM-12014__image10408151910137" src="en-us_image_0000001532767638.png"></span> in the row where the alarm is located.</span></li><li id="ALM-12014__li51941841173421"><span>Obtain <strong id="ALM-12014__b65960965173421">HostName</strong>, <strong id="ALM-12014__b56777780173421">PartitionName</strong> and <strong id="ALM-12014__b41237977173421">DirName</strong> from <strong id="ALM-12014__b645062473115">Location</strong>.</span></li><li id="ALM-12014__li15983295173421"><span>Check whether the disk of <strong id="ALM-12014__b64823390173421">PartitionName</strong> on <strong id="ALM-12014__b46539606173421">HostName</strong> is inserted to the correct server slot.</span><p><ul class="subitemlist" id="ALM-12014__ul9232462173421"><li id="ALM-12014__li11611727173421">If yes, go to <a href="#ALM-12014__li9631929173421">4</a>.</li><li id="ALM-12014__li1025829173421">If no, go to <a href="#ALM-12014__li18162941173421">5</a>.</li></ul> <div class="section" id="ALM-12014__sb1a1ee7b7a444d5dbe8388e9c9e8bba9"><h4 class="sectiontitle">Procedure</h4><ol id="ALM-12014__ol43371064173421"><li id="ALM-12014__li30640494173421"><span>On <span id="ALM-12014__text34789336432">MRS</span> Manager, click <strong id="ALM-12014__b18317580173421">O&amp;M &gt; Alarm &gt; Alarms</strong>, and click <span><img id="ALM-12014__image10408151910137" src="en-us_image_0000001532767638.png"></span> in the row where the alarm is located.</span></li><li id="ALM-12014__li51941841173421"><span>Obtain <strong id="ALM-12014__b65960965173421">HostName</strong>, <strong id="ALM-12014__b56777780173421">PartitionName</strong> and <strong id="ALM-12014__b41237977173421">DirName</strong> from <strong id="ALM-12014__b645062473115">Location</strong>.</span></li><li id="ALM-12014__li15983295173421"><span>Check whether the disk of <strong id="ALM-12014__b64823390173421">PartitionName</strong> on <strong id="ALM-12014__b46539606173421">HostName</strong> is inserted to the correct server slot.</span><p><ul class="subitemlist" id="ALM-12014__ul9232462173421"><li id="ALM-12014__li11611727173421">If yes, go to <a href="#ALM-12014__li9631929173421">4</a>.</li><li id="ALM-12014__li1025829173421">If no, go to <a href="#ALM-12014__li18162941173421">5</a>.</li></ul>
</p></li><li id="ALM-12014__li9631929173421"><a name="ALM-12014__li9631929173421"></a><a name="li9631929173421"></a><span>Contact hardware engineers to remove the faulty disk.</span></li><li id="ALM-12014__li18162941173421"><a name="ALM-12014__li18162941173421"></a><a name="li18162941173421"></a><span>Log in to the <strong id="ALM-12014__b19578501173421">HostName</strong> node where an alarm is reported and check whether there is a line containing <strong id="ALM-12014__b41988789173421">DirName</strong> in the <strong id="ALM-12014__b42354785173421">/etc/fstab</strong> file as user <strong id="ALM-12014__b37365710490">root</strong>. <span id="ALM-12014__text43649449460"></span></span><p><ul class="subitemlist" id="ALM-12014__ul61670428173421"><li id="ALM-12014__li8185528173421">If yes, go to <a href="#ALM-12014__li20338192173421">6</a>.</li><li id="ALM-12014__li59048052173421">If no, go to <a href="#ALM-12014__li48826004173421">7</a>.</li></ul> </p></li><li id="ALM-12014__li9631929173421"><a name="ALM-12014__li9631929173421"></a><a name="li9631929173421"></a><span>Contact hardware engineers to remove the faulty disk.</span></li><li id="ALM-12014__li18162941173421"><a name="ALM-12014__li18162941173421"></a><a name="li18162941173421"></a><span>Log in to the <strong id="ALM-12014__b19578501173421">HostName</strong> node where an alarm is reported and check whether there is a line containing <strong id="ALM-12014__b41988789173421">DirName</strong> in the <strong id="ALM-12014__b42354785173421">/etc/fstab</strong> file as user <strong id="ALM-12014__b37365710490">root</strong>. <span id="ALM-12014__text43649449460"></span></span><p><ul class="subitemlist" id="ALM-12014__ul61670428173421"><li id="ALM-12014__li8185528173421">If yes, go to <a href="#ALM-12014__li20338192173421">6</a>.</li><li id="ALM-12014__li59048052173421">If no, go to <a href="#ALM-12014__li48826004173421">7</a>.</li></ul>
</p></li><li id="ALM-12014__li20338192173421"><a name="ALM-12014__li20338192173421"></a><a name="li20338192173421"></a><span>Run the <strong id="ALM-12014__b29248746173421">vi /etc/fstab</strong> command to edit the file and delete the line containing <strong id="ALM-12014__b61912122173421">DirName</strong>.</span></li><li id="ALM-12014__li48826004173421"><a name="ALM-12014__li48826004173421"></a><a name="li48826004173421"></a><span>Contact hardware engineers to insert a new disk. For details, see the hardware product document of the relevant model. If the faulty disk is in a RAID group, configure the RAID group. For details, see the configuration methods of the relevant RAID controller card.</span></li><li id="ALM-12014__li55753407173421"><span>Wait 20 to 30 minutes (The disk size determines the waiting time), and run the <strong id="ALM-12014__b36780855173421">mount</strong> command to check whether the disk has been mounted to the <strong id="ALM-12014__b62592242173421">DirName</strong> directory.</span><p><ul class="subitemlist" id="ALM-12014__ul28564444173421"><li id="ALM-12014__li26459270173421">If yes, manually clear the alarm. No further operation is required.</li><li id="ALM-12014__li62826150173421">If no, go to <a href="#ALM-12014__li1607193817587">9</a>.</li></ul> </p></li><li id="ALM-12014__li20338192173421"><a name="ALM-12014__li20338192173421"></a><a name="li20338192173421"></a><span>Run the <strong id="ALM-12014__b29248746173421">vi /etc/fstab</strong> command to edit the file and delete the line containing <strong id="ALM-12014__b61912122173421">DirName</strong>.</span></li><li id="ALM-12014__li48826004173421"><a name="ALM-12014__li48826004173421"></a><a name="li48826004173421"></a><span>Contact hardware engineers to insert a new disk. For details, see the hardware product document of the relevant model. If the faulty disk is in a RAID group, configure the RAID group. For details, see the configuration methods of the relevant RAID controller card.</span></li><li id="ALM-12014__li55753407173421"><span>Wait 20 to 30 minutes (The disk size determines the waiting time), and run the <strong id="ALM-12014__b36780855173421">mount</strong> command to check whether the disk has been mounted to the <strong id="ALM-12014__b62592242173421">DirName</strong> directory.</span><p><ul class="subitemlist" id="ALM-12014__ul28564444173421"><li id="ALM-12014__li26459270173421">If yes, manually clear the alarm. No further operation is required.</li><li id="ALM-12014__li62826150173421">If no, go to <a href="#ALM-12014__li1607193817587">9</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12014__p0392542185819"><strong id="ALM-12014__b59246063204559">Collect fault information.</strong></p> <p id="ALM-12014__p0392542185819"><strong id="ALM-12014__b59246063204559">Collect fault information.</strong></p>
<ol start="9" id="ALM-12014__ol36071038115815"><li id="ALM-12014__li1607193817587"><a name="ALM-12014__li1607193817587"></a><a name="li1607193817587"></a><span>On the FusionInsight Manager, choose <strong id="ALM-12014__b87862548435">O&amp;M</strong> &gt; <strong id="ALM-12014__b11281153164820">Log &gt; Download</strong>.</span></li><li id="ALM-12014__li1560793895812"><span>Select the <strong id="ALM-12014__b486612581809">OmmServer</strong> from the Services drop-down list and click <strong id="ALM-12014__b20607238175815">OK</strong>.</span></li><li id="ALM-12014__li660723815584"><span>Set Start Date for log collection to 10 minutes ahead of the alarm generation time and End Date to 10 minutes behind the alarm generation time and click <strong id="ALM-12014__b15452018112">Download</strong>.</span></li><li id="ALM-12014__li495644512588"><span>Contact the <span id="ALM-12014__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="9" id="ALM-12014__ol36071038115815"><li id="ALM-12014__li1607193817587"><a name="ALM-12014__li1607193817587"></a><a name="li1607193817587"></a><span>On the <span id="ALM-12014__text314615114416">MRS</span> Manager, choose <strong id="ALM-12014__b87862548435">O&amp;M</strong> &gt; <strong id="ALM-12014__b11281153164820">Log &gt; Download</strong>.</span></li><li id="ALM-12014__li1560793895812"><span>Select the <strong id="ALM-12014__b486612581809">OmmServer</strong> from the Services drop-down list and click <strong id="ALM-12014__b20607238175815">OK</strong>.</span></li><li id="ALM-12014__li660723815584"><span>Set Start Date for log collection to 10 minutes ahead of the alarm generation time and End Date to 10 minutes behind the alarm generation time and click <strong id="ALM-12014__b15452018112">Download</strong>.</span></li><li id="ALM-12014__li495644512588"><span>Contact the <span id="ALM-12014__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12014__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12014__p697913319401">After the fault is rectified, the system does not automatically clear this alarm, and you need to manually clear the alarm.</p> <div class="section" id="ALM-12014__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12014__p697913319401">After the fault is rectified, the system does not automatically clear this alarm, and you need to manually clear the alarm.</p>
</div> </div>

View File

@ -69,7 +69,7 @@
</div> </div>
<div class="section" id="ALM-12015__sc5211f0c333e491987141617bb9cc5d2"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12015__en-us_topic_0070543537_p5850279">The hard disk is faulty, for example, a bad sector exists.</p> <div class="section" id="ALM-12015__sc5211f0c333e491987141617bb9cc5d2"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12015__en-us_topic_0070543537_p5850279">The hard disk is faulty, for example, a bad sector exists.</p>
</div> </div>
<div class="section" id="ALM-12015__s2082e61748a44109ae22b65edd6caf4f"><h4 class="sectiontitle">Procedure</h4><ol id="ALM-12015__en-us_topic_0070543537_ol4110613"><li id="ALM-12015__en-us_topic_0070543537_li36995518"><span>On FusionInsight Manager, choose <strong id="ALM-12015__b87862548435">O&amp;M</strong> &gt; <strong id="ALM-12015__b10296131615319">Alarm &gt; Alarms</strong>, click<strong id="ALM-12015__b142969161035"> </strong><span><img id="ALM-12015__image10408151910137" src="en-us_image_0000001582807717.png"></span> in the row where the alarm is located.</span></li><li id="ALM-12015__en-us_topic_0070543537_li64524211"><span>Obtain <strong id="ALM-12015__en-us_topic_0070543537_b59078569">HostName</strong> and <strong id="ALM-12015__en-us_topic_0070543537_b61945077">PartitionName</strong> from <strong id="ALM-12015__b196121357184515">Location</strong>. <strong id="ALM-12015__en-us_topic_0070543537_b51495331">HostName</strong> is the node where the alarm is reported, and <strong id="ALM-12015__en-us_topic_0070543537_b60804799">PartitionName</strong> is the partition of the faulty disk.</span></li><li id="ALM-12015__en-us_topic_0070543537_li10372286"><span>Contact hardware engineers to check whether the disk is faulty. If the disk is faulty, remove it from the server.</span></li><li id="ALM-12015__en-us_topic_0070543537_li26241711"><span>After the disk is removed, alarm <strong id="ALM-12015__en-us_topic_0070543537_b34848813">ALM-12014 Partition Lost</strong> is reported. Handle the alarm. For details, see <a href="ALM-12014.html">ALM-12014 Partition Lost</a>. After the alarm <strong id="ALM-12015__en-us_topic_0070543537_b4181593">ALM-12014 Partition Lost</strong> is cleared, alarm <strong id="ALM-12015__en-us_topic_0070543537_b37634337">ALM-12015 Partition Filesystem Readonly</strong> is automatically cleared.</span></li></ol> <div class="section" id="ALM-12015__s2082e61748a44109ae22b65edd6caf4f"><h4 class="sectiontitle">Procedure</h4><ol id="ALM-12015__en-us_topic_0070543537_ol4110613"><li id="ALM-12015__en-us_topic_0070543537_li36995518"><span>On <span id="ALM-12015__text34789336432">MRS</span> Manager, choose <strong id="ALM-12015__b87862548435">O&amp;M</strong> &gt; <strong id="ALM-12015__b10296131615319">Alarm &gt; Alarms</strong>, click<strong id="ALM-12015__b142969161035"> </strong><span><img id="ALM-12015__image10408151910137" src="en-us_image_0000001582807717.png"></span> in the row where the alarm is located.</span></li><li id="ALM-12015__en-us_topic_0070543537_li64524211"><span>Obtain <strong id="ALM-12015__en-us_topic_0070543537_b59078569">HostName</strong> and <strong id="ALM-12015__en-us_topic_0070543537_b61945077">PartitionName</strong> from <strong id="ALM-12015__b196121357184515">Location</strong>. <strong id="ALM-12015__en-us_topic_0070543537_b51495331">HostName</strong> is the node where the alarm is reported, and <strong id="ALM-12015__en-us_topic_0070543537_b60804799">PartitionName</strong> is the partition of the faulty disk.</span></li><li id="ALM-12015__en-us_topic_0070543537_li10372286"><span>Contact hardware engineers to check whether the disk is faulty. If the disk is faulty, remove it from the server.</span></li><li id="ALM-12015__en-us_topic_0070543537_li26241711"><span>After the disk is removed, alarm <strong id="ALM-12015__en-us_topic_0070543537_b34848813">ALM-12014 Partition Lost</strong> is reported. Handle the alarm. For details, see <a href="ALM-12014.html">ALM-12014 Partition Lost</a>. After the alarm <strong id="ALM-12015__en-us_topic_0070543537_b4181593">ALM-12014 Partition Lost</strong> is cleared, alarm <strong id="ALM-12015__en-us_topic_0070543537_b37634337">ALM-12015 Partition Filesystem Readonly</strong> is automatically cleared.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12015__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12015__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12015__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12015__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -65,7 +65,7 @@
<div class="section" id="ALM-12016__s23c7881992f44efb95893912e391c0c0"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12016__en-us_topic_0070543548_ul1202807"><li id="ALM-12016__en-us_topic_0070543548_li10825264">The alarm threshold or alarm smoothing times are incorrect.</li><li id="ALM-12016__en-us_topic_0070543548_li30318520">CPU configuration cannot meet service requirements. The CPU usage reaches the upper limit.</li></ul> <div class="section" id="ALM-12016__s23c7881992f44efb95893912e391c0c0"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12016__en-us_topic_0070543548_ul1202807"><li id="ALM-12016__en-us_topic_0070543548_li10825264">The alarm threshold or alarm smoothing times are incorrect.</li><li id="ALM-12016__en-us_topic_0070543548_li30318520">CPU configuration cannot meet service requirements. The CPU usage reaches the upper limit.</li></ul>
</div> </div>
<div class="section" id="ALM-12016__s43e4003b37294857a410ff23763ad2ef"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12016__en-us_topic_0070543548_p39881087"><strong id="ALM-12016__b58386659173930">Check whether the alarm threshold or alarm <strong id="ALM-12016__b18142175243719">Trigger Count</strong> are correct.</strong></p> <div class="section" id="ALM-12016__s43e4003b37294857a410ff23763ad2ef"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12016__en-us_topic_0070543548_p39881087"><strong id="ALM-12016__b58386659173930">Check whether the alarm threshold or alarm <strong id="ALM-12016__b18142175243719">Trigger Count</strong> are correct.</strong></p>
<ol id="ALM-12016__ol1362745417400"><li id="ALM-12016__li24816170173938"><span>Change the alarm threshold and alarm <strong id="ALM-12016__b13281711203813">Trigger Count</strong> based on CPU usage.</span><p><p class="litext" id="ALM-12016__p6523306173938">On FusionInsight Manager, choose <strong id="ALM-12016__b73164535166">O&amp;M</strong> &gt; <strong id="ALM-12016__b1366935516171">Alarm</strong> &gt; <strong id="ALM-12016__b14318131145112">Thresholds &gt; </strong><em id="ALM-12016__i193217112515">Name of the desired cluster</em> &gt; <strong id="ALM-12016__b16357675173938">Host</strong> &gt; <strong id="ALM-12016__b13001354173938">CPU</strong> &gt; <strong id="ALM-12016__b49903330173938">Host CPU Usage</strong> and change the alarm smoothing times based on CPU usage, as shown in <a href="#ALM-12016__fig42676420173938">Figure 1</a>.</p> <ol id="ALM-12016__ol1362745417400"><li id="ALM-12016__li24816170173938"><span>Change the alarm threshold and alarm <strong id="ALM-12016__b13281711203813">Trigger Count</strong> based on CPU usage.</span><p><p class="litext" id="ALM-12016__p6523306173938">On <span id="ALM-12016__text34789336432">MRS</span> Manager, choose <strong id="ALM-12016__b73164535166">O&amp;M</strong> &gt; <strong id="ALM-12016__b1366935516171">Alarm</strong> &gt; <strong id="ALM-12016__b14318131145112">Thresholds &gt; </strong><em id="ALM-12016__i193217112515">Name of the desired cluster</em> &gt; <strong id="ALM-12016__b16357675173938">Host</strong> &gt; <strong id="ALM-12016__b13001354173938">CPU</strong> &gt; <strong id="ALM-12016__b49903330173938">Host CPU Usage</strong> and change the alarm smoothing times based on CPU usage, as shown in <a href="#ALM-12016__fig42676420173938">Figure 1</a>.</p>
<div class="note" id="ALM-12016__note57869743173938"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p class="text" id="ALM-12016__p58625754173938">This option defines the alarm check phase. <strong id="ALM-12016__b74612137375">Trigger Count</strong> indicates the alarm check threshold. An alarm is generated when the number of check times exceeds the threshold.</p> <div class="note" id="ALM-12016__note57869743173938"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p class="text" id="ALM-12016__p58625754173938">This option defines the alarm check phase. <strong id="ALM-12016__b74612137375">Trigger Count</strong> indicates the alarm check threshold. An alarm is generated when the number of check times exceeds the threshold.</p>
</div></div> </div></div>
<div class="fignone" id="ALM-12016__fig42676420173938"><a name="ALM-12016__fig42676420173938"></a><a name="fig42676420173938"></a><span class="figcap"><b>Figure 1 </b>Setting alarm smoothing times</span><br><span><img id="ALM-12016__image122911304588" src="en-us_image_0000001583087533.png"></span></div> <div class="fignone" id="ALM-12016__fig42676420173938"><a name="ALM-12016__fig42676420173938"></a><a name="fig42676420173938"></a><span class="figcap"><b>Figure 1 </b>Setting alarm smoothing times</span><br><span><img id="ALM-12016__image122911304588" src="en-us_image_0000001583087533.png"></span></div>
@ -74,10 +74,10 @@
</p></li><li id="ALM-12016__li29621482173938"><span>After 2 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12016__ul12793264173938"><li id="ALM-12016__li22018946173938">If yes, no further action is required.</li><li id="ALM-12016__li38704176173938">If no, go to <a href="#ALM-12016__li65266749173938">3</a>.</li></ul> </p></li><li id="ALM-12016__li29621482173938"><span>After 2 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12016__ul12793264173938"><li id="ALM-12016__li22018946173938">If yes, no further action is required.</li><li id="ALM-12016__li38704176173938">If no, go to <a href="#ALM-12016__li65266749173938">3</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12016__p48030518173938"><strong id="ALM-12016__b1326250617406">Check whether the CPU usage reaches the upper limit.</strong></p> <p class="tableheading" id="ALM-12016__p48030518173938"><strong id="ALM-12016__b1326250617406">Check whether the CPU usage reaches the upper limit.</strong></p>
<ol start="3" id="ALM-12016__ol44225396174015"><li id="ALM-12016__li65266749173938"><a name="ALM-12016__li65266749173938"></a><a name="li65266749173938"></a><span>In the alarm list on FusionInsight Manager, click <span><img id="ALM-12016__image168221113135319" src="en-us_image_0000001582927773.png"></span> in the row where the alarm is located to view the alarm host address in the alarm details.</span></li><li id="ALM-12016__li52115308173938"><span>On the <strong id="ALM-12016__b51685932101729">Hosts</strong> page, click the node on which the alarm is reported.</span></li><li id="ALM-12016__li60590444173938"><span>View the CPU usage for 5 minutes. If the CPU usage exceeds the threshold for multiple times, contact the system administrator to add more CPUs.</span></li><li id="ALM-12016__li38620506173938"><span>Check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12016__ul30302958173938"><li id="ALM-12016__li8878949173938">If yes, no further action is required.</li><li id="ALM-12016__li48106238173938">If no, go to <a href="#ALM-12016__li35735451173938">7</a>.</li></ul> <ol start="3" id="ALM-12016__ol44225396174015"><li id="ALM-12016__li65266749173938"><a name="ALM-12016__li65266749173938"></a><a name="li65266749173938"></a><span>In the alarm list on <span id="ALM-12016__text1055517904410">MRS</span> Manager, click <span><img id="ALM-12016__image168221113135319" src="en-us_image_0000001582927773.png"></span> in the row where the alarm is located to view the alarm host address in the alarm details.</span></li><li id="ALM-12016__li52115308173938"><span>On the <strong id="ALM-12016__b51685932101729">Hosts</strong> page, click the node on which the alarm is reported.</span></li><li id="ALM-12016__li60590444173938"><span>View the CPU usage for 5 minutes. If the CPU usage exceeds the threshold for multiple times, contact the system administrator to add more CPUs.</span></li><li id="ALM-12016__li38620506173938"><span>Check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12016__ul30302958173938"><li id="ALM-12016__li8878949173938">If yes, no further action is required.</li><li id="ALM-12016__li48106238173938">If no, go to <a href="#ALM-12016__li35735451173938">7</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12016__p51491657174016"><strong id="ALM-12016__b42921091174020">Collect fault information.</strong></p> <p class="tableheading" id="ALM-12016__p51491657174016"><strong id="ALM-12016__b42921091174020">Collect fault information.</strong></p>
<ol start="7" id="ALM-12016__ol57964469174025"><li id="ALM-12016__li35735451173938"><a name="ALM-12016__li35735451173938"></a><a name="li35735451173938"></a><span>On the FusionInsight Manager in the active cluster, choose <strong id="ALM-12016__b12040241173938">O&amp;M</strong> &gt; <strong id="ALM-12016__b41253307173938">Log &gt; Download</strong>.</span></li><li id="ALM-12016__li49036890173938"><span>Select <strong id="ALM-12016__b53183609173938">OmmServer</strong> from the <strong id="ALM-12016__b477010478910">Service</strong> and click <strong id="ALM-12016__b1577112471895">OK</strong>.</span></li><li id="ALM-12016__li11141594173938"><span>Set <strong id="ALM-12016__b38678826173938">Start Date</strong> for log collection to 10 minutes ahead of the alarm generation time and <strong id="ALM-12016__b12565117173938">End Date</strong> to 10 minutes behind the alarm generation time in <strong id="ALM-12016__b20155417195615">Time Range</strong> and click <strong id="ALM-12016__b45977197173938">Download</strong>.</span></li><li id="ALM-12016__li495644512588"><span>Contact the <span id="ALM-12016__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="7" id="ALM-12016__ol57964469174025"><li id="ALM-12016__li35735451173938"><a name="ALM-12016__li35735451173938"></a><a name="li35735451173938"></a><span>On the <span id="ALM-12016__text3873171014445">MRS</span> Manager in the active cluster, choose <strong id="ALM-12016__b12040241173938">O&amp;M</strong> &gt; <strong id="ALM-12016__b41253307173938">Log &gt; Download</strong>.</span></li><li id="ALM-12016__li49036890173938"><span>Select <strong id="ALM-12016__b53183609173938">OmmServer</strong> from the <strong id="ALM-12016__b477010478910">Service</strong> and click <strong id="ALM-12016__b1577112471895">OK</strong>.</span></li><li id="ALM-12016__li11141594173938"><span>Set <strong id="ALM-12016__b38678826173938">Start Date</strong> for log collection to 10 minutes ahead of the alarm generation time and <strong id="ALM-12016__b12565117173938">End Date</strong> to 10 minutes behind the alarm generation time in <strong id="ALM-12016__b20155417195615">Time Range</strong> and click <strong id="ALM-12016__b45977197173938">Download</strong>.</span></li><li id="ALM-12016__li495644512588"><span>Contact the <span id="ALM-12016__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12016__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12016__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12016__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12016__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -70,12 +70,12 @@
<div class="section" id="ALM-12017__sd53668685806495fb8d456ba9e2c2c11"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12017__en-us_topic_0070543559_ul27395440"><li id="ALM-12017__en-us_topic_0070543559_li45232374">The alarm threshold is incorrect.</li><li id="ALM-12017__en-us_topic_0070543559_li4438190">Disk configuration of the server cannot meet service requirements.</li></ul> <div class="section" id="ALM-12017__sd53668685806495fb8d456ba9e2c2c11"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12017__en-us_topic_0070543559_ul27395440"><li id="ALM-12017__en-us_topic_0070543559_li45232374">The alarm threshold is incorrect.</li><li id="ALM-12017__en-us_topic_0070543559_li4438190">Disk configuration of the server cannot meet service requirements.</li></ul>
</div> </div>
<div class="section" id="ALM-12017__s6fd2395d167c4db4814624ea702a37ac"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12017__en-us_topic_0070543559_p23949084"><strong id="ALM-12017__b457009885739">Check whether the alarm threshold is appropriate.</strong></p> <div class="section" id="ALM-12017__s6fd2395d167c4db4814624ea702a37ac"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12017__en-us_topic_0070543559_p23949084"><strong id="ALM-12017__b457009885739">Check whether the alarm threshold is appropriate.</strong></p>
<ol id="ALM-12017__ol229057318582"><li id="ALM-12017__li3269990385745"><span>Log in to FusionInsight Manager, choose <strong id="ALM-12017__b126241333219">O&amp;M</strong> &gt; <strong id="ALM-12017__b156241435323">Alarm &gt;</strong> <strong id="ALM-12017__b1562412314328">Thresholds</strong><strong id="ALM-12017__b1962413373216"> &gt; </strong><em id="ALM-12017__i1162415315324">Name of the desired cluster</em> &gt; <strong id="ALM-12017__b962416314323">Host</strong> &gt; <strong id="ALM-12017__b106241931323">Disk</strong> &gt; <strong id="ALM-12017__b4624163203210">Disk Usage</strong> and check whether the threshold (configurable, 90% by default) is appropriate.</span><p><ul class="subitemlist" id="ALM-12017__ul1854640385745"><li id="ALM-12017__li1687169885745">If yes, go to <a href="#ALM-12017__li1280611085745">2</a>.</li><li id="ALM-12017__li2443033285745">If no, go to <a href="#ALM-12017__li2782670585745">4</a>.</li></ul> <ol id="ALM-12017__ol229057318582"><li id="ALM-12017__li3269990385745"><span>Log in to <span id="ALM-12017__text34789336432">MRS</span> Manager, choose <strong id="ALM-12017__b126241333219">O&amp;M</strong> &gt; <strong id="ALM-12017__b156241435323">Alarm &gt;</strong> <strong id="ALM-12017__b1562412314328">Thresholds</strong><strong id="ALM-12017__b1962413373216"> &gt; </strong><em id="ALM-12017__i1162415315324">Name of the desired cluster</em> &gt; <strong id="ALM-12017__b962416314323">Host</strong> &gt; <strong id="ALM-12017__b106241931323">Disk</strong> &gt; <strong id="ALM-12017__b4624163203210">Disk Usage</strong> and check whether the threshold (configurable, 90% by default) is appropriate.</span><p><ul class="subitemlist" id="ALM-12017__ul1854640385745"><li id="ALM-12017__li1687169885745">If yes, go to <a href="#ALM-12017__li1280611085745">2</a>.</li><li id="ALM-12017__li2443033285745">If no, go to <a href="#ALM-12017__li2782670585745">4</a>.</li></ul>
</p></li><li id="ALM-12017__li1280611085745"><a name="ALM-12017__li1280611085745"></a><a name="li1280611085745"></a><span>Choose <strong id="ALM-12017__b2586367385745">O&amp;M</strong> &gt; <strong id="ALM-12017__b1379910713499">Alarm &gt;</strong> <strong id="ALM-12017__b2887114614242">Thresholds</strong><strong id="ALM-12017__b29831221166"> &gt; </strong><em id="ALM-12017__i9983102101619">Name of the desired cluster</em> &gt; <strong id="ALM-12017__b6413578985745">Host</strong> &gt; <strong id="ALM-12017__b4035119385745">Disk</strong> &gt; <strong id="ALM-12017__b2761642585745">Disk Usage</strong> and click <strong id="ALM-12017__b6659180133310">Modify</strong> in the <strong id="ALM-12017__b1374719315332">Operation</strong> column to change the alarm threshold based on site requirements. As shown in <a href="#ALM-12017__fig6063892885745">Figure 1</a>:</span><p><div class="fignone" id="ALM-12017__fig6063892885745"><a name="ALM-12017__fig6063892885745"></a><a name="fig6063892885745"></a><span class="figcap"><b>Figure 1 </b>Setting an alarm threshold</span><br><span><img id="ALM-12017__image1615410501365" src="en-us_image_0000001582927861.png"></span></div> </p></li><li id="ALM-12017__li1280611085745"><a name="ALM-12017__li1280611085745"></a><a name="li1280611085745"></a><span>Choose <strong id="ALM-12017__b2586367385745">O&amp;M</strong> &gt; <strong id="ALM-12017__b1379910713499">Alarm &gt;</strong> <strong id="ALM-12017__b2887114614242">Thresholds</strong><strong id="ALM-12017__b29831221166"> &gt; </strong><em id="ALM-12017__i9983102101619">Name of the desired cluster</em> &gt; <strong id="ALM-12017__b6413578985745">Host</strong> &gt; <strong id="ALM-12017__b4035119385745">Disk</strong> &gt; <strong id="ALM-12017__b2761642585745">Disk Usage</strong> and click <strong id="ALM-12017__b6659180133310">Modify</strong> in the <strong id="ALM-12017__b1374719315332">Operation</strong> column to change the alarm threshold based on site requirements. As shown in <a href="#ALM-12017__fig6063892885745">Figure 1</a>:</span><p><div class="fignone" id="ALM-12017__fig6063892885745"><a name="ALM-12017__fig6063892885745"></a><a name="fig6063892885745"></a><span class="figcap"><b>Figure 1 </b>Setting an alarm threshold</span><br><span><img id="ALM-12017__image1615410501365" src="en-us_image_0000001582927861.png"></span></div>
</p></li><li id="ALM-12017__li4783109885745"><span>After 2 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12017__ul59050785745"><li id="ALM-12017__li4814612685745">If yes, no further action is required.</li><li id="ALM-12017__li752215285745">If no, go to <a href="#ALM-12017__li2782670585745">4</a>.</li></ul> </p></li><li id="ALM-12017__li4783109885745"><span>After 2 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12017__ul59050785745"><li id="ALM-12017__li4814612685745">If yes, no further action is required.</li><li id="ALM-12017__li752215285745">If no, go to <a href="#ALM-12017__li2782670585745">4</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12017__p531456685745"><strong id="ALM-12017__b98862278588">Check whether the disk usage reaches the upper limit.</strong></p> <p class="tableheading" id="ALM-12017__p531456685745"><strong id="ALM-12017__b98862278588">Check whether the disk usage reaches the upper limit.</strong></p>
<ol start="4" id="ALM-12017__ol1005390085829"><li id="ALM-12017__li2782670585745"><a name="ALM-12017__li2782670585745"></a><a name="li2782670585745"></a><span>In the alarm list on FusionInsight Manager, click <span><img id="ALM-12017__image168221113135319" src="en-us_image_0000001582807909.png"></span> in the row where the alarm is located to view the alarm host name and disk partition information in the alarm details.</span></li><li id="ALM-12017__li3937060885745"><span>Log in to the node where the alarm is generated as user <strong id="ALM-12017__b4911375485745">root</strong>. <span id="ALM-12017__text43649449460"></span></span></li><li id="ALM-12017__li1529764085745"><span>Run the <strong id="ALM-12017__b5391142133919">df -lmPT | awk '$2 != "iso9660"' | grep '^/dev/' | awk '{"readlink -m "$1 | getline real }{$1=real; print $0}' | sort -u -k 1,1</strong> command to check the system disk partition usage. Check whether the disk is mounted to the following directories based on the disk partition name obtained in <a href="#ALM-12017__li2782670585745">4</a>: <strong id="ALM-12017__b4568855685745">/</strong>, <strong id="ALM-12017__b2096079285745">/opt</strong>, <strong id="ALM-12017__b5442940785745">/tmp</strong>, <strong id="ALM-12017__b2010261785745">/var</strong>, <strong id="ALM-12017__b4670583385745">/var/log</strong>, and <strong id="ALM-12017__b2507614885745">/srv/BigData</strong>(can be customized).</span><p><ul class="subitemlist" id="ALM-12017__ul3152589985745"><li id="ALM-12017__li1790212085745">If yes, the disk is a system disk. Then go to <a href="#ALM-12017__li6170195385745">10</a>.</li><li id="ALM-12017__li4078557985745">If no, the disk is not a system disk. Then go to <a href="#ALM-12017__li1190839985745">7</a>.</li></ul> <ol start="4" id="ALM-12017__ol1005390085829"><li id="ALM-12017__li2782670585745"><a name="ALM-12017__li2782670585745"></a><a name="li2782670585745"></a><span>In the alarm list on <span id="ALM-12017__text1747921311443">MRS</span> Manager, click <span><img id="ALM-12017__image168221113135319" src="en-us_image_0000001582807909.png"></span> in the row where the alarm is located to view the alarm host name and disk partition information in the alarm details.</span></li><li id="ALM-12017__li3937060885745"><span>Log in to the node where the alarm is generated as user <strong id="ALM-12017__b4911375485745">root</strong>. <span id="ALM-12017__text43649449460"></span></span></li><li id="ALM-12017__li1529764085745"><span>Run the <strong id="ALM-12017__b5391142133919">df -lmPT | awk '$2 != "iso9660"' | grep '^/dev/' | awk '{"readlink -m "$1 | getline real }{$1=real; print $0}' | sort -u -k 1,1</strong> command to check the system disk partition usage. Check whether the disk is mounted to the following directories based on the disk partition name obtained in <a href="#ALM-12017__li2782670585745">4</a>: <strong id="ALM-12017__b4568855685745">/</strong>, <strong id="ALM-12017__b2096079285745">/opt</strong>, <strong id="ALM-12017__b5442940785745">/tmp</strong>, <strong id="ALM-12017__b2010261785745">/var</strong>, <strong id="ALM-12017__b4670583385745">/var/log</strong>, and <strong id="ALM-12017__b2507614885745">/srv/BigData</strong>(can be customized).</span><p><ul class="subitemlist" id="ALM-12017__ul3152589985745"><li id="ALM-12017__li1790212085745">If yes, the disk is a system disk. Then go to <a href="#ALM-12017__li6170195385745">10</a>.</li><li id="ALM-12017__li4078557985745">If no, the disk is not a system disk. Then go to <a href="#ALM-12017__li1190839985745">7</a>.</li></ul>
</p></li><li id="ALM-12017__li1190839985745"><a name="ALM-12017__li1190839985745"></a><a name="li1190839985745"></a><span>Run the <strong id="ALM-12017__b10661194925219">df -lmPT | awk '$2 != "iso9660"' | grep '^/dev/' | awk '{"readlink -m "$1 | getline real }{$1=real; print $0}' | sort -u -k 1,1</strong> command to check the system disk partition usage. Determine the role of the disk based on the disk partition name obtained in <a href="#ALM-12017__li2782670585745">4</a>.</span></li><li id="ALM-12017__li11884059152614"><span>Check the disk service.</span><p><div class="p" id="ALM-12017__p0769162644910">In <span id="ALM-12017__text13624174411515">MRS</span>, check whether the disk service is HDFS, Yarn, Kafka, Supervisor.<ul id="ALM-12017__ul148852372297"><li id="ALM-12017__li10740174317299">If yes, adjust the capacity. Then go to <a href="#ALM-12017__li1354951085745">9</a>.</li><li id="ALM-12017__li1159152152914">If no, go to <a href="#ALM-12017__li1359113885745">12</a>.</li></ul> </p></li><li id="ALM-12017__li1190839985745"><a name="ALM-12017__li1190839985745"></a><a name="li1190839985745"></a><span>Run the <strong id="ALM-12017__b10661194925219">df -lmPT | awk '$2 != "iso9660"' | grep '^/dev/' | awk '{"readlink -m "$1 | getline real }{$1=real; print $0}' | sort -u -k 1,1</strong> command to check the system disk partition usage. Determine the role of the disk based on the disk partition name obtained in <a href="#ALM-12017__li2782670585745">4</a>.</span></li><li id="ALM-12017__li11884059152614"><span>Check the disk service.</span><p><div class="p" id="ALM-12017__p0769162644910">In <span id="ALM-12017__text13624174411515">MRS</span>, check whether the disk service is HDFS, Yarn, Kafka, Supervisor.<ul id="ALM-12017__ul148852372297"><li id="ALM-12017__li10740174317299">If yes, adjust the capacity. Then go to <a href="#ALM-12017__li1354951085745">9</a>.</li><li id="ALM-12017__li1159152152914">If no, go to <a href="#ALM-12017__li1359113885745">12</a>.</li></ul>
</div> </div>
</p></li><li id="ALM-12017__li1354951085745"><a name="ALM-12017__li1354951085745"></a><a name="li1354951085745"></a><span>After 2 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12017__ul150550185745"><li id="ALM-12017__li4676654185745">If yes, no further action is required.</li><li id="ALM-12017__li2999343985745">If no, go to <a href="#ALM-12017__li1359113885745">12</a>.</li></ul> </p></li><li id="ALM-12017__li1354951085745"><a name="ALM-12017__li1354951085745"></a><a name="li1354951085745"></a><span>After 2 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12017__ul150550185745"><li id="ALM-12017__li4676654185745">If yes, no further action is required.</li><li id="ALM-12017__li2999343985745">If no, go to <a href="#ALM-12017__li1359113885745">12</a>.</li></ul>
@ -84,7 +84,7 @@
</p></li><li id="ALM-12017__li1359113885745"><a name="ALM-12017__li1359113885745"></a><a name="li1359113885745"></a><span>Contact the system administrator to expand the disk capacity.</span></li><li id="ALM-12017__li2833807185745"><span>After 2 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12017__ul5088862685745"><li id="ALM-12017__li5521138285745">If yes, no further action is required.</li><li id="ALM-12017__li4293699485745">If no, go to <a href="#ALM-12017__li5603307085745">14</a>.</li></ul> </p></li><li id="ALM-12017__li1359113885745"><a name="ALM-12017__li1359113885745"></a><a name="li1359113885745"></a><span>Contact the system administrator to expand the disk capacity.</span></li><li id="ALM-12017__li2833807185745"><span>After 2 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12017__ul5088862685745"><li id="ALM-12017__li5521138285745">If yes, no further action is required.</li><li id="ALM-12017__li4293699485745">If no, go to <a href="#ALM-12017__li5603307085745">14</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12017__p5534445785745"><strong id="ALM-12017__b657764185839">Collect fault information.</strong></p> <p class="tableheading" id="ALM-12017__p5534445785745"><strong id="ALM-12017__b657764185839">Collect fault information.</strong></p>
<ol start="14" id="ALM-12017__ol4750985985842"><li id="ALM-12017__li5603307085745"><a name="ALM-12017__li5603307085745"></a><a name="li5603307085745"></a><span>On FusionInsight Manager, choose <strong id="ALM-12017__b13819155015320">O&amp;M</strong> &gt; <strong id="ALM-12017__b1368243785745">Log &gt; Download</strong>.</span></li><li id="ALM-12017__li1061898185745"><span>Select <strong id="ALM-12017__b1352831932712">OMS</strong> from the <strong id="ALM-12017__b13893145519916">Service</strong> and click <strong id="ALM-12017__b20893115513911">OK</strong>.</span></li><li id="ALM-12017__li1145664103113"><span>Click <span><img id="ALM-12017__image1945644173117" src="en-us_image_0000001583127613.png"></span> in the upper right corner, and set <strong id="ALM-12017__b6456941173117">Start Date</strong> and <strong id="ALM-12017__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12017__b13456164113319">Download</strong>.</span></li><li id="ALM-12017__li495644512588"><span>Contact the <span id="ALM-12017__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="14" id="ALM-12017__ol4750985985842"><li id="ALM-12017__li5603307085745"><a name="ALM-12017__li5603307085745"></a><a name="li5603307085745"></a><span>On <span id="ALM-12017__text1478971412440">MRS</span> Manager, choose <strong id="ALM-12017__b13819155015320">O&amp;M</strong> &gt; <strong id="ALM-12017__b1368243785745">Log &gt; Download</strong>.</span></li><li id="ALM-12017__li1061898185745"><span>Select <strong id="ALM-12017__b1352831932712">OMS</strong> from the <strong id="ALM-12017__b13893145519916">Service</strong> and click <strong id="ALM-12017__b20893115513911">OK</strong>.</span></li><li id="ALM-12017__li1145664103113"><span>Click <span><img id="ALM-12017__image1945644173117" src="en-us_image_0000001583127613.png"></span> in the upper right corner, and set <strong id="ALM-12017__b6456941173117">Start Date</strong> and <strong id="ALM-12017__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12017__b13456164113319">Download</strong>.</span></li><li id="ALM-12017__li495644512588"><span>Contact the <span id="ALM-12017__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12017__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12017__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12017__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12017__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -72,10 +72,10 @@ MemAvailable: 227641452 kB</pre>
</p></li><li id="ALM-12018__li448043669252"><span>Calculate the real-world memory usage: Memory usage = 1 - (Memory available/Memory total)</span><p><ul class="subitemlist" id="ALM-12018__ul568914459252"><li id="ALM-12018__li42205629252">If the memory usage is lower than 90%, manually disable transferring from monitoring indicators to alarms.</li><li id="ALM-12018__li63212719252">If the memory usage is higher than 90%, go to <a href="#ALM-12018__li5861159252">4</a>.</li></ul> </p></li><li id="ALM-12018__li448043669252"><span>Calculate the real-world memory usage: Memory usage = 1 - (Memory available/Memory total)</span><p><ul class="subitemlist" id="ALM-12018__ul568914459252"><li id="ALM-12018__li42205629252">If the memory usage is lower than 90%, manually disable transferring from monitoring indicators to alarms.</li><li id="ALM-12018__li63212719252">If the memory usage is higher than 90%, go to <a href="#ALM-12018__li5861159252">4</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12018__p422609659252"><strong id="ALM-12018__b40915938935">Expand the system.</strong></p> <p class="tableheading" id="ALM-12018__p422609659252"><strong id="ALM-12018__b40915938935">Expand the system.</strong></p>
<ol start="4" id="ALM-12018__ol28552339317"><li id="ALM-12018__li5861159252"><a name="ALM-12018__li5861159252"></a><a name="li5861159252"></a><span>In the alarm list on FusionInsight Manager, click <span><img id="ALM-12018__image168221113135319" src="en-us_image_0000001582927669.png"></span> in the row where the alarm is located to view the alarm host address in the alarm details.</span></li><li id="ALM-12018__li474753219252"><span>Log in to the host where the alarm is generated as user <strong id="ALM-12018__b52750359252">root</strong>. <span id="ALM-12018__text5966104516217"></span></span></li><li id="ALM-12018__li242002745617"><span>If the memory usage exceeds the threshold, perform memory capacity expansion.</span></li><li id="ALM-12018__li202957929252"><span>Run the command <strong id="ALM-12018__b246247099252">free -m | grep Mem\: | awk '{printf("%s,", $3 * 100 / $2)}'</strong> to check the system memory usage.</span></li><li id="ALM-12018__li305215859252"><span>Wait for 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12018__ul111473689252"><li id="ALM-12018__li316825749252">If yes, no further action is required.</li><li id="ALM-12018__li161516779252">If no, go to <a href="#ALM-12018__li372014939252">9</a>.</li></ul> <ol start="4" id="ALM-12018__ol28552339317"><li id="ALM-12018__li5861159252"><a name="ALM-12018__li5861159252"></a><a name="li5861159252"></a><span>In the alarm list on <span id="ALM-12018__text34789336432">MRS</span> Manager, click <span><img id="ALM-12018__image168221113135319" src="en-us_image_0000001582927669.png"></span> in the row where the alarm is located to view the alarm host address in the alarm details.</span></li><li id="ALM-12018__li474753219252"><span>Log in to the host where the alarm is generated as user <strong id="ALM-12018__b52750359252">root</strong>. <span id="ALM-12018__text5966104516217"></span></span></li><li id="ALM-12018__li242002745617"><span>If the memory usage exceeds the threshold, perform memory capacity expansion.</span></li><li id="ALM-12018__li202957929252"><span>Run the command <strong id="ALM-12018__b246247099252">free -m | grep Mem\: | awk '{printf("%s,", $3 * 100 / $2)}'</strong> to check the system memory usage.</span></li><li id="ALM-12018__li305215859252"><span>Wait for 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12018__ul111473689252"><li id="ALM-12018__li316825749252">If yes, no further action is required.</li><li id="ALM-12018__li161516779252">If no, go to <a href="#ALM-12018__li372014939252">9</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12018__p332174499252"><strong id="ALM-12018__b165019069321">Collect fault information.</strong></p> <p class="tableheading" id="ALM-12018__p332174499252"><strong id="ALM-12018__b165019069321">Collect fault information.</strong></p>
<ol start="9" id="ALM-12018__ol300682989324"><li id="ALM-12018__li372014939252"><a name="ALM-12018__li372014939252"></a><a name="li372014939252"></a><span>On the FusionInsight Manager in the active cluster, choose <strong id="ALM-12018__b57841710145614">O&amp;M</strong> &gt; <strong id="ALM-12018__b563292829252">Log &gt; Download</strong>.</span></li><li id="ALM-12018__li40625489252"><span>Select <strong id="ALM-12018__b663779889252">OmmServer</strong> from the <strong id="ALM-12018__b1099120531019">Servic</strong>e and click <strong id="ALM-12018__b999117511012">OK</strong>.</span></li><li id="ALM-12018__li1145664103113"><span>Click <span><img id="ALM-12018__image1945644173117" src="en-us_image_0000001583127413.png"></span> in the upper right corner, and set <strong id="ALM-12018__b6456941173117">Start Date</strong> and <strong id="ALM-12018__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12018__b13456164113319">Download</strong>.</span></li><li id="ALM-12018__li495644512588"><span>Contact the <span id="ALM-12018__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="9" id="ALM-12018__ol300682989324"><li id="ALM-12018__li372014939252"><a name="ALM-12018__li372014939252"></a><a name="li372014939252"></a><span>On the <span id="ALM-12018__text12605171711442">MRS</span> Manager in the active cluster, choose <strong id="ALM-12018__b57841710145614">O&amp;M</strong> &gt; <strong id="ALM-12018__b563292829252">Log &gt; Download</strong>.</span></li><li id="ALM-12018__li40625489252"><span>Select <strong id="ALM-12018__b663779889252">OmmServer</strong> from the <strong id="ALM-12018__b1099120531019">Servic</strong>e and click <strong id="ALM-12018__b999117511012">OK</strong>.</span></li><li id="ALM-12018__li1145664103113"><span>Click <span><img id="ALM-12018__image1945644173117" src="en-us_image_0000001583127413.png"></span> in the upper right corner, and set <strong id="ALM-12018__b6456941173117">Start Date</strong> and <strong id="ALM-12018__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12018__b13456164113319">Download</strong>.</span></li><li id="ALM-12018__li495644512588"><span>Contact the <span id="ALM-12018__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12018__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12018__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12018__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12018__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -65,14 +65,14 @@
<div class="section" id="ALM-12027__s3ddd6cfc758a404a82adc3dfe898bd66"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12027__p681753145417">Too many processes are running on the node. You need to increase the value of <strong id="ALM-12027__en-us_topic_0070543581_b61845569">pid_max</strong>.</p> <div class="section" id="ALM-12027__s3ddd6cfc758a404a82adc3dfe898bd66"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12027__p681753145417">Too many processes are running on the node. You need to increase the value of <strong id="ALM-12027__en-us_topic_0070543581_b61845569">pid_max</strong>.</p>
</div> </div>
<div class="section" id="ALM-12027__s9445b6fc399a470295ea751769713fde"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12027__en-us_topic_0070543581_p55372696"><strong id="ALM-12027__b360029529747">Increase the value of pid_max.</strong></p> <div class="section" id="ALM-12027__s9445b6fc399a470295ea751769713fde"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12027__en-us_topic_0070543581_p55372696"><strong id="ALM-12027__b360029529747">Increase the value of pid_max.</strong></p>
<ol id="ALM-12027__ol240915109757"><li id="ALM-12027__li639798269750"><span>In the alarm list on FusionInsight Manager, click <span><img id="ALM-12027__image168221113135319" src="en-us_image_0000001532607906.png"></span> in the row where the alarm is located to view the alarm host address in the alarm details.</span></li><li id="ALM-12027__li149834549750"><span>Log in to the host where the alarm is generated as user <strong id="ALM-12027__b389475309750">root</strong>. <span id="ALM-12027__text43649449460"></span></span></li><li id="ALM-12027__li513020679750"><span>Run the <strong id="ALM-12027__b6333589750">cat /proc/sys/kernel/pid_max</strong>command to check the value of <strong id="ALM-12027__b57002299750">pid_max</strong>.</span></li><li id="ALM-12027__li205272659750"><span>If the PID usage exceeds the threshold, run the command <strong id="ALM-12027__b590654259750">echo </strong><em id="ALM-12027__i618267859750">new value </em><strong id="ALM-12027__b195701549750">&gt; /proc/sys/kernel/pid_max</strong> to enlarge the value of <strong id="ALM-12027__b419136639750">pid_max</strong>.</span><p><p class="litext" id="ALM-12027__p395635099750">Example: <strong id="ALM-12027__b416786479750">echo 65536 &gt; /proc/sys/kernel/pid_max</strong></p> <ol id="ALM-12027__ol240915109757"><li id="ALM-12027__li639798269750"><span>In the alarm list on <span id="ALM-12027__text34789336432">MRS</span> Manager, click <span><img id="ALM-12027__image168221113135319" src="en-us_image_0000001532607906.png"></span> in the row where the alarm is located to view the alarm host address in the alarm details.</span></li><li id="ALM-12027__li149834549750"><span>Log in to the host where the alarm is generated as user <strong id="ALM-12027__b389475309750">root</strong>. <span id="ALM-12027__text43649449460"></span></span></li><li id="ALM-12027__li513020679750"><span>Run the <strong id="ALM-12027__b6333589750">cat /proc/sys/kernel/pid_max</strong>command to check the value of <strong id="ALM-12027__b57002299750">pid_max</strong>.</span></li><li id="ALM-12027__li205272659750"><span>If the PID usage exceeds the threshold, run the command <strong id="ALM-12027__b590654259750">echo </strong><em id="ALM-12027__i618267859750">new value </em><strong id="ALM-12027__b195701549750">&gt; /proc/sys/kernel/pid_max</strong> to enlarge the value of <strong id="ALM-12027__b419136639750">pid_max</strong>.</span><p><p class="litext" id="ALM-12027__p395635099750">Example: <strong id="ALM-12027__b416786479750">echo 65536 &gt; /proc/sys/kernel/pid_max</strong></p>
<div class="note" id="ALM-12027__note163571615102916"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12027__p10664145203015">The maximum value of <span class="parmname" id="ALM-12027__parmname1566455103015"><b>pid_max</b></span> is as follows:</p> <div class="note" id="ALM-12027__note163571615102916"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12027__p10664145203015">The maximum value of <span class="parmname" id="ALM-12027__parmname1566455103015"><b>pid_max</b></span> is as follows:</p>
<ul id="ALM-12027__ul13990143413014"><li id="ALM-12027__li7990034173015">On 32-bit systems: 32768</li><li id="ALM-12027__li799018345307">On 64-bit systems: 4194304 (2^22)</li></ul> <ul id="ALM-12027__ul13990143413014"><li id="ALM-12027__li7990034173015">On 32-bit systems: 32768</li><li id="ALM-12027__li799018345307">On 64-bit systems: 4194304 (2^22)</li></ul>
</div></div> </div></div>
</p></li><li id="ALM-12027__li148339459750"><span>Wait for 5 minutes, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12027__ul590069549750"><li id="ALM-12027__li505276609750">If yes, no further action is required.</li><li id="ALM-12027__li662086519750">If no, go to <a href="#ALM-12027__li377225729750">6</a>.</li></ul> </p></li><li id="ALM-12027__li148339459750"><span>Wait for 5 minutes, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12027__ul590069549750"><li id="ALM-12027__li505276609750">If yes, no further action is required.</li><li id="ALM-12027__li662086519750">If no, go to <a href="#ALM-12027__li377225729750">6</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12027__p61837339750"><strong id="ALM-12027__b361001479817">Collect fault information.</strong></p> <p class="tableheading" id="ALM-12027__p61837339750"><strong id="ALM-12027__b361001479817">Collect fault information.</strong></p>
<ol start="6" id="ALM-12027__ol116595289821"><li id="ALM-12027__li377225729750"><a name="ALM-12027__li377225729750"></a><a name="li377225729750"></a><span>On the FusionInsight Manager home page of the active cluster, choose <strong id="ALM-12027__b311203779750">O&amp;M</strong> &gt; <strong id="ALM-12027__b116479379750">Log &gt; Download</strong>.</span></li><li id="ALM-12027__li3107269750"><span>Select all services from the <strong id="ALM-12027__b356295299750">Service</strong> and click <strong id="ALM-12027__b3991118545">OK</strong>.</span></li><li id="ALM-12027__li1145664103113"><span>Click <span><img id="ALM-12027__image1945644173117" src="en-us_image_0000001582927797.png"></span> in the upper right corner, and set <strong id="ALM-12027__b6456941173117">Start Date</strong> and <strong id="ALM-12027__b11456154113318">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12027__b13456164113319">Download</strong>.</span></li><li id="ALM-12027__li495644512588"><span>Contact the <span id="ALM-12027__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="6" id="ALM-12027__ol116595289821"><li id="ALM-12027__li377225729750"><a name="ALM-12027__li377225729750"></a><a name="li377225729750"></a><span>On the <span id="ALM-12027__text83311120144410">MRS</span> Manager home page of the active cluster, choose <strong id="ALM-12027__b311203779750">O&amp;M</strong> &gt; <strong id="ALM-12027__b116479379750">Log &gt; Download</strong>.</span></li><li id="ALM-12027__li3107269750"><span>Select all services from the <strong id="ALM-12027__b356295299750">Service</strong> and click <strong id="ALM-12027__b3991118545">OK</strong>.</span></li><li id="ALM-12027__li1145664103113"><span>Click <span><img id="ALM-12027__image1945644173117" src="en-us_image_0000001582927797.png"></span> in the upper right corner, and set <strong id="ALM-12027__b6456941173117">Start Date</strong> and <strong id="ALM-12027__b11456154113318">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12027__b13456164113319">Download</strong>.</span></li><li id="ALM-12027__li495644512588"><span>Contact the <span id="ALM-12027__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12027__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12027__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12027__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12027__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -1,6 +1,6 @@
<a name="ALM-12028"></a><a name="ALM-12028"></a> <a name="ALM-12028"></a><a name="ALM-12028"></a>
<h1 class="topictitle1">ALM-12028 Number of Processes in the D State and Z State on a Host Exceeds the Threshold</h1> <h1 class="topictitle1">and Z StateALM-12028 Number of Processes in the D State and Z State on a Host Exceeds the Threshold</h1>
<div id="body14709652"><div class="section" id="ALM-12028__section23718688"><h4 class="sectiontitle">Description</h4><p id="ALM-12028__p50631172">The system checks the number of processes in the D stateand Z state of user <strong id="ALM-12028__b16253141134213">omm</strong> on the host every 30 seconds and compares the actual number with the threshold. The number of processes in the D state and Z state on the host has a default threshold range. This alarm is generated when the number of processes exceeds the threshold.</p> <div id="body14709652"><div class="section" id="ALM-12028__section23718688"><h4 class="sectiontitle">Description</h4><p id="ALM-12028__p50631172">The system checks the number of processes in the D stateand Z state of user <strong id="ALM-12028__b16253141134213">omm</strong> on the host every 30 seconds and compares the actual number with the threshold. The number of processes in the D state and Z state on the host has a default threshold range. This alarm is generated when the number of processes exceeds the threshold.</p>
<p id="ALM-12028__p53027366">This alarm is cleared when the <strong id="ALM-12028__b1896274320598">Trigger Count</strong> is <strong id="ALM-12028__b15669123210464">1</strong> and the total number of processes in the D state and Z state of user <strong id="ALM-12028__b19867204318485">omm</strong> on the host does not exceed the threshold. This alarm is cleared when the <strong id="ALM-12028__b134171188010">Trigger Count</strong> is greater than <strong id="ALM-12028__b466017588499">1</strong> and the total number of processes in the D state and Z state of user <strong id="ALM-12028__b1986717812518">omm</strong> on the host is less than or equal to 90% of the threshold.</p> <p id="ALM-12028__p53027366">This alarm is cleared when the <strong id="ALM-12028__b1896274320598">Trigger Count</strong> is <strong id="ALM-12028__b15669123210464">1</strong> and the total number of processes in the D state and Z state of user <strong id="ALM-12028__b19867204318485">omm</strong> on the host does not exceed the threshold. This alarm is cleared when the <strong id="ALM-12028__b134171188010">Trigger Count</strong> is greater than <strong id="ALM-12028__b466017588499">1</strong> and the total number of processes in the D state and Z state of user <strong id="ALM-12028__b1986717812518">omm</strong> on the host is less than or equal to 90% of the threshold.</p>
<div class="note" id="ALM-12028__note13991618131016"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12028__p84028186101">The function of checking the number of processes in the Z state on the host applies to MRS 3.2.0<span id="ALM-12028__ph174355293719">-LTS.2</span> or later.</p> <div class="note" id="ALM-12028__note13991618131016"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12028__p84028186101">The function of checking the number of processes in the Z state on the host applies to MRS 3.2.0<span id="ALM-12028__ph174355293719">-LTS.2</span> or later.</p>
@ -67,12 +67,12 @@
<div class="section" id="ALM-12028__section59967381"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12028__p66388367">The host responds slowly to I/O (disk I/O and network I/O) requests and some processes are in the D state and Z state.</p> <div class="section" id="ALM-12028__section59967381"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12028__p66388367">The host responds slowly to I/O (disk I/O and network I/O) requests and some processes are in the D state and Z state.</p>
</div> </div>
<div class="section" id="ALM-12028__section2835522"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12028__p8748685"><strong id="ALM-12028__b168151162515">Check the processes in the D state</strong><strong id="ALM-12028__b1581161112520"> and Z state</strong><strong id="ALM-12028__b19812114253">.</strong></p> <div class="section" id="ALM-12028__section2835522"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12028__p8748685"><strong id="ALM-12028__b168151162515">Check the processes in the D state</strong><strong id="ALM-12028__b1581161112520"> and Z state</strong><strong id="ALM-12028__b19812114253">.</strong></p>
<ol id="ALM-12028__ol5802802991057"><li id="ALM-12028__li6390942091049"><span>In the alarm list on FusionInsight Manager, locate the row that contains the alarm, and click <span><img id="ALM-12028__image168221113135319" src="en-us_image_0000001532448262.png"></span> to view the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12028__li1641579391049"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12028__b1426064751813">root</strong>. (<span id="ALM-12028__text995114020554"></span>) Then run the <strong id="ALM-12028__b3831387091049">su - omm</strong> command to switch to user <strong id="ALM-12028__b1288412448360">omm</strong>.</span></li><li id="ALM-12028__li12129135691210"><span>Run the following command as user <strong id="ALM-12028__b1667813740112956">omm</strong> to view the PID of the process that is in the D state and Z state:</span><p><p class="litext" id="ALM-12028__p91301556161211"><strong id="ALM-12028__b613095661213">ps -elf | grep -v "\[thread_checkio\]" | awk 'NR!=1 {print $2, $3, $4}' | grep omm | awk -F' ' '{print $1, $3}' | grep -E "Z|D" | awk '{print $2}'</strong></p> <ol id="ALM-12028__ol5802802991057"><li id="ALM-12028__li6390942091049"><span>In the alarm list on <span id="ALM-12028__text34789336432">MRS</span> Manager, locate the row that contains the alarm, and click <span><img id="ALM-12028__image168221113135319" src="en-us_image_0000001532448262.png"></span> to view the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12028__li1641579391049"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12028__b1426064751813">root</strong>. (<span id="ALM-12028__text995114020554"></span>) Then run the <strong id="ALM-12028__b3831387091049">su - omm</strong> command to switch to user <strong id="ALM-12028__b1288412448360">omm</strong>.</span></li><li id="ALM-12028__li12129135691210"><span>Run the following command as user <strong id="ALM-12028__b1667813740112956">omm</strong> to view the PID of the process that is in the D state and Z state:</span><p><p class="litext" id="ALM-12028__p91301556161211"><strong id="ALM-12028__b613095661213">ps -elf | grep -v "\[thread_checkio\]" | awk 'NR!=1 {print $2, $3, $4}' | grep omm | awk -F' ' '{print $1, $3}' | grep -E "Z|D" | awk '{print $2}'</strong></p>
</p></li><li id="ALM-12028__li2799290091049"><span>Check whether the command output is empty.</span><p><ul class="subitemlist" id="ALM-12028__ul1056686291049"><li id="ALM-12028__li747103591049">If yes, the service process is running properly. Then go to <a href="#ALM-12028__li2701143291049">6</a>.</li><li id="ALM-12028__li117409591049">If no, go to <a href="#ALM-12028__li573000391049">5</a>.</li></ul> </p></li><li id="ALM-12028__li2799290091049"><span>Check whether the command output is empty.</span><p><ul class="subitemlist" id="ALM-12028__ul1056686291049"><li id="ALM-12028__li747103591049">If yes, the service process is running properly. Then go to <a href="#ALM-12028__li2701143291049">6</a>.</li><li id="ALM-12028__li117409591049">If no, go to <a href="#ALM-12028__li573000391049">5</a>.</li></ul>
</p></li><li id="ALM-12028__li573000391049"><a name="ALM-12028__li573000391049"></a><a name="li573000391049"></a><span>Switch to user <strong id="ALM-12028__b1281511314404">root</strong> and run the <strong id="ALM-12028__b8712438134020">reboot</strong> command to restart the host for which the alarm is generated. (Restarting a host is risky. Ensure that the service process is normal after the restart.)</span></li><li id="ALM-12028__li2701143291049"><a name="ALM-12028__li2701143291049"></a><a name="li2701143291049"></a><span>Check whether the alarm is cleared 5 minutes later.</span><p><ul class="subitemlist" id="ALM-12028__ul1358954691049"><li id="ALM-12028__li5157003291049">If yes, no further action is required.</li><li id="ALM-12028__li1642303091049">If no, go to <a href="#ALM-12028__li4177630091049">7</a>.</li></ul> </p></li><li id="ALM-12028__li573000391049"><a name="ALM-12028__li573000391049"></a><a name="li573000391049"></a><span>Switch to user <strong id="ALM-12028__b1281511314404">root</strong> and run the <strong id="ALM-12028__b8712438134020">reboot</strong> command to restart the host for which the alarm is generated. (Restarting a host is risky. Ensure that the service process is normal after the restart.)</span></li><li id="ALM-12028__li2701143291049"><a name="ALM-12028__li2701143291049"></a><a name="li2701143291049"></a><span>Check whether the alarm is cleared 5 minutes later.</span><p><ul class="subitemlist" id="ALM-12028__ul1358954691049"><li id="ALM-12028__li5157003291049">If yes, no further action is required.</li><li id="ALM-12028__li1642303091049">If no, go to <a href="#ALM-12028__li4177630091049">7</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12028__p5519705391049"><strong id="ALM-12028__b89637239112">Collect the fault information.</strong></p> <p class="tableheading" id="ALM-12028__p5519705391049"><strong id="ALM-12028__b89637239112">Collect the fault information.</strong></p>
<ol start="7" id="ALM-12028__ol128225129115"><li id="ALM-12028__li4177630091049"><a name="ALM-12028__li4177630091049"></a><a name="li4177630091049"></a><span>On FusionInsight Manager, choose <strong id="ALM-12028__b750820372495">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12028__b550820377491">Log</strong> &gt; <strong id="ALM-12028__b185081037134914">Download</strong>.</span></li><li id="ALM-12028__li4044238791049"><span>Select <strong id="ALM-12028__b884279457112956">OMS</strong> for <strong id="ALM-12028__b811590828112956">Service</strong> and click <strong id="ALM-12028__b1060353008112956">OK</strong>.</span></li><li id="ALM-12028__li2843716491049"><span>Click <span><img id="ALM-12028__image104601319175315" src="en-us_image_0000001583087581.png"></span> in the upper right corner, and set <strong id="ALM-12028__b522882672112956">Start Date</strong> and <strong id="ALM-12028__b2029904650112956">End Date</strong> for log collection to 1 hour ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12028__b449569331112956">Download</strong>.</span></li><li id="ALM-12028__li2170896591049"><span>Contact <span id="ALM-12028__text02161454416">O&amp;M personnel</span> and provide the collected logs.</span></li></ol> <ol start="7" id="ALM-12028__ol128225129115"><li id="ALM-12028__li4177630091049"><a name="ALM-12028__li4177630091049"></a><a name="li4177630091049"></a><span>On <span id="ALM-12028__text5916112219441">MRS</span> Manager, choose <strong id="ALM-12028__b750820372495">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12028__b550820377491">Log</strong> &gt; <strong id="ALM-12028__b185081037134914">Download</strong>.</span></li><li id="ALM-12028__li4044238791049"><span>Select <strong id="ALM-12028__b884279457112956">OMS</strong> for <strong id="ALM-12028__b811590828112956">Service</strong> and click <strong id="ALM-12028__b1060353008112956">OK</strong>.</span></li><li id="ALM-12028__li2843716491049"><span>Click <span><img id="ALM-12028__image104601319175315" src="en-us_image_0000001583087581.png"></span> in the upper right corner, and set <strong id="ALM-12028__b522882672112956">Start Date</strong> and <strong id="ALM-12028__b2029904650112956">End Date</strong> for log collection to 1 hour ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12028__b449569331112956">Download</strong>.</span></li><li id="ALM-12028__li2170896591049"><span>Contact <span id="ALM-12028__text02161454416">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12028__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12028__p754913417333">This alarm is automatically cleared after the fault is rectified.</p> <div class="section" id="ALM-12028__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12028__p754913417333">This alarm is automatically cleared after the fault is rectified.</p>
</div> </div>

View File

@ -82,7 +82,7 @@
<div class="section" id="ALM-12033__section31508455"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12033__p66974322">The disk is aged or has bad sectors.</p> <div class="section" id="ALM-12033__section31508455"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12033__p66974322">The disk is aged or has bad sectors.</p>
</div> </div>
<div class="section" id="ALM-12033__section15140644"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12033__p1856018391458"><strong id="ALM-12033__b3282392191458">Check the disk status.</strong></p> <div class="section" id="ALM-12033__section15140644"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12033__p1856018391458"><strong id="ALM-12033__b3282392191458">Check the disk status.</strong></p>
<ol id="ALM-12033__ol4468641992138"><li id="ALM-12033__li4149191591458"><span>On FusionInsight Manager, choose <strong id="ALM-12033__b54488846544512">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12033__b20884598144512">Alarm</strong> &gt; <strong id="ALM-12033__b27619653444512">Alarms</strong>.</span></li><li id="ALM-12033__li3788291791458"><a name="ALM-12033__li3788291791458"></a><a name="li3788291791458"></a><span>View the detailed information about the alarm. Check the values of <strong id="ALM-12033__b1216047996">HostName</strong> and <strong id="ALM-12033__b43072495916">DiskName</strong> in the location information to obtain the information about the faulty disk for which the alarm is generated.</span></li><li id="ALM-12033__li540193791458"><span>Check whether the node for which the alarm is generated is in a virtualization environment. </span><p><ul id="ALM-12033__ul4861744191458"><li id="ALM-12033__li3490378991458">If yes, go to <a href="#ALM-12033__li2831628891458">4</a>.</li><li id="ALM-12033__li863462891458">If no, go to <a href="#ALM-12033__li2583597491458">7</a>.</li></ul> <ol id="ALM-12033__ol4468641992138"><li id="ALM-12033__li4149191591458"><span>On <span id="ALM-12033__text34789336432">MRS</span> Manager, choose <strong id="ALM-12033__b54488846544512">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12033__b20884598144512">Alarm</strong> &gt; <strong id="ALM-12033__b27619653444512">Alarms</strong>.</span></li><li id="ALM-12033__li3788291791458"><a name="ALM-12033__li3788291791458"></a><a name="li3788291791458"></a><span>View the detailed information about the alarm. Check the values of <strong id="ALM-12033__b1216047996">HostName</strong> and <strong id="ALM-12033__b43072495916">DiskName</strong> in the location information to obtain the information about the faulty disk for which the alarm is generated.</span></li><li id="ALM-12033__li540193791458"><span>Check whether the node for which the alarm is generated is in a virtualization environment.</span><p><ul id="ALM-12033__ul4861744191458"><li id="ALM-12033__li3490378991458">If yes, go to <a href="#ALM-12033__li2831628891458">4</a>.</li><li id="ALM-12033__li863462891458">If no, go to <a href="#ALM-12033__li2583597491458">7</a>.</li></ul>
</p></li><li id="ALM-12033__li2831628891458"><a name="ALM-12033__li2831628891458"></a><a name="li2831628891458"></a><span>Check whether the storage performance provided by the virtualization environment meets the hardware requirements. Then, go to <a href="#ALM-12033__li1205527419227">5</a>.</span></li><li id="ALM-12033__li1205527419227"><a name="ALM-12033__li1205527419227"></a><a name="li1205527419227"></a><span>Log in to the alarm node as user <strong id="ALM-12033__b19653192618269">root</strong>, run the <strong id="ALM-12033__b13449155511259">df -h</strong> command, and check whether the command output contains the value of the <strong id="ALM-12033__b1074473672410">DiskName</strong> field. <span id="ALM-12033__text23715444267"></span></span><p><ul id="ALM-12033__ul12100362193111"><li id="ALM-12033__li56917037201355">If yes, go to <a href="#ALM-12033__li2583597491458">7</a>.</li><li id="ALM-12033__li8577348201455">If no, go to <a href="#ALM-12033__li2325719119312">6</a>.</li></ul> </p></li><li id="ALM-12033__li2831628891458"><a name="ALM-12033__li2831628891458"></a><a name="li2831628891458"></a><span>Check whether the storage performance provided by the virtualization environment meets the hardware requirements. Then, go to <a href="#ALM-12033__li1205527419227">5</a>.</span></li><li id="ALM-12033__li1205527419227"><a name="ALM-12033__li1205527419227"></a><a name="li1205527419227"></a><span>Log in to the alarm node as user <strong id="ALM-12033__b19653192618269">root</strong>, run the <strong id="ALM-12033__b13449155511259">df -h</strong> command, and check whether the command output contains the value of the <strong id="ALM-12033__b1074473672410">DiskName</strong> field. <span id="ALM-12033__text23715444267"></span></span><p><ul id="ALM-12033__ul12100362193111"><li id="ALM-12033__li56917037201355">If yes, go to <a href="#ALM-12033__li2583597491458">7</a>.</li><li id="ALM-12033__li8577348201455">If no, go to <a href="#ALM-12033__li2325719119312">6</a>.</li></ul>
</p></li><li id="ALM-12033__li2325719119312"><a name="ALM-12033__li2325719119312"></a><a name="li2325719119312"></a><span>Run the <strong id="ALM-12033__b1673412214263">lsblk</strong> command to check whether the mapping between the value of <strong id="ALM-12033__b1388164762511">DiskName</strong> and the disk has been created.</span><p><div class="p" id="ALM-12033__p55380970201120"><span><img id="ALM-12033__image94412418324" src="en-us_image_0000001583127305.jpg"></span><ul id="ALM-12033__ul245583919286"><li id="ALM-12033__li40945773201617">If yes, go to <a href="#ALM-12033__li2583597491458">7</a>. .</li><li id="ALM-12033__li4547636219286">If no, go to <a href="#ALM-12033__li4518231891458">22</a>.</li></ul> </p></li><li id="ALM-12033__li2325719119312"><a name="ALM-12033__li2325719119312"></a><a name="li2325719119312"></a><span>Run the <strong id="ALM-12033__b1673412214263">lsblk</strong> command to check whether the mapping between the value of <strong id="ALM-12033__b1388164762511">DiskName</strong> and the disk has been created.</span><p><div class="p" id="ALM-12033__p55380970201120"><span><img id="ALM-12033__image94412418324" src="en-us_image_0000001583127305.jpg"></span><ul id="ALM-12033__ul245583919286"><li id="ALM-12033__li40945773201617">If yes, go to <a href="#ALM-12033__li2583597491458">7</a>. .</li><li id="ALM-12033__li4547636219286">If no, go to <a href="#ALM-12033__li4518231891458">22</a>.</li></ul>
</div> </div>
@ -90,7 +90,7 @@
</div></div> </div></div>
<p id="ALM-12033__p1461704291458">Example:</p> <p id="ALM-12033__p1461704291458">Example:</p>
<p id="ALM-12033__p6444452091458"><strong id="ALM-12033__b4312977191458">lsscsi | grep "/dev/sda"</strong></p> <p id="ALM-12033__p6444452091458"><strong id="ALM-12033__b4312977191458">lsscsi | grep "/dev/sda"</strong></p>
<p id="ALM-12033__p5262362091458">In the command output, if <strong id="ALM-12033__b1140415713347">ATA</strong>, <strong id="ALM-12033__b8367139163411">SATA</strong>, or <strong id="ALM-12033__b15300181111349">SAS</strong> is displayed in the third line, the disk has not been organized into a RAID group. If other information is displayed, RAID has been set up. </p> <p id="ALM-12033__p5262362091458">In the command output, if <strong id="ALM-12033__b1140415713347">ATA</strong>, <strong id="ALM-12033__b8367139163411">SATA</strong>, or <strong id="ALM-12033__b15300181111349">SAS</strong> is displayed in the third line, the disk has not been organized into a RAID group. If other information is displayed, RAID has been set up.</p>
<ul id="ALM-12033__ul385053291458"><li id="ALM-12033__li3465478891458">If yes, go to <a href="#ALM-12033__li1471607091458">12</a>.</li><li id="ALM-12033__li5557441691458">If no, go to <a href="#ALM-12033__li523387391458">8</a>.</li></ul> <ul id="ALM-12033__ul385053291458"><li id="ALM-12033__li3465478891458">If yes, go to <a href="#ALM-12033__li1471607091458">12</a>.</li><li id="ALM-12033__li5557441691458">If no, go to <a href="#ALM-12033__li523387391458">8</a>.</li></ul>
</p></li><li id="ALM-12033__li523387391458"><a name="ALM-12033__li523387391458"></a><a name="li523387391458"></a><span>Run the <strong id="ALM-12033__b4710486591458">smartctl -i /dev/sd[x]</strong> command to check whether the hardware supports the SMART tool.</span><p><p id="ALM-12033__p2129060791458">Example:</p> </p></li><li id="ALM-12033__li523387391458"><a name="ALM-12033__li523387391458"></a><a name="li523387391458"></a><span>Run the <strong id="ALM-12033__b4710486591458">smartctl -i /dev/sd[x]</strong> command to check whether the hardware supports the SMART tool.</span><p><p id="ALM-12033__p2129060791458">Example:</p>
<p id="ALM-12033__p5739774091458"><strong id="ALM-12033__b4681761791458">smartctl -i /dev/sda</strong></p> <p id="ALM-12033__p5739774091458"><strong id="ALM-12033__b4681761791458">smartctl -i /dev/sda</strong></p>
@ -105,7 +105,7 @@
<p id="ALM-12033__p1134168691458">Check the <strong id="ALM-12033__b1453654720449">Command/Feature_name</strong> column in the command output. If <strong id="ALM-12033__b23561756164415">READ SECTOR(S)</strong> or <strong id="ALM-12033__b8635195920442">WRITE SECTOR(S)</strong> is displayed, the disk has bad sectors. If other errors occur, the disk circuit board is faulty. Both errors indicate that the disk is abnormal and needs to be replaced.</p> <p id="ALM-12033__p1134168691458">Check the <strong id="ALM-12033__b1453654720449">Command/Feature_name</strong> column in the command output. If <strong id="ALM-12033__b23561756164415">READ SECTOR(S)</strong> or <strong id="ALM-12033__b8635195920442">WRITE SECTOR(S)</strong> is displayed, the disk has bad sectors. If other errors occur, the disk circuit board is faulty. Both errors indicate that the disk is abnormal and needs to be replaced.</p>
<p id="ALM-12033__p3496631491458">If "No Errors Logged" is displayed, no error log exists. You can trigger the disk SMART self-check.</p> <p id="ALM-12033__p3496631491458">If "No Errors Logged" is displayed, no error log exists. You can trigger the disk SMART self-check.</p>
<ul id="ALM-12033__ul4626137591458"><li id="ALM-12033__li1369919991458">If yes, go to <a href="#ALM-12033__li2167780691458">11</a>.</li><li id="ALM-12033__li3589332091458">If no, go to <a href="#ALM-12033__li6235920691458">18</a>.</li></ul> <ul id="ALM-12033__ul4626137591458"><li id="ALM-12033__li1369919991458">If yes, go to <a href="#ALM-12033__li2167780691458">11</a>.</li><li id="ALM-12033__li3589332091458">If no, go to <a href="#ALM-12033__li6235920691458">18</a>.</li></ul>
</p></li><li id="ALM-12033__li2167780691458"><a name="ALM-12033__li2167780691458"></a><a name="li2167780691458"></a><span>Run the <strong id="ALM-12033__b6088252791458">smartctl -t long /dev/sd[x]</strong> command to trigger the disk SMART self-check. After the command is executed, the time when the self-check is to be completed is displayed. After the self-check is completed, repeat <a href="#ALM-12033__li3483730991458">9</a> and <a href="#ALM-12033__li1145378391458">10</a> to check whether the disk is working properly. </span><p><p id="ALM-12033__p2440318291458">Example:</p> </p></li><li id="ALM-12033__li2167780691458"><a name="ALM-12033__li2167780691458"></a><a name="li2167780691458"></a><span>Run the <strong id="ALM-12033__b6088252791458">smartctl -t long /dev/sd[x]</strong> command to trigger the disk SMART self-check. After the command is executed, the time when the self-check is to be completed is displayed. After the self-check is completed, repeat <a href="#ALM-12033__li3483730991458">9</a> and <a href="#ALM-12033__li1145378391458">10</a> to check whether the disk is working properly.</span><p><p id="ALM-12033__p2440318291458">Example:</p>
<p id="ALM-12033__p1830205491458"><strong id="ALM-12033__b3050076291458">smartctl -t long /dev/sda</strong></p> <p id="ALM-12033__p1830205491458"><strong id="ALM-12033__b3050076291458">smartctl -t long /dev/sda</strong></p>
<ul id="ALM-12033__ul607140291458"><li id="ALM-12033__li5464262591458">If yes, go to <a href="#ALM-12033__li3381567991458">17</a>.</li><li id="ALM-12033__li6397652591458">If no, go to <a href="#ALM-12033__li6235920691458">18</a>.</li></ul> <ul id="ALM-12033__ul607140291458"><li id="ALM-12033__li5464262591458">If yes, go to <a href="#ALM-12033__li3381567991458">17</a>.</li><li id="ALM-12033__li6397652591458">If no, go to <a href="#ALM-12033__li6235920691458">18</a>.</li></ul>
</p></li><li id="ALM-12033__li1471607091458"><a name="ALM-12033__li1471607091458"></a><a name="li1471607091458"></a><span>Run the <strong id="ALM-12033__b6533577191458">smartctl -d [sat|scsi]+megaraid,[DID] -H --all /dev/sd[x]</strong> command to check whether the hardware supports SMART.</span><p><div class="note" id="ALM-12033__note5115102791458"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><ul id="ALM-12033__ul5770606591458"><li id="ALM-12033__li4959254191458">In the command, <strong id="ALM-12033__b1427513108113">[sat|scsi]</strong> indicates the disk type. Both types need to be used.</li><li id="ALM-12033__li4367968891458"><strong id="ALM-12033__b119411427211">[DID]</strong> indicates the slot information. Slots 0 to 15 need to be used.</li></ul> </p></li><li id="ALM-12033__li1471607091458"><a name="ALM-12033__li1471607091458"></a><a name="li1471607091458"></a><span>Run the <strong id="ALM-12033__b6533577191458">smartctl -d [sat|scsi]+megaraid,[DID] -H --all /dev/sd[x]</strong> command to check whether the hardware supports SMART.</span><p><div class="note" id="ALM-12033__note5115102791458"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><ul id="ALM-12033__ul5770606591458"><li id="ALM-12033__li4959254191458">In the command, <strong id="ALM-12033__b1427513108113">[sat|scsi]</strong> indicates the disk type. Both types need to be used.</li><li id="ALM-12033__li4367968891458"><strong id="ALM-12033__b119411427211">[DID]</strong> indicates the slot information. Slots 0 to 15 need to be used.</li></ul>
@ -126,18 +126,18 @@
<p id="ALM-12033__p1804927491458">Check the <strong id="ALM-12033__b112875461773">Command/Feature_name</strong> column in the command output. If <strong id="ALM-12033__b228717461573">READ SECTOR(S)</strong> or <strong id="ALM-12033__b228734613715">WRITE SECTOR(S)</strong> is displayed, the disk has bad sectors. If other errors occur, the disk circuit board is faulty. Both errors indicate that the disk is abnormal and needs to be replaced.</p> <p id="ALM-12033__p1804927491458">Check the <strong id="ALM-12033__b112875461773">Command/Feature_name</strong> column in the command output. If <strong id="ALM-12033__b228717461573">READ SECTOR(S)</strong> or <strong id="ALM-12033__b228734613715">WRITE SECTOR(S)</strong> is displayed, the disk has bad sectors. If other errors occur, the disk circuit board is faulty. Both errors indicate that the disk is abnormal and needs to be replaced.</p>
<p id="ALM-12033__p2822574691458">If "No Errors Logged" is displayed, no error log exists. You can trigger the disk SMART self-check.</p> <p id="ALM-12033__p2822574691458">If "No Errors Logged" is displayed, no error log exists. You can trigger the disk SMART self-check.</p>
<ul id="ALM-12033__ul5270512291458"><li id="ALM-12033__li458405291458">If yes, go to <a href="#ALM-12033__li1119862391458">15</a>.</li><li id="ALM-12033__li3576394791458">If no, go to <a href="#ALM-12033__li6235920691458">18</a>.</li></ul> <ul id="ALM-12033__ul5270512291458"><li id="ALM-12033__li458405291458">If yes, go to <a href="#ALM-12033__li1119862391458">15</a>.</li><li id="ALM-12033__li3576394791458">If no, go to <a href="#ALM-12033__li6235920691458">18</a>.</li></ul>
</p></li><li id="ALM-12033__li1119862391458"><a name="ALM-12033__li1119862391458"></a><a name="li1119862391458"></a><span>Run the <strong id="ALM-12033__b3367874791458">smartctl -d [sat|scsi]+megaraid,[DID] -t long /dev/sd[x]</strong> command to trigger the disk SMART self-check. After the command is executed, the time when the self-check is to be completed is displayed. After the self-check is completed, repeat <a href="#ALM-12033__li4568369291458">13</a> and <a href="#ALM-12033__li5027541391458">14</a> to check whether the disk is working properly. </span><p><p id="ALM-12033__p5707158091458">Example:</p> </p></li><li id="ALM-12033__li1119862391458"><a name="ALM-12033__li1119862391458"></a><a name="li1119862391458"></a><span>Run the <strong id="ALM-12033__b3367874791458">smartctl -d [sat|scsi]+megaraid,[DID] -t long /dev/sd[x]</strong> command to trigger the disk SMART self-check. After the command is executed, the time when the self-check is to be completed is displayed. After the self-check is completed, repeat <a href="#ALM-12033__li4568369291458">13</a> and <a href="#ALM-12033__li5027541391458">14</a> to check whether the disk is working properly.</span><p><p id="ALM-12033__p5707158091458">Example:</p>
<p id="ALM-12033__p4388217491458"><strong id="ALM-12033__b5939525391458">smartctl -d sat+megaraid,2 -t long /dev/sda</strong></p> <p id="ALM-12033__p4388217491458"><strong id="ALM-12033__b5939525391458">smartctl -d sat+megaraid,2 -t long /dev/sda</strong></p>
<ul id="ALM-12033__ul6479523391458"><li id="ALM-12033__li4628618791458">If yes, go to <a href="#ALM-12033__li3381567991458">17</a>.</li><li id="ALM-12033__li5819363791458">If no, go to <a href="#ALM-12033__li6235920691458">18</a>.</li></ul> <ul id="ALM-12033__ul6479523391458"><li id="ALM-12033__li4628618791458">If yes, go to <a href="#ALM-12033__li3381567991458">17</a>.</li><li id="ALM-12033__li5819363791458">If no, go to <a href="#ALM-12033__li6235920691458">18</a>.</li></ul>
</p></li><li id="ALM-12033__li1606413991458"><a name="ALM-12033__li1606413991458"></a><a name="li1606413991458"></a><span>If the configured RAID controller card does not support SMART, the disk does not support SMART. In this case, use the check tool provided by the corresponding RAID controller card vendor to rectify the fault. Then go to <a href="#ALM-12033__li3381567991458">17</a>. </span><p><p id="ALM-12033__p2612691991458">For example, LSI is a MegaCLI tool.</p> </p></li><li id="ALM-12033__li1606413991458"><a name="ALM-12033__li1606413991458"></a><a name="li1606413991458"></a><span>If the configured RAID controller card does not support SMART, the disk does not support SMART. In this case, use the check tool provided by the corresponding RAID controller card vendor to rectify the fault. Then go to <a href="#ALM-12033__li3381567991458">17</a>.</span><p><p id="ALM-12033__p2612691991458">For example, LSI is a MegaCLI tool.</p>
</p></li><li id="ALM-12033__li3381567991458"><a name="ALM-12033__li3381567991458"></a><a name="li3381567991458"></a><span>On FusionInsight Manager, choose <strong id="ALM-12033__b1433073281718">O&amp;M</strong> &gt; <strong id="ALM-12033__b105611435111713">Alarm</strong> &gt; <strong id="ALM-12033__b43125395172">Alarms</strong>, click <strong id="ALM-12033__b144519611816">Clear</strong> in the <strong id="ALM-12033__b28015914186">Operation</strong> column of the alarm, and check whether the alarm is reported on the same disk again.</span><p><p id="ALM-12033__p3590566291458">If the alarm is reported for three times, replace the disk.</p> </p></li><li id="ALM-12033__li3381567991458"><a name="ALM-12033__li3381567991458"></a><a name="li3381567991458"></a><span>On <span id="ALM-12033__text8965630124419">MRS</span> Manager, choose <strong id="ALM-12033__b1433073281718">O&amp;M</strong> &gt; <strong id="ALM-12033__b105611435111713">Alarm</strong> &gt; <strong id="ALM-12033__b43125395172">Alarms</strong>, click <strong id="ALM-12033__b144519611816">Clear</strong> in the <strong id="ALM-12033__b28015914186">Operation</strong> column of the alarm, and check whether the alarm is reported on the same disk again.</span><p><p id="ALM-12033__p3590566291458">If the alarm is reported for three times, replace the disk.</p>
<ul id="ALM-12033__ul5471550891458"><li id="ALM-12033__li2267753091458">If yes, go to <a href="#ALM-12033__li6235920691458">18</a>.</li><li id="ALM-12033__li2494067591458">If no, no further action is required.</li></ul> <ul id="ALM-12033__ul5471550891458"><li id="ALM-12033__li2267753091458">If yes, go to <a href="#ALM-12033__li6235920691458">18</a>.</li><li id="ALM-12033__li2494067591458">If no, no further action is required.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12033__p1160151292144"><strong id="ALM-12033__b3730474492144">Replace the disk.</strong></p> <p id="ALM-12033__p1160151292144"><strong id="ALM-12033__b3730474492144">Replace the disk.</strong></p>
<ol start="18" id="ALM-12033__ol5110722692159"><li id="ALM-12033__li6235920691458"><a name="ALM-12033__li6235920691458"></a><a name="li6235920691458"></a><span>On FusionInsight Manager, choose <strong id="ALM-12033__b10218142171210">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12033__b19228521101217">Alarm</strong> &gt; <strong id="ALM-12033__b1922813217128">Alarms</strong>.</span></li><li id="ALM-12033__li2436194691458"><span>View the detailed information about the alarm. Check the values of <strong id="ALM-12033__b8430753201817">HostName</strong> and <strong id="ALM-12033__b19444145316187">DiskName</strong> in the location information to obtain the information about the faulty disk for which the alarm is reported.</span></li><li id="ALM-12033__li1793092591458"><span>Replace the disk.</span></li><li id="ALM-12033__li2716060091458"><span>Check whether the alarm is cleared.</span><p><ul id="ALM-12033__ul4311881291458"><li id="ALM-12033__li5252499591458">If yes, no further action is required.</li><li id="ALM-12033__li296290891458">If no, go to <a href="#ALM-12033__li4518231891458">22</a>.</li></ul> <ol start="18" id="ALM-12033__ol5110722692159"><li id="ALM-12033__li6235920691458"><a name="ALM-12033__li6235920691458"></a><a name="li6235920691458"></a><span>On <span id="ALM-12033__text172221432124410">MRS</span> Manager, choose <strong id="ALM-12033__b10218142171210">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12033__b19228521101217">Alarm</strong> &gt; <strong id="ALM-12033__b1922813217128">Alarms</strong>.</span></li><li id="ALM-12033__li2436194691458"><span>View the detailed information about the alarm. Check the values of <strong id="ALM-12033__b8430753201817">HostName</strong> and <strong id="ALM-12033__b19444145316187">DiskName</strong> in the location information to obtain the information about the faulty disk for which the alarm is reported.</span></li><li id="ALM-12033__li1793092591458"><span>Replace the disk.</span></li><li id="ALM-12033__li2716060091458"><span>Check whether the alarm is cleared.</span><p><ul id="ALM-12033__ul4311881291458"><li id="ALM-12033__li5252499591458">If yes, no further action is required.</li><li id="ALM-12033__li296290891458">If no, go to <a href="#ALM-12033__li4518231891458">22</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12033__p98841749221"><strong id="ALM-12033__b218487059221">Collect the fault information.</strong></p> <p id="ALM-12033__p98841749221"><strong id="ALM-12033__b218487059221">Collect the fault information.</strong></p>
<ol start="22" id="ALM-12033__ol24355139224"><li id="ALM-12033__li4518231891458"><a name="ALM-12033__li4518231891458"></a><a name="li4518231891458"></a><span>On FusionInsight Manager, choose <strong id="ALM-12033__b1657416563128">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12033__b657418563122">Log</strong> &gt; <strong id="ALM-12033__b657435661218">Download</strong>.</span></li><li id="ALM-12033__li398767891458"><span>Select <strong id="ALM-12033__b01051101315">OMS</strong> for <strong id="ALM-12033__b192116161319">Service</strong> and click <strong id="ALM-12033__b16219111132">OK</strong>.</span></li><li id="ALM-12033__li3588910391458"><span>Click <span><img id="ALM-12033__image104601319175315" src="en-us_image_0000001532927338.png"></span> in the upper right corner, and set <strong id="ALM-12033__b209140148644512">Start Date</strong> and <strong id="ALM-12033__b31843827044512">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12033__b148916401344512">Download</strong>.</span></li><li id="ALM-12033__li5456647491458"><span>Contact <span id="ALM-12033__text10816191314136">O&amp;M personnel</span> and provide the collected logs.</span></li></ol> <ol start="22" id="ALM-12033__ol24355139224"><li id="ALM-12033__li4518231891458"><a name="ALM-12033__li4518231891458"></a><a name="li4518231891458"></a><span>On <span id="ALM-12033__text1845783354419">MRS</span> Manager, choose <strong id="ALM-12033__b1657416563128">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12033__b657418563122">Log</strong> &gt; <strong id="ALM-12033__b657435661218">Download</strong>.</span></li><li id="ALM-12033__li398767891458"><span>Select <strong id="ALM-12033__b01051101315">OMS</strong> for <strong id="ALM-12033__b192116161319">Service</strong> and click <strong id="ALM-12033__b16219111132">OK</strong>.</span></li><li id="ALM-12033__li3588910391458"><span>Click <span><img id="ALM-12033__image104601319175315" src="en-us_image_0000001532927338.png"></span> in the upper right corner, and set <strong id="ALM-12033__b209140148644512">Start Date</strong> and <strong id="ALM-12033__b31843827044512">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12033__b148916401344512">Download</strong>.</span></li><li id="ALM-12033__li5456647491458"><span>Contact <span id="ALM-12033__text10816191314136">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12033__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12033__p754913417333">This alarm is automatically cleared after the fault is rectified.</p> <div class="section" id="ALM-12033__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12033__p754913417333">This alarm is automatically cleared after the fault is rectified.</p>
</div> </div>

View File

@ -64,7 +64,7 @@
<div class="section" id="ALM-12034__s263b5f2875944e7b9df856ae80d2a053"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12034__en-us_topic_0070543608_p16651572">The alarm cause depends on the task details. Handle the alarm according to the logs and alarm details.</p> <div class="section" id="ALM-12034__s263b5f2875944e7b9df856ae80d2a053"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12034__en-us_topic_0070543608_p16651572">The alarm cause depends on the task details. Handle the alarm according to the logs and alarm details.</p>
</div> </div>
<div class="section" id="ALM-12034__s1ca44cb0f88942d591bb071c656d4ccc"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12034__en-us_topic_0070543608_p6600119"><strong id="ALM-12034__b11327931485">Check whether the disk space is sufficient.</strong></p> <div class="section" id="ALM-12034__s1ca44cb0f88942d591bb071c656d4ccc"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12034__en-us_topic_0070543608_p6600119"><strong id="ALM-12034__b11327931485">Check whether the disk space is sufficient.</strong></p>
<ol id="ALM-12034__ol947516194522"><li id="ALM-12034__li739591494522"><span>In the FusionInsight Manager portal, click <strong id="ALM-12034__b3064793094522">O&amp;M &gt; Alarm<strong id="ALM-12034__b27872374104950"> &gt; Alarms</strong></strong>.</span></li><li id="ALM-12034__li488781094522"><span>In the alarm list, click <span><img id="ALM-12034__image168221113135319" src="en-us_image_0000001582807645.png"></span> in the row where the alarm is located and obtain <strong id="ALM-12034__b6656323294522">TaskName</strong> from <strong id="ALM-12034__b9723191310467">Location</strong>.</span></li><li id="ALM-12034__li644408694522"><span>Choose <strong id="ALM-12034__b4399029494522">O&amp;M</strong> &gt; <strong id="ALM-12034__b6036833394522">Backup and Restoration &gt; Backup Management</strong>.</span></li><li id="ALM-12034__li5220897594522"><span>Search for the backup task based on <strong id="ALM-12034__b0347912913">TaskName</strong> and click <strong id="ALM-12034__b20551318102819">More</strong><strong id="ALM-12034__b185711882811"> </strong>in the <strong id="ALM-12034__b43471515919">Operation</strong> column. In the displayed dialog box, click <strong id="ALM-12034__b63471511997">View History</strong> and view the task details.</span></li><li id="ALM-12034__li20896327494"><span>In the displayed dialog box and click <span><img id="ALM-12034__image5943924184912" src="en-us_image_0000001532927370.png"></span> to check whether the following message is displayed: Failed to backup xx due to insufficient disk space, move the data in the xx directory to other directories.</span><p><ul class="subitemlist" id="ALM-12034__ul450817218102"><li id="ALM-12034__li75085211107">If yes, go to <a href="#ALM-12034__li8265923133114">6</a>.</li><li id="ALM-12034__li2510182101010">If no, go to <a href="#ALM-12034__li115006411351">13</a>.</li></ul> <ol id="ALM-12034__ol947516194522"><li id="ALM-12034__li739591494522"><span>In the <span id="ALM-12034__text34789336432">MRS</span> Manager portal, click <strong id="ALM-12034__b3064793094522">O&amp;M &gt; Alarm<strong id="ALM-12034__b27872374104950"> &gt; Alarms</strong></strong>.</span></li><li id="ALM-12034__li488781094522"><span>In the alarm list, click <span><img id="ALM-12034__image168221113135319" src="en-us_image_0000001582807645.png"></span> in the row where the alarm is located and obtain <strong id="ALM-12034__b6656323294522">TaskName</strong> from <strong id="ALM-12034__b9723191310467">Location</strong>.</span></li><li id="ALM-12034__li644408694522"><span>Choose <strong id="ALM-12034__b4399029494522">O&amp;M</strong> &gt; <strong id="ALM-12034__b6036833394522">Backup and Restoration &gt; Backup Management</strong>.</span></li><li id="ALM-12034__li5220897594522"><span>Search for the backup task based on <strong id="ALM-12034__b0347912913">TaskName</strong> and click <strong id="ALM-12034__b20551318102819">More</strong><strong id="ALM-12034__b185711882811"> </strong>in the <strong id="ALM-12034__b43471515919">Operation</strong> column. In the displayed dialog box, click <strong id="ALM-12034__b63471511997">View History</strong> and view the task details.</span></li><li id="ALM-12034__li20896327494"><span>In the displayed dialog box and click <span><img id="ALM-12034__image5943924184912" src="en-us_image_0000001532927370.png"></span> to check whether the following message is displayed: Failed to backup xx due to insufficient disk space, move the data in the xx directory to other directories.</span><p><ul class="subitemlist" id="ALM-12034__ul450817218102"><li id="ALM-12034__li75085211107">If yes, go to <a href="#ALM-12034__li8265923133114">6</a>.</li><li id="ALM-12034__li2510182101010">If no, go to <a href="#ALM-12034__li115006411351">13</a>.</li></ul>
</p></li><li id="ALM-12034__li8265923133114"><a name="ALM-12034__li8265923133114"></a><a name="li8265923133114"></a><span>Choose <strong id="ALM-12034__b10266192333118">Backup Path</strong> &gt; <strong id="ALM-12034__b1226611237319">View </strong>and obtain the <strong id="ALM-12034__b1626622314312">Backup Path</strong>.</span></li><li id="ALM-12034__li11760165519910"><span>Log in to the node as user <strong id="ALM-12034__b142279511119">root</strong> and run the following command to check the node mounting details:</span><p><p id="ALM-12034__p177811011105319"><span id="ALM-12034__text16214101716530"></span></p> </p></li><li id="ALM-12034__li8265923133114"><a name="ALM-12034__li8265923133114"></a><a name="li8265923133114"></a><span>Choose <strong id="ALM-12034__b10266192333118">Backup Path</strong> &gt; <strong id="ALM-12034__b1226611237319">View </strong>and obtain the <strong id="ALM-12034__b1626622314312">Backup Path</strong>.</span></li><li id="ALM-12034__li11760165519910"><span>Log in to the node as user <strong id="ALM-12034__b142279511119">root</strong> and run the following command to check the node mounting details:</span><p><p id="ALM-12034__p177811011105319"><span id="ALM-12034__text16214101716530"></span></p>
<p id="ALM-12034__p1730510253112"><strong id="ALM-12034__b13233131210719">df -h</strong></p> <p id="ALM-12034__p1730510253112"><strong id="ALM-12034__b13233131210719">df -h</strong></p>
</p></li><li id="ALM-12034__li75106309133"><span>Check whether the available space of the node to which the backup path is mounted is less than 20 GB.</span><p><ul class="subitemlist" id="ALM-12034__ul93921250101319"><li id="ALM-12034__li16393950171317">If yes, go to <a href="#ALM-12034__li181154133220">9</a>.</li><li id="ALM-12034__li1439665061320">If no, go to <a href="#ALM-12034__li115006411351">13</a>.</li></ul> </p></li><li id="ALM-12034__li75106309133"><span>Check whether the available space of the node to which the backup path is mounted is less than 20 GB.</span><p><ul class="subitemlist" id="ALM-12034__ul93921250101319"><li id="ALM-12034__li16393950171317">If yes, go to <a href="#ALM-12034__li181154133220">9</a>.</li><li id="ALM-12034__li1439665061320">If no, go to <a href="#ALM-12034__li115006411351">13</a>.</li></ul>
@ -73,7 +73,7 @@
</p></li><li id="ALM-12034__li5916521794522"><a name="ALM-12034__li5916521794522"></a><a name="li5916521794522"></a><span>After 2 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12034__ul4385661594522"><li id="ALM-12034__li5372883994522">If yes, no further action is required.</li><li id="ALM-12034__li5706874094522">If no, go to <a href="#ALM-12034__li115006411351">13</a>.</li></ul> </p></li><li id="ALM-12034__li5916521794522"><a name="ALM-12034__li5916521794522"></a><a name="li5916521794522"></a><span>After 2 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12034__ul4385661594522"><li id="ALM-12034__li5372883994522">If yes, no further action is required.</li><li id="ALM-12034__li5706874094522">If no, go to <a href="#ALM-12034__li115006411351">13</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="subitemlist" id="ALM-12034__p4445114113355"><strong id="ALM-12034__b1570250993141">Collect fault information.</strong></p> <p class="subitemlist" id="ALM-12034__p4445114113355"><strong id="ALM-12034__b1570250993141">Collect fault information.</strong></p>
<ol start="13" id="ALM-12034__ol135018418359"><li id="ALM-12034__li115006411351"><a name="ALM-12034__li115006411351"></a><a name="li115006411351"></a><span>On the FusionInsight Manager portal, choose <strong id="ALM-12034__b8500174113511">O&amp;M</strong> &gt; <strong id="ALM-12034__b4500941173512">Log &gt; Download</strong>.</span></li><li id="ALM-12034__li13500174119354"><span>Select <strong id="ALM-12034__b450034113518">Controller</strong> from the <strong id="ALM-12034__b150044112358">Service</strong> and click <strong id="ALM-12034__b3991118545">OK</strong>.</span></li><li id="ALM-12034__li2501144119351"><span>Click <span><img id="ALM-12034__image13500184111355" src="en-us_image_0000001532448214.png"></span> in the upper right corner, and set <strong id="ALM-12034__b450010417354">Start Date</strong> and <strong id="ALM-12034__b1250124110357">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12034__b1950164118356">Download</strong>.</span></li><li id="ALM-12034__li495644512588"><span>Contact the <span id="ALM-12034__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="13" id="ALM-12034__ol135018418359"><li id="ALM-12034__li115006411351"><a name="ALM-12034__li115006411351"></a><a name="li115006411351"></a><span>On the <span id="ALM-12034__text230783684413">MRS</span> Manager portal, choose <strong id="ALM-12034__b8500174113511">O&amp;M</strong> &gt; <strong id="ALM-12034__b4500941173512">Log &gt; Download</strong>.</span></li><li id="ALM-12034__li13500174119354"><span>Select <strong id="ALM-12034__b450034113518">Controller</strong> from the <strong id="ALM-12034__b150044112358">Service</strong> and click <strong id="ALM-12034__b3991118545">OK</strong>.</span></li><li id="ALM-12034__li2501144119351"><span>Click <span><img id="ALM-12034__image13500184111355" src="en-us_image_0000001532448214.png"></span> in the upper right corner, and set <strong id="ALM-12034__b450010417354">Start Date</strong> and <strong id="ALM-12034__b1250124110357">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12034__b1950164118356">Download</strong>.</span></li><li id="ALM-12034__li495644512588"><span>Contact the <span id="ALM-12034__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12034__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12034__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12034__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12034__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -64,12 +64,12 @@
<div class="section" id="ALM-12035__sacfd7ee8334740b0ba21d1763037c632"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12035__en-us_topic_0070543609_p26263014">The alarm cause depends on the task details. Handle the alarm according to the logs and alarm details.</p> <div class="section" id="ALM-12035__sacfd7ee8334740b0ba21d1763037c632"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12035__en-us_topic_0070543609_p26263014">The alarm cause depends on the task details. Handle the alarm according to the logs and alarm details.</p>
</div> </div>
<div class="section" id="ALM-12035__s9bf9cfe815d64aefa40fafcd22fe46e5"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12035__en-us_topic_0070543609_p46929400"><strong id="ALM-12035__b3404416894635">Collect fault information.</strong></p> <div class="section" id="ALM-12035__s9bf9cfe815d64aefa40fafcd22fe46e5"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12035__en-us_topic_0070543609_p46929400"><strong id="ALM-12035__b3404416894635">Collect fault information.</strong></p>
<ol id="ALM-12035__ol8912728101615"><li id="ALM-12035__li1191262861614"><span>In the FusionInsight Manager, choose <strong id="ALM-12035__b14623152119812">Cluster &gt; </strong><em id="ALM-12035__i56519211481">Name of the desired cluster</em><strong id="ALM-12035__b1162417211489"> &gt; Services</strong>, and check whether the running status of the component meets the requirements. (The OMS and DBService must be in the normal state, and other components must be stopped.)</span><p><ul id="ALM-12035__ul18912142801616"><li id="ALM-12035__li39121528141617">If yes, go to <a href="#ALM-12035__li18912172820165">9</a>.</li><li id="ALM-12035__li69121828141614">If no, go to <a href="#ALM-12035__li16912228111613">2</a>.</li></ul> <ol id="ALM-12035__ol8912728101615"><li id="ALM-12035__li1191262861614"><span>In the <span id="ALM-12035__text34789336432">MRS</span> Manager, choose <strong id="ALM-12035__b14623152119812">Cluster &gt; </strong><em id="ALM-12035__i56519211481">Name of the desired cluster</em><strong id="ALM-12035__b1162417211489"> &gt; Services</strong>, and check whether the running status of the component meets the requirements. (The OMS and DBService must be in the normal state, and other components must be stopped.)</span><p><ul id="ALM-12035__ul18912142801616"><li id="ALM-12035__li39121528141617">If yes, go to <a href="#ALM-12035__li18912172820165">9</a>.</li><li id="ALM-12035__li69121828141614">If no, go to <a href="#ALM-12035__li16912228111613">2</a>.</li></ul>
</p></li><li id="ALM-12035__li16912228111613"><a name="ALM-12035__li16912228111613"></a><a name="li16912228111613"></a><span>Restore the component status as required and start the recovery task again.</span></li><li id="ALM-12035__li49121828171617"><span>Log in to the FusionInsight Manager portal and click <strong id="ALM-12035__b0912162814167">O&amp;M &gt; Alarm<strong id="ALM-12035__b19912728151615"> &gt; Alarms</strong></strong>.</span></li><li id="ALM-12035__li591222818167"><span>In the alarm list, click <span><img id="ALM-12035__image159128280169" src="en-us_image_0000001532448366.png"></span> in the row where the alarm is located to obtain <strong id="ALM-12035__b59121128141611">TaskName</strong> from <strong id="ALM-12035__b2912162815161">Location</strong>.</span></li><li id="ALM-12035__li18912152891616"><span>Choose <strong id="ALM-12035__b12912162812167">O&amp;M</strong> &gt; <strong id="ALM-12035__b79123288163"><strong id="ALM-12035__b2912132812163">Backup and Restoration &gt; </strong>Restoration Management</strong>.</span></li><li id="ALM-12035__li1912142813165"><span>Find the restoration task by <strong id="ALM-12035__b15912528101611">Task Name</strong> and view the task details.</span></li><li id="ALM-12035__li1991218288166"><span>Perform the recovery task again and check whether the recovery task execution is successful.</span><p><ul class="subitemlist" id="ALM-12035__ul491292812164"><li id="ALM-12035__li10912122819168">If yes, go to <a href="#ALM-12035__li691272812168">8</a>.</li><li id="ALM-12035__li10912192811612">If no, go to <a href="#ALM-12035__li18912172820165">9</a>.</li></ul> </p></li><li id="ALM-12035__li16912228111613"><a name="ALM-12035__li16912228111613"></a><a name="li16912228111613"></a><span>Restore the component status as required and start the recovery task again.</span></li><li id="ALM-12035__li49121828171617"><span>Log in to the <span id="ALM-12035__text098317388449">MRS</span> Manager portal and click <strong id="ALM-12035__b0912162814167">O&amp;M &gt; Alarm<strong id="ALM-12035__b19912728151615"> &gt; Alarms</strong></strong>.</span></li><li id="ALM-12035__li591222818167"><span>In the alarm list, click <span><img id="ALM-12035__image159128280169" src="en-us_image_0000001532448366.png"></span> in the row where the alarm is located to obtain <strong id="ALM-12035__b59121128141611">TaskName</strong> from <strong id="ALM-12035__b2912162815161">Location</strong>.</span></li><li id="ALM-12035__li18912152891616"><span>Choose <strong id="ALM-12035__b12912162812167">O&amp;M</strong> &gt; <strong id="ALM-12035__b79123288163"><strong id="ALM-12035__b2912132812163">Backup and Restoration &gt; </strong>Restoration Management</strong>.</span></li><li id="ALM-12035__li1912142813165"><span>Find the restoration task by <strong id="ALM-12035__b15912528101611">Task Name</strong> and view the task details.</span></li><li id="ALM-12035__li1991218288166"><span>Perform the recovery task again and check whether the recovery task execution is successful.</span><p><ul class="subitemlist" id="ALM-12035__ul491292812164"><li id="ALM-12035__li10912122819168">If yes, go to <a href="#ALM-12035__li691272812168">8</a>.</li><li id="ALM-12035__li10912192811612">If no, go to <a href="#ALM-12035__li18912172820165">9</a>.</li></ul>
</p></li><li id="ALM-12035__li691272812168"><a name="ALM-12035__li691272812168"></a><a name="li691272812168"></a><span>After 2 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12035__ul191292851612"><li id="ALM-12035__li18912628181613">If yes, no further action is required.</li><li id="ALM-12035__li1991218285168">If no, go to <a href="#ALM-12035__li18912172820165">9</a>.</li></ul> </p></li><li id="ALM-12035__li691272812168"><a name="ALM-12035__li691272812168"></a><a name="li691272812168"></a><span>After 2 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12035__ul191292851612"><li id="ALM-12035__li18912628181613">If yes, no further action is required.</li><li id="ALM-12035__li1991218285168">If no, go to <a href="#ALM-12035__li18912172820165">9</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12035__en-us_topic_0070543610_p36865955"><strong id="ALM-12035__b5671597695034">Collect fault information.</strong></p> <p class="tableheading" id="ALM-12035__en-us_topic_0070543610_p36865955"><strong id="ALM-12035__b5671597695034">Collect fault information.</strong></p>
<ol start="9" id="ALM-12035__ol17912928131615"><li id="ALM-12035__li18912172820165"><a name="ALM-12035__li18912172820165"></a><a name="li18912172820165"></a><span>On the FusionInsight Manager portal, choose <strong id="ALM-12035__b11912192811618">O&amp;M</strong> &gt; <strong id="ALM-12035__b4912112871618">Log &gt; Download</strong>.</span></li><li id="ALM-12035__li29127284164"><span>Select <strong id="ALM-12035__b1491242841616">Controller</strong> from the <strong id="ALM-12035__b9912928131617">Service</strong> and click <strong id="ALM-12035__b3991118545">OK</strong>.</span></li><li id="ALM-12035__li16912132810167"><span>Click <span><img id="ALM-12035__image119122281161" src="en-us_image_0000001583127489.png"></span> in the upper right corner, and set <strong id="ALM-12035__b4912228191616">Start Date</strong> and <strong id="ALM-12035__b19122028151619">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12035__b891272814169">Download</strong>.</span></li><li id="ALM-12035__li495644512588"><span>Contact the <span id="ALM-12035__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="9" id="ALM-12035__ol17912928131615"><li id="ALM-12035__li18912172820165"><a name="ALM-12035__li18912172820165"></a><a name="li18912172820165"></a><span>On the <span id="ALM-12035__text1319234010447">MRS</span> Manager portal, choose <strong id="ALM-12035__b11912192811618">O&amp;M</strong> &gt; <strong id="ALM-12035__b4912112871618">Log &gt; Download</strong>.</span></li><li id="ALM-12035__li29127284164"><span>Select <strong id="ALM-12035__b1491242841616">Controller</strong> from the <strong id="ALM-12035__b9912928131617">Service</strong> and click <strong id="ALM-12035__b3991118545">OK</strong>.</span></li><li id="ALM-12035__li16912132810167"><span>Click <span><img id="ALM-12035__image119122281161" src="en-us_image_0000001583127489.png"></span> in the upper right corner, and set <strong id="ALM-12035__b4912228191616">Start Date</strong> and <strong id="ALM-12035__b19122028151619">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12035__b891272814169">Download</strong>.</span></li><li id="ALM-12035__li495644512588"><span>Contact the <span id="ALM-12035__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12035__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12035__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12035__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12035__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -60,7 +60,7 @@
<div class="section" id="ALM-12037__s0c12362544fb484a832aad2e1306c715"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12037__en-us_topic_0070543611_ul59582212"><li id="ALM-12037__en-us_topic_0070543611_li66477866">The NTP server network is abnormal.</li><li id="ALM-12037__en-us_topic_0070543611_li61429885">The NTP server authentication fails.</li><li id="ALM-12037__en-us_topic_0070543611_li15998054">The NTP server time cannot be obtained.</li><li id="ALM-12037__en-us_topic_0070543611_li9764758">The time obtained from the NTP server is not continuously updated.</li></ul> <div class="section" id="ALM-12037__s0c12362544fb484a832aad2e1306c715"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12037__en-us_topic_0070543611_ul59582212"><li id="ALM-12037__en-us_topic_0070543611_li66477866">The NTP server network is abnormal.</li><li id="ALM-12037__en-us_topic_0070543611_li61429885">The NTP server authentication fails.</li><li id="ALM-12037__en-us_topic_0070543611_li15998054">The NTP server time cannot be obtained.</li><li id="ALM-12037__en-us_topic_0070543611_li9764758">The time obtained from the NTP server is not continuously updated.</li></ul>
</div> </div>
<div class="section" id="ALM-12037__s10907d0f1dcf40acb84507bb13294ade"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12037__en-us_topic_0070543611_p52747895"><strong id="ALM-12037__b154553399520">Check the NTP server network.</strong></p> <div class="section" id="ALM-12037__s10907d0f1dcf40acb84507bb13294ade"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12037__en-us_topic_0070543611_p52747895"><strong id="ALM-12037__b154553399520">Check the NTP server network.</strong></p>
<ol id="ALM-12037__ol954788095212"><li id="ALM-12037__li472667049523"><span>On the FusionInsight Manager portal, click <strong id="ALM-12037__b3064793094522">O&amp;M &gt; Alarm<strong id="ALM-12037__b27872374104950"> &gt; Alarms</strong></strong> and click <span><img id="ALM-12037__image168221113135319" src="en-us_image_0000001532607750.png"></span> in the row where the alarm is located.</span></li><li id="ALM-12037__li477866769523"><span>View the alarm additional information to check whether the NTP server fails to be pinged.</span><p><ul class="subitemlist" id="ALM-12037__ul127661719523"><li id="ALM-12037__li305801229523">If yes, go to <a href="#ALM-12037__li601372919523">3</a>.</li><li id="ALM-12037__li610707879523">If no, go to <a href="#ALM-12037__li392824159523">4</a>.</li></ul> <ol id="ALM-12037__ol954788095212"><li id="ALM-12037__li472667049523"><span>On the <span id="ALM-12037__text34789336432">MRS</span> Manager portal, click <strong id="ALM-12037__b3064793094522">O&amp;M &gt; Alarm<strong id="ALM-12037__b27872374104950"> &gt; Alarms</strong></strong> and click <span><img id="ALM-12037__image168221113135319" src="en-us_image_0000001532607750.png"></span> in the row where the alarm is located.</span></li><li id="ALM-12037__li477866769523"><span>View the alarm additional information to check whether the NTP server fails to be pinged.</span><p><ul class="subitemlist" id="ALM-12037__ul127661719523"><li id="ALM-12037__li305801229523">If yes, go to <a href="#ALM-12037__li601372919523">3</a>.</li><li id="ALM-12037__li610707879523">If no, go to <a href="#ALM-12037__li392824159523">4</a>.</li></ul>
</p></li><li id="ALM-12037__li601372919523"><a name="ALM-12037__li601372919523"></a><a name="li601372919523"></a><span>Contact the network administrator to check the network configuration and ensure that the network between the NTP server and the active OMS node is normal. Then, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12037__ul628802729523"><li id="ALM-12037__li274269039523">If yes, no further action is required.</li><li id="ALM-12037__li69866969523">If no, go to <a href="#ALM-12037__li392824159523">4</a>.</li></ul> </p></li><li id="ALM-12037__li601372919523"><a name="ALM-12037__li601372919523"></a><a name="li601372919523"></a><span>Contact the network administrator to check the network configuration and ensure that the network between the NTP server and the active OMS node is normal. Then, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12037__ul628802729523"><li id="ALM-12037__li274269039523">If yes, no further action is required.</li><li id="ALM-12037__li69866969523">If no, go to <a href="#ALM-12037__li392824159523">4</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12037__p290515429523"><strong id="ALM-12037__b4031175895218">Check whether the NTP server authentication fails.</strong></p> <p class="tableheading" id="ALM-12037__p290515429523"><strong id="ALM-12037__b4031175895218">Check whether the NTP server authentication fails.</strong></p>
@ -83,7 +83,7 @@
</p></li><li id="ALM-12037__li251290419523"><a name="ALM-12037__li251290419523"></a><a name="li251290419523"></a><span>Contact the provider of the NTP server to rectify the NTP server fault. After the NTP server is normal, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12037__ul185373339523"><li id="ALM-12037__li28791669523">If yes, no further action is required.</li><li id="ALM-12037__li318858659523">If no, go to <a href="#ALM-12037__li654599509523">12</a>.</li></ul> </p></li><li id="ALM-12037__li251290419523"><a name="ALM-12037__li251290419523"></a><a name="li251290419523"></a><span>Contact the provider of the NTP server to rectify the NTP server fault. After the NTP server is normal, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12037__ul185373339523"><li id="ALM-12037__li28791669523">If yes, no further action is required.</li><li id="ALM-12037__li318858659523">If no, go to <a href="#ALM-12037__li654599509523">12</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12037__p326182779523"><strong id="ALM-12037__b4063632295325">Collect fault information.</strong></p> <p class="tableheading" id="ALM-12037__p326182779523"><strong id="ALM-12037__b4063632295325">Collect fault information.</strong></p>
<ol start="12" id="ALM-12037__ol2743408695328"><li id="ALM-12037__li654599509523"><a name="ALM-12037__li654599509523"></a><a name="li654599509523"></a><span>On the FusionInsight Manager, choose <strong id="ALM-12037__b248347779523">O&amp;M</strong> &gt; <strong id="ALM-12037__b221864089523">Log &gt; Download</strong>.</span></li><li id="ALM-12037__li82842029523"><span>Select <strong id="ALM-12037__b522686449523">NodeAgent</strong> and <strong id="ALM-12037__b6557569523">OmmServer</strong> from the <strong id="ALM-12037__b59018059523">Service</strong> and click <strong id="ALM-12037__b3991118545">OK</strong>.</span></li><li id="ALM-12037__li1145664103113"><span>Click <span><img id="ALM-12037__image1945644173117" src="en-us_image_0000001582927645.png"></span> in the upper right corner, and set <strong id="ALM-12037__b6456941173117">Start Date</strong> and <strong id="ALM-12037__b11456154113318">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12037__b13456164113319">Download</strong>.</span></li><li id="ALM-12037__li495644512588"><span>Contact the <span id="ALM-12037__text106761111141812">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="12" id="ALM-12037__ol2743408695328"><li id="ALM-12037__li654599509523"><a name="ALM-12037__li654599509523"></a><a name="li654599509523"></a><span>On the <span id="ALM-12037__text48182045204410">MRS</span> Manager, choose <strong id="ALM-12037__b248347779523">O&amp;M</strong> &gt; <strong id="ALM-12037__b221864089523">Log &gt; Download</strong>.</span></li><li id="ALM-12037__li82842029523"><span>Select <strong id="ALM-12037__b522686449523">NodeAgent</strong> and <strong id="ALM-12037__b6557569523">OmmServer</strong> from the <strong id="ALM-12037__b59018059523">Service</strong> and click <strong id="ALM-12037__b3991118545">OK</strong>.</span></li><li id="ALM-12037__li1145664103113"><span>Click <span><img id="ALM-12037__image1945644173117" src="en-us_image_0000001582927645.png"></span> in the upper right corner, and set <strong id="ALM-12037__b6456941173117">Start Date</strong> and <strong id="ALM-12037__b11456154113318">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12037__b13456164113319">Download</strong>.</span></li><li id="ALM-12037__li495644512588"><span>Contact the <span id="ALM-12037__text106761111141812">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12037__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12037__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12037__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12037__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -1,7 +1,7 @@
<a name="ALM-12038"></a><a name="ALM-12038"></a> <a name="ALM-12038"></a><a name="ALM-12038"></a>
<h1 class="topictitle1">ALM-12038 Monitoring Indicator Dumping Failure</h1> <h1 class="topictitle1">ALM-12038 Monitoring Indicator Dumping Failure</h1>
<div id="body63636060"><div class="section" id="ALM-12038__s8e9121c2c414434483ea97a53f56b6a3"><h4 class="sectiontitle">Description</h4><p id="ALM-12038__en-us_topic_0070543612_p1601797">After monitoring indicator dumping is configured on FusionInsight Manager, the system checks the monitoring indicator dumping result at the dumping interval (60 seconds by default). This alarm is generated when the dumping fails.</p> <div id="body63636060"><div class="section" id="ALM-12038__s8e9121c2c414434483ea97a53f56b6a3"><h4 class="sectiontitle">Description</h4><p id="ALM-12038__en-us_topic_0070543612_p1601797">After monitoring indicator dumping is configured on <span id="ALM-12038__text34789336432">MRS</span> Manager, the system checks the monitoring indicator dumping result at the dumping interval (60 seconds by default). This alarm is generated when the dumping fails.</p>
<p id="ALM-12038__en-us_topic_0070543612_p14416173">This alarm is cleared when dumping is successful.</p> <p id="ALM-12038__en-us_topic_0070543612_p14416173">This alarm is cleared when dumping is successful.</p>
</div> </div>
<div class="section" id="ALM-12038__s31d8d809c03f4781ab23dff587f0e76c"><h4 class="sectiontitle">Attribute</h4> <div class="section" id="ALM-12038__s31d8d809c03f4781ab23dff587f0e76c"><h4 class="sectiontitle">Attribute</h4>
@ -55,12 +55,12 @@
</table> </table>
</div> </div>
</div> </div>
<div class="section" id="ALM-12038__s00b9cb8c5c10409681288b82523f4a66"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-12038__en-us_topic_0070543612_p14566108">The upper-layer management system cannot obtain monitoring indicators from the FusionInsight Manager system.</p> <div class="section" id="ALM-12038__s00b9cb8c5c10409681288b82523f4a66"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-12038__en-us_topic_0070543612_p14566108">The upper-layer management system cannot obtain monitoring indicators from the <span id="ALM-12038__text5481134815444">MRS</span> Manager system.</p>
</div> </div>
<div class="section" id="ALM-12038__s4e59ea22202b4f69831fdaa7a30f2974"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12038__en-us_topic_0070543612_ul39004066"><li id="ALM-12038__en-us_topic_0070543612_li15492278">The server cannot be connected.</li><li id="ALM-12038__en-us_topic_0070543612_li5212777">The save path on the server cannot be accessed.</li><li id="ALM-12038__en-us_topic_0070543612_li46914996">The monitoring indicator file fails to be uploaded.</li></ul> <div class="section" id="ALM-12038__s4e59ea22202b4f69831fdaa7a30f2974"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12038__en-us_topic_0070543612_ul39004066"><li id="ALM-12038__en-us_topic_0070543612_li15492278">The server cannot be connected.</li><li id="ALM-12038__en-us_topic_0070543612_li5212777">The save path on the server cannot be accessed.</li><li id="ALM-12038__en-us_topic_0070543612_li46914996">The monitoring indicator file fails to be uploaded.</li></ul>
</div> </div>
<div class="section" id="ALM-12038__s78696e4fd8994f578e59819068f88bd9"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12038__en-us_topic_0070543612_p42018335"><strong id="ALM-12038__b39938613103615">Check whether the server connection is normal.</strong></p> <div class="section" id="ALM-12038__s78696e4fd8994f578e59819068f88bd9"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12038__en-us_topic_0070543612_p42018335"><strong id="ALM-12038__b39938613103615">Check whether the server connection is normal.</strong></p>
<ol id="ALM-12038__ol614335103629"><li id="ALM-12038__li55118711103617"><span>Check whether the network between the FusionInsight Manager system and the server is normal.</span><p><ul class="subitemlist" id="ALM-12038__ul50863543103617"><li id="ALM-12038__li35971633103617">If yes, go to <a href="#ALM-12038__li44378490103617">3</a>.</li><li id="ALM-12038__li28021126103617">If no, go to <a href="#ALM-12038__li59131350103617">2</a>.</li></ul> <ol id="ALM-12038__ol614335103629"><li id="ALM-12038__li55118711103617"><span>Check whether the network between the <span id="ALM-12038__text5743154924417">MRS</span> Manager system and the server is normal.</span><p><ul class="subitemlist" id="ALM-12038__ul50863543103617"><li id="ALM-12038__li35971633103617">If yes, go to <a href="#ALM-12038__li44378490103617">3</a>.</li><li id="ALM-12038__li28021126103617">If no, go to <a href="#ALM-12038__li59131350103617">2</a>.</li></ul>
</p></li><li id="ALM-12038__li59131350103617"><a name="ALM-12038__li59131350103617"></a><a name="li59131350103617"></a><span>Contact the network administrator to recover the network and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12038__ul51309392103617"><li id="ALM-12038__li26306358103617">If yes, no further action is required.</li><li id="ALM-12038__li50440286103617">If no, go to <a href="#ALM-12038__li44378490103617">3</a>.</li></ul> </p></li><li id="ALM-12038__li59131350103617"><a name="ALM-12038__li59131350103617"></a><a name="li59131350103617"></a><span>Contact the network administrator to recover the network and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12038__ul51309392103617"><li id="ALM-12038__li26306358103617">If yes, no further action is required.</li><li id="ALM-12038__li50440286103617">If no, go to <a href="#ALM-12038__li44378490103617">3</a>.</li></ul>
</p></li><li id="ALM-12038__li44378490103617"><a name="ALM-12038__li44378490103617"></a><a name="li44378490103617"></a><span>Choose <strong id="ALM-12038__b62420103103617">System</strong> &gt; <strong id="ALM-12038__b24910022103617"><strong id="ALM-12038__b1861155518585">Interconnection</strong> &gt; Upload Performance Data</strong> and check whether the FTP username, password, port, dump mode, and public key configured on the upload performance data page are consistent with the configuration on the server.</span><p><ul class="subitemlist" id="ALM-12038__ul19844024103617"><li id="ALM-12038__li4445911103617">If yes, go to <a href="#ALM-12038__li31439394103617">5</a>.</li><li id="ALM-12038__li24574512103617">If no, go to <a href="#ALM-12038__li38260071103617">4</a>.</li></ul> </p></li><li id="ALM-12038__li44378490103617"><a name="ALM-12038__li44378490103617"></a><a name="li44378490103617"></a><span>Choose <strong id="ALM-12038__b62420103103617">System</strong> &gt; <strong id="ALM-12038__b24910022103617"><strong id="ALM-12038__b1861155518585">Interconnection</strong> &gt; Upload Performance Data</strong> and check whether the FTP username, password, port, dump mode, and public key configured on the upload performance data page are consistent with the configuration on the server.</span><p><ul class="subitemlist" id="ALM-12038__ul19844024103617"><li id="ALM-12038__li4445911103617">If yes, go to <a href="#ALM-12038__li31439394103617">5</a>.</li><li id="ALM-12038__li24574512103617">If no, go to <a href="#ALM-12038__li38260071103617">4</a>.</li></ul>
</p></li><li id="ALM-12038__li38260071103617"><a name="ALM-12038__li38260071103617"></a><a name="li38260071103617"></a><span>Enter the correct configuration information, click <strong id="ALM-12038__b63862097103617">OK</strong>, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12038__ul38583553103617"><li id="ALM-12038__li37887965103617">If yes, no further action is required.</li><li id="ALM-12038__li49026304103617">If no, go to <a href="#ALM-12038__li31439394103617">5</a>.</li></ul> </p></li><li id="ALM-12038__li38260071103617"><a name="ALM-12038__li38260071103617"></a><a name="li38260071103617"></a><span>Enter the correct configuration information, click <strong id="ALM-12038__b63862097103617">OK</strong>, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12038__ul38583553103617"><li id="ALM-12038__li37887965103617">If yes, no further action is required.</li><li id="ALM-12038__li49026304103617">If no, go to <a href="#ALM-12038__li31439394103617">5</a>.</li></ul>
@ -76,7 +76,7 @@
</p></li><li id="ALM-12038__li53095195103617"><a name="ALM-12038__li53095195103617"></a><a name="li53095195103617"></a><span>Delete unnecessary files or go to the monitoring indicator dumping configuration page to change the save path. Then, check whether the save path has sufficient disk space.</span><p><ul class="subitemlist" id="ALM-12038__ul35452684103617"><li id="ALM-12038__li50587406103617">If yes, no further action is required.</li><li id="ALM-12038__li3939187103617">If no, go to <a href="#ALM-12038__li51692141103617">11</a>.</li></ul> </p></li><li id="ALM-12038__li53095195103617"><a name="ALM-12038__li53095195103617"></a><a name="li53095195103617"></a><span>Delete unnecessary files or go to the monitoring indicator dumping configuration page to change the save path. Then, check whether the save path has sufficient disk space.</span><p><ul class="subitemlist" id="ALM-12038__ul35452684103617"><li id="ALM-12038__li50587406103617">If yes, no further action is required.</li><li id="ALM-12038__li3939187103617">If no, go to <a href="#ALM-12038__li51692141103617">11</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12038__p50638708103617"><strong id="ALM-12038__b6018239103721">Collect fault information.</strong></p> <p class="tableheading" id="ALM-12038__p50638708103617"><strong id="ALM-12038__b6018239103721">Collect fault information.</strong></p>
<ol start="11" id="ALM-12038__ol22765086103724"><li id="ALM-12038__li51692141103617"><a name="ALM-12038__li51692141103617"></a><a name="li51692141103617"></a><span>On the FusionInsight Manager portal, choose <strong id="ALM-12038__b2056231918912">O&amp;M</strong> &gt; <strong id="ALM-12038__b5743571103617">Log &gt; Download</strong>.</span></li><li id="ALM-12038__li51051832103617"><span>Select <strong id="ALM-12038__b1352831932712">OMS</strong> from the <strong id="ALM-12038__b26313908103617">Service</strong> and click <strong id="ALM-12038__b3991118545">OK</strong>.</span></li><li id="ALM-12038__li1145664103113"><span>Click <span><img id="ALM-12038__image1945644173117" src="en-us_image_0000001532927442.png"></span> in the upper right corner, and set <strong id="ALM-12038__b6456941173117">Start Date</strong> and <strong id="ALM-12038__b11456154113318">End Date</strong> for log collection to 1 hour ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12038__b13456164113319">Download</strong>.</span></li><li id="ALM-12038__li495644512588"><span>Contact the <span id="ALM-12038__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="11" id="ALM-12038__ol22765086103724"><li id="ALM-12038__li51692141103617"><a name="ALM-12038__li51692141103617"></a><a name="li51692141103617"></a><span>On the <span id="ALM-12038__text59921550134418">MRS</span> Manager portal, choose <strong id="ALM-12038__b2056231918912">O&amp;M</strong> &gt; <strong id="ALM-12038__b5743571103617">Log &gt; Download</strong>.</span></li><li id="ALM-12038__li51051832103617"><span>Select <strong id="ALM-12038__b1352831932712">OMS</strong> from the <strong id="ALM-12038__b26313908103617">Service</strong> and click <strong id="ALM-12038__b3991118545">OK</strong>.</span></li><li id="ALM-12038__li1145664103113"><span>Click <span><img id="ALM-12038__image1945644173117" src="en-us_image_0000001532927442.png"></span> in the upper right corner, and set <strong id="ALM-12038__b6456941173117">Start Date</strong> and <strong id="ALM-12038__b11456154113318">End Date</strong> for log collection to 1 hour ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12038__b13456164113319">Download</strong>.</span></li><li id="ALM-12038__li495644512588"><span>Contact the <span id="ALM-12038__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12038__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12038__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12038__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12038__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -75,7 +75,7 @@
<div class="section" id="ALM-12039__s2aec9ff7cd804900af8457f84b365f70"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12039__en-us_topic_0070543613_ul18926225"><li id="ALM-12039__en-us_topic_0070543613_li36118300">The network between the active and standby nodes is unstable.</li><li id="ALM-12039__en-us_topic_0070543613_li56629250">The standby OMS Database is abnormal.</li><li id="ALM-12039__en-us_topic_0070543613_li39901202">The standby node disk space is full.</li></ul> <div class="section" id="ALM-12039__s2aec9ff7cd804900af8457f84b365f70"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12039__en-us_topic_0070543613_ul18926225"><li id="ALM-12039__en-us_topic_0070543613_li36118300">The network between the active and standby nodes is unstable.</li><li id="ALM-12039__en-us_topic_0070543613_li56629250">The standby OMS Database is abnormal.</li><li id="ALM-12039__en-us_topic_0070543613_li39901202">The standby node disk space is full.</li></ul>
</div> </div>
<div class="section" id="ALM-12039__s3db2913b091445d59edc8bff2fa84546"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12039__en-us_topic_0070543613_p10771936"><strong id="ALM-12039__b4973408104948">Check whether the network between the active and standby nodes is normal.</strong></p> <div class="section" id="ALM-12039__s3db2913b091445d59edc8bff2fa84546"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12039__en-us_topic_0070543613_p10771936"><strong id="ALM-12039__b4973408104948">Check whether the network between the active and standby nodes is normal.</strong></p>
<ol id="ALM-12039__ol3742838310508"><li id="ALM-12039__li49524776104950"><span>Log in to FusionInsight Manager, click <strong id="ALM-12039__b27872374104950">O&amp;M &gt; Alarm<strong id="ALM-12039__b2084142661316"> &gt; Alarms</strong></strong>, click <span><img id="ALM-12039__image168221113135319" src="en-us_image_0000001582927757.png"></span> in the row where the alarm is located, and query the standby OMS Database IP address.</span></li><li id="ALM-12039__li52083901104950"><span>Log in to the active OMS Database node as user <strong id="ALM-12039__b43069802104950">root</strong>. <span id="ALM-12039__text43649449460"></span></span></li><li id="ALM-12039__li5718024104950"><span>Run the <strong id="ALM-12039__b66101931104950">ping </strong><em id="ALM-12039__i58046467104950">Standby OMS Database heartbeat IP address</em> command to check whether the standby OMS Database node is reachable.</span><p><ul class="subitemlist" id="ALM-12039__ul635336104950"><li id="ALM-12039__li4143393104950">If yes, go to <a href="#ALM-12039__li19362442104950">6</a>.</li><li id="ALM-12039__li70592104950">If no, go to <a href="#ALM-12039__li36080609104950">4</a>.</li></ul> <ol id="ALM-12039__ol3742838310508"><li id="ALM-12039__li49524776104950"><span>Log in to <span id="ALM-12039__text34789336432">MRS</span> Manager, click <strong id="ALM-12039__b27872374104950">O&amp;M &gt; Alarm<strong id="ALM-12039__b2084142661316"> &gt; Alarms</strong></strong>, click <span><img id="ALM-12039__image168221113135319" src="en-us_image_0000001582927757.png"></span> in the row where the alarm is located, and query the standby OMS Database IP address.</span></li><li id="ALM-12039__li52083901104950"><span>Log in to the active OMS Database node as user <strong id="ALM-12039__b43069802104950">root</strong>. <span id="ALM-12039__text43649449460"></span></span></li><li id="ALM-12039__li5718024104950"><span>Run the <strong id="ALM-12039__b66101931104950">ping </strong><em id="ALM-12039__i58046467104950">Standby OMS Database heartbeat IP address</em> command to check whether the standby OMS Database node is reachable.</span><p><ul class="subitemlist" id="ALM-12039__ul635336104950"><li id="ALM-12039__li4143393104950">If yes, go to <a href="#ALM-12039__li19362442104950">6</a>.</li><li id="ALM-12039__li70592104950">If no, go to <a href="#ALM-12039__li36080609104950">4</a>.</li></ul>
</p></li><li id="ALM-12039__li36080609104950"><a name="ALM-12039__li36080609104950"></a><a name="li36080609104950"></a><span>Contact the network administrator to check whether the network is faulty.</span><p><ul class="subitemlist" id="ALM-12039__ul18922037104950"><li id="ALM-12039__li60506784104950">If yes, go to <a href="#ALM-12039__li35036231104950">5</a>.</li><li id="ALM-12039__li2102448104950">If no, go to <a href="#ALM-12039__li19362442104950">6</a>.</li></ul> </p></li><li id="ALM-12039__li36080609104950"><a name="ALM-12039__li36080609104950"></a><a name="li36080609104950"></a><span>Contact the network administrator to check whether the network is faulty.</span><p><ul class="subitemlist" id="ALM-12039__ul18922037104950"><li id="ALM-12039__li60506784104950">If yes, go to <a href="#ALM-12039__li35036231104950">5</a>.</li><li id="ALM-12039__li2102448104950">If no, go to <a href="#ALM-12039__li19362442104950">6</a>.</li></ul>
</p></li><li id="ALM-12039__li35036231104950"><a name="ALM-12039__li35036231104950"></a><a name="li35036231104950"></a><span>Rectify the network fault and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12039__ul31915716104950"><li id="ALM-12039__li56290029104950">If yes, no further action is required.</li><li id="ALM-12039__li63198514104950">If no, go to <a href="#ALM-12039__li19362442104950">6</a>.</li></ul> </p></li><li id="ALM-12039__li35036231104950"><a name="ALM-12039__li35036231104950"></a><a name="li35036231104950"></a><span>Rectify the network fault and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12039__ul31915716104950"><li id="ALM-12039__li56290029104950">If yes, no further action is required.</li><li id="ALM-12039__li63198514104950">If no, go to <a href="#ALM-12039__li19362442104950">6</a>.</li></ul>
</p></li></ol> </p></li></ol>
@ -89,7 +89,7 @@
</p></li><li id="ALM-12039__li27597409104950"><a name="ALM-12039__li27597409104950"></a><a name="li27597409104950"></a><span>Expand the disk capacity.</span></li><li id="ALM-12039__li21260851104950"><span>After the disk capacity is expanded, wait 2 minutes and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12039__ul6890515104950"><li id="ALM-12039__li47050096104950">If yes, no further action is required.</li><li id="ALM-12039__li52961395104950">If no, go to <a href="#ALM-12039__li64121842104950">16</a>.</li></ul> </p></li><li id="ALM-12039__li27597409104950"><a name="ALM-12039__li27597409104950"></a><a name="li27597409104950"></a><span>Expand the disk capacity.</span></li><li id="ALM-12039__li21260851104950"><span>After the disk capacity is expanded, wait 2 minutes and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12039__ul6890515104950"><li id="ALM-12039__li47050096104950">If yes, no further action is required.</li><li id="ALM-12039__li52961395104950">If no, go to <a href="#ALM-12039__li64121842104950">16</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12039__p62014640104950"><strong id="ALM-12039__b2323110410512">Collect fault information.</strong></p> <p class="tableheading" id="ALM-12039__p62014640104950"><strong id="ALM-12039__b2323110410512">Collect fault information.</strong></p>
<ol start="16" id="ALM-12039__ol6204213010516"><li id="ALM-12039__li64121842104950"><a name="ALM-12039__li64121842104950"></a><a name="li64121842104950"></a><span>On the FusionInsight Manager portal, choose <strong id="ALM-12039__b57129933104950">O&amp;M</strong> &gt; <strong id="ALM-12039__b44407351104950">Log &gt; Download</strong>.</span></li><li id="ALM-12039__li65050230104950"><span>Select <strong id="ALM-12039__b40225672104950">OMMServer</strong> from the <strong id="ALM-12039__b26486728104950">Service</strong> and click <strong id="ALM-12039__b3991118545">OK</strong>.</span></li><li id="ALM-12039__li1145664103113"><span>Click <span><img id="ALM-12039__image1945644173117" src="en-us_image_0000001532448378.png"></span> in the upper right corner, and set <strong id="ALM-12039__b6456941173117">Start Date</strong> and <strong id="ALM-12039__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12039__b13456164113319">Download</strong>.</span></li><li id="ALM-12039__li495644512588"><span>Contact the <span id="ALM-12039__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="16" id="ALM-12039__ol6204213010516"><li id="ALM-12039__li64121842104950"><a name="ALM-12039__li64121842104950"></a><a name="li64121842104950"></a><span>On the <span id="ALM-12039__text38361453104413">MRS</span> Manager portal, choose <strong id="ALM-12039__b57129933104950">O&amp;M</strong> &gt; <strong id="ALM-12039__b44407351104950">Log &gt; Download</strong>.</span></li><li id="ALM-12039__li65050230104950"><span>Select <strong id="ALM-12039__b40225672104950">OMMServer</strong> from the <strong id="ALM-12039__b26486728104950">Service</strong> and click <strong id="ALM-12039__b3991118545">OK</strong>.</span></li><li id="ALM-12039__li1145664103113"><span>Click <span><img id="ALM-12039__image1945644173117" src="en-us_image_0000001532448378.png"></span> in the upper right corner, and set <strong id="ALM-12039__b6456941173117">Start Date</strong> and <strong id="ALM-12039__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12039__b13456164113319">Download</strong>.</span></li><li id="ALM-12039__li495644512588"><span>Contact the <span id="ALM-12039__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12039__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12039__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12039__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12039__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -64,7 +64,7 @@
<div class="section" id="ALM-12040__section35878843"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12040__ul82194524379"><li id="ALM-12040__li16219652113719">rng-tools or haveged has not been installed or started.</li><li id="ALM-12040__li22191352123720">The entropy of the OS is smaller than 100 for multiple consecutive times.</li></ul> <div class="section" id="ALM-12040__section35878843"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12040__ul82194524379"><li id="ALM-12040__li16219652113719">rng-tools or haveged has not been installed or started.</li><li id="ALM-12040__li22191352123720">The entropy of the OS is smaller than 100 for multiple consecutive times.</li></ul>
</div> </div>
<div class="section" id="ALM-12040__section54474133"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12040__p41910029"><strong id="ALM-12040__b47474410105652">Check whether haveged or rng-tools has been installed or started.</strong></p> <div class="section" id="ALM-12040__section54474133"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12040__p41910029"><strong id="ALM-12040__b47474410105652">Check whether haveged or rng-tools has been installed or started.</strong></p>
<ol id="ALM-12040__ol4071338310572"><li id="ALM-12040__li30761529105655"><span>Log in to FusionInsight Manager and choose <strong id="ALM-12040__b1144914633011">O&amp;M</strong> &gt; <strong id="ALM-12040__b14501663300">Alarm</strong> &gt; <strong id="ALM-12040__b84514673013">Alarms</strong>.</span></li><li id="ALM-12040__li8418311105655"><span>Check the value of <strong id="ALM-12040__b078362362511">HostName</strong> in the <strong id="ALM-12040__b15793142362519">Location</strong> area to obtain the name of the host for which the alarm is generated.</span></li><li id="ALM-12040__li10794601105655"><span>Log in to the node for which the alarm is generated as user <strong id="ALM-12040__b8655940105655">root</strong>. <span id="ALM-12040__text23715444267"></span></span></li><li id="ALM-12040__li1325705155017"><span>Run the <strong id="ALM-12040__b925735155015">/bin/rpm -qa | grep -w "haveged"</strong> command to check the haveged installation status and check whether the command output is empty.</span><p><ul id="ALM-12040__ul13139362503"><li id="ALM-12040__li08387016232">If yes, go to <a href="#ALM-12040__li978924652119">6</a>.</li><li id="ALM-12040__li38381500230">If no, go to <a href="#ALM-12040__li35057727105655">5</a>.</li></ul> <ol id="ALM-12040__ol4071338310572"><li id="ALM-12040__li30761529105655"><span>Log in to <span id="ALM-12040__text34789336432">MRS</span> Manager and choose <strong id="ALM-12040__b1144914633011">O&amp;M</strong> &gt; <strong id="ALM-12040__b14501663300">Alarm</strong> &gt; <strong id="ALM-12040__b84514673013">Alarms</strong>.</span></li><li id="ALM-12040__li8418311105655"><span>Check the value of <strong id="ALM-12040__b078362362511">HostName</strong> in the <strong id="ALM-12040__b15793142362519">Location</strong> area to obtain the name of the host for which the alarm is generated.</span></li><li id="ALM-12040__li10794601105655"><span>Log in to the node for which the alarm is generated as user <strong id="ALM-12040__b8655940105655">root</strong>. <span id="ALM-12040__text23715444267"></span></span></li><li id="ALM-12040__li1325705155017"><span>Run the <strong id="ALM-12040__b925735155015">/bin/rpm -qa | grep -w "haveged"</strong> command to check the haveged installation status and check whether the command output is empty.</span><p><ul id="ALM-12040__ul13139362503"><li id="ALM-12040__li08387016232">If yes, go to <a href="#ALM-12040__li978924652119">6</a>.</li><li id="ALM-12040__li38381500230">If no, go to <a href="#ALM-12040__li35057727105655">5</a>.</li></ul>
</p></li><li id="ALM-12040__li35057727105655"><a name="ALM-12040__li35057727105655"></a><a name="li35057727105655"></a><span>Run the <strong id="ALM-12040__b1947512105655">/sbin/service haveged status |grep "running"</strong> command and check the command output.</span><p><ul class="subitemlist" id="ALM-12040__ul41178005105655"><li id="ALM-12040__li23530779105655">If the command is executed successfully, haveged has been installed and configured correctly and is running properly. Go to <a href="#ALM-12040__li22912175218">8</a>.</li><li id="ALM-12040__li26944955105655">If the command fails to execute, haveged is not running properly. Run the following command to manually restart haveged and go to <a href="#ALM-12040__li20231214524">9</a>:<p class="subitemlist" id="ALM-12040__p17261031175416"><strong id="ALM-12040__b1692510321938">systemctl restart haveged.service</strong></p> </p></li><li id="ALM-12040__li35057727105655"><a name="ALM-12040__li35057727105655"></a><a name="li35057727105655"></a><span>Run the <strong id="ALM-12040__b1947512105655">/sbin/service haveged status |grep "running"</strong> command and check the command output.</span><p><ul class="subitemlist" id="ALM-12040__ul41178005105655"><li id="ALM-12040__li23530779105655">If the command is executed successfully, haveged has been installed and configured correctly and is running properly. Go to <a href="#ALM-12040__li22912175218">8</a>.</li><li id="ALM-12040__li26944955105655">If the command fails to execute, haveged is not running properly. Run the following command to manually restart haveged and go to <a href="#ALM-12040__li20231214524">9</a>:<p class="subitemlist" id="ALM-12040__p17261031175416"><strong id="ALM-12040__b1692510321938">systemctl restart haveged.service</strong></p>
</li></ul> </li></ul>
</p></li><li id="ALM-12040__li978924652119"><a name="ALM-12040__li978924652119"></a><a name="li978924652119"></a><span>Run the <strong id="ALM-12040__b47084090105655">/bin/rpm -qa | grep -w "rng-tools"</strong> command to check the rng-tools installation and check whether the command output is empty.</span><p><ul id="ALM-12040__ul16856143695418"><li id="ALM-12040__li185643665416">If yes, contact the OS vendor to install and start haveged or rng-tools. Then go to <a href="#ALM-12040__li20231214524">9</a>.</li><li id="ALM-12040__li4856103665412">If no, go to <a href="#ALM-12040__li34867421105655">7</a>.</li></ul> </p></li><li id="ALM-12040__li978924652119"><a name="ALM-12040__li978924652119"></a><a name="li978924652119"></a><span>Run the <strong id="ALM-12040__b47084090105655">/bin/rpm -qa | grep -w "rng-tools"</strong> command to check the rng-tools installation and check whether the command output is empty.</span><p><ul id="ALM-12040__ul16856143695418"><li id="ALM-12040__li185643665416">If yes, contact the OS vendor to install and start haveged or rng-tools. Then go to <a href="#ALM-12040__li20231214524">9</a>.</li><li id="ALM-12040__li4856103665412">If no, go to <a href="#ALM-12040__li34867421105655">7</a>.</li></ul>
@ -94,7 +94,7 @@ Restart=always</pre>
</p></li><li id="ALM-12040__li20231214524"><a name="ALM-12040__li20231214524"></a><a name="li20231214524"></a><span>Wait until the system to check the entropy at 00:00 on the following day and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12040__ul17214121526"><li id="ALM-12040__li172812165218">If yes, no further action is required.</li><li id="ALM-12040__li10211245210">If no, go to <a href="#ALM-12040__li5962839105655">10</a>.</li></ul> </p></li><li id="ALM-12040__li20231214524"><a name="ALM-12040__li20231214524"></a><a name="li20231214524"></a><span>Wait until the system to check the entropy at 00:00 on the following day and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12040__ul17214121526"><li id="ALM-12040__li172812165218">If yes, no further action is required.</li><li id="ALM-12040__li10211245210">If no, go to <a href="#ALM-12040__li5962839105655">10</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12040__p39013326105655"><strong id="ALM-12040__b15098459105711">Collect fault information.</strong></p> <p class="tableheading" id="ALM-12040__p39013326105655"><strong id="ALM-12040__b15098459105711">Collect fault information.</strong></p>
<ol start="10" id="ALM-12040__ol3438675910577"><li id="ALM-12040__li5962839105655"><a name="ALM-12040__li5962839105655"></a><a name="li5962839105655"></a><span>On FusionInsight Manager, choose <strong id="ALM-12040__b15129118135012">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12040__b913828115012">Log</strong> &gt; <strong id="ALM-12040__b131389811500">Download</strong>.</span></li><li id="ALM-12040__li53665559105655"><span>Select <strong id="ALM-12040__b168670067183456">NodeAgent</strong> for <strong id="ALM-12040__b77671734683456">Service</strong> and click <strong id="ALM-12040__b26186472983456">OK</strong>.</span></li><li id="ALM-12040__li13227985105655"><span>Click <span><img id="ALM-12040__image104601319175315" src="en-us_image_0000001532927350.png"></span> in the upper right corner, and set <strong id="ALM-12040__b357114351501">Start Date</strong> and <strong id="ALM-12040__b1572183555014">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12040__b18573163555012">Download</strong>.</span></li><li id="ALM-12040__li64833892105655"><span>Contact <span id="ALM-12040__text126301214142412">O&amp;M personnel</span> and provide the collected logs.</span></li></ol> <ol start="10" id="ALM-12040__ol3438675910577"><li id="ALM-12040__li5962839105655"><a name="ALM-12040__li5962839105655"></a><a name="li5962839105655"></a><span>On <span id="ALM-12040__text2054175616445">MRS</span> Manager, choose <strong id="ALM-12040__b15129118135012">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12040__b913828115012">Log</strong> &gt; <strong id="ALM-12040__b131389811500">Download</strong>.</span></li><li id="ALM-12040__li53665559105655"><span>Select <strong id="ALM-12040__b168670067183456">NodeAgent</strong> for <strong id="ALM-12040__b77671734683456">Service</strong> and click <strong id="ALM-12040__b26186472983456">OK</strong>.</span></li><li id="ALM-12040__li13227985105655"><span>Click <span><img id="ALM-12040__image104601319175315" src="en-us_image_0000001532927350.png"></span> in the upper right corner, and set <strong id="ALM-12040__b357114351501">Start Date</strong> and <strong id="ALM-12040__b1572183555014">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12040__b18573163555012">Download</strong>.</span></li><li id="ALM-12040__li64833892105655"><span>Contact <span id="ALM-12040__text126301214142412">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12040__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12040__p754913417333">This alarm is automatically cleared after the fault is rectified.</p> <div class="section" id="ALM-12040__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12040__p754913417333">This alarm is automatically cleared after the fault is rectified.</p>
</div> </div>

View File

@ -65,7 +65,7 @@
<div class="section" id="ALM-12041__s40c63dc25cc84bfc9e3241365ab0f0bd"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12041__p1177141835411">The file permission is abnormal or the file is lost due to a user manually modified information such as the file permission, user, and user group, or the system is powered off unexpectedly.</p> <div class="section" id="ALM-12041__s40c63dc25cc84bfc9e3241365ab0f0bd"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12041__p1177141835411">The file permission is abnormal or the file is lost due to a user manually modified information such as the file permission, user, and user group, or the system is powered off unexpectedly.</p>
</div> </div>
<div class="section" id="ALM-12041__s97497fe5175042fc8f02531ea6f82aa1"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12041__en-us_topic_0070543616_p53128798"><strong id="ALM-12041__b5076943711011">Check whether the abnormal file exists and whether the permission on the abnormal file is correct.</strong></p> <div class="section" id="ALM-12041__s97497fe5175042fc8f02531ea6f82aa1"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12041__en-us_topic_0070543616_p53128798"><strong id="ALM-12041__b5076943711011">Check whether the abnormal file exists and whether the permission on the abnormal file is correct.</strong></p>
<ol id="ALM-12041__ol5738633911023"><li id="ALM-12041__li4564192111014"><span>On the FusionInsight Manager portal, choose <strong id="ALM-12041__b2744094511014">O&amp;M &gt; Alarm<strong id="ALM-12041__b27872374104950"> &gt; Alarms</strong></strong>.</span></li><li id="ALM-12041__li5407306011014"><span>Check the value of <strong id="ALM-12041__b812410911014">HostName</strong> to obtain the host name involved in this alarm. Check the value of <strong id="ALM-12041__b600811711014">PathName</strong> to obtain the path or name of the abnormal file.</span></li><li id="ALM-12041__li1784176011014"><span>Log in to the node for which the alarm is generated as user <strong id="ALM-12041__b1689549811014">root</strong>. <span id="ALM-12041__text43649449460"></span></span></li><li id="ALM-12041__li2193450211014"><span>Run the <strong id="ALM-12041__b2635812011014">ll </strong><em id="ALM-12041__i3589648911014">pathName</em> command, where <em id="ALM-12041__i5463295011014">pathName</em> indicates the name of the abnormal file to obtain the user, permission, and user group information about the file or directory.</span></li><li id="ALM-12041__li1834285111014"><a name="ALM-12041__li1834285111014"></a><a name="li1834285111014"></a><span>Go to <strong id="ALM-12041__b6319279611014">${BIGDATA_HOME}/om-agent/nodeagent/etc/agent/autocheck</strong> directory. Then run the <strong id="ALM-12041__b3186425611014">vi keyfile</strong> command and search for the name of the abnormal file and check the due permission of the file.</span><p><div class="note" id="ALM-12041__note21303849111810"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12041__p32947381111823">To ensure proper configuration synchronization between the active and standby OMS servers, files, directories, and files and sub-directories in the directories configured in <strong id="ALM-12041__b28090979111823">$OMS_RUN_PATH/workspace/ha/module/hasync/plugin/conf/filesync.xml </strong>will also be monitored except files and directories in <strong id="ALM-12041__b51492227111823">keyfile</strong>. User <strong id="ALM-12041__b60776860111823">omm </strong>must have read and write permissions of files and read and execute permissions of directories.</p> <ol id="ALM-12041__ol5738633911023"><li id="ALM-12041__li4564192111014"><span>On the <span id="ALM-12041__text34789336432">MRS</span> Manager portal, choose <strong id="ALM-12041__b2744094511014">O&amp;M &gt; Alarm<strong id="ALM-12041__b27872374104950"> &gt; Alarms</strong></strong>.</span></li><li id="ALM-12041__li5407306011014"><span>Check the value of <strong id="ALM-12041__b812410911014">HostName</strong> to obtain the host name involved in this alarm. Check the value of <strong id="ALM-12041__b600811711014">PathName</strong> to obtain the path or name of the abnormal file.</span></li><li id="ALM-12041__li1784176011014"><span>Log in to the node for which the alarm is generated as user <strong id="ALM-12041__b1689549811014">root</strong>. <span id="ALM-12041__text43649449460"></span></span></li><li id="ALM-12041__li2193450211014"><span>Run the <strong id="ALM-12041__b2635812011014">ll </strong><em id="ALM-12041__i3589648911014">pathName</em> command, where <em id="ALM-12041__i5463295011014">pathName</em> indicates the name of the abnormal file to obtain the user, permission, and user group information about the file or directory.</span></li><li id="ALM-12041__li1834285111014"><a name="ALM-12041__li1834285111014"></a><a name="li1834285111014"></a><span>Go to <strong id="ALM-12041__b6319279611014">${BIGDATA_HOME}/om-agent/nodeagent/etc/agent/autocheck</strong> directory. Then run the <strong id="ALM-12041__b3186425611014">vi keyfile</strong> command and search for the name of the abnormal file and check the due permission of the file.</span><p><div class="note" id="ALM-12041__note21303849111810"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12041__p32947381111823">To ensure proper configuration synchronization between the active and standby OMS servers, files, directories, and files and sub-directories in the directories configured in <strong id="ALM-12041__b28090979111823">$OMS_RUN_PATH/workspace/ha/module/hasync/plugin/conf/filesync.xml </strong>will also be monitored except files and directories in <strong id="ALM-12041__b51492227111823">keyfile</strong>. User <strong id="ALM-12041__b60776860111823">omm </strong>must have read and write permissions of files and read and execute permissions of directories.</p>
</div></div> </div></div>
</p></li><li id="ALM-12041__li937595411014"><span>Compare the real-world permission of the file with the due permission obtained in <a href="#ALM-12041__li1834285111014">5</a> and correct the permission, user, and user group information for the file.</span></li><li id="ALM-12041__li75110811014"><span>Wait a hour and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12041__ul4392001111014"><li id="ALM-12041__li1727472911014">If yes, no further action is required.</li><li id="ALM-12041__li5707578411014">If no, go to <a href="#ALM-12041__li1068683211014">8</a>.</li></ul> </p></li><li id="ALM-12041__li937595411014"><span>Compare the real-world permission of the file with the due permission obtained in <a href="#ALM-12041__li1834285111014">5</a> and correct the permission, user, and user group information for the file.</span></li><li id="ALM-12041__li75110811014"><span>Wait a hour and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12041__ul4392001111014"><li id="ALM-12041__li1727472911014">If yes, no further action is required.</li><li id="ALM-12041__li5707578411014">If no, go to <a href="#ALM-12041__li1068683211014">8</a>.</li></ul>
<div class="note" id="ALM-12041__note50974664111832"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12041__p22323065111841">If the disk partition where the cluster installation directory resides is used up, some temporary files will be generated in the program installation directory when running the <strong id="ALM-12041__b66689858111841">sed</strong> command fails. Users do not have the read, write, and execute permissions of these temporary files. The system reports an alarm indicating that permissions of temporary files are abnormal if these files are within the monitoring range of the alarm. Perform the preceding alarm handling processes to clear the alarm. Alternatively, you can directly delete the temporary files after confirming that files with abnormal permissions are temporary. The temporary file generated after a <strong id="ALM-12041__b63337813111841">sed</strong> command execution failure is similar to the following.</p> <div class="note" id="ALM-12041__note50974664111832"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12041__p22323065111841">If the disk partition where the cluster installation directory resides is used up, some temporary files will be generated in the program installation directory when running the <strong id="ALM-12041__b66689858111841">sed</strong> command fails. Users do not have the read, write, and execute permissions of these temporary files. The system reports an alarm indicating that permissions of temporary files are abnormal if these files are within the monitoring range of the alarm. Perform the preceding alarm handling processes to clear the alarm. Alternatively, you can directly delete the temporary files after confirming that files with abnormal permissions are temporary. The temporary file generated after a <strong id="ALM-12041__b63337813111841">sed</strong> command execution failure is similar to the following.</p>
@ -73,7 +73,7 @@
<p class="subitemlist" id="ALM-12041__p132194544418"><span><img id="ALM-12041__image13221252114113" src="en-us_image_0000001532927558.jpg"></span></p> <p class="subitemlist" id="ALM-12041__p132194544418"><span><img id="ALM-12041__image13221252114113" src="en-us_image_0000001532927558.jpg"></span></p>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12041__p5973578011014"><strong id="ALM-12041__b120539411028">Collect fault information.</strong></p> <p class="tableheading" id="ALM-12041__p5973578011014"><strong id="ALM-12041__b120539411028">Collect fault information.</strong></p>
<ol start="8" id="ALM-12041__ol6667694311030"><li id="ALM-12041__li1068683211014"><a name="ALM-12041__li1068683211014"></a><a name="li1068683211014"></a><span>On the FusionInsight Manager portal, choose <strong id="ALM-12041__b675997211014">O&amp;M</strong> &gt; <strong id="ALM-12041__b6083974911014">Log &gt; Download</strong>.</span></li><li id="ALM-12041__li5465607911014"><span>Select <strong id="ALM-12041__b2907263111014">NodeAgent</strong> from the <strong id="ALM-12041__b6032708911014">Service</strong> and click <strong id="ALM-12041__b3991118545">OK</strong>.</span></li><li id="ALM-12041__li1145664103113"><span>Click <span><img id="ALM-12041__image1945644173117" src="en-us_image_0000001532607890.png"></span> in the upper right corner, and set <strong id="ALM-12041__b6456941173117">Start Date</strong> and <strong id="ALM-12041__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12041__b13456164113319">Download</strong>.</span></li><li id="ALM-12041__li495644512588"><span>Contact the <span id="ALM-12041__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="8" id="ALM-12041__ol6667694311030"><li id="ALM-12041__li1068683211014"><a name="ALM-12041__li1068683211014"></a><a name="li1068683211014"></a><span>On the <span id="ALM-12041__text1629912591447">MRS</span> Manager portal, choose <strong id="ALM-12041__b675997211014">O&amp;M</strong> &gt; <strong id="ALM-12041__b6083974911014">Log &gt; Download</strong>.</span></li><li id="ALM-12041__li5465607911014"><span>Select <strong id="ALM-12041__b2907263111014">NodeAgent</strong> from the <strong id="ALM-12041__b6032708911014">Service</strong> and click <strong id="ALM-12041__b3991118545">OK</strong>.</span></li><li id="ALM-12041__li1145664103113"><span>Click <span><img id="ALM-12041__image1945644173117" src="en-us_image_0000001532607890.png"></span> in the upper right corner, and set <strong id="ALM-12041__b6456941173117">Start Date</strong> and <strong id="ALM-12041__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12041__b13456164113319">Download</strong>.</span></li><li id="ALM-12041__li495644512588"><span>Contact the <span id="ALM-12041__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12041__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12041__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12041__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12041__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -65,14 +65,14 @@
<div class="section" id="ALM-12042__s0ed39bd436594cd2af2414af2dd189c3"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12042__en-us_topic_0070543617_p48750623">The file configuration is modified manually or the system is powered off unexpectedly.</p> <div class="section" id="ALM-12042__s0ed39bd436594cd2af2414af2dd189c3"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12042__en-us_topic_0070543617_p48750623">The file configuration is modified manually or the system is powered off unexpectedly.</p>
</div> </div>
<div class="section" id="ALM-12042__sa72cb081ce9546069c49fa0a37a80746"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12042__en-us_topic_0070543617_p56486420"><strong id="ALM-12042__b364765131137">Check abnormal file configuration.</strong></p> <div class="section" id="ALM-12042__sa72cb081ce9546069c49fa0a37a80746"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12042__en-us_topic_0070543617_p56486420"><strong id="ALM-12042__b364765131137">Check abnormal file configuration.</strong></p>
<ol id="ALM-12042__ol5061051711317"><li id="ALM-12042__li181336611310"><span>On the FusionInsight Manager portal, choose <strong id="ALM-12042__b5239726811310">O&amp;M &gt; Alarm<strong id="ALM-12042__b27872374104950"> &gt; Alarms</strong></strong>.</span></li><li id="ALM-12042__li4687569511310"><span>Check the value of <strong id="ALM-12042__b1632029711310">HostName</strong> to obtain the host name involved in this alarm. Check the value of <strong id="ALM-12042__b1266495111310">PathName</strong> to obtain the path or name of the abnormal file.</span></li><li id="ALM-12042__li3883495711310"><span>Log in to the node for which the alarm is generated as user <strong id="ALM-12042__b1922807611310">root</strong>. <span id="ALM-12042__text43649449460"></span></span></li><li id="ALM-12042__li5862385211310"><span>View the $BIGDATA_LOG_HOME/nodeagent/scriptlog/checkfileconfig.log file and analyze the cause based on the error log. Locate the check standards of the file in the <a href="#ALM-12042__en-us_topic_0070543617_cab">Related Information</a> and manually check and modify the file based on the standards.</span><p><p id="ALM-12042__p181921481219">Run the <strong id="ALM-12042__b18909133892210">vi </strong><em id="ALM-12042__i14910133813227">file name</em> command to enter the editing mode, and then press <strong id="ALM-12042__b25755449228">Insert</strong> to start editing.</p> <ol id="ALM-12042__ol5061051711317"><li id="ALM-12042__li181336611310"><span>On the <span id="ALM-12042__text153911714512">MRS</span> Manager portal, choose <strong id="ALM-12042__b5239726811310">O&amp;M &gt; Alarm<strong id="ALM-12042__b27872374104950"> &gt; Alarms</strong></strong>.</span></li><li id="ALM-12042__li4687569511310"><span>Check the value of <strong id="ALM-12042__b1632029711310">HostName</strong> to obtain the host name involved in this alarm. Check the value of <strong id="ALM-12042__b1266495111310">PathName</strong> to obtain the path or name of the abnormal file.</span></li><li id="ALM-12042__li3883495711310"><span>Log in to the node for which the alarm is generated as user <strong id="ALM-12042__b1922807611310">root</strong>. <span id="ALM-12042__text43649449460"></span></span></li><li id="ALM-12042__li5862385211310"><span>View the $BIGDATA_LOG_HOME/nodeagent/scriptlog/checkfileconfig.log file and analyze the cause based on the error log. Locate the check standards of the file in the <a href="#ALM-12042__en-us_topic_0070543617_cab">Related Information</a> and manually check and modify the file based on the standards.</span><p><p id="ALM-12042__p181921481219">Run the <strong id="ALM-12042__b18909133892210">vi </strong><em id="ALM-12042__i14910133813227">file name</em> command to enter the editing mode, and then press <strong id="ALM-12042__b25755449228">Insert</strong> to start editing.</p>
<p id="ALM-12042__p16192108152119">After the modification is complete, press <strong id="ALM-12042__b976905012226">Esc</strong> to exit the editing mode and enter<strong id="ALM-12042__b161181354142215"> :wq</strong> to save the settings and exit.</p> <p id="ALM-12042__p16192108152119">After the modification is complete, press <strong id="ALM-12042__b976905012226">Esc</strong> to exit the editing mode and enter<strong id="ALM-12042__b161181354142215"> :wq</strong> to save the settings and exit.</p>
<p id="ALM-12042__p830792372214">For example:</p> <p id="ALM-12042__p830792372214">For example:</p>
<p id="ALM-12042__p1819218813219"><strong id="ALM-12042__b0943142682220">vi /etc/ssh/sshd_config</strong></p> <p id="ALM-12042__p1819218813219"><strong id="ALM-12042__b0943142682220">vi /etc/ssh/sshd_config</strong></p>
</p></li><li id="ALM-12042__li3021967611310"><span>Wait a hour and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12042__ul3019924411310"><li id="ALM-12042__li5785262811310">If yes, no further action is required.</li><li id="ALM-12042__li5555125411310">If no, go to <a href="#ALM-12042__li1843685711310">6</a>.</li></ul> </p></li><li id="ALM-12042__li3021967611310"><span>Wait a hour and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12042__ul3019924411310"><li id="ALM-12042__li5785262811310">If yes, no further action is required.</li><li id="ALM-12042__li5555125411310">If no, go to <a href="#ALM-12042__li1843685711310">6</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12042__p335774111310"><strong id="ALM-12042__b2285498211323">Collect fault information.</strong></p> <p class="tableheading" id="ALM-12042__p335774111310"><strong id="ALM-12042__b2285498211323">Collect fault information.</strong></p>
<ol start="6" id="ALM-12042__ol3443027411326"><li id="ALM-12042__li1843685711310"><a name="ALM-12042__li1843685711310"></a><a name="li1843685711310"></a><span>On the FusionInsight Manager portal, choose <strong id="ALM-12042__b354163311310">O&amp;M</strong> &gt; <strong id="ALM-12042__b3187470111310">Log &gt; Download</strong>.</span></li><li id="ALM-12042__li3405016711310"><span>Select <strong id="ALM-12042__b3171399011310">NodeAgent</strong> from the <strong id="ALM-12042__b1699046211310">Service</strong> and click <strong id="ALM-12042__b3991118545">OK</strong>.</span></li><li id="ALM-12042__li1145664103113"><span>Click <span><img id="ALM-12042__image1945644173117" src="en-us_image_0000001532927502.png"></span> in the upper right corner, and set <strong id="ALM-12042__b6456941173117">Start Date</strong> and <strong id="ALM-12042__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12042__b13456164113319">Download</strong>.</span></li><li id="ALM-12042__li495644512588"><span>Contact the <span id="ALM-12042__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="6" id="ALM-12042__ol3443027411326"><li id="ALM-12042__li1843685711310"><a name="ALM-12042__li1843685711310"></a><a name="li1843685711310"></a><span>On the <span id="ALM-12042__text34789336432">MRS</span> Manager portal, choose <strong id="ALM-12042__b354163311310">O&amp;M</strong> &gt; <strong id="ALM-12042__b3187470111310">Log &gt; Download</strong>.</span></li><li id="ALM-12042__li3405016711310"><span>Select <strong id="ALM-12042__b3171399011310">NodeAgent</strong> from the <strong id="ALM-12042__b1699046211310">Service</strong> and click <strong id="ALM-12042__b3991118545">OK</strong>.</span></li><li id="ALM-12042__li1145664103113"><span>Click <span><img id="ALM-12042__image1945644173117" src="en-us_image_0000001532927502.png"></span> in the upper right corner, and set <strong id="ALM-12042__b6456941173117">Start Date</strong> and <strong id="ALM-12042__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12042__b13456164113319">Download</strong>.</span></li><li id="ALM-12042__li495644512588"><span>Contact the <span id="ALM-12042__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12042__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12042__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12042__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12042__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -73,8 +73,8 @@
<div class="section" id="ALM-12045__section56798701"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12045__ul27994219"><li id="ALM-12045__li37441695101640">An OS exception occurs.</li><li id="ALM-12045__li60731574192851">The NICs are bonded in active/standby mode.</li><li id="ALM-12045__li50621380">The alarm threshold is improperly configured.</li><li id="ALM-12045__li52939239">The network quality is poor.</li></ul> <div class="section" id="ALM-12045__section56798701"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12045__ul27994219"><li id="ALM-12045__li37441695101640">An OS exception occurs.</li><li id="ALM-12045__li60731574192851">The NICs are bonded in active/standby mode.</li><li id="ALM-12045__li50621380">The alarm threshold is improperly configured.</li><li id="ALM-12045__li52939239">The network quality is poor.</li></ul>
</div> </div>
<div class="section" id="ALM-12045__section41426264"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12045__p60550233154039"><strong id="ALM-12045__b20378211155946">View the network packet dropped rate.</strong></p> <div class="section" id="ALM-12045__section41426264"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12045__p60550233154039"><strong id="ALM-12045__b20378211155946">View the network packet dropped rate.</strong></p>
<ol id="ALM-12045__ol54177744154120"><li id="ALM-12045__li34357272165726"><span>On FusionInsight Manager, choose <strong id="ALM-12045__b8597763200">O&amp;M</strong> &gt; <strong id="ALM-12045__b1760510652010">Alarm</strong> &gt; <strong id="ALM-12045__b19605968206">Alarms</strong>. On the page that is displayed, click <span><img id="ALM-12045__image168221113135319" src="en-us_image_0000001583087417.png"></span> in the row containing the alarm, and view the name of the host for which the alarm is generated and the NIC name.</span></li><li id="ALM-12045__li17837656154120"><span>Log in to the alarm node as user <strong id="ALM-12045__b35564051154120">omm</strong>, and run the <strong id="ALM-12045__b143893142219">/sbin/ifconfig </strong><em id="ALM-12045__i07461520122210">NIC name</em> command to check whether packet loss occurs on the network.</span><p><p id="ALM-12045__p897517249258"><span><img id="ALM-12045__image14835549449" src="en-us_image_0000001532767498.png"></span></p> <ol id="ALM-12045__ol54177744154120"><li id="ALM-12045__li34357272165726"><span>On <span id="ALM-12045__text34789336432">MRS</span> Manager, choose <strong id="ALM-12045__b8597763200">O&amp;M</strong> &gt; <strong id="ALM-12045__b1760510652010">Alarm</strong> &gt; <strong id="ALM-12045__b19605968206">Alarms</strong>. On the page that is displayed, click <span><img id="ALM-12045__image168221113135319" src="en-us_image_0000001583087417.png"></span> in the row containing the alarm, and view the name of the host for which the alarm is generated and the NIC name.</span></li><li id="ALM-12045__li17837656154120"><span>Log in to the alarm node as user <strong id="ALM-12045__b35564051154120">omm</strong>, and run the <strong id="ALM-12045__b143893142219">/sbin/ifconfig </strong><em id="ALM-12045__i07461520122210">NIC name</em> command to check whether packet loss occurs on the network.</span><p><p id="ALM-12045__p897517249258"><span><img id="ALM-12045__image14835549449" src="en-us_image_0000001532767498.png"></span></p>
<div class="note" id="ALM-12045__note5975624192520"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><ul id="ALM-12045__ul29750248258"><li class="text" id="ALM-12045__li13975142415259"><em id="ALM-12045__i182770239944413">IP address of the node for which the alarm is generated</em>: Query the IP address of the node for which the alarm is generated on the <strong id="ALM-12045__b76887363443">Hosts</strong> page of FusionInsight Manager based on the value of <strong id="ALM-12045__b37108310144413">HostName</strong> in the alarm location information. Check both the IP addresses of the management plane and service plane.</li><li id="ALM-12045__li19975124182513">Packet loss rate = (Number of dropped packets/Total number of received packets) x 100%. If the packet loss rate is greater than the system threshold (0.5% by default), read packets are dropped.</li></ul> <div class="note" id="ALM-12045__note5975624192520"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><ul id="ALM-12045__ul29750248258"><li class="text" id="ALM-12045__li13975142415259"><em id="ALM-12045__i182770239944413">IP address of the node for which the alarm is generated</em>: Query the IP address of the node for which the alarm is generated on the <strong id="ALM-12045__b76887363443">Hosts</strong> page of <span id="ALM-12045__text58491222184512">MRS</span> Manager based on the value of <strong id="ALM-12045__b37108310144413">HostName</strong> in the alarm location information. Check both the IP addresses of the management plane and service plane.</li><li id="ALM-12045__li19975124182513">Packet loss rate = (Number of dropped packets/Total number of received packets) x 100%. If the packet loss rate is greater than the system threshold (0.5% by default), read packets are dropped.</li></ul>
</div></div> </div></div>
<ul id="ALM-12045__ul12976132492510"><li id="ALM-12045__li1097522414255">If yes, go to <a href="#ALM-12045__li4196511811134">11</a>.</li><li id="ALM-12045__li297652462516">If no, go to <a href="#ALM-12045__li6542838717657">3</a>.</li></ul> <ul id="ALM-12045__ul12976132492510"><li id="ALM-12045__li1097522414255">If yes, go to <a href="#ALM-12045__li4196511811134">11</a>.</li><li id="ALM-12045__li297652462516">If no, go to <a href="#ALM-12045__li6542838717657">3</a>.</li></ul>
</p></li></ol> </p></li></ol>
@ -92,9 +92,9 @@ Red Hat Enterprise Linux Server release<strong id="ALM-12045__b26880224102544">
</p></li><li id="ALM-12045__li42309040172040"><a name="ALM-12045__li42309040172040"></a><a name="li42309040172040"></a><span>Run the <strong id="ALM-12045__b11621601153236">cat /proc/version</strong> command to check whether the SUSE kernel version is 3.0 or later.</span><p><pre class="screen" id="ALM-12045__screen50396840174541"># cat /proc/version </p></li><li id="ALM-12045__li42309040172040"><a name="ALM-12045__li42309040172040"></a><a name="li42309040172040"></a><span>Run the <strong id="ALM-12045__b11621601153236">cat /proc/version</strong> command to check whether the SUSE kernel version is 3.0 or later.</span><p><pre class="screen" id="ALM-12045__screen50396840174541"># cat /proc/version
Linux version <strong id="ALM-12045__b37899196102550">3.0.101-63-default</strong> (geeko@buildhost) (gcc version 4.3.4 [gcc-4_3-branch revision 152973] (SUSE Linux) ) #1 SMP Tue Jun 23 16:02:31 UTC 2015 (4b89d0c)</pre> Linux version <strong id="ALM-12045__b37899196102550">3.0.101-63-default</strong> (geeko@buildhost) (gcc version 4.3.4 [gcc-4_3-branch revision 152973] (SUSE Linux) ) #1 SMP Tue Jun 23 16:02:31 UTC 2015 (4b89d0c)</pre>
<ul id="ALM-12045__ul62847380172126"><li id="ALM-12045__li9858303195115">If yes, the alarm sending function cannot be enabled. Go to <a href="#ALM-12045__li43950618195120">7</a>.</li><li id="ALM-12045__li5930366195117">If no, go to <a href="#ALM-12045__li4196511811134">11</a>.</li></ul> <ul id="ALM-12045__ul62847380172126"><li id="ALM-12045__li9858303195115">If yes, the alarm sending function cannot be enabled. Go to <a href="#ALM-12045__li43950618195120">7</a>.</li><li id="ALM-12045__li5930366195117">If no, go to <a href="#ALM-12045__li4196511811134">11</a>.</li></ul>
</p></li><li id="ALM-12045__li43950618195120"><a name="ALM-12045__li43950618195120"></a><a name="li43950618195120"></a><span>Log in to FusionInsight Manager and choose <strong id="ALM-12045__b167130161044413">O&amp;M</strong> &gt; <strong id="ALM-12045__b157735138144413">Alarm</strong> &gt; <strong id="ALM-12045__b156701955944413">Threshold Configuration</strong>.</span></li></ol><ol start="8" id="ALM-12045__ol26457910172340"><li id="ALM-12045__li26465420174815"><span>In the navigation tree of the <strong id="ALM-12045__b478911510483">Thresholds</strong> page, choose <em id="ALM-12045__i594510221489">Name of the desired cluster</em> &gt; <strong id="ALM-12045__b167061042124815">Host</strong> &gt; <strong id="ALM-12045__b6981184610481">Network Reading</strong> &gt; <strong id="ALM-12045__b9341172144913">Read Packet Dropped Rate</strong>. In the area on the right, check whether the <strong id="ALM-12045__b1278202219498">Switch</strong> is toggled on.</span><p><ul id="ALM-12045__ul20313429174820"><li id="ALM-12045__li9894347174820">If yes, the alarm sending function is enabled. Go to <a href="#ALM-12045__li38517503111027">9</a>.</li><li id="ALM-12045__li56297179194352">If no, the alarm sending function is disabled. Go to <a href="#ALM-12045__li16613085112024">10</a>.</li></ul> </p></li><li id="ALM-12045__li43950618195120"><a name="ALM-12045__li43950618195120"></a><a name="li43950618195120"></a><span>Log in to <span id="ALM-12045__text155134244459">MRS</span> Manager and choose <strong id="ALM-12045__b167130161044413">O&amp;M</strong> &gt; <strong id="ALM-12045__b157735138144413">Alarm</strong> &gt; <strong id="ALM-12045__b156701955944413">Threshold Configuration</strong>.</span></li></ol><ol start="8" id="ALM-12045__ol26457910172340"><li id="ALM-12045__li26465420174815"><span>In the navigation tree of the <strong id="ALM-12045__b478911510483">Thresholds</strong> page, choose <em id="ALM-12045__i594510221489">Name of the desired cluster</em> &gt; <strong id="ALM-12045__b167061042124815">Host</strong> &gt; <strong id="ALM-12045__b6981184610481">Network Reading</strong> &gt; <strong id="ALM-12045__b9341172144913">Read Packet Dropped Rate</strong>. In the area on the right, check whether the <strong id="ALM-12045__b1278202219498">Switch</strong> is toggled on.</span><p><ul id="ALM-12045__ul20313429174820"><li id="ALM-12045__li9894347174820">If yes, the alarm sending function is enabled. Go to <a href="#ALM-12045__li38517503111027">9</a>.</li><li id="ALM-12045__li56297179194352">If no, the alarm sending function is disabled. Go to <a href="#ALM-12045__li16613085112024">10</a>.</li></ul>
</p></li><li id="ALM-12045__li38517503111027"><a name="ALM-12045__li38517503111027"></a><a name="li38517503111027"></a><span>In the area on the right, toggle <strong id="ALM-12045__b1172691785113">Switch</strong> off to disable the checking of <strong id="ALM-12045__b1523917125216">Network Read Packet Dropped Rate Exceeds the Threshold</strong>.</span><p><p id="ALM-12045__p11736263111027"><span><img id="ALM-12045__image828012285713" src="en-us_image_0000001532607762.png"></span></p> </p></li><li id="ALM-12045__li38517503111027"><a name="ALM-12045__li38517503111027"></a><a name="li38517503111027"></a><span>In the area on the right, toggle <strong id="ALM-12045__b1172691785113">Switch</strong> off to disable the checking of <strong id="ALM-12045__b1523917125216">Network Read Packet Dropped Rate Exceeds the Threshold</strong>.</span><p><p id="ALM-12045__p11736263111027"><span><img id="ALM-12045__image828012285713" src="en-us_image_0000001532607762.png"></span></p>
</p></li><li id="ALM-12045__li16613085112024"><a name="ALM-12045__li16613085112024"></a><a name="li16613085112024"></a><span>On the <strong id="ALM-12045__b16749813195314">Alarm</strong> page of FusionInsight Manager, search for alarm <strong id="ALM-12045__b444015317534">12045</strong> and manually clear the alarm if it is not automatically cleared. No further action is required.</span><p><p id="ALM-12045__p1861091166"><span><img id="ALM-12045__image11618931616" src="en-us_image_0000001532448274.png"></span></p> </p></li><li id="ALM-12045__li16613085112024"><a name="ALM-12045__li16613085112024"></a><a name="li16613085112024"></a><span>On the <strong id="ALM-12045__b16749813195314">Alarm</strong> page of <span id="ALM-12045__text3177142615458">MRS</span> Manager, search for alarm <strong id="ALM-12045__b444015317534">12045</strong> and manually clear the alarm if it is not automatically cleared. No further action is required.</span><p><p id="ALM-12045__p1861091166"><span><img id="ALM-12045__image11618931616" src="en-us_image_0000001532448274.png"></span></p>
<div class="note" id="ALM-12045__note60160766112035"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12045__p4575985112035">ID of the Network Read Packet Dropped Rate Exceeds the Threshold alarm is <strong id="ALM-12045__b1878673214337">12045</strong>.</p> <div class="note" id="ALM-12045__note60160766112035"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12045__p4575985112035">ID of the Network Read Packet Dropped Rate Exceeds the Threshold alarm is <strong id="ALM-12045__b1878673214337">12045</strong>.</p>
</div></div> </div></div>
</p></li></ol> </p></li></ol>
@ -137,7 +137,7 @@ Slave queue ID: 0</pre>
</li></ul> </li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12045__p60219992"><strong id="ALM-12045__b47522666112832">Check whether the threshold is set properly.</strong></p> <p class="tableheading" id="ALM-12045__p60219992"><strong id="ALM-12045__b47522666112832">Check whether the threshold is set properly.</strong></p>
<ol start="14" id="ALM-12045__ol16493433173222"><li id="ALM-12045__li61276131112834"><a name="ALM-12045__li61276131112834"></a><a name="li61276131112834"></a><span>Log in to FusionInsight Manager, choose <strong id="ALM-12045__b659184595419">O&amp;M</strong> &gt; <strong id="ALM-12045__b57512047155419">Alarm</strong> &gt; <strong id="ALM-12045__b23011451552">Thresholds</strong> &gt; <em id="ALM-12045__i18305310145510">Name of the desired cluster</em> &gt; <strong id="ALM-12045__b6129181516557">Host</strong> &gt; <strong id="ALM-12045__b882618236551">Network Reading</strong> &gt; <strong id="ALM-12045__b135892028105514">Read Packet Dropped Rate</strong>, and check whether the alarm threshold is configured properly. The default value is <strong id="ALM-12045__b144531924155615">0.5%</strong>. You can adjust the threshold as needed.</span><p><ul class="subitemlist" id="ALM-12045__ul36634620112834"><li id="ALM-12045__li23616603112834">If yes, go to <a href="#ALM-12045__li56023883112834">17</a>.</li><li id="ALM-12045__li33896675112834">If no, go to <a href="#ALM-12045__li47653126112834">15</a>.</li></ul> <ol start="14" id="ALM-12045__ol16493433173222"><li id="ALM-12045__li61276131112834"><a name="ALM-12045__li61276131112834"></a><a name="li61276131112834"></a><span>Log in to <span id="ALM-12045__text1611192724519">MRS</span> Manager, choose <strong id="ALM-12045__b659184595419">O&amp;M</strong> &gt; <strong id="ALM-12045__b57512047155419">Alarm</strong> &gt; <strong id="ALM-12045__b23011451552">Thresholds</strong> &gt; <em id="ALM-12045__i18305310145510">Name of the desired cluster</em> &gt; <strong id="ALM-12045__b6129181516557">Host</strong> &gt; <strong id="ALM-12045__b882618236551">Network Reading</strong> &gt; <strong id="ALM-12045__b135892028105514">Read Packet Dropped Rate</strong>, and check whether the alarm threshold is configured properly. The default value is <strong id="ALM-12045__b144531924155615">0.5%</strong>. You can adjust the threshold as needed.</span><p><ul class="subitemlist" id="ALM-12045__ul36634620112834"><li id="ALM-12045__li23616603112834">If yes, go to <a href="#ALM-12045__li56023883112834">17</a>.</li><li id="ALM-12045__li33896675112834">If no, go to <a href="#ALM-12045__li47653126112834">15</a>.</li></ul>
</p></li></ol><ol start="15" id="ALM-12045__ol13032980174025"><li id="ALM-12045__li47653126112834"><a name="ALM-12045__li47653126112834"></a><a name="li47653126112834"></a><span>Choose <strong id="ALM-12045__b66788575566">O&amp;M</strong> &gt; <strong id="ALM-12045__b9758759195615">Alarm</strong> &gt; <strong id="ALM-12045__b53403618572">Thresholds</strong> &gt; <em id="ALM-12045__i666599145719">Name of the desired cluster</em> &gt; <strong id="ALM-12045__b16662161395714">Host</strong> &gt; <strong id="ALM-12045__b182811922155712">Network Reading</strong> &gt; <strong id="ALM-12045__b489452945716">Read Packet Dropped Rate</strong>. Click <strong id="ALM-12045__b204309457574">Modify</strong> in the <strong id="ALM-12045__b320605515717">Operation</strong> column to change the threshold. See <a href="#ALM-12045__fig52784093112834">Figure 1</a>.</span><p><div class="fignone" id="ALM-12045__fig52784093112834"><a name="ALM-12045__fig52784093112834"></a><a name="fig52784093112834"></a><span class="figcap"><b>Figure 1 </b>Configuring the alarm threshold</span><br><span><img id="ALM-12045__image956695784115" src="en-us_image_0000001582927657.png"></span></div> </p></li></ol><ol start="15" id="ALM-12045__ol13032980174025"><li id="ALM-12045__li47653126112834"><a name="ALM-12045__li47653126112834"></a><a name="li47653126112834"></a><span>Choose <strong id="ALM-12045__b66788575566">O&amp;M</strong> &gt; <strong id="ALM-12045__b9758759195615">Alarm</strong> &gt; <strong id="ALM-12045__b53403618572">Thresholds</strong> &gt; <em id="ALM-12045__i666599145719">Name of the desired cluster</em> &gt; <strong id="ALM-12045__b16662161395714">Host</strong> &gt; <strong id="ALM-12045__b182811922155712">Network Reading</strong> &gt; <strong id="ALM-12045__b489452945716">Read Packet Dropped Rate</strong>. Click <strong id="ALM-12045__b204309457574">Modify</strong> in the <strong id="ALM-12045__b320605515717">Operation</strong> column to change the threshold. See <a href="#ALM-12045__fig52784093112834">Figure 1</a>.</span><p><div class="fignone" id="ALM-12045__fig52784093112834"><a name="ALM-12045__fig52784093112834"></a><a name="fig52784093112834"></a><span class="figcap"><b>Figure 1 </b>Configuring the alarm threshold</span><br><span><img id="ALM-12045__image956695784115" src="en-us_image_0000001582927657.png"></span></div>
</p></li><li id="ALM-12045__li20285900112834"><span>After 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12045__ul59074262112834"><li id="ALM-12045__li26224954112834">If yes, no further action is required.</li><li id="ALM-12045__li43846509112834">If no, go to <a href="#ALM-12045__li56023883112834">17</a>.</li></ul> </p></li><li id="ALM-12045__li20285900112834"><span>After 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12045__ul59074262112834"><li id="ALM-12045__li26224954112834">If yes, no further action is required.</li><li id="ALM-12045__li43846509112834">If no, go to <a href="#ALM-12045__li56023883112834">17</a>.</li></ul>
</p></li></ol> </p></li></ol>
@ -146,7 +146,7 @@ Slave queue ID: 0</pre>
</p></li><li id="ALM-12045__li4503547112834"><a name="ALM-12045__li4503547112834"></a><a name="li4503547112834"></a><span>After 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12045__ul17454193112834"><li id="ALM-12045__li34452907112834">If yes, no further action is required.</li><li id="ALM-12045__li39222057112834">If no, go to <a href="#ALM-12045__li40531926112834">19</a>.</li></ul> </p></li><li id="ALM-12045__li4503547112834"><a name="ALM-12045__li4503547112834"></a><a name="li4503547112834"></a><span>After 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12045__ul17454193112834"><li id="ALM-12045__li34452907112834">If yes, no further action is required.</li><li id="ALM-12045__li39222057112834">If no, go to <a href="#ALM-12045__li40531926112834">19</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12045__p22870015112834"><strong id="ALM-12045__b58378062112918">Collect the fault information.</strong></p> <p class="tableheading" id="ALM-12045__p22870015112834"><strong id="ALM-12045__b58378062112918">Collect the fault information.</strong></p>
<ol start="19" id="ALM-12045__ol57529826112922"><li id="ALM-12045__li40531926112834"><a name="ALM-12045__li40531926112834"></a><a name="li40531926112834"></a><span>On FusionInsight Manager of the active cluster, choose <strong id="ALM-12045__b15490161312420">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12045__b1349021382415">Log</strong> &gt; <strong id="ALM-12045__b7491161320243">Download</strong>.</span></li><li id="ALM-12045__li29243017112834"><span>Select <strong id="ALM-12045__b1473121611242">OMS</strong> for <strong id="ALM-12045__b20731016152419">Service</strong> and click <strong id="ALM-12045__b8746168242">OK</strong>.</span></li><li id="ALM-12045__li61860565112834"><span>Expand the <strong id="ALM-12045__b168351953175820">Hosts</strong> dialog box and select the alarm node and the active OMS node.</span></li><li id="ALM-12045__li19874180112834"><span>Click <span><img id="ALM-12045__image104601319175315" src="en-us_image_0000001532927350.png"></span> in the upper right corner, and set <strong id="ALM-12045__b198664245246">Start Date</strong> and <strong id="ALM-12045__b12867324152414">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time respectively. Then, click <strong id="ALM-12045__b1286718244241">Download</strong>.</span></li><li id="ALM-12045__li66304723112834"><span>Contact <span id="ALM-12045__text14546632162412">O&amp;M personnel</span> and provide the collected logs.</span></li></ol> <ol start="19" id="ALM-12045__ol57529826112922"><li id="ALM-12045__li40531926112834"><a name="ALM-12045__li40531926112834"></a><a name="li40531926112834"></a><span>On <span id="ALM-12045__text799516286457">MRS</span> Manager of the active cluster, choose <strong id="ALM-12045__b15490161312420">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12045__b1349021382415">Log</strong> &gt; <strong id="ALM-12045__b7491161320243">Download</strong>.</span></li><li id="ALM-12045__li29243017112834"><span>Select <strong id="ALM-12045__b1473121611242">OMS</strong> for <strong id="ALM-12045__b20731016152419">Service</strong> and click <strong id="ALM-12045__b8746168242">OK</strong>.</span></li><li id="ALM-12045__li61860565112834"><span>Expand the <strong id="ALM-12045__b168351953175820">Hosts</strong> dialog box and select the alarm node and the active OMS node.</span></li><li id="ALM-12045__li19874180112834"><span>Click <span><img id="ALM-12045__image104601319175315" src="en-us_image_0000001532927350.png"></span> in the upper right corner, and set <strong id="ALM-12045__b198664245246">Start Date</strong> and <strong id="ALM-12045__b12867324152414">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time respectively. Then, click <strong id="ALM-12045__b1286718244241">Download</strong>.</span></li><li id="ALM-12045__li66304723112834"><span>Contact <span id="ALM-12045__text14546632162412">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12045__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12045__p754913417333">This alarm is automatically cleared after the fault is rectified.</p> <div class="section" id="ALM-12045__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12045__p754913417333">This alarm is automatically cleared after the fault is rectified.</p>
</div> </div>

View File

@ -71,7 +71,7 @@
<div class="section" id="ALM-12046__section35870633"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12046__ul29765467"><li id="ALM-12046__li66562616">The alarm threshold is improperly configured.</li><li id="ALM-12046__li62192640">The network quality is poor.</li></ul> <div class="section" id="ALM-12046__section35870633"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12046__ul29765467"><li id="ALM-12046__li66562616">The alarm threshold is improperly configured.</li><li id="ALM-12046__li62192640">The network quality is poor.</li></ul>
</div> </div>
<div class="section" id="ALM-12046__section54400241"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12046__p4439065"><strong id="ALM-12046__b488114212259">Check whether the threshold is set properly.</strong></p> <div class="section" id="ALM-12046__section54400241"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12046__p4439065"><strong id="ALM-12046__b488114212259">Check whether the threshold is set properly.</strong></p>
<ol id="ALM-12046__ol51757082114518"><li id="ALM-12046__li5429491411450"><span>Log in to FusionInsight Manager, choose <strong id="ALM-12046__b530513292591">O&amp;M</strong> &gt; <strong id="ALM-12046__b73111296591">Alarm</strong> &gt; <strong id="ALM-12046__b132072925911">Thresholds</strong> &gt; <em id="ALM-12046__i1532816298599">Name of the desired cluster</em> &gt; <strong id="ALM-12046__b534062916596">Host</strong> &gt; <strong id="ALM-12046__b14347529165911">Network Writing</strong> &gt; <strong id="ALM-12046__b13361529195913">Write Packet Dropped Rate</strong>, and check whether the alarm threshold is configured properly. The default value is <strong id="ALM-12046__b7369102916591">0.5%</strong>. You can adjust the threshold as needed.</span><p><ul class="subitemlist" id="ALM-12046__ul603276811450"><li id="ALM-12046__li1878771011450">If yes, go to <a href="#ALM-12046__li4369794811450">4</a>.</li><li id="ALM-12046__li4540955011450">If no, go to <a href="#ALM-12046__li5699560811450">2</a>.</li></ul> <ol id="ALM-12046__ol51757082114518"><li id="ALM-12046__li5429491411450"><span>Log in to <span id="ALM-12046__text34789336432">MRS</span> Manager, choose <strong id="ALM-12046__b530513292591">O&amp;M</strong> &gt; <strong id="ALM-12046__b73111296591">Alarm</strong> &gt; <strong id="ALM-12046__b132072925911">Thresholds</strong> &gt; <em id="ALM-12046__i1532816298599">Name of the desired cluster</em> &gt; <strong id="ALM-12046__b534062916596">Host</strong> &gt; <strong id="ALM-12046__b14347529165911">Network Writing</strong> &gt; <strong id="ALM-12046__b13361529195913">Write Packet Dropped Rate</strong>, and check whether the alarm threshold is configured properly. The default value is <strong id="ALM-12046__b7369102916591">0.5%</strong>. You can adjust the threshold as needed.</span><p><ul class="subitemlist" id="ALM-12046__ul603276811450"><li id="ALM-12046__li1878771011450">If yes, go to <a href="#ALM-12046__li4369794811450">4</a>.</li><li id="ALM-12046__li4540955011450">If no, go to <a href="#ALM-12046__li5699560811450">2</a>.</li></ul>
</p></li><li id="ALM-12046__li5699560811450"><a name="ALM-12046__li5699560811450"></a><a name="li5699560811450"></a><span>Choose <strong id="ALM-12046__b86275584598">O&amp;M</strong> &gt; <strong id="ALM-12046__b863815815596">Alarm</strong> &gt; <strong id="ALM-12046__b46391158155914">Thresholds</strong> &gt; <em id="ALM-12046__i17639135845918">Name of the desired cluster</em> &gt; <strong id="ALM-12046__b1639175845912">Host</strong> &gt; <strong id="ALM-12046__b1964015811598">Network Writing</strong> &gt; <strong id="ALM-12046__b17640175865912">Write Packet Dropped Rate</strong>. Click <strong id="ALM-12046__b564014589596">Modify</strong> in the <strong id="ALM-12046__b1664135816596">Operation</strong> column to change the threshold.</span><p><p class="litext" id="ALM-12046__p3581190711450">See <a href="#ALM-12046__fig153215311450">Figure 1</a>.</p> </p></li><li id="ALM-12046__li5699560811450"><a name="ALM-12046__li5699560811450"></a><a name="li5699560811450"></a><span>Choose <strong id="ALM-12046__b86275584598">O&amp;M</strong> &gt; <strong id="ALM-12046__b863815815596">Alarm</strong> &gt; <strong id="ALM-12046__b46391158155914">Thresholds</strong> &gt; <em id="ALM-12046__i17639135845918">Name of the desired cluster</em> &gt; <strong id="ALM-12046__b1639175845912">Host</strong> &gt; <strong id="ALM-12046__b1964015811598">Network Writing</strong> &gt; <strong id="ALM-12046__b17640175865912">Write Packet Dropped Rate</strong>. Click <strong id="ALM-12046__b564014589596">Modify</strong> in the <strong id="ALM-12046__b1664135816596">Operation</strong> column to change the threshold.</span><p><p class="litext" id="ALM-12046__p3581190711450">See <a href="#ALM-12046__fig153215311450">Figure 1</a>.</p>
<div class="fignone" id="ALM-12046__fig153215311450"><a name="ALM-12046__fig153215311450"></a><a name="fig153215311450"></a><span class="figcap"><b>Figure 1 </b>Configuring the alarm threshold</span><br><span><img id="ALM-12046__image1482785044213" src="en-us_image_0000001582807837.png"></span></div> <div class="fignone" id="ALM-12046__fig153215311450"><a name="ALM-12046__fig153215311450"></a><a name="fig153215311450"></a><span class="figcap"><b>Figure 1 </b>Configuring the alarm threshold</span><br><span><img id="ALM-12046__image1482785044213" src="en-us_image_0000001582807837.png"></span></div>
</p></li><li id="ALM-12046__li1629248811450"><span>After 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12046__ul1759973611450"><li id="ALM-12046__li4319843211450">If yes, no further action is required.</li><li id="ALM-12046__li941206611450">If no, go to <a href="#ALM-12046__li4369794811450">4</a>.</li></ul> </p></li><li id="ALM-12046__li1629248811450"><span>After 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12046__ul1759973611450"><li id="ALM-12046__li4319843211450">If yes, no further action is required.</li><li id="ALM-12046__li941206611450">If no, go to <a href="#ALM-12046__li4369794811450">4</a>.</li></ul>
@ -81,7 +81,7 @@
</p></li><li id="ALM-12046__li6056359711450"><a name="ALM-12046__li6056359711450"></a><a name="li6056359711450"></a><span>After 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12046__ul1317526611450"><li id="ALM-12046__li5773721911450">If yes, no further action is required.</li><li id="ALM-12046__li4620316111450">If no, go to <a href="#ALM-12046__li820146511450">6</a>.</li></ul> </p></li><li id="ALM-12046__li6056359711450"><a name="ALM-12046__li6056359711450"></a><a name="li6056359711450"></a><span>After 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12046__ul1317526611450"><li id="ALM-12046__li5773721911450">If yes, no further action is required.</li><li id="ALM-12046__li4620316111450">If no, go to <a href="#ALM-12046__li820146511450">6</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12046__p5146853111450"><strong id="ALM-12046__b6696662511465">Collect the fault information.</strong></p> <p class="tableheading" id="ALM-12046__p5146853111450"><strong id="ALM-12046__b6696662511465">Collect the fault information.</strong></p>
<ol start="6" id="ALM-12046__ol4187815011462"><li id="ALM-12046__li820146511450"><a name="ALM-12046__li820146511450"></a><a name="li820146511450"></a><span>On FusionInsight Manager of the active cluster, choose <strong id="ALM-12046__b82519710275">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12046__b92521274278">Log</strong> &gt; <strong id="ALM-12046__b122521742710">Download</strong>.</span></li><li id="ALM-12046__li670432911450"><span>Select <strong id="ALM-12046__b73620916276">OMS</strong> for <strong id="ALM-12046__b53624992712">Service</strong> and click <strong id="ALM-12046__b17362129182711">OK</strong>.</span></li><li id="ALM-12046__li6033896511450"><span>Expand the <strong id="ALM-12046__b1511218191705">Hosts</strong> dialog box and select the alarm node and the active OMS node.</span></li><li id="ALM-12046__li617977311450"><span>Click <span><img id="ALM-12046__image92961342720" src="en-us_image_0000001532927350.png"></span> in the upper right corner, and set <strong id="ALM-12046__b12391113112719">Start Date</strong> and <strong id="ALM-12046__b53961311278">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time respectively. Then, click <strong id="ALM-12046__b23914137276">Download</strong>.</span></li><li id="ALM-12046__li3079963411450"><span>Contact <span id="ALM-12046__text26871216142711">O&amp;M personnel</span> and provide the collected logs.</span></li></ol> <ol start="6" id="ALM-12046__ol4187815011462"><li id="ALM-12046__li820146511450"><a name="ALM-12046__li820146511450"></a><a name="li820146511450"></a><span>On <span id="ALM-12046__text1373532144510">MRS</span> Manager of the active cluster, choose <strong id="ALM-12046__b82519710275">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12046__b92521274278">Log</strong> &gt; <strong id="ALM-12046__b122521742710">Download</strong>.</span></li><li id="ALM-12046__li670432911450"><span>Select <strong id="ALM-12046__b73620916276">OMS</strong> for <strong id="ALM-12046__b53624992712">Service</strong> and click <strong id="ALM-12046__b17362129182711">OK</strong>.</span></li><li id="ALM-12046__li6033896511450"><span>Expand the <strong id="ALM-12046__b1511218191705">Hosts</strong> dialog box and select the alarm node and the active OMS node.</span></li><li id="ALM-12046__li617977311450"><span>Click <span><img id="ALM-12046__image92961342720" src="en-us_image_0000001532927350.png"></span> in the upper right corner, and set <strong id="ALM-12046__b12391113112719">Start Date</strong> and <strong id="ALM-12046__b53961311278">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time respectively. Then, click <strong id="ALM-12046__b23914137276">Download</strong>.</span></li><li id="ALM-12046__li3079963411450"><span>Contact <span id="ALM-12046__text26871216142711">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12046__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12046__p754913417333">This alarm is automatically cleared after the fault is rectified.</p> <div class="section" id="ALM-12046__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12046__p754913417333">This alarm is automatically cleared after the fault is rectified.</p>
</div> </div>

View File

@ -71,7 +71,7 @@
<div class="section" id="ALM-12047__section62597753"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12047__ul16019148"><li id="ALM-12047__li9954605">The alarm threshold is improperly configured.</li><li id="ALM-12047__li22482584">The network quality is poor.</li></ul> <div class="section" id="ALM-12047__section62597753"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12047__ul16019148"><li id="ALM-12047__li9954605">The alarm threshold is improperly configured.</li><li id="ALM-12047__li22482584">The network quality is poor.</li></ul>
</div> </div>
<div class="section" id="ALM-12047__section26508869"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12047__p9150041"><strong id="ALM-12047__b48301864144321">Check whether the threshold is set properly.</strong></p> <div class="section" id="ALM-12047__section26508869"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12047__p9150041"><strong id="ALM-12047__b48301864144321">Check whether the threshold is set properly.</strong></p>
<ol id="ALM-12047__ol5621610514492"><li id="ALM-12047__li16991200144325"><span>Log in to FusionInsight Manager, choose <strong id="ALM-12047__b1096642210512">O&amp;M</strong> &gt; <strong id="ALM-12047__b2977102211513">Alarm</strong> &gt; <strong id="ALM-12047__b10994192210512">Thresholds</strong> &gt; <em id="ALM-12047__i1822231959">Name of the desired cluster</em> &gt; <strong id="ALM-12047__b614102317519">Host</strong> &gt; <strong id="ALM-12047__b121811235515">Network Reading</strong> &gt; <strong id="ALM-12047__b13271323151">Read Packet Error Rate</strong>, and check whether the alarm threshold is configured properly. The default value is <strong id="ALM-12047__b173611231556">0.5%</strong>. You can adjust the threshold as needed.</span><p><ul class="subitemlist" id="ALM-12047__ul54083694144325"><li id="ALM-12047__li61199409144325">If yes, go to <a href="#ALM-12047__li47122569144325">4</a>.</li><li id="ALM-12047__li58205082144325">If no, go to <a href="#ALM-12047__li18938060144325">2</a>.</li></ul> <ol id="ALM-12047__ol5621610514492"><li id="ALM-12047__li16991200144325"><span>Log in to <span id="ALM-12047__text34789336432">MRS</span> Manager, choose <strong id="ALM-12047__b1096642210512">O&amp;M</strong> &gt; <strong id="ALM-12047__b2977102211513">Alarm</strong> &gt; <strong id="ALM-12047__b10994192210512">Thresholds</strong> &gt; <em id="ALM-12047__i1822231959">Name of the desired cluster</em> &gt; <strong id="ALM-12047__b614102317519">Host</strong> &gt; <strong id="ALM-12047__b121811235515">Network Reading</strong> &gt; <strong id="ALM-12047__b13271323151">Read Packet Error Rate</strong>, and check whether the alarm threshold is configured properly. The default value is <strong id="ALM-12047__b173611231556">0.5%</strong>. You can adjust the threshold as needed.</span><p><ul class="subitemlist" id="ALM-12047__ul54083694144325"><li id="ALM-12047__li61199409144325">If yes, go to <a href="#ALM-12047__li47122569144325">4</a>.</li><li id="ALM-12047__li58205082144325">If no, go to <a href="#ALM-12047__li18938060144325">2</a>.</li></ul>
</p></li><li id="ALM-12047__li18938060144325"><a name="ALM-12047__li18938060144325"></a><a name="li18938060144325"></a><span>Choose <strong id="ALM-12047__b11895317762">O&amp;M</strong> &gt; <strong id="ALM-12047__b789714171965">Alarm</strong> &gt; <strong id="ALM-12047__b9898141713613">Thresholds</strong> &gt; <em id="ALM-12047__i389981710618">Name of the desired cluster</em> &gt; <strong id="ALM-12047__b179008171767">Host</strong> &gt; <strong id="ALM-12047__b109007171611">Network Reading</strong> &gt; <strong id="ALM-12047__b790117174615">Read Packet Error Rate</strong>. Click <strong id="ALM-12047__b139032017464">Modify</strong> in the <strong id="ALM-12047__b169038177610">Operation</strong> column to change the threshold.</span><p><p class="litext" id="ALM-12047__p34109930144325">See <a href="#ALM-12047__fig35859496144325">Figure 1</a>.</p> </p></li><li id="ALM-12047__li18938060144325"><a name="ALM-12047__li18938060144325"></a><a name="li18938060144325"></a><span>Choose <strong id="ALM-12047__b11895317762">O&amp;M</strong> &gt; <strong id="ALM-12047__b789714171965">Alarm</strong> &gt; <strong id="ALM-12047__b9898141713613">Thresholds</strong> &gt; <em id="ALM-12047__i389981710618">Name of the desired cluster</em> &gt; <strong id="ALM-12047__b179008171767">Host</strong> &gt; <strong id="ALM-12047__b109007171611">Network Reading</strong> &gt; <strong id="ALM-12047__b790117174615">Read Packet Error Rate</strong>. Click <strong id="ALM-12047__b139032017464">Modify</strong> in the <strong id="ALM-12047__b169038177610">Operation</strong> column to change the threshold.</span><p><p class="litext" id="ALM-12047__p34109930144325">See <a href="#ALM-12047__fig35859496144325">Figure 1</a>.</p>
<div class="fignone" id="ALM-12047__fig35859496144325"><a name="ALM-12047__fig35859496144325"></a><a name="fig35859496144325"></a><span class="figcap"><b>Figure 1 </b>Configuring the alarm threshold</span><br><span><img id="ALM-12047__image777621374319" src="en-us_image_0000001532767698.png"></span></div> <div class="fignone" id="ALM-12047__fig35859496144325"><a name="ALM-12047__fig35859496144325"></a><a name="fig35859496144325"></a><span class="figcap"><b>Figure 1 </b>Configuring the alarm threshold</span><br><span><img id="ALM-12047__image777621374319" src="en-us_image_0000001532767698.png"></span></div>
</p></li><li id="ALM-12047__li11450397144325"><span>After 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12047__ul34110047144325"><li id="ALM-12047__li36224819144325">If yes, no further action is required.</li><li id="ALM-12047__li48529247144325">If no, go to <a href="#ALM-12047__li47122569144325">4</a>.</li></ul> </p></li><li id="ALM-12047__li11450397144325"><span>After 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12047__ul34110047144325"><li id="ALM-12047__li36224819144325">If yes, no further action is required.</li><li id="ALM-12047__li48529247144325">If no, go to <a href="#ALM-12047__li47122569144325">4</a>.</li></ul>
@ -81,7 +81,7 @@
</p></li><li id="ALM-12047__li52164171144325"><a name="ALM-12047__li52164171144325"></a><a name="li52164171144325"></a><span>After 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12047__ul644002144325"><li id="ALM-12047__li21449944144325">If yes, no further action is required.</li><li id="ALM-12047__li59723879144325">If no, go to <a href="#ALM-12047__li66824355144325">6</a>.</li></ul> </p></li><li id="ALM-12047__li52164171144325"><a name="ALM-12047__li52164171144325"></a><a name="li52164171144325"></a><span>After 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12047__ul644002144325"><li id="ALM-12047__li21449944144325">If yes, no further action is required.</li><li id="ALM-12047__li59723879144325">If no, go to <a href="#ALM-12047__li66824355144325">6</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12047__p37260279144922"><strong id="ALM-12047__b41163092144926">Collect the fault information.</strong></p> <p class="tableheading" id="ALM-12047__p37260279144922"><strong id="ALM-12047__b41163092144926">Collect the fault information.</strong></p>
<ol start="6" id="ALM-12047__ol4946431144932"><li id="ALM-12047__li66824355144325"><a name="ALM-12047__li66824355144325"></a><a name="li66824355144325"></a><span>On FusionInsight Manager of the active cluster, choose <strong id="ALM-12047__b114185633111">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12047__b912756123118">Log</strong> &gt; <strong id="ALM-12047__b313125673116">Download</strong>.</span></li><li id="ALM-12047__li64548284144325"><span>Select <strong id="ALM-12047__b13721135814311">OMS</strong> for <strong id="ALM-12047__b8721758153120">Service</strong> and click <strong id="ALM-12047__b187221358143114">OK</strong>.</span></li><li id="ALM-12047__li44063647144325"><span>Expand the <strong id="ALM-12047__b1780712356614">Hosts</strong> dialog box and select the alarm node and the active OMS node.</span></li><li id="ALM-12047__li61028510144325"><span>Click <span><img id="ALM-12047__image1171914283214" src="en-us_image_0000001532927350.png"></span> in the upper right corner, and set <strong id="ALM-12047__b672772103210">Start Date</strong> and <strong id="ALM-12047__b9727729327">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time respectively. Then, click <strong id="ALM-12047__b127281226321">Download</strong>.</span></li><li id="ALM-12047__li44362264144325"><span>Contact <span id="ALM-12047__text5904144183214">O&amp;M personnel</span> and provide the collected logs.</span></li></ol> <ol start="6" id="ALM-12047__ol4946431144932"><li id="ALM-12047__li66824355144325"><a name="ALM-12047__li66824355144325"></a><a name="li66824355144325"></a><span>On <span id="ALM-12047__text13811353454">MRS</span> Manager of the active cluster, choose <strong id="ALM-12047__b114185633111">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12047__b912756123118">Log</strong> &gt; <strong id="ALM-12047__b313125673116">Download</strong>.</span></li><li id="ALM-12047__li64548284144325"><span>Select <strong id="ALM-12047__b13721135814311">OMS</strong> for <strong id="ALM-12047__b8721758153120">Service</strong> and click <strong id="ALM-12047__b187221358143114">OK</strong>.</span></li><li id="ALM-12047__li44063647144325"><span>Expand the <strong id="ALM-12047__b1780712356614">Hosts</strong> dialog box and select the alarm node and the active OMS node.</span></li><li id="ALM-12047__li61028510144325"><span>Click <span><img id="ALM-12047__image1171914283214" src="en-us_image_0000001532927350.png"></span> in the upper right corner, and set <strong id="ALM-12047__b672772103210">Start Date</strong> and <strong id="ALM-12047__b9727729327">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time respectively. Then, click <strong id="ALM-12047__b127281226321">Download</strong>.</span></li><li id="ALM-12047__li44362264144325"><span>Contact <span id="ALM-12047__text5904144183214">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12047__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12047__p754913417333">This alarm is automatically cleared after the fault is rectified.</p> <div class="section" id="ALM-12047__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12047__p754913417333">This alarm is automatically cleared after the fault is rectified.</p>
</div> </div>

View File

@ -71,7 +71,7 @@
<div class="section" id="ALM-12048__section44045770"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12048__ul53896892"><li id="ALM-12048__li15309985">The alarm threshold is improperly configured.</li><li id="ALM-12048__li3572145">The network quality is poor.</li></ul> <div class="section" id="ALM-12048__section44045770"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12048__ul53896892"><li id="ALM-12048__li15309985">The alarm threshold is improperly configured.</li><li id="ALM-12048__li3572145">The network quality is poor.</li></ul>
</div> </div>
<div class="section" id="ALM-12048__section60867610"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12048__p20908314"><strong id="ALM-12048__b538516311339">Check whether the threshold is set properly.</strong></p> <div class="section" id="ALM-12048__section60867610"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12048__p20908314"><strong id="ALM-12048__b538516311339">Check whether the threshold is set properly.</strong></p>
<ol id="ALM-12048__ol6406395014549"><li id="ALM-12048__li11357890145357"><span>Log in to FusionInsight Manager, choose <strong id="ALM-12048__b11238448717">O&amp;M</strong> &gt; <strong id="ALM-12048__b42571744714">Alarm</strong> &gt; <strong id="ALM-12048__b132639411718">Thresholds</strong> &gt; <em id="ALM-12048__i11266204978">Name of the desired cluster</em> &gt; <strong id="ALM-12048__b1126810415718">Host</strong> &gt; <strong id="ALM-12048__b427064876">Network Writing</strong> &gt; <strong id="ALM-12048__b527544074">Write Packet Error Rate</strong>, and check whether the alarm threshold is configured properly. The default value is <strong id="ALM-12048__b152781042714">0.5%</strong>. You can adjust the threshold as needed.</span><p><ul class="subitemlist" id="ALM-12048__ul1261987145357"><li id="ALM-12048__li57812933145357">If yes, go to <a href="#ALM-12048__li12888339145357">4</a>.</li><li id="ALM-12048__li52336003145357">If no, go to <a href="#ALM-12048__li15963175145357">2</a>.</li></ul> <ol id="ALM-12048__ol6406395014549"><li id="ALM-12048__li11357890145357"><span>Log in to <span id="ALM-12048__text34789336432">MRS</span> Manager, choose <strong id="ALM-12048__b11238448717">O&amp;M</strong> &gt; <strong id="ALM-12048__b42571744714">Alarm</strong> &gt; <strong id="ALM-12048__b132639411718">Thresholds</strong> &gt; <em id="ALM-12048__i11266204978">Name of the desired cluster</em> &gt; <strong id="ALM-12048__b1126810415718">Host</strong> &gt; <strong id="ALM-12048__b427064876">Network Writing</strong> &gt; <strong id="ALM-12048__b527544074">Write Packet Error Rate</strong>, and check whether the alarm threshold is configured properly. The default value is <strong id="ALM-12048__b152781042714">0.5%</strong>. You can adjust the threshold as needed.</span><p><ul class="subitemlist" id="ALM-12048__ul1261987145357"><li id="ALM-12048__li57812933145357">If yes, go to <a href="#ALM-12048__li12888339145357">4</a>.</li><li id="ALM-12048__li52336003145357">If no, go to <a href="#ALM-12048__li15963175145357">2</a>.</li></ul>
</p></li><li id="ALM-12048__li15963175145357"><a name="ALM-12048__li15963175145357"></a><a name="li15963175145357"></a><span>Choose <strong id="ALM-12048__b143281531670">O&amp;M</strong> &gt; <strong id="ALM-12048__b10334143112710">Alarm</strong> &gt; <strong id="ALM-12048__b835115312714">Thresholds</strong> &gt; <em id="ALM-12048__i9353231677">Name of the desired cluster</em> &gt; <strong id="ALM-12048__b33577318710">Host</strong> &gt; <strong id="ALM-12048__b5359531876">Network Writing</strong> &gt; <strong id="ALM-12048__b11361143118716">Write Packet Error Rate</strong>. Click <strong id="ALM-12048__b236373115712">Modify</strong> in the <strong id="ALM-12048__b136519311171">Operation</strong> column to change the threshold.</span><p><p class="litext" id="ALM-12048__p47573930145357">See <a href="#ALM-12048__fig53221363145357">Figure 1</a>.</p> </p></li><li id="ALM-12048__li15963175145357"><a name="ALM-12048__li15963175145357"></a><a name="li15963175145357"></a><span>Choose <strong id="ALM-12048__b143281531670">O&amp;M</strong> &gt; <strong id="ALM-12048__b10334143112710">Alarm</strong> &gt; <strong id="ALM-12048__b835115312714">Thresholds</strong> &gt; <em id="ALM-12048__i9353231677">Name of the desired cluster</em> &gt; <strong id="ALM-12048__b33577318710">Host</strong> &gt; <strong id="ALM-12048__b5359531876">Network Writing</strong> &gt; <strong id="ALM-12048__b11361143118716">Write Packet Error Rate</strong>. Click <strong id="ALM-12048__b236373115712">Modify</strong> in the <strong id="ALM-12048__b136519311171">Operation</strong> column to change the threshold.</span><p><p class="litext" id="ALM-12048__p47573930145357">See <a href="#ALM-12048__fig53221363145357">Figure 1</a>.</p>
<div class="fignone" id="ALM-12048__fig53221363145357"><a name="ALM-12048__fig53221363145357"></a><a name="fig53221363145357"></a><span class="figcap"><b>Figure 1 </b>Configuring the alarm threshold</span><br><span><img id="ALM-12048__image71961316435" src="en-us_image_0000001532767658.png"></span></div> <div class="fignone" id="ALM-12048__fig53221363145357"><a name="ALM-12048__fig53221363145357"></a><a name="fig53221363145357"></a><span class="figcap"><b>Figure 1 </b>Configuring the alarm threshold</span><br><span><img id="ALM-12048__image71961316435" src="en-us_image_0000001532767658.png"></span></div>
</p></li><li id="ALM-12048__li53127101145357"><span>After 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12048__ul44566628145357"><li id="ALM-12048__li9450851145357">If yes, no further action is required.</li><li id="ALM-12048__li27321468145357">If no, go to <a href="#ALM-12048__li12888339145357">4</a>.</li></ul> </p></li><li id="ALM-12048__li53127101145357"><span>After 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12048__ul44566628145357"><li id="ALM-12048__li9450851145357">If yes, no further action is required.</li><li id="ALM-12048__li27321468145357">If no, go to <a href="#ALM-12048__li12888339145357">4</a>.</li></ul>
@ -81,7 +81,7 @@
</p></li><li id="ALM-12048__li60279330145357"><a name="ALM-12048__li60279330145357"></a><a name="li60279330145357"></a><span>After 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12048__ul3229702145357"><li id="ALM-12048__li48886195145357">If yes, no further action is required.</li><li id="ALM-12048__li358855145357">If no, go to <a href="#ALM-12048__li5643066145357">6</a>.</li></ul> </p></li><li id="ALM-12048__li60279330145357"><a name="ALM-12048__li60279330145357"></a><a name="li60279330145357"></a><span>After 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12048__ul3229702145357"><li id="ALM-12048__li48886195145357">If yes, no further action is required.</li><li id="ALM-12048__li358855145357">If no, go to <a href="#ALM-12048__li5643066145357">6</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12048__p29067324145357"><strong id="ALM-12048__b10082732145437">Collect the fault information.</strong></p> <p class="tableheading" id="ALM-12048__p29067324145357"><strong id="ALM-12048__b10082732145437">Collect the fault information.</strong></p>
<ol start="6" id="ALM-12048__ol65647935145434"><li id="ALM-12048__li5643066145357"><a name="ALM-12048__li5643066145357"></a><a name="li5643066145357"></a><span>On FusionInsight Manager of the active cluster, choose <strong id="ALM-12048__b4624406810">O&amp;M</strong> &gt; <strong id="ALM-12048__b1867213013818">Log</strong> &gt; <strong id="ALM-12048__b1867914017813">Download</strong>.</span></li><li id="ALM-12048__li50787595145357"><span>Select <strong id="ALM-12048__b8263126183">OMS</strong> for <strong id="ALM-12048__b9277766818">Service</strong> and click <strong id="ALM-12048__b142791561189">OK</strong>.</span></li><li id="ALM-12048__li54435176145357"><span>Expand the <strong id="ALM-12048__b192997101388">Hosts</strong> dialog box and select the alarm node and the active OMS node.</span></li><li id="ALM-12048__li20154536145357"><span>Click <span><img id="ALM-12048__image104601319175315" src="en-us_image_0000001532927350.png"></span> in the upper right corner, and set <strong id="ALM-12048__b154102151382">Start Date</strong> and <strong id="ALM-12048__b24171815687">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time respectively. Then, click <strong id="ALM-12048__b1742012153810">Download</strong>.</span></li><li id="ALM-12048__li21904738145357"><span>Contact <span id="ALM-12048__text1165617231785">O&amp;M personnel</span> and provide the collected logs.</span></li></ol> <ol start="6" id="ALM-12048__ol65647935145434"><li id="ALM-12048__li5643066145357"><a name="ALM-12048__li5643066145357"></a><a name="li5643066145357"></a><span>On <span id="ALM-12048__text571918394453">MRS</span> Manager of the active cluster, choose <strong id="ALM-12048__b4624406810">O&amp;M</strong> &gt; <strong id="ALM-12048__b1867213013818">Log</strong> &gt; <strong id="ALM-12048__b1867914017813">Download</strong>.</span></li><li id="ALM-12048__li50787595145357"><span>Select <strong id="ALM-12048__b8263126183">OMS</strong> for <strong id="ALM-12048__b9277766818">Service</strong> and click <strong id="ALM-12048__b142791561189">OK</strong>.</span></li><li id="ALM-12048__li54435176145357"><span>Expand the <strong id="ALM-12048__b192997101388">Hosts</strong> dialog box and select the alarm node and the active OMS node.</span></li><li id="ALM-12048__li20154536145357"><span>Click <span><img id="ALM-12048__image104601319175315" src="en-us_image_0000001532927350.png"></span> in the upper right corner, and set <strong id="ALM-12048__b154102151382">Start Date</strong> and <strong id="ALM-12048__b24171815687">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time respectively. Then, click <strong id="ALM-12048__b1742012153810">Download</strong>.</span></li><li id="ALM-12048__li21904738145357"><span>Contact <span id="ALM-12048__text1165617231785">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12048__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12048__p754913417333">This alarm is automatically cleared after the fault is rectified.</p> <div class="section" id="ALM-12048__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12048__p754913417333">This alarm is automatically cleared after the fault is rectified.</p>
</div> </div>

View File

@ -71,18 +71,18 @@
<div class="section" id="ALM-12049__sa03918d9e6754c80bef107ff31e23284"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12049__en-us_topic_0070543623_ul1897869"><li id="ALM-12049__en-us_topic_0070543623_li17080825">The alarm threshold is set improperly.</li><li id="ALM-12049__en-us_topic_0070543623_li19509699">The network port rate cannot meet the current service requirements.</li></ul> <div class="section" id="ALM-12049__sa03918d9e6754c80bef107ff31e23284"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12049__en-us_topic_0070543623_ul1897869"><li id="ALM-12049__en-us_topic_0070543623_li17080825">The alarm threshold is set improperly.</li><li id="ALM-12049__en-us_topic_0070543623_li19509699">The network port rate cannot meet the current service requirements.</li></ul>
</div> </div>
<div class="section" id="ALM-12049__s0f9f5ec0a021434b9928f5bf4c940044"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12049__en-us_topic_0070543623_p36781757"><strong id="ALM-12049__b4092164015127">Check whether the threshold is set properly.</strong></p> <div class="section" id="ALM-12049__s0f9f5ec0a021434b9928f5bf4c940044"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12049__en-us_topic_0070543623_p36781757"><strong id="ALM-12049__b4092164015127">Check whether the threshold is set properly.</strong></p>
<ol id="ALM-12049__ol6452245415148"><li id="ALM-12049__li4670351415131"><span>On the FusionInsight Manager, choose <strong id="ALM-12049__b15915337194818">O&amp;M &gt; Alarm</strong> &gt; <strong id="ALM-12049__b191503711486">Thresholds</strong> &gt; <em id="ALM-12049__i189151337174819">Name of the desired cluster</em> &gt; <strong id="ALM-12049__b16915237154813">Host</strong> &gt; <strong id="ALM-12049__b1691573754820">Network Reading</strong> &gt; <strong id="ALM-12049__b10915143734811">Read Throughput Rate</strong> and check whether the alarm threshold is set properly. (By default, 80% is a proper value. However, users can configure the value as required.)</span><p><ul class="subitemlist" id="ALM-12049__ul5738506215131"><li id="ALM-12049__li2521002015131">If yes, go to <a href="#ALM-12049__li5611086815131">2</a>.</li><li id="ALM-12049__li2874573915131">If no, go to <a href="#ALM-12049__li3065917315131">4</a>.</li></ul> <ol id="ALM-12049__ol6452245415148"><li id="ALM-12049__li4670351415131"><span>On the <span id="ALM-12049__text34789336432">MRS</span> Manager, choose <strong id="ALM-12049__b15915337194818">O&amp;M &gt; Alarm</strong> &gt; <strong id="ALM-12049__b191503711486">Thresholds</strong> &gt; <em id="ALM-12049__i189151337174819">Name of the desired cluster</em> &gt; <strong id="ALM-12049__b16915237154813">Host</strong> &gt; <strong id="ALM-12049__b1691573754820">Network Reading</strong> &gt; <strong id="ALM-12049__b10915143734811">Read Throughput Rate</strong> and check whether the alarm threshold is set properly. (By default, 80% is a proper value. However, users can configure the value as required.)</span><p><ul class="subitemlist" id="ALM-12049__ul5738506215131"><li id="ALM-12049__li2521002015131">If yes, go to <a href="#ALM-12049__li5611086815131">2</a>.</li><li id="ALM-12049__li2874573915131">If no, go to <a href="#ALM-12049__li3065917315131">4</a>.</li></ul>
</p></li><li id="ALM-12049__li5611086815131"><a name="ALM-12049__li5611086815131"></a><a name="li5611086815131"></a><span>Based on actual usage condition, choose <strong id="ALM-12049__b07081191469">O&amp;M &gt; Alarm</strong> &gt; <strong id="ALM-12049__b20106143125110">Thresholds</strong> &gt; <em id="ALM-12049__i11541848175114">Name of the desired cluster</em> &gt; <strong id="ALM-12049__b47111192469">Host</strong> &gt; <strong id="ALM-12049__b2418814015131">Network Reading</strong> &gt; <strong id="ALM-12049__b1308228415131">Read Throughput Rate</strong> and click <strong id="ALM-12049__b84051320104416">Modify</strong> in the<strong id="ALM-12049__b18538823144410"> Operation</strong> column to modify the alarm threshold.</span><p><p class="litext" id="ALM-12049__p5303205615131">For details, see <a href="#ALM-12049__fig566375315131">Figure 1</a>.</p> </p></li><li id="ALM-12049__li5611086815131"><a name="ALM-12049__li5611086815131"></a><a name="li5611086815131"></a><span>Based on actual usage condition, choose <strong id="ALM-12049__b07081191469">O&amp;M &gt; Alarm</strong> &gt; <strong id="ALM-12049__b20106143125110">Thresholds</strong> &gt; <em id="ALM-12049__i11541848175114">Name of the desired cluster</em> &gt; <strong id="ALM-12049__b47111192469">Host</strong> &gt; <strong id="ALM-12049__b2418814015131">Network Reading</strong> &gt; <strong id="ALM-12049__b1308228415131">Read Throughput Rate</strong> and click <strong id="ALM-12049__b84051320104416">Modify</strong> in the<strong id="ALM-12049__b18538823144410"> Operation</strong> column to modify the alarm threshold.</span><p><p class="litext" id="ALM-12049__p5303205615131">For details, see <a href="#ALM-12049__fig566375315131">Figure 1</a>.</p>
<div class="fignone" id="ALM-12049__fig566375315131"><a name="ALM-12049__fig566375315131"></a><a name="fig566375315131"></a><span class="figcap"><b>Figure 1 </b>Setting alarm thresholds</span><br><span><img id="ALM-12049__image1615410501365" src="en-us_image_0000001532448486.png"></span></div> <div class="fignone" id="ALM-12049__fig566375315131"><a name="ALM-12049__fig566375315131"></a><a name="fig566375315131"></a><span class="figcap"><b>Figure 1 </b>Setting alarm thresholds</span><br><span><img id="ALM-12049__image1615410501365" src="en-us_image_0000001532448486.png"></span></div>
</p></li><li id="ALM-12049__li6085933615131"><span>Wait for 5 minutes, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12049__ul5129012315131"><li id="ALM-12049__li3523576915131">If yes, no further action is required.</li><li id="ALM-12049__li3552506415131">If no, go to <a href="#ALM-12049__li3065917315131">4</a>.</li></ul> </p></li><li id="ALM-12049__li6085933615131"><span>Wait for 5 minutes, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12049__ul5129012315131"><li id="ALM-12049__li3523576915131">If yes, no further action is required.</li><li id="ALM-12049__li3552506415131">If no, go to <a href="#ALM-12049__li3065917315131">4</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12049__p5895793115131"><strong id="ALM-12049__b5562659915153">Check whether the network port rate can meet the service requirements.</strong></p> <p class="tableheading" id="ALM-12049__p5895793115131"><strong id="ALM-12049__b5562659915153">Check whether the network port rate can meet the service requirements.</strong></p>
<ol start="4" id="ALM-12049__ol665573431527"><li id="ALM-12049__li3065917315131"><a name="ALM-12049__li3065917315131"></a><a name="li3065917315131"></a><span>On FusionInsight Manager, click <span><img id="ALM-12049__image168221113135319" src="en-us_image_0000001582927869.png"></span> in the row where the alarm is located in the real-time alarm list and obtain the IP address of the host and the network port name for which the alarm is generated.</span></li><li id="ALM-12049__li36506615131"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12049__b749710315131">root</strong>. <span id="ALM-12049__text43649449460"></span></span></li><li id="ALM-12049__li1487667815131"><span>Run the <strong id="ALM-12049__b328560015131">ethtool </strong><em id="ALM-12049__i2957040015131">network port name</em> command to check the maximum speed of the current network port.</span><p><div class="note" id="ALM-12049__note4639220615131"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p class="text" id="ALM-12049__p6480701315131">In the VM environment, you cannot run a command to query the network port rate. It is recommended that you contact the system administrator to confirm whether the network port rate meets the requirements.</p> <ol start="4" id="ALM-12049__ol665573431527"><li id="ALM-12049__li3065917315131"><a name="ALM-12049__li3065917315131"></a><a name="li3065917315131"></a><span>On <span id="ALM-12049__text1631874219458">MRS</span> Manager, click <span><img id="ALM-12049__image168221113135319" src="en-us_image_0000001582927869.png"></span> in the row where the alarm is located in the real-time alarm list and obtain the IP address of the host and the network port name for which the alarm is generated.</span></li><li id="ALM-12049__li36506615131"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12049__b749710315131">root</strong>. <span id="ALM-12049__text43649449460"></span></span></li><li id="ALM-12049__li1487667815131"><span>Run the <strong id="ALM-12049__b328560015131">ethtool </strong><em id="ALM-12049__i2957040015131">network port name</em> command to check the maximum speed of the current network port.</span><p><div class="note" id="ALM-12049__note4639220615131"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p class="text" id="ALM-12049__p6480701315131">In the VM environment, you cannot run a command to query the network port rate. It is recommended that you contact the system administrator to confirm whether the network port rate meets the requirements.</p>
</div></div> </div></div>
</p></li><li id="ALM-12049__li6678124515131"><span>If the network read throughput rate exceeds the threshold, contact the system administrator to increase the network port rate.</span></li><li id="ALM-12049__li3780745315131"><span>Check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12049__ul6509010915131"><li id="ALM-12049__li6416030115131">If yes, no further action is required.</li><li id="ALM-12049__li2960185515131">If no, go to <a href="#ALM-12049__li4699944215131">9</a>.</li></ul> </p></li><li id="ALM-12049__li6678124515131"><span>If the network read throughput rate exceeds the threshold, contact the system administrator to increase the network port rate.</span></li><li id="ALM-12049__li3780745315131"><span>Check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12049__ul6509010915131"><li id="ALM-12049__li6416030115131">If yes, no further action is required.</li><li id="ALM-12049__li2960185515131">If no, go to <a href="#ALM-12049__li4699944215131">9</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12049__p4894007015131"><strong id="ALM-12049__b3536737115217">Collect fault information.</strong></p> <p class="tableheading" id="ALM-12049__p4894007015131"><strong id="ALM-12049__b3536737115217">Collect fault information.</strong></p>
<ol start="9" id="ALM-12049__ol2826703815214"><li id="ALM-12049__li4699944215131"><a name="ALM-12049__li4699944215131"></a><a name="li4699944215131"></a><span>On the FusionInsight Manager home page of the active cluster, choose <strong id="ALM-12049__b39977366113627">O&amp;M</strong> &gt; <strong id="ALM-12049__b24251979113627">Log &gt; Download</strong>.</span></li><li id="ALM-12049__li6522206415131"><span>Select <strong id="ALM-12049__b1352831932712">OMS</strong> from the <strong id="ALM-12049__b4885847115131">Service</strong> and click <strong id="ALM-12049__b3991118545">OK</strong>.</span></li><li id="ALM-12049__li4849583015131"><span>Set <strong id="ALM-12049__b5012766815131">Host</strong> to the node for which the alarm is generated and the active OMS node.</span></li><li id="ALM-12049__li1145664103113"><span>Click <span><img id="ALM-12049__image1945644173117" src="en-us_image_0000001582807921.png"></span> in the upper right corner, and set <strong id="ALM-12049__b6456941173117">Start Date</strong> and <strong id="ALM-12049__b11456154113318">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12049__b13456164113319">Download</strong>.</span></li><li id="ALM-12049__li495644512588"><span>Contact the <span id="ALM-12049__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="9" id="ALM-12049__ol2826703815214"><li id="ALM-12049__li4699944215131"><a name="ALM-12049__li4699944215131"></a><a name="li4699944215131"></a><span>On the <span id="ALM-12049__text1367314374513">MRS</span> Manager home page of the active cluster, choose <strong id="ALM-12049__b39977366113627">O&amp;M</strong> &gt; <strong id="ALM-12049__b24251979113627">Log &gt; Download</strong>.</span></li><li id="ALM-12049__li6522206415131"><span>Select <strong id="ALM-12049__b1352831932712">OMS</strong> from the <strong id="ALM-12049__b4885847115131">Service</strong> and click <strong id="ALM-12049__b3991118545">OK</strong>.</span></li><li id="ALM-12049__li4849583015131"><span>Set <strong id="ALM-12049__b5012766815131">Host</strong> to the node for which the alarm is generated and the active OMS node.</span></li><li id="ALM-12049__li1145664103113"><span>Click <span><img id="ALM-12049__image1945644173117" src="en-us_image_0000001582807921.png"></span> in the upper right corner, and set <strong id="ALM-12049__b6456941173117">Start Date</strong> and <strong id="ALM-12049__b11456154113318">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12049__b13456164113319">Download</strong>.</span></li><li id="ALM-12049__li495644512588"><span>Contact the <span id="ALM-12049__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12049__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12049__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12049__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12049__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -71,18 +71,18 @@
<div class="section" id="ALM-12050__sf1f91024377049c1863cc8ab14993c9d"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12050__en-us_topic_0070543624_ul15671223"><li id="ALM-12050__en-us_topic_0070543624_li6823280">The alarm threshold is set improperly.</li><li id="ALM-12050__en-us_topic_0070543624_li61409521">The network port rate cannot meet the current service requirements.</li></ul> <div class="section" id="ALM-12050__sf1f91024377049c1863cc8ab14993c9d"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12050__en-us_topic_0070543624_ul15671223"><li id="ALM-12050__en-us_topic_0070543624_li6823280">The alarm threshold is set improperly.</li><li id="ALM-12050__en-us_topic_0070543624_li61409521">The network port rate cannot meet the current service requirements.</li></ul>
</div> </div>
<div class="section" id="ALM-12050__s288b004e523b4795aa832a7ef214236d"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12050__en-us_topic_0070543624_p8115287"><strong id="ALM-12050__b4779452715650">Check whether the threshold is set properly.</strong></p> <div class="section" id="ALM-12050__s288b004e523b4795aa832a7ef214236d"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12050__en-us_topic_0070543624_p8115287"><strong id="ALM-12050__b4779452715650">Check whether the threshold is set properly.</strong></p>
<ol id="ALM-12050__ol626009901578"><li id="ALM-12050__li3381340415653"><span>On the FusionInsight Manager, choose <strong id="ALM-12050__b034142294917">O&amp;M &gt; Alarm</strong> &gt; <strong id="ALM-12050__b1334142294919">Thresholds</strong> &gt; <em id="ALM-12050__i63492294910">Name of the desired cluster</em> &gt; <strong id="ALM-12050__b134152244914">Host</strong> &gt; <strong id="ALM-12050__b1349229499">Network Writing</strong> &gt; <strong id="ALM-12050__b1434722204913">Write Throughput Rate</strong> and check whether the alarm threshold is set properly. (By default, 80% is a proper value. However, users can configure the value as required.)</span><p><ul class="subitemlist" id="ALM-12050__ul1867012515653"><li id="ALM-12050__li683775815653">If yes, go to <a href="#ALM-12050__li3034361015653">4</a>.</li><li id="ALM-12050__li1698753915653">If no, go to <a href="#ALM-12050__li2386220215653">2</a>.</li></ul> <ol id="ALM-12050__ol626009901578"><li id="ALM-12050__li3381340415653"><span>On the <span id="ALM-12050__text34789336432">MRS</span> Manager, choose <strong id="ALM-12050__b034142294917">O&amp;M &gt; Alarm</strong> &gt; <strong id="ALM-12050__b1334142294919">Thresholds</strong> &gt; <em id="ALM-12050__i63492294910">Name of the desired cluster</em> &gt; <strong id="ALM-12050__b134152244914">Host</strong> &gt; <strong id="ALM-12050__b1349229499">Network Writing</strong> &gt; <strong id="ALM-12050__b1434722204913">Write Throughput Rate</strong> and check whether the alarm threshold is set properly. (By default, 80% is a proper value. However, users can configure the value as required.)</span><p><ul class="subitemlist" id="ALM-12050__ul1867012515653"><li id="ALM-12050__li683775815653">If yes, go to <a href="#ALM-12050__li3034361015653">4</a>.</li><li id="ALM-12050__li1698753915653">If no, go to <a href="#ALM-12050__li2386220215653">2</a>.</li></ul>
</p></li><li id="ALM-12050__li2386220215653"><a name="ALM-12050__li2386220215653"></a><a name="li2386220215653"></a><span>Based on actual usage condition, choose <strong id="ALM-12050__b972065414613">O&amp;M &gt; Alarm</strong> &gt; <strong id="ALM-12050__b17713333531">Thresholds</strong> &gt; <em id="ALM-12050__i277113336535">Name of the desired cluster</em> &gt; <strong id="ALM-12050__b12724175415463">Host</strong> &gt; <strong id="ALM-12050__b2479886215653">Network Writing</strong> &gt; <strong id="ALM-12050__b6255081015653">Write Throughput Rate</strong> and click <strong id="ALM-12050__b84051320104416">Modify</strong> in the<strong id="ALM-12050__b18538823144410"> Operation</strong> column to modify the alarm threshold.</span><p><p class="litext" id="ALM-12050__p3345084615653">For details, see <a href="#ALM-12050__fig2514972915653">Figure 1</a>.</p> </p></li><li id="ALM-12050__li2386220215653"><a name="ALM-12050__li2386220215653"></a><a name="li2386220215653"></a><span>Based on actual usage condition, choose <strong id="ALM-12050__b972065414613">O&amp;M &gt; Alarm</strong> &gt; <strong id="ALM-12050__b17713333531">Thresholds</strong> &gt; <em id="ALM-12050__i277113336535">Name of the desired cluster</em> &gt; <strong id="ALM-12050__b12724175415463">Host</strong> &gt; <strong id="ALM-12050__b2479886215653">Network Writing</strong> &gt; <strong id="ALM-12050__b6255081015653">Write Throughput Rate</strong> and click <strong id="ALM-12050__b84051320104416">Modify</strong> in the<strong id="ALM-12050__b18538823144410"> Operation</strong> column to modify the alarm threshold.</span><p><p class="litext" id="ALM-12050__p3345084615653">For details, see <a href="#ALM-12050__fig2514972915653">Figure 1</a>.</p>
<div class="fignone" id="ALM-12050__fig2514972915653"><a name="ALM-12050__fig2514972915653"></a><a name="fig2514972915653"></a><span class="figcap"><b>Figure 1 </b>Setting alarm thresholds</span><br><span><img id="ALM-12050__image1615410501365" src="en-us_image_0000001532448282.png"></span></div> <div class="fignone" id="ALM-12050__fig2514972915653"><a name="ALM-12050__fig2514972915653"></a><a name="fig2514972915653"></a><span class="figcap"><b>Figure 1 </b>Setting alarm thresholds</span><br><span><img id="ALM-12050__image1615410501365" src="en-us_image_0000001532448282.png"></span></div>
</p></li><li id="ALM-12050__li5919843115653"><span>Wait for 5 minutes, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12050__ul6204017715653"><li id="ALM-12050__li1343323115653">If yes, no further action is required.</li><li id="ALM-12050__li1434989315653">If no, go to <a href="#ALM-12050__li3034361015653">4</a>.</li></ul> </p></li><li id="ALM-12050__li5919843115653"><span>Wait for 5 minutes, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12050__ul6204017715653"><li id="ALM-12050__li1343323115653">If yes, no further action is required.</li><li id="ALM-12050__li1434989315653">If no, go to <a href="#ALM-12050__li3034361015653">4</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12050__p2149068415653"><strong id="ALM-12050__b828937615716">Check whether the network port rate can meet the service requirements.</strong></p> <p class="tableheading" id="ALM-12050__p2149068415653"><strong id="ALM-12050__b828937615716">Check whether the network port rate can meet the service requirements.</strong></p>
<ol start="4" id="ALM-12050__ol3843532615729"><li id="ALM-12050__li3034361015653"><a name="ALM-12050__li3034361015653"></a><a name="li3034361015653"></a><span>On FusionInsight Manager, click <span><img id="ALM-12050__image168221113135319" src="en-us_image_0000001532767506.png"></span> in the row where the alarm is located in the real-time alarm list and obtain the IP address of the host and the network port name for which the alarm is generated.</span></li><li id="ALM-12050__li4191332115653"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12050__b465703515653">root</strong>. <span id="ALM-12050__text43649449460"></span></span></li><li id="ALM-12050__li3191668615653"><span>Run the <strong id="ALM-12050__b4167557215653">ethtool</strong><em id="ALM-12050__i3953582915653">network port name</em> command to check the maximum speed of the current network port.</span><p><div class="note" id="ALM-12050__note4828554115653"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p class="text" id="ALM-12050__p2027814115653">In the VM environment, you cannot run a command to query the network port rate. It is recommended that you contact the system administrator to confirm whether the network port rate meets the requirements.</p> <ol start="4" id="ALM-12050__ol3843532615729"><li id="ALM-12050__li3034361015653"><a name="ALM-12050__li3034361015653"></a><a name="li3034361015653"></a><span>On <span id="ALM-12050__text1228747114510">MRS</span> Manager, click <span><img id="ALM-12050__image168221113135319" src="en-us_image_0000001532767506.png"></span> in the row where the alarm is located in the real-time alarm list and obtain the IP address of the host and the network port name for which the alarm is generated.</span></li><li id="ALM-12050__li4191332115653"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12050__b465703515653">root</strong>. <span id="ALM-12050__text43649449460"></span></span></li><li id="ALM-12050__li3191668615653"><span>Run the <strong id="ALM-12050__b4167557215653">ethtool</strong><em id="ALM-12050__i3953582915653">network port name</em> command to check the maximum speed of the current network port.</span><p><div class="note" id="ALM-12050__note4828554115653"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p class="text" id="ALM-12050__p2027814115653">In the VM environment, you cannot run a command to query the network port rate. It is recommended that you contact the system administrator to confirm whether the network port rate meets the requirements.</p>
</div></div> </div></div>
</p></li><li id="ALM-12050__li1881472115653"><span>If the network write throughput rate exceeds the threshold, contact the system administrator to increase the network port rate.</span></li><li id="ALM-12050__li2938411415653"><span>Check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12050__ul3018892815653"><li id="ALM-12050__li3511476815653">If yes, no further action is required.</li><li id="ALM-12050__li2572394615653">If no, go to <a href="#ALM-12050__li1329206015653">9</a>.</li></ul> </p></li><li id="ALM-12050__li1881472115653"><span>If the network write throughput rate exceeds the threshold, contact the system administrator to increase the network port rate.</span></li><li id="ALM-12050__li2938411415653"><span>Check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12050__ul3018892815653"><li id="ALM-12050__li3511476815653">If yes, no further action is required.</li><li id="ALM-12050__li2572394615653">If no, go to <a href="#ALM-12050__li1329206015653">9</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12050__p326490115653"><strong id="ALM-12050__b6410918015740">Collect fault information.</strong></p> <p class="tableheading" id="ALM-12050__p326490115653"><strong id="ALM-12050__b6410918015740">Collect fault information.</strong></p>
<ol start="9" id="ALM-12050__ol1685979415736"><li id="ALM-12050__li1329206015653"><a name="ALM-12050__li1329206015653"></a><a name="li1329206015653"></a><span>On the FusionInsight Manager home page of the active cluster, choose <strong id="ALM-12050__b39977366113627">O&amp;M</strong> &gt; <strong id="ALM-12050__b24251979113627">Log &gt; Download</strong>.</span></li><li id="ALM-12050__li3479759015653"><span>Select <strong id="ALM-12050__b1352831932712">OMS</strong> from the <strong id="ALM-12050__b291511315653">Service</strong> and click <strong id="ALM-12050__b3991118545">OK</strong>.</span></li><li id="ALM-12050__li3252315653"><span>Set <strong id="ALM-12050__b4474285615653">Host</strong> to the node for which the alarm is generated and the active OMS node.</span></li><li id="ALM-12050__li1145664103113"><span>Click <span><img id="ALM-12050__image1945644173117" src="en-us_image_0000001583087425.png"></span> in the upper right corner, and set <strong id="ALM-12050__b6456941173117">Start Date</strong> and <strong id="ALM-12050__b11456154113318">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12050__b13456164113319">Download</strong>.</span></li><li id="ALM-12050__li495644512588"><span>Contact the <span id="ALM-12050__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="9" id="ALM-12050__ol1685979415736"><li id="ALM-12050__li1329206015653"><a name="ALM-12050__li1329206015653"></a><a name="li1329206015653"></a><span>On the <span id="ALM-12050__text533084864517">MRS</span> Manager home page of the active cluster, choose <strong id="ALM-12050__b39977366113627">O&amp;M</strong> &gt; <strong id="ALM-12050__b24251979113627">Log &gt; Download</strong>.</span></li><li id="ALM-12050__li3479759015653"><span>Select <strong id="ALM-12050__b1352831932712">OMS</strong> from the <strong id="ALM-12050__b291511315653">Service</strong> and click <strong id="ALM-12050__b3991118545">OK</strong>.</span></li><li id="ALM-12050__li3252315653"><span>Set <strong id="ALM-12050__b4474285615653">Host</strong> to the node for which the alarm is generated and the active OMS node.</span></li><li id="ALM-12050__li1145664103113"><span>Click <span><img id="ALM-12050__image1945644173117" src="en-us_image_0000001583087425.png"></span> in the upper right corner, and set <strong id="ALM-12050__b6456941173117">Start Date</strong> and <strong id="ALM-12050__b11456154113318">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12050__b13456164113319">Download</strong>.</span></li><li id="ALM-12050__li495644512588"><span>Contact the <span id="ALM-12050__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12050__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12050__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12050__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12050__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -71,7 +71,7 @@
<div class="section" id="ALM-12051__s0cead5bbc9184838988c86d15e691059"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12051__p842912019417">Massive small files are stored in the disk.</p> <div class="section" id="ALM-12051__s0cead5bbc9184838988c86d15e691059"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12051__p842912019417">Massive small files are stored in the disk.</p>
</div> </div>
<div class="section" id="ALM-12051__s693cda05471c449fbbb49adf59fe9622"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12051__en-us_topic_0070543626_p60322871"><strong id="ALM-12051__b50867683151048">Massive small files are stored in the disk.</strong></p> <div class="section" id="ALM-12051__s693cda05471c449fbbb49adf59fe9622"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12051__en-us_topic_0070543626_p60322871"><strong id="ALM-12051__b50867683151048">Massive small files are stored in the disk.</strong></p>
<ol id="ALM-12051__ol7307055151059"><li id="ALM-12051__li366824151050"><span>On FusionInsight Manager, choose <strong id="ALM-12051__b3405855155015">O&amp;M &gt; Alarm &gt; Alarms</strong> and click <span><img id="ALM-12051__image168221113135319" src="en-us_image_0000001583127373.png"></span> in the row where the alarm is located in the real-time alarm list and obtain the IP address of the host and the disk partition for which the alarm is generated.</span></li><li id="ALM-12051__li29712783151050"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12051__b3301420151050">root</strong>. <span id="ALM-12051__text43649449460"></span></span></li><li id="ALM-12051__li51564885151050"><span>Run the <strong id="ALM-12051__b62048598192327">df -i | grep -iE "</strong><em id="ALM-12051__i45978739192327">partition name|</em>Filesystem" command to check the current disk Inode usage.</span><p><pre class="screen" id="ALM-12051__screen3233016319146"># df -i | grep -iE "<em id="ALM-12051__i85483581988"><strong id="ALM-12051__b67335711988">xvda2</strong></em>|Filesystem" <ol id="ALM-12051__ol7307055151059"><li id="ALM-12051__li366824151050"><span>On <span id="ALM-12051__text34789336432">MRS</span> Manager, choose <strong id="ALM-12051__b3405855155015">O&amp;M &gt; Alarm &gt; Alarms</strong> and click <span><img id="ALM-12051__image168221113135319" src="en-us_image_0000001583127373.png"></span> in the row where the alarm is located in the real-time alarm list and obtain the IP address of the host and the disk partition for which the alarm is generated.</span></li><li id="ALM-12051__li29712783151050"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12051__b3301420151050">root</strong>. <span id="ALM-12051__text43649449460"></span></span></li><li id="ALM-12051__li51564885151050"><span>Run the <strong id="ALM-12051__b62048598192327">df -i | grep -iE "</strong><em id="ALM-12051__i45978739192327">partition name|</em>Filesystem" command to check the current disk Inode usage.</span><p><pre class="screen" id="ALM-12051__screen3233016319146"># df -i | grep -iE "<em id="ALM-12051__i85483581988"><strong id="ALM-12051__b67335711988">xvda2</strong></em>|Filesystem"
Filesystem Inodes IUsed IFree IUse% Mounted on Filesystem Inodes IUsed IFree IUse% Mounted on
/dev/xvda2 2359296 207420 2151876 <strong id="ALM-12051__b4380300819328">9%</strong> /</pre> /dev/xvda2 2359296 207420 2151876 <strong id="ALM-12051__b4380300819328">9%</strong> /</pre>
</p></li><li id="ALM-12051__li14711322338"><span>If the Inode usage exceeds the threshold, manually check small files stored in the disk partition and confirm whether these small files can be deleted.</span><p><div class="note" id="ALM-12051__note187031221315"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12051__p17058221139">Run the <strong id="ALM-12051__b767393015319">for i in /*; do echo $i; find $i|wc -l;</strong> <strong id="ALM-12051__b176739309311">done</strong> command to query the number of files in a partition. Replace <strong id="ALM-12051__b1367310309311">/*</strong> with the specified partition.</p> </p></li><li id="ALM-12051__li14711322338"><span>If the Inode usage exceeds the threshold, manually check small files stored in the disk partition and confirm whether these small files can be deleted.</span><p><div class="note" id="ALM-12051__note187031221315"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12051__p17058221139">Run the <strong id="ALM-12051__b767393015319">for i in /*; do echo $i; find $i|wc -l;</strong> <strong id="ALM-12051__b176739309311">done</strong> command to query the number of files in a partition. Replace <strong id="ALM-12051__b1367310309311">/*</strong> with the specified partition.</p>
@ -89,7 +89,7 @@ Filesystem Inodes IUsed IFree IUse% Mounted on
</p></li><li id="ALM-12051__li52275864151050"><a name="ALM-12051__li52275864151050"></a><a name="li52275864151050"></a><span>Wait for 5 minutes, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12051__ul42070605151050"><li id="ALM-12051__li65141340151050">If yes, no further action is required.</li><li class="subitemlist" id="ALM-12051__li14419336439">If no, go to <a href="#ALM-12051__li1819875814203">6</a>.</li></ul> </p></li><li id="ALM-12051__li52275864151050"><a name="ALM-12051__li52275864151050"></a><a name="li52275864151050"></a><span>Wait for 5 minutes, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12051__ul42070605151050"><li id="ALM-12051__li65141340151050">If yes, no further action is required.</li><li class="subitemlist" id="ALM-12051__li14419336439">If no, go to <a href="#ALM-12051__li1819875814203">6</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12051__p32707140151050"><strong id="ALM-12051__b22914379151125">Collect fault information.</strong></p> <p class="tableheading" id="ALM-12051__p32707140151050"><strong id="ALM-12051__b22914379151125">Collect fault information.</strong></p>
<ol start="6" id="ALM-12051__ol40476616151128"><li id="ALM-12051__li1819875814203"><a name="ALM-12051__li1819875814203"></a><a name="li1819875814203"></a><span>On the FusionInsight Manager home page of the active cluster, choose<strong id="ALM-12051__b32032649151050"> </strong><strong id="ALM-12051__b39977366113627">O&amp;M</strong> &gt; <strong id="ALM-12051__b24251979113627">Log &gt; Download</strong>.</span></li><li id="ALM-12051__li24712483151050"><span>Select <strong id="ALM-12051__b1352831932712">OMS</strong> from the <strong id="ALM-12051__b48358353151050">Service</strong> and click <strong id="ALM-12051__b3991118545">OK</strong>.</span></li><li id="ALM-12051__li55554122151050"><span>Set <strong id="ALM-12051__b21085761151050">Host</strong> to the node for which the alarm is generated and the active OMS node.</span></li><li id="ALM-12051__li1145664103113"><span>Click <span><img id="ALM-12051__image1945644173117" src="en-us_image_0000001532927410.png"></span> in the upper right corner, and set <strong id="ALM-12051__b6456941173117">Start Date</strong> and <strong id="ALM-12051__b11456154113318">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12051__b13456164113319">Download</strong>.</span></li><li id="ALM-12051__li495644512588"><span>Contact the <span id="ALM-12051__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="6" id="ALM-12051__ol40476616151128"><li id="ALM-12051__li1819875814203"><a name="ALM-12051__li1819875814203"></a><a name="li1819875814203"></a><span>On the <span id="ALM-12051__text591925064519">MRS</span> Manager home page of the active cluster, choose<strong id="ALM-12051__b32032649151050"> </strong><strong id="ALM-12051__b39977366113627">O&amp;M</strong> &gt; <strong id="ALM-12051__b24251979113627">Log &gt; Download</strong>.</span></li><li id="ALM-12051__li24712483151050"><span>Select <strong id="ALM-12051__b1352831932712">OMS</strong> from the <strong id="ALM-12051__b48358353151050">Service</strong> and click <strong id="ALM-12051__b3991118545">OK</strong>.</span></li><li id="ALM-12051__li55554122151050"><span>Set <strong id="ALM-12051__b21085761151050">Host</strong> to the node for which the alarm is generated and the active OMS node.</span></li><li id="ALM-12051__li1145664103113"><span>Click <span><img id="ALM-12051__image1945644173117" src="en-us_image_0000001532927410.png"></span> in the upper right corner, and set <strong id="ALM-12051__b6456941173117">Start Date</strong> and <strong id="ALM-12051__b11456154113318">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12051__b13456164113319">Download</strong>.</span></li><li id="ALM-12051__li495644512588"><span>Contact the <span id="ALM-12051__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12051__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12051__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12051__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12051__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -66,7 +66,7 @@
<div class="section" id="ALM-12052__sb7b610c6de7745eb88b799c8579eadf1"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12052__en-us_topic_0070543627_ul48073244"><li id="ALM-12052__en-us_topic_0070543627_li30006020">The temporary port cannot meet the current service requirements.</li><li id="ALM-12052__en-us_topic_0070543627_li1618726">The system is abnormal.</li></ul> <div class="section" id="ALM-12052__sb7b610c6de7745eb88b799c8579eadf1"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12052__en-us_topic_0070543627_ul48073244"><li id="ALM-12052__en-us_topic_0070543627_li30006020">The temporary port cannot meet the current service requirements.</li><li id="ALM-12052__en-us_topic_0070543627_li1618726">The system is abnormal.</li></ul>
</div> </div>
<div class="section" id="ALM-12052__s6decbfe8b04e489d9cf8766a9aa9271f"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12052__en-us_topic_0070543627_p64008009"><strong id="ALM-12052__b36299953151424">Expand the temporary port number range.</strong></p> <div class="section" id="ALM-12052__s6decbfe8b04e489d9cf8766a9aa9271f"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12052__en-us_topic_0070543627_p64008009"><strong id="ALM-12052__b36299953151424">Expand the temporary port number range.</strong></p>
<ol id="ALM-12052__ol4904735151436"><li id="ALM-12052__li53454689151427"><span>On FusionInsight Manager, click <span><img id="ALM-12052__image168221113135319" src="en-us_image_0000001532607970.png"></span> in the row where the alarm is located in the real-time alarm list and obtain the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12052__li34862525151427"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12052__b6057410214588">omm</strong>.</span></li><li id="ALM-12052__li5292302151427"><span>Run<strong id="ALM-12052__b4455986911048"> </strong>the<strong id="ALM-12052__b6549450511048"> cat /proc/sys/net/ipv4/ip_local_port_range |cut -f 1 </strong>command to obtain the value of the start port and run the <strong id="ALM-12052__b1596735623311"><strong id="ALM-12052__b179672056113317">cat /proc/sys/net/ipv4/ip_local_port_range</strong> |cut -f 2 </strong>command to obtain the value of the end port. The total number of temporary ports is the value of the end port minus the value of the start port. If the total number of temporary ports is smaller than 28,232, the random port range of the OS is narrow. Contact the system administrator to increase the port range.</span></li><li id="ALM-12052__li235192813711"><span>Run the <strong id="ALM-12052__b1571566811118">ss -ant 2&gt;/dev/null | grep -v LISTEN | awk 'NR &gt; 2 {print $4}'|cut -d ':' -f 2 | awk '$1 &gt;"</strong><i><span class="varname" id="ALM-12052__varname1665926611118">Value of the start port</span></i><strong id="ALM-12052__b722328511118">" {print $1}' | sort -u | wc -l</strong> command to calculate the number of used temporary ports.</span></li><li id="ALM-12052__li47630726151427"><span>The formula for calculating the usage of the temporary ports is: Usage of the temporary ports = (Number of used temporary ports/Total number of temporary ports) x 100%. Check whether the temporary port usage exceeds the threshold.</span><p><ul id="ALM-12052__ul22547539165328"><li id="ALM-12052__li56893717165328">If yes, go to <a href="#ALM-12052__li39311997145458">7</a>.</li><li id="ALM-12052__li20178347165328">If no, go to <a href="#ALM-12052__li61526456151427">6</a>.</li></ul> <ol id="ALM-12052__ol4904735151436"><li id="ALM-12052__li53454689151427"><span>On <span id="ALM-12052__text34789336432">MRS</span> Manager, click <span><img id="ALM-12052__image168221113135319" src="en-us_image_0000001532607970.png"></span> in the row where the alarm is located in the real-time alarm list and obtain the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12052__li34862525151427"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12052__b6057410214588">omm</strong>.</span></li><li id="ALM-12052__li5292302151427"><span>Run<strong id="ALM-12052__b4455986911048"> </strong>the<strong id="ALM-12052__b6549450511048"> cat /proc/sys/net/ipv4/ip_local_port_range |cut -f 1 </strong>command to obtain the value of the start port and run the <strong id="ALM-12052__b1596735623311"><strong id="ALM-12052__b179672056113317">cat /proc/sys/net/ipv4/ip_local_port_range</strong> |cut -f 2 </strong>command to obtain the value of the end port. The total number of temporary ports is the value of the end port minus the value of the start port. If the total number of temporary ports is smaller than 28,232, the random port range of the OS is narrow. Contact the system administrator to increase the port range.</span></li><li id="ALM-12052__li235192813711"><span>Run the <strong id="ALM-12052__b1571566811118">ss -ant 2&gt;/dev/null | grep -v LISTEN | awk 'NR &gt; 2 {print $4}'|cut -d ':' -f 2 | awk '$1 &gt;"</strong><i><span class="varname" id="ALM-12052__varname1665926611118">Value of the start port</span></i><strong id="ALM-12052__b722328511118">" {print $1}' | sort -u | wc -l</strong> command to calculate the number of used temporary ports.</span></li><li id="ALM-12052__li47630726151427"><span>The formula for calculating the usage of the temporary ports is: Usage of the temporary ports = (Number of used temporary ports/Total number of temporary ports) x 100%. Check whether the temporary port usage exceeds the threshold.</span><p><ul id="ALM-12052__ul22547539165328"><li id="ALM-12052__li56893717165328">If yes, go to <a href="#ALM-12052__li39311997145458">7</a>.</li><li id="ALM-12052__li20178347165328">If no, go to <a href="#ALM-12052__li61526456151427">6</a>.</li></ul>
</p></li><li id="ALM-12052__li61526456151427"><a name="ALM-12052__li61526456151427"></a><a name="li61526456151427"></a><span>Wait for 5 minutes, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12052__ul46327333151427"><li id="ALM-12052__li26023356151427">If yes, no further action is required.</li><li id="ALM-12052__li27517102151427">If no, go to <a href="#ALM-12052__li39311997145458">7</a>.</li></ul> </p></li><li id="ALM-12052__li61526456151427"><a name="ALM-12052__li61526456151427"></a><a name="li61526456151427"></a><span>Wait for 5 minutes, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12052__ul46327333151427"><li id="ALM-12052__li26023356151427">If yes, no further action is required.</li><li id="ALM-12052__li27517102151427">If no, go to <a href="#ALM-12052__li39311997145458">7</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12052__p14292813151427"><strong id="ALM-12052__b49945765151444">Check whether the system environment is abnormal.</strong></p> <p class="tableheading" id="ALM-12052__p14292813151427"><strong id="ALM-12052__b49945765151444">Check whether the system environment is abnormal.</strong></p>
@ -86,7 +86,7 @@ tcp 0 0 10-120-85-154:45435 10-120-85-154:9866 CLOSE_WAIT 94237/java
</p></li><li id="ALM-12052__li785710172156"><span>After obtaining the administrator's approval, clear the processes that occupy a large number of ports. Wait for 5 minutes, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12052__ul45958539151427"><li id="ALM-12052__li56769573151427">If yes, no further action is required.</li><li id="ALM-12052__li34932666151427">If no, go to <a href="#ALM-12052__li57585220151427">10</a>.</li></ul> </p></li><li id="ALM-12052__li785710172156"><span>After obtaining the administrator's approval, clear the processes that occupy a large number of ports. Wait for 5 minutes, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12052__ul45958539151427"><li id="ALM-12052__li56769573151427">If yes, no further action is required.</li><li id="ALM-12052__li34932666151427">If no, go to <a href="#ALM-12052__li57585220151427">10</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12052__p10973675151427"><strong id="ALM-12052__b3641674915155">Collect fault information.</strong></p> <p class="tableheading" id="ALM-12052__p10973675151427"><strong id="ALM-12052__b3641674915155">Collect fault information.</strong></p>
<ol start="10" id="ALM-12052__ol5485290715150"><li id="ALM-12052__li57585220151427"><a name="ALM-12052__li57585220151427"></a><a name="li57585220151427"></a><span>On the FusionInsight Manager home page of the active cluster, choose <strong id="ALM-12052__b39977366113627">O&amp;M</strong> &gt; <strong id="ALM-12052__b24251979113627">Log &gt; Download</strong>.</span></li><li id="ALM-12052__li60837487151427"><span>Select <strong id="ALM-12052__b1352831932712">OMS</strong> from the <strong id="ALM-12052__b33891259151427">Service</strong> and click <strong id="ALM-12052__b3991118545">OK</strong>.</span></li><li id="ALM-12052__li28889415151427"><span>Set <strong id="ALM-12052__b10666475151427">Host</strong> to the node for which the alarm is generated and the active OMS node.</span></li><li id="ALM-12052__li1145664103113"><span>Click <span><img id="ALM-12052__image1945644173117" src="en-us_image_0000001582807913.png"></span> in the upper right corner, and set <strong id="ALM-12052__b6456941173117">Start Date</strong> and <strong id="ALM-12052__b11456154113318">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12052__b13456164113319">Download</strong>.</span></li><li id="ALM-12052__li495644512588"><span>Contact the <span id="ALM-12052__text4614151421417">O&amp;M personnel</span> and send the collected log information and files <strong id="ALM-12052__b201061554424">port_result.txt</strong> and <strong id="ALM-12052__b1210685412211">ps_result.txt</strong>. Then, delete the two residual temporary files from the environment.</span></li></ol> <ol start="10" id="ALM-12052__ol5485290715150"><li id="ALM-12052__li57585220151427"><a name="ALM-12052__li57585220151427"></a><a name="li57585220151427"></a><span>On the <span id="ALM-12052__text1975965314518">MRS</span> Manager home page of the active cluster, choose <strong id="ALM-12052__b39977366113627">O&amp;M</strong> &gt; <strong id="ALM-12052__b24251979113627">Log &gt; Download</strong>.</span></li><li id="ALM-12052__li60837487151427"><span>Select <strong id="ALM-12052__b1352831932712">OMS</strong> from the <strong id="ALM-12052__b33891259151427">Service</strong> and click <strong id="ALM-12052__b3991118545">OK</strong>.</span></li><li id="ALM-12052__li28889415151427"><span>Set <strong id="ALM-12052__b10666475151427">Host</strong> to the node for which the alarm is generated and the active OMS node.</span></li><li id="ALM-12052__li1145664103113"><span>Click <span><img id="ALM-12052__image1945644173117" src="en-us_image_0000001582807913.png"></span> in the upper right corner, and set <strong id="ALM-12052__b6456941173117">Start Date</strong> and <strong id="ALM-12052__b11456154113318">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12052__b13456164113319">Download</strong>.</span></li><li id="ALM-12052__li495644512588"><span>Contact the <span id="ALM-12052__text4614151421417">O&amp;M personnel</span> and send the collected log information and files <strong id="ALM-12052__b201061554424">port_result.txt</strong> and <strong id="ALM-12052__b1210685412211">ps_result.txt</strong>. Then, delete the two residual temporary files from the environment.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12052__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12052__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12052__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12052__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -66,11 +66,11 @@
<div class="section" id="ALM-12053__en-us_topic_0070543628_section373139"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12053__en-us_topic_0070543628_ul26937201"><li id="ALM-12053__li184022012102816">The application process is abnormal. For example, the opened file or socket is not closed.</li><li id="ALM-12053__en-us_topic_0070543628_li41108220">The number of file handles cannot meet the current service requirements.</li><li id="ALM-12053__en-us_topic_0070543628_li34429662">The system is abnormal.</li></ul> <div class="section" id="ALM-12053__en-us_topic_0070543628_section373139"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12053__en-us_topic_0070543628_ul26937201"><li id="ALM-12053__li184022012102816">The application process is abnormal. For example, the opened file or socket is not closed.</li><li id="ALM-12053__en-us_topic_0070543628_li41108220">The number of file handles cannot meet the current service requirements.</li><li id="ALM-12053__en-us_topic_0070543628_li34429662">The system is abnormal.</li></ul>
</div> </div>
<div class="section" id="ALM-12053__se041063f671f4371a7e0bb7c4da04f29"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12053__p9858548184015"><strong id="ALM-12053__b11685182963818">Check information about files opened in processes.</strong></p> <div class="section" id="ALM-12053__se041063f671f4371a7e0bb7c4da04f29"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12053__p9858548184015"><strong id="ALM-12053__b11685182963818">Check information about files opened in processes.</strong></p>
<ol id="ALM-12053__ol2107954134014"><li id="ALM-12053__li142191911124120"><span>On FusionInsight Manager, click <span><img id="ALM-12053__image1219131174117" src="en-us_image_0000001532927418.png"></span> in the row where the alarm is located in the real-time alarm list and obtain the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12053__li184472141416"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12053__b294641818419">root</strong>. <span id="ALM-12053__text18701027134116"></span></span></li><li id="ALM-12053__li1762124184114"><span>Run the <strong id="ALM-12053__b06214124117">lsof -n|awk '{print $2}'|sort|uniq -c|sort -nr|more</strong> command to check the process that occupies excessive file handles.</span></li><li id="ALM-12053__li264144244316"><span>Check whether the processes in which a large number of files are opened are normal. For example, check whether there are files or sockets not closed.</span><p><ul id="ALM-12053__ul192411041445"><li id="ALM-12053__li10241144134412">If yes, go to <a href="#ALM-12053__li698311306446">5</a>.</li><li id="ALM-12053__li125435134444">If no, go to <a href="#ALM-12053__li50842733151924">7</a>.</li></ul> <ol id="ALM-12053__ol2107954134014"><li id="ALM-12053__li142191911124120"><span>On <span id="ALM-12053__text34789336432">MRS</span> Manager, click <span><img id="ALM-12053__image1219131174117" src="en-us_image_0000001532927418.png"></span> in the row where the alarm is located in the real-time alarm list and obtain the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12053__li184472141416"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12053__b294641818419">root</strong>. <span id="ALM-12053__text18701027134116"></span></span></li><li id="ALM-12053__li1762124184114"><span>Run the <strong id="ALM-12053__b06214124117">lsof -n|awk '{print $2}'|sort|uniq -c|sort -nr|more</strong> command to check the process that occupies excessive file handles.</span></li><li id="ALM-12053__li264144244316"><span>Check whether the processes in which a large number of files are opened are normal. For example, check whether there are files or sockets not closed.</span><p><ul id="ALM-12053__ul192411041445"><li id="ALM-12053__li10241144134412">If yes, go to <a href="#ALM-12053__li698311306446">5</a>.</li><li id="ALM-12053__li125435134444">If no, go to <a href="#ALM-12053__li50842733151924">7</a>.</li></ul>
</p></li><li id="ALM-12053__li698311306446"><a name="ALM-12053__li698311306446"></a><a name="li698311306446"></a><span>Release the abnormal processes that occupy too many file handles.</span></li><li id="ALM-12053__li137485054416"><span>Five minutes later, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12053__ul19374750194414"><li id="ALM-12053__li33741750154420">If yes, no further action is required.</li><li id="ALM-12053__li537418505442">If no, go to <a href="#ALM-12053__li50842733151924">7</a>.</li></ul> </p></li><li id="ALM-12053__li698311306446"><a name="ALM-12053__li698311306446"></a><a name="li698311306446"></a><span>Release the abnormal processes that occupy too many file handles.</span></li><li id="ALM-12053__li137485054416"><span>Five minutes later, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12053__ul19374750194414"><li id="ALM-12053__li33741750154420">If yes, no further action is required.</li><li id="ALM-12053__li537418505442">If no, go to <a href="#ALM-12053__li50842733151924">7</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12053__en-us_topic_0070543628_p37339219"><strong id="ALM-12053__b50291933151922">Increase the number of file handles.</strong></p> <p class="tableheading" id="ALM-12053__en-us_topic_0070543628_p37339219"><strong id="ALM-12053__b50291933151922">Increase the number of file handles.</strong></p>
<ol start="7" id="ALM-12053__ol66890550151936"><li id="ALM-12053__li50842733151924"><a name="ALM-12053__li50842733151924"></a><a name="li50842733151924"></a><span>On FusionInsight Manager, click <span><img id="ALM-12053__image168221113135319" src="en-us_image_0000001532607746.png"></span> in the row where the alarm is located in the real-time alarm list and obtain the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12053__li24620726151924"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12053__b54931419151924">root</strong>.</span></li><li id="ALM-12053__li103121715194518"><a name="ALM-12053__li103121715194518"></a><a name="li103121715194518"></a><span>Contact the system administrator to increase the number of system file handles.</span></li><li id="ALM-12053__li37165512528"><span>Run the <strong id="ALM-12053__b1690117451482">cat /proc/sys/fs/file-nr</strong> command to view the used handles and the maximum number of file handles. The first value is the number of used handles, the third value is the maximum number. Please check whether the usage exceeds the threshold.</span><p><ul class="subitemlist" id="ALM-12053__ul198522013534"><li class="subitemlist" id="ALM-12053__li816519713539">If yes, go to <a href="#ALM-12053__li103121715194518">9</a>.</li><li id="ALM-12053__li885215017534">If no, go to <a href="#ALM-12053__li133010151924">11</a>.<pre class="screen" id="ALM-12053__screen3672717115216"># cat /proc/sys/fs/file-nr <ol start="7" id="ALM-12053__ol66890550151936"><li id="ALM-12053__li50842733151924"><a name="ALM-12053__li50842733151924"></a><a name="li50842733151924"></a><span>On <span id="ALM-12053__text729965674513">MRS</span> Manager, click <span><img id="ALM-12053__image168221113135319" src="en-us_image_0000001532607746.png"></span> in the row where the alarm is located in the real-time alarm list and obtain the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12053__li24620726151924"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12053__b54931419151924">root</strong>.</span></li><li id="ALM-12053__li103121715194518"><a name="ALM-12053__li103121715194518"></a><a name="li103121715194518"></a><span>Contact the system administrator to increase the number of system file handles.</span></li><li id="ALM-12053__li37165512528"><span>Run the <strong id="ALM-12053__b1690117451482">cat /proc/sys/fs/file-nr</strong> command to view the used handles and the maximum number of file handles. The first value is the number of used handles, the third value is the maximum number. Please check whether the usage exceeds the threshold.</span><p><ul class="subitemlist" id="ALM-12053__ul198522013534"><li class="subitemlist" id="ALM-12053__li816519713539">If yes, go to <a href="#ALM-12053__li103121715194518">9</a>.</li><li id="ALM-12053__li885215017534">If no, go to <a href="#ALM-12053__li133010151924">11</a>.<pre class="screen" id="ALM-12053__screen3672717115216"># cat /proc/sys/fs/file-nr
12704 0 640000</pre> 12704 0 640000</pre>
</li></ul> </li></ul>
</p></li><li id="ALM-12053__li133010151924"><a name="ALM-12053__li133010151924"></a><a name="li133010151924"></a><span>Wait for 5 minutes, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12053__ul18228740151924"><li id="ALM-12053__li5548368151924">If yes, no further action is required.</li><li id="ALM-12053__li46764658151924">If no, go to <a href="#ALM-12053__li21666806151924">12</a>.</li></ul> </p></li><li id="ALM-12053__li133010151924"><a name="ALM-12053__li133010151924"></a><a name="li133010151924"></a><span>Wait for 5 minutes, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12053__ul18228740151924"><li id="ALM-12053__li5548368151924">If yes, no further action is required.</li><li id="ALM-12053__li46764658151924">If no, go to <a href="#ALM-12053__li21666806151924">12</a>.</li></ul>
@ -80,7 +80,7 @@
</p></li><li id="ALM-12053__li23370043151924"><a name="ALM-12053__li23370043151924"></a><a name="li23370043151924"></a><span>Wait for 5 minutes, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12053__ul19344122151924"><li id="ALM-12053__li60783531151924">If yes, no further action is required.</li><li id="ALM-12053__li24518968151924">If no, go to <a href="#ALM-12053__li58218801151924">14</a>.</li></ul> </p></li><li id="ALM-12053__li23370043151924"><a name="ALM-12053__li23370043151924"></a><a name="li23370043151924"></a><span>Wait for 5 minutes, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12053__ul19344122151924"><li id="ALM-12053__li60783531151924">If yes, no further action is required.</li><li id="ALM-12053__li24518968151924">If no, go to <a href="#ALM-12053__li58218801151924">14</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12053__p39879373151924"><strong id="ALM-12053__b60486860151959">Collect fault information.</strong></p> <p class="tableheading" id="ALM-12053__p39879373151924"><strong id="ALM-12053__b60486860151959">Collect fault information.</strong></p>
<ol start="14" id="ALM-12053__ol4489551315202"><li id="ALM-12053__li58218801151924"><a name="ALM-12053__li58218801151924"></a><a name="li58218801151924"></a><span>On the FusionInsight Manager home page of the active cluster, choose <strong id="ALM-12053__b39977366113627">O&amp;M</strong> &gt; <strong id="ALM-12053__b24251979113627">Log &gt; Download</strong>.</span></li><li id="ALM-12053__li57014808151924"><span>Select <strong id="ALM-12053__b1352831932712">OMS</strong> from the <strong id="ALM-12053__b18102480151924">Service</strong> and click <strong id="ALM-12053__b3991118545">OK</strong>.</span></li><li id="ALM-12053__li54796720151924"><span>Set <strong id="ALM-12053__b43371226151924">Host</strong> to the node for which the alarm is generated and the active OMS node.</span></li><li id="ALM-12053__li1145664103113"><span>Click <span><img id="ALM-12053__image1945644173117" src="en-us_image_0000001582927641.png"></span> in the upper right corner, and set <strong id="ALM-12053__b6456941173117">Start Date</strong> and <strong id="ALM-12053__b11456154113318">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12053__b13456164113319">Download</strong>.</span></li><li id="ALM-12053__li495644512588"><span>Contact the <span id="ALM-12053__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="14" id="ALM-12053__ol4489551315202"><li id="ALM-12053__li58218801151924"><a name="ALM-12053__li58218801151924"></a><a name="li58218801151924"></a><span>On the <span id="ALM-12053__text18637125712451">MRS</span> Manager home page of the active cluster, choose <strong id="ALM-12053__b39977366113627">O&amp;M</strong> &gt; <strong id="ALM-12053__b24251979113627">Log &gt; Download</strong>.</span></li><li id="ALM-12053__li57014808151924"><span>Select <strong id="ALM-12053__b1352831932712">OMS</strong> from the <strong id="ALM-12053__b18102480151924">Service</strong> and click <strong id="ALM-12053__b3991118545">OK</strong>.</span></li><li id="ALM-12053__li54796720151924"><span>Set <strong id="ALM-12053__b43371226151924">Host</strong> to the node for which the alarm is generated and the active OMS node.</span></li><li id="ALM-12053__li1145664103113"><span>Click <span><img id="ALM-12053__image1945644173117" src="en-us_image_0000001582927641.png"></span> in the upper right corner, and set <strong id="ALM-12053__b6456941173117">Start Date</strong> and <strong id="ALM-12053__b11456154113318">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12053__b13456164113319">Download</strong>.</span></li><li id="ALM-12053__li495644512588"><span>Contact the <span id="ALM-12053__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12053__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12053__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12053__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12053__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -68,7 +68,7 @@
<div class="section" id="ALM-12054__section39072761"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12054__p51542282">No certificate (CA certificate, HA root certificate, HA user certificate, Gaussdb root certificate, or Gaussdb user certificate) is imported to the system, the certificate fails to be imported, or the certificate file is invalid.</p> <div class="section" id="ALM-12054__section39072761"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12054__p51542282">No certificate (CA certificate, HA root certificate, HA user certificate, Gaussdb root certificate, or Gaussdb user certificate) is imported to the system, the certificate fails to be imported, or the certificate file is invalid.</p>
</div> </div>
<div class="section" id="ALM-12054__section16110535"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12054__p14175317"><strong id="ALM-12054__b17561093983012">Check the alarm cause.</strong></p> <div class="section" id="ALM-12054__section16110535"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12054__p14175317"><strong id="ALM-12054__b17561093983012">Check the alarm cause.</strong></p>
<ol id="ALM-12054__ol15202643152315"><li id="ALM-12054__li3518787615237"><span>On FusionInsight Manager, locate the target alarm in the real-time alarm list and click <span><img id="ALM-12054__image168221113135319" src="en-us_image_0000001532448262.png"></span>.</span><p><p class="litext" id="ALM-12054__p4827967915237">View <strong id="ALM-12054__b1735515248158">Additional Information</strong> to obtain the additional information about the alarm.</p> <ol id="ALM-12054__ol15202643152315"><li id="ALM-12054__li3518787615237"><span>On <span id="ALM-12054__text34789336432">MRS</span> Manager, locate the target alarm in the real-time alarm list and click <span><img id="ALM-12054__image168221113135319" src="en-us_image_0000001532448262.png"></span>.</span><p><p class="litext" id="ALM-12054__p4827967915237">View <strong id="ALM-12054__b1735515248158">Additional Information</strong> to obtain the additional information about the alarm.</p>
<ul class="subitemlist" id="ALM-12054__ul6505776815237"><li id="ALM-12054__li3084159815237">If <strong id="ALM-12054__b10712888143012">CA Certificate</strong> is displayed in the additional alarm information, log in to the active OMS management node as user <strong id="ALM-12054__b2115831533012">omm</strong> and go to <a href="#ALM-12054__li2768003415237">2</a>.</li><li id="ALM-12054__li205560515237">If <strong id="ALM-12054__b18381976313012">HA root Certificate</strong> is displayed in the additional information, view <strong id="ALM-12054__b14553259973012">Location</strong> to obtain the name of the host involved in this alarm. Then, log in to the host as user <strong id="ALM-12054__b10513863303012">omm</strong> and go to <a href="#ALM-12054__li6628516015237">3</a>.</li><li id="ALM-12054__li2214172115237">If <strong id="ALM-12054__b18169878033012">HA server Certificate</strong> is displayed in the additional information, view <strong id="ALM-12054__b6062742563012">Location</strong> to obtain the name of the host involved in this alarm. Then, log in to the host as user <strong id="ALM-12054__b10652669033012">omm</strong> and go to <a href="#ALM-12054__li64457371511">4</a>.</li><li id="ALM-12054__li5926131164116">If <strong id="ALM-12054__b2672141219591">Certificate has expired</strong> is displayed in the additional information, view <strong id="ALM-12054__b105451432204">Location</strong> to obtain the name of the host for which the alarm is generated. Then, log in to the host as user <strong id="ALM-12054__b1133110114812">omm</strong> and perform <a href="#ALM-12054__li2768003415237">2</a> to <a href="#ALM-12054__li64457371511">4</a> in sequence to check whether the certificates have expired. If these certificates have not expired, check whether other certificates have been imported. If yes, import the certificate files again.</li></ul> <ul class="subitemlist" id="ALM-12054__ul6505776815237"><li id="ALM-12054__li3084159815237">If <strong id="ALM-12054__b10712888143012">CA Certificate</strong> is displayed in the additional alarm information, log in to the active OMS management node as user <strong id="ALM-12054__b2115831533012">omm</strong> and go to <a href="#ALM-12054__li2768003415237">2</a>.</li><li id="ALM-12054__li205560515237">If <strong id="ALM-12054__b18381976313012">HA root Certificate</strong> is displayed in the additional information, view <strong id="ALM-12054__b14553259973012">Location</strong> to obtain the name of the host involved in this alarm. Then, log in to the host as user <strong id="ALM-12054__b10513863303012">omm</strong> and go to <a href="#ALM-12054__li6628516015237">3</a>.</li><li id="ALM-12054__li2214172115237">If <strong id="ALM-12054__b18169878033012">HA server Certificate</strong> is displayed in the additional information, view <strong id="ALM-12054__b6062742563012">Location</strong> to obtain the name of the host involved in this alarm. Then, log in to the host as user <strong id="ALM-12054__b10652669033012">omm</strong> and go to <a href="#ALM-12054__li64457371511">4</a>.</li><li id="ALM-12054__li5926131164116">If <strong id="ALM-12054__b2672141219591">Certificate has expired</strong> is displayed in the additional information, view <strong id="ALM-12054__b105451432204">Location</strong> to obtain the name of the host for which the alarm is generated. Then, log in to the host as user <strong id="ALM-12054__b1133110114812">omm</strong> and perform <a href="#ALM-12054__li2768003415237">2</a> to <a href="#ALM-12054__li64457371511">4</a> in sequence to check whether the certificates have expired. If these certificates have not expired, check whether other certificates have been imported. If yes, import the certificate files again.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12054__p4864900615237"><strong id="ALM-12054__b58840153152325">Check the validity period of the certificate files in the system.</strong></p> <p class="tableheading" id="ALM-12054__p4864900615237"><strong id="ALM-12054__b58840153152325">Check the validity period of the certificate files in the system.</strong></p>
@ -97,7 +97,7 @@
<ul class="subitemlist" id="ALM-12054__ul6583370515237"><li id="ALM-12054__li5632256215237">If yes, go to <a href="#ALM-12054__li993320915237">7</a>.</li><li id="ALM-12054__li3714101715237">If no, no further action is required.</li></ul> <ul class="subitemlist" id="ALM-12054__ul6583370515237"><li id="ALM-12054__li5632256215237">If yes, go to <a href="#ALM-12054__li993320915237">7</a>.</li><li id="ALM-12054__li3714101715237">If no, no further action is required.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12054__p5563243315237"><strong id="ALM-12054__b30164211152420">Collect the fault information.</strong></p> <p class="tableheading" id="ALM-12054__p5563243315237"><strong id="ALM-12054__b30164211152420">Collect the fault information.</strong></p>
<ol start="7" id="ALM-12054__ol55826366152424"><li id="ALM-12054__li993320915237"><a name="ALM-12054__li993320915237"></a><a name="li993320915237"></a><span>On FusionInsight Manager, choose <strong id="ALM-12054__b4151203863012">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12054__b12295014603012">Log</strong> &gt; <strong id="ALM-12054__b7905950833012">Download</strong>.</span></li><li id="ALM-12054__li2229001815237"><span>In the <strong id="ALM-12054__b4980650983012">Services</strong> area, select <strong id="ALM-12054__b1450275463012">Controller</strong>, <strong id="ALM-12054__b5024522853012">OmmServer</strong>, <strong id="ALM-12054__b545910514410">OmmCore</strong>, and <strong id="ALM-12054__b16912131010417">Tomcat</strong>, and click <strong id="ALM-12054__b18481545533012">OK</strong>.</span></li><li id="ALM-12054__li6639244115237"><span>Click <span><img id="ALM-12054__image104601319175315" src="en-us_image_0000001532927350.png"></span> in the upper right corner, and set <strong id="ALM-12054__b9743421579">Start Date</strong> and <strong id="ALM-12054__b157431721876">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12054__b57438212715">Download</strong>.</span></li><li id="ALM-12054__li907865015237"><span>Contact <span id="ALM-12054__text126301214142412">O&amp;M personnel</span> and provide the collected logs.</span></li></ol> <ol start="7" id="ALM-12054__ol55826366152424"><li id="ALM-12054__li993320915237"><a name="ALM-12054__li993320915237"></a><a name="li993320915237"></a><span>On <span id="ALM-12054__text87527034610">MRS</span> Manager, choose <strong id="ALM-12054__b4151203863012">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12054__b12295014603012">Log</strong> &gt; <strong id="ALM-12054__b7905950833012">Download</strong>.</span></li><li id="ALM-12054__li2229001815237"><span>In the <strong id="ALM-12054__b4980650983012">Services</strong> area, select <strong id="ALM-12054__b1450275463012">Controller</strong>, <strong id="ALM-12054__b5024522853012">OmmServer</strong>, <strong id="ALM-12054__b545910514410">OmmCore</strong>, and <strong id="ALM-12054__b16912131010417">Tomcat</strong>, and click <strong id="ALM-12054__b18481545533012">OK</strong>.</span></li><li id="ALM-12054__li6639244115237"><span>Click <span><img id="ALM-12054__image104601319175315" src="en-us_image_0000001532927350.png"></span> in the upper right corner, and set <strong id="ALM-12054__b9743421579">Start Date</strong> and <strong id="ALM-12054__b157431721876">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12054__b57438212715">Download</strong>.</span></li><li id="ALM-12054__li907865015237"><span>Contact <span id="ALM-12054__text126301214142412">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12054__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12054__p754913417333">This alarm is automatically cleared after the fault is rectified.</p> <div class="section" id="ALM-12054__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12054__p754913417333">This alarm is automatically cleared after the fault is rectified.</p>
</div> </div>

View File

@ -68,7 +68,7 @@
<div class="section" id="ALM-12055__section10108989"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12055__p63941749">The remaining validity period of a system certificate (CA certificate, HA root certificate, HA user certificate, Gaussdb root certificate, or Gaussdb user certificate) is less than 30 days.</p> <div class="section" id="ALM-12055__section10108989"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12055__p63941749">The remaining validity period of a system certificate (CA certificate, HA root certificate, HA user certificate, Gaussdb root certificate, or Gaussdb user certificate) is less than 30 days.</p>
</div> </div>
<div class="section" id="ALM-12055__section23872039"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12055__p11899166"><strong id="ALM-12055__b1887045508312">Check the alarm cause.</strong></p> <div class="section" id="ALM-12055__section23872039"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12055__p11899166"><strong id="ALM-12055__b1887045508312">Check the alarm cause.</strong></p>
<ol id="ALM-12055__ol43542277152959"><li id="ALM-12055__li17570723152950"><span>On FusionInsight Manager, locate the target alarm in the real-time alarm list and click <span><img id="ALM-12055__image168221113135319" src="en-us_image_0000001532448262.png"></span>.</span><p><p class="litext" id="ALM-12055__p14741576152950">View <strong id="ALM-12055__b2570155932812">Additional Information</strong> to obtain the additional information about the alarm.</p> <ol id="ALM-12055__ol43542277152959"><li id="ALM-12055__li17570723152950"><span>On <span id="ALM-12055__text34789336432">MRS</span> Manager, locate the target alarm in the real-time alarm list and click <span><img id="ALM-12055__image168221113135319" src="en-us_image_0000001532448262.png"></span>.</span><p><p class="litext" id="ALM-12055__p14741576152950">View <strong id="ALM-12055__b2570155932812">Additional Information</strong> to obtain the additional information about the alarm.</p>
<ul class="subitemlist" id="ALM-12055__ul7673462152950"><li id="ALM-12055__li9190972152950">If <strong id="ALM-12055__b5620522910">CA Certificate</strong> is displayed in the additional alarm information, log in to the active OMS management node as user <strong id="ALM-12055__b3677516299">omm</strong> and go to <a href="#ALM-12055__li31866665152950">2</a>.</li><li id="ALM-12055__li56441759152950">If <strong id="ALM-12055__b0156041182917">HA root Certificate</strong> is displayed in the additional information, view <strong id="ALM-12055__b121574411299">Location</strong> to obtain the name of the host involved in this alarm. Then, log in to the host as user <strong id="ALM-12055__b1215714419290">omm</strong> and go to <a href="#ALM-12055__li35214520152950">3</a>.</li><li id="ALM-12055__li8309147152950">If <strong id="ALM-12055__b17901512193020">HA server Certificate</strong> is displayed in the additional information, view <strong id="ALM-12055__b1879191210300">Location</strong> to obtain the name of the host involved in this alarm. Then, log in to the host as user <strong id="ALM-12055__b879181223010">omm</strong> and go to <a href="#ALM-12055__li089064874420">4</a>.</li></ul> <ul class="subitemlist" id="ALM-12055__ul7673462152950"><li id="ALM-12055__li9190972152950">If <strong id="ALM-12055__b5620522910">CA Certificate</strong> is displayed in the additional alarm information, log in to the active OMS management node as user <strong id="ALM-12055__b3677516299">omm</strong> and go to <a href="#ALM-12055__li31866665152950">2</a>.</li><li id="ALM-12055__li56441759152950">If <strong id="ALM-12055__b0156041182917">HA root Certificate</strong> is displayed in the additional information, view <strong id="ALM-12055__b121574411299">Location</strong> to obtain the name of the host involved in this alarm. Then, log in to the host as user <strong id="ALM-12055__b1215714419290">omm</strong> and go to <a href="#ALM-12055__li35214520152950">3</a>.</li><li id="ALM-12055__li8309147152950">If <strong id="ALM-12055__b17901512193020">HA server Certificate</strong> is displayed in the additional information, view <strong id="ALM-12055__b1879191210300">Location</strong> to obtain the name of the host involved in this alarm. Then, log in to the host as user <strong id="ALM-12055__b879181223010">omm</strong> and go to <a href="#ALM-12055__li089064874420">4</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12055__p1952302152950"><strong id="ALM-12055__b53159753412">Check the validity period of the certificate files in the system.</strong></p> <p class="tableheading" id="ALM-12055__p1952302152950"><strong id="ALM-12055__b53159753412">Check the validity period of the certificate files in the system.</strong></p>
@ -97,7 +97,7 @@
<ul class="subitemlist" id="ALM-12055__ul176911540125116"><li id="ALM-12055__li669119403510">If yes, go to <a href="#ALM-12055__li48423894152950">7</a>.</li><li id="ALM-12055__li1869114010511">If no, no further action is required.</li></ul> <ul class="subitemlist" id="ALM-12055__ul176911540125116"><li id="ALM-12055__li669119403510">If yes, go to <a href="#ALM-12055__li48423894152950">7</a>.</li><li id="ALM-12055__li1869114010511">If no, no further action is required.</li></ul>
</p></li></ol> </p></li></ol>
<p class="tableheading" id="ALM-12055__p65221176152950"><strong id="ALM-12055__b29152840153038">Collect the fault information.</strong></p> <p class="tableheading" id="ALM-12055__p65221176152950"><strong id="ALM-12055__b29152840153038">Collect the fault information.</strong></p>
<ol start="7" id="ALM-12055__ol35401324153041"><li id="ALM-12055__li48423894152950"><a name="ALM-12055__li48423894152950"></a><a name="li48423894152950"></a><span>On FusionInsight Manager, choose <strong id="ALM-12055__b151432717503">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12055__b11520152710504">Log</strong> &gt; <strong id="ALM-12055__b10521172713504">Download</strong>.</span></li><li id="ALM-12055__li33161866152950"><span>In the <strong id="ALM-12055__b19231630205011">Services</strong> area, select <strong id="ALM-12055__b2923183065012">Controller</strong>, <strong id="ALM-12055__b169231304504">OmmServer</strong>, <strong id="ALM-12055__b14923430145017">OmmCore</strong>, and <strong id="ALM-12055__b179231830165014">Tomcat</strong>, and click <strong id="ALM-12055__b1923143012505">OK</strong>.</span></li><li id="ALM-12055__li30021345152950"><span>Click <span><img id="ALM-12055__image104601319175315" src="en-us_image_0000001532927350.png"></span> in the upper right corner, and set <strong id="ALM-12055__b16249143625013">Start Date</strong> and <strong id="ALM-12055__b824993611507">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12055__b13249123612501">Download</strong>.</span></li><li id="ALM-12055__li15809856152950"><span>Contact <span id="ALM-12055__text126301214142412">O&amp;M personnel</span> and provide the collected logs.</span></li></ol> <ol start="7" id="ALM-12055__ol35401324153041"><li id="ALM-12055__li48423894152950"><a name="ALM-12055__li48423894152950"></a><a name="li48423894152950"></a><span>On <span id="ALM-12055__text107823364617">MRS</span> Manager, choose <strong id="ALM-12055__b151432717503">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12055__b11520152710504">Log</strong> &gt; <strong id="ALM-12055__b10521172713504">Download</strong>.</span></li><li id="ALM-12055__li33161866152950"><span>In the <strong id="ALM-12055__b19231630205011">Services</strong> area, select <strong id="ALM-12055__b2923183065012">Controller</strong>, <strong id="ALM-12055__b169231304504">OmmServer</strong>, <strong id="ALM-12055__b14923430145017">OmmCore</strong>, and <strong id="ALM-12055__b179231830165014">Tomcat</strong>, and click <strong id="ALM-12055__b1923143012505">OK</strong>.</span></li><li id="ALM-12055__li30021345152950"><span>Click <span><img id="ALM-12055__image104601319175315" src="en-us_image_0000001532927350.png"></span> in the upper right corner, and set <strong id="ALM-12055__b16249143625013">Start Date</strong> and <strong id="ALM-12055__b824993611507">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12055__b13249123612501">Download</strong>.</span></li><li id="ALM-12055__li15809856152950"><span>Contact <span id="ALM-12055__text126301214142412">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12055__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12055__p754913417333">This alarm is automatically cleared after the fault is rectified.</p> <div class="section" id="ALM-12055__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12055__p754913417333">This alarm is automatically cleared after the fault is rectified.</p>
</div> </div>

View File

@ -59,10 +59,10 @@
</div> </div>
<div class="section" id="ALM-12057__section42966593568"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12057__p240915442254">Metadata is not configured with the task to periodically back up data to a third-party server.</p> <div class="section" id="ALM-12057__section42966593568"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12057__p240915442254">Metadata is not configured with the task to periodically back up data to a third-party server.</p>
</div> </div>
<div class="section" id="ALM-12057__section1525571619574"><h4 class="sectiontitle">Procedure</h4><ol id="ALM-12057__ol449617567348"><li id="ALM-12057__li1611744911013"><span>On the FusionInsight Manager portal choose <strong id="ALM-12057__b188358153113">O&amp;M &gt; Alarm &gt; Alarms</strong>.</span></li><li id="ALM-12057__li169585911117"><span>In the alarm list, click <span><img id="ALM-12057__image168221113135319" src="en-us_image_0000001532927570.png"></span> in the row where the alarm is located and identify the data module from which the alarm is generated based on <strong id="ALM-12057__b4668102723111">Additional Information</strong>.</span></li><li id="ALM-12057__li11496856143419"><span>Choose <strong id="ALM-12057__b721210326">O&amp;M</strong> &gt; <strong id="ALM-12057__b1488442514323">Backup and Restoration &gt; Backup Management</strong> &gt; <strong id="ALM-12057__b55459305323">Create</strong>.</span></li><li id="ALM-12057__li144225714510"><span>Configure a backup task. The backup data to be configured is consistent with the data in Additional Information of the alarm.</span></li><li id="ALM-12057__li1133644161218"><span>After the backup task is created successfully, wait for two minutes and check whether the alarm is cleared.</span><p><ul id="ALM-12057__ul643195154411"><li id="ALM-12057__li5431451134410">If yes, no further action is required.</li><li id="ALM-12057__li1843551124416">If no, go to <a href="#ALM-12057__li1185962516113">6</a>.</li></ul> <div class="section" id="ALM-12057__section1525571619574"><h4 class="sectiontitle">Procedure</h4><ol id="ALM-12057__ol449617567348"><li id="ALM-12057__li1611744911013"><span>On the <span id="ALM-12057__text34789336432">MRS</span> Manager portal choose <strong id="ALM-12057__b188358153113">O&amp;M &gt; Alarm &gt; Alarms</strong>.</span></li><li id="ALM-12057__li169585911117"><span>In the alarm list, click <span><img id="ALM-12057__image168221113135319" src="en-us_image_0000001532927570.png"></span> in the row where the alarm is located and identify the data module from which the alarm is generated based on <strong id="ALM-12057__b4668102723111">Additional Information</strong>.</span></li><li id="ALM-12057__li11496856143419"><span>Choose <strong id="ALM-12057__b721210326">O&amp;M</strong> &gt; <strong id="ALM-12057__b1488442514323">Backup and Restoration &gt; Backup Management</strong> &gt; <strong id="ALM-12057__b55459305323">Create</strong>.</span></li><li id="ALM-12057__li144225714510"><span>Configure a backup task. The backup data to be configured is consistent with the data in Additional Information of the alarm.</span></li><li id="ALM-12057__li1133644161218"><span>After the backup task is created successfully, wait for two minutes and check whether the alarm is cleared.</span><p><ul id="ALM-12057__ul643195154411"><li id="ALM-12057__li5431451134410">If yes, no further action is required.</li><li id="ALM-12057__li1843551124416">If no, go to <a href="#ALM-12057__li1185962516113">6</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12057__p1284212519115"><strong id="ALM-12057__b1432912914719">Collect fault information</strong></p> <p id="ALM-12057__p1284212519115"><strong id="ALM-12057__b1432912914719">Collect fault information</strong></p>
<ol start="6" id="ALM-12057__ol8860142514111"><li id="ALM-12057__li1185962516113"><a name="ALM-12057__li1185962516113"></a><a name="li1185962516113"></a><span>On FusionInsight Manager, choose <strong id="ALM-12057__b2068611561668">O&amp;M</strong> &gt; <strong id="ALM-12057__b19686105610610">Log &gt; Download</strong>.</span></li><li id="ALM-12057__li13859112516110"><span>In the <strong id="ALM-12057__b8859172516114">Service</strong> area, select <strong id="ALM-12057__b285913251016">Controller</strong> and click <strong id="ALM-12057__b3991118545">OK</strong>.</span></li><li id="ALM-12057__li4859182515115"><span>Click <span><img id="ALM-12057__image185919251512" src="en-us_image_0000001583087553.png"></span> in the upper right corner, and set <strong id="ALM-12057__b198594252011">Start Date</strong> and <strong id="ALM-12057__b58593251114">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12057__b11859025919">Download</strong>.</span></li><li id="ALM-12057__li495644512588"><span>Contact the <span id="ALM-12057__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="6" id="ALM-12057__ol8860142514111"><li id="ALM-12057__li1185962516113"><a name="ALM-12057__li1185962516113"></a><a name="li1185962516113"></a><span>On <span id="ALM-12057__text19508366462">MRS</span> Manager, choose <strong id="ALM-12057__b2068611561668">O&amp;M</strong> &gt; <strong id="ALM-12057__b19686105610610">Log &gt; Download</strong>.</span></li><li id="ALM-12057__li13859112516110"><span>In the <strong id="ALM-12057__b8859172516114">Service</strong> area, select <strong id="ALM-12057__b285913251016">Controller</strong> and click <strong id="ALM-12057__b3991118545">OK</strong>.</span></li><li id="ALM-12057__li4859182515115"><span>Click <span><img id="ALM-12057__image185919251512" src="en-us_image_0000001583087553.png"></span> in the upper right corner, and set <strong id="ALM-12057__b198594252011">Start Date</strong> and <strong id="ALM-12057__b58593251114">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12057__b11859025919">Download</strong>.</span></li><li id="ALM-12057__li495644512588"><span>Contact the <span id="ALM-12057__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12057__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12057__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12057__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12057__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -65,7 +65,7 @@
<div class="section" id="ALM-12061__section19542851121912"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12061__ul116931351161910"><li id="ALM-12061__li369312514191">The alarm threshold is improperly configured.</li><li id="ALM-12061__li176935515190">The maximum number of processes (including threads) that can be concurrently opened by user <strong id="ALM-12061__b116937517193">omm</strong> is inappropriate.</li><li id="ALM-12061__li669355121914">An excessive number of threads are opened at the same time.</li></ul> <div class="section" id="ALM-12061__section19542851121912"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12061__ul116931351161910"><li id="ALM-12061__li369312514191">The alarm threshold is improperly configured.</li><li id="ALM-12061__li176935515190">The maximum number of processes (including threads) that can be concurrently opened by user <strong id="ALM-12061__b116937517193">omm</strong> is inappropriate.</li><li id="ALM-12061__li669355121914">An excessive number of threads are opened at the same time.</li></ul>
</div> </div>
<div class="section" id="ALM-12061__section145451851131917"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12061__p12693135116199"><strong id="ALM-12061__b166935517198">Check whether the alarm threshold or alarm hit number is properly configured.</strong></p> <div class="section" id="ALM-12061__section145451851131917"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12061__p12693135116199"><strong id="ALM-12061__b166935517198">Check whether the alarm threshold or alarm hit number is properly configured.</strong></p>
<ol id="ALM-12061__ol1937419236218"><li id="ALM-12061__li63741123102117"><span>On the FusionInsight Manager, change the alarm threshold and <strong id="ALM-12061__b1936942319210">Trigger Count</strong> based on the actual CPU usage.</span><p><p id="ALM-12061__p53741023132117">Specifically, choose <strong id="ALM-12061__b12369223182120">O&amp;M </strong>&gt; <strong id="ALM-12061__b7369182362114">Alarm</strong> &gt; <strong id="ALM-12061__b183696238213">Thresholds</strong> &gt;<em id="ALM-12061__i2811143010409"> Name of the desired cluster</em> &gt; <strong id="ALM-12061__b1736902316215">Host</strong>&gt; <strong id="ALM-12061__b1369122314213">Process</strong> &gt; <strong id="ALM-12061__b1736992318217">omm Process Usage</strong> to change Trigger Count.</p> <ol id="ALM-12061__ol1937419236218"><li id="ALM-12061__li63741123102117"><span>On the <span id="ALM-12061__text34789336432">MRS</span> Manager, change the alarm threshold and <strong id="ALM-12061__b1936942319210">Trigger Count</strong> based on the actual CPU usage.</span><p><p id="ALM-12061__p53741023132117">Specifically, choose <strong id="ALM-12061__b12369223182120">O&amp;M </strong>&gt; <strong id="ALM-12061__b7369182362114">Alarm</strong> &gt; <strong id="ALM-12061__b183696238213">Thresholds</strong> &gt;<em id="ALM-12061__i2811143010409"> Name of the desired cluster</em> &gt; <strong id="ALM-12061__b1736902316215">Host</strong>&gt; <strong id="ALM-12061__b1369122314213">Process</strong> &gt; <strong id="ALM-12061__b1736992318217">omm Process Usage</strong> to change Trigger Count.</p>
<div class="note" id="ALM-12061__note1837419235216"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12061__p6374102312216">The alarm is generated when the process usage exceeds the threshold for the times specified by <strong id="ALM-12061__b1237411237214">Trigger Count</strong>.</p> <div class="note" id="ALM-12061__note1837419235216"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12061__p6374102312216">The alarm is generated when the process usage exceeds the threshold for the times specified by <strong id="ALM-12061__b1237411237214">Trigger Count</strong>.</p>
</div></div> </div></div>
<p id="ALM-12061__p1737417236213">Set the alarm threshold based on the actual process usage. To check the process usage, choose <strong id="ALM-12061__b4374172315215">O&amp;M</strong> &gt; <strong id="ALM-12061__b11374192352114">Alarm</strong> &gt; <strong id="ALM-12061__b183741423162110">Thresholds</strong> &gt; <em id="ALM-12061__i18450436164420">Name of the desired cluster</em> &gt; <strong id="ALM-12061__b2374102311219">Host</strong>&gt; <strong id="ALM-12061__b51371152474">Process</strong> &gt; <strong id="ALM-12061__b1693614974714">omm Process Usage</strong>, as shown in <a href="#ALM-12061__fig437414238216">Figure 1</a>.</p> <p id="ALM-12061__p1737417236213">Set the alarm threshold based on the actual process usage. To check the process usage, choose <strong id="ALM-12061__b4374172315215">O&amp;M</strong> &gt; <strong id="ALM-12061__b11374192352114">Alarm</strong> &gt; <strong id="ALM-12061__b183741423162110">Thresholds</strong> &gt; <em id="ALM-12061__i18450436164420">Name of the desired cluster</em> &gt; <strong id="ALM-12061__b2374102311219">Host</strong>&gt; <strong id="ALM-12061__b51371152474">Process</strong> &gt; <strong id="ALM-12061__b1693614974714">omm Process Usage</strong>, as shown in <a href="#ALM-12061__fig437414238216">Figure 1</a>.</p>
@ -73,14 +73,14 @@
</p></li><li id="ALM-12061__li33745237217"><span>2 minutes later, check whether the alarm is cleared.</span><p><ul id="ALM-12061__ul1437412317219"><li id="ALM-12061__li2374182312217">If it is, no further action is required.</li><li id="ALM-12061__li2374112315211">If it is not, go to <a href="#ALM-12061__li936717234216">3</a>.</li></ul> </p></li><li id="ALM-12061__li33745237217"><span>2 minutes later, check whether the alarm is cleared.</span><p><ul id="ALM-12061__ul1437412317219"><li id="ALM-12061__li2374182312217">If it is, no further action is required.</li><li id="ALM-12061__li2374112315211">If it is not, go to <a href="#ALM-12061__li936717234216">3</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12061__p630219198214"><strong id="ALM-12061__b6695451191916">Check whether the maximum number of processes (including threads) opened by user omm is appropriate.</strong></p> <p id="ALM-12061__p630219198214"><strong id="ALM-12061__b6695451191916">Check whether the maximum number of processes (including threads) opened by user omm is appropriate.</strong></p>
<ol start="3" id="ALM-12061__ol13367112317219"><li id="ALM-12061__li936717234216"><a name="ALM-12061__li936717234216"></a><a name="li936717234216"></a><span>In the alarm list on FusionInsight Manager, locate the row that contains the alarm, and view the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12061__li1136752311217"><span>Log in to the host where the alarm is generated as user <strong id="ALM-12061__b1136717231212">root</strong>. <span id="ALM-12061__text985593916354"></span></span></li><li id="ALM-12061__li15367523112112"><span>Run the <strong id="ALM-12061__b5367122302115">su - omm</strong> command to switch to user <strong id="ALM-12061__b193671623132111">omm</strong>.</span></li><li id="ALM-12061__li8367112332111"><span>Run the <strong id="ALM-12061__b14367122392112">ulimit -u</strong> command to obtain the maximum number of threads that can be concurrently opened by user <strong id="ALM-12061__b1236732392116">omm</strong> and check whether the number is greater than or equal to 60000.</span><p><ul id="ALM-12061__ul136710230215"><li id="ALM-12061__li13367423122115">If it is, go to <a href="#ALM-12061__li293443912213">8</a>.</li><li id="ALM-12061__li2367102320214">If it is not, go to <a href="#ALM-12061__li8367152314217">7</a>.</li></ul> <ol start="3" id="ALM-12061__ol13367112317219"><li id="ALM-12061__li936717234216"><a name="ALM-12061__li936717234216"></a><a name="li936717234216"></a><span>In the alarm list on <span id="ALM-12061__text1120482214469">MRS</span> Manager, locate the row that contains the alarm, and view the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12061__li1136752311217"><span>Log in to the host where the alarm is generated as user <strong id="ALM-12061__b1136717231212">root</strong>. <span id="ALM-12061__text985593916354"></span></span></li><li id="ALM-12061__li15367523112112"><span>Run the <strong id="ALM-12061__b5367122302115">su - omm</strong> command to switch to user <strong id="ALM-12061__b193671623132111">omm</strong>.</span></li><li id="ALM-12061__li8367112332111"><span>Run the <strong id="ALM-12061__b14367122392112">ulimit -u</strong> command to obtain the maximum number of threads that can be concurrently opened by user <strong id="ALM-12061__b1236732392116">omm</strong> and check whether the number is greater than or equal to 60000.</span><p><ul id="ALM-12061__ul136710230215"><li id="ALM-12061__li13367423122115">If it is, go to <a href="#ALM-12061__li293443912213">8</a>.</li><li id="ALM-12061__li2367102320214">If it is not, go to <a href="#ALM-12061__li8367152314217">7</a>.</li></ul>
</p></li><li id="ALM-12061__li8367152314217"><a name="ALM-12061__li8367152314217"></a><a name="li8367152314217"></a><span>Run the <strong id="ALM-12061__b53671823112118">ulimit -u 60000</strong> command to change the maximum number to 60000. Two minutes later, check whether the alarm is cleared.</span><p><ul id="ALM-12061__ul19367423152119"><li id="ALM-12061__li93671123122113">If it is, no further action is required.</li><li id="ALM-12061__li836702332116">If it is not, go to <a href="#ALM-12061__li1668345092117">12</a>.</li></ul> </p></li><li id="ALM-12061__li8367152314217"><a name="ALM-12061__li8367152314217"></a><a name="li8367152314217"></a><span>Run the <strong id="ALM-12061__b53671823112118">ulimit -u 60000</strong> command to change the maximum number to 60000. Two minutes later, check whether the alarm is cleared.</span><p><ul id="ALM-12061__ul19367423152119"><li id="ALM-12061__li93671123122113">If it is, no further action is required.</li><li id="ALM-12061__li836702332116">If it is not, go to <a href="#ALM-12061__li1668345092117">12</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12061__p7839436162117"><strong id="ALM-12061__b1836742382111">Check whether an excessive number of processes are opened at the same time.</strong></p> <p id="ALM-12061__p7839436162117"><strong id="ALM-12061__b1836742382111">Check whether an excessive number of processes are opened at the same time.</strong></p>
<ol start="8" id="ALM-12061__ol1093673902112"><li id="ALM-12061__li293443912213"><a name="ALM-12061__li293443912213"></a><a name="li293443912213"></a><span>In the alarm list on FusionInsight Manager, locate the row that contains the alarm, and view the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12061__li3934143952119"><span>Log in to the host where the alarm is generated as user <strong id="ALM-12061__b209341539202116">root</strong>.</span></li><li id="ALM-12061__li893473922118"><span>Run the <strong id="ALM-12061__b199341039112112">ps -o nlwp, pid, lwp, args, -u omm|sort -n</strong> command to check the numbers of threads used by the system. The result is sorted based on the thread number. Analyze the top 5 thread numbers and check whether the threads are incorrectly used. If they are, contact maintenance personnel to rectify the fault. If they are not, run the <strong id="ALM-12061__b209343391212">ulimit -u</strong> command to change the maximum number to be greater than 60000.</span></li><li id="ALM-12061__li119349396211"><span>Five minutes later, check whether the alarm is cleared.</span><p><ul id="ALM-12061__ul11934203918217"><li id="ALM-12061__li29341139172111">If it is, no further action is required.</li><li id="ALM-12061__li10934539102120">If it is not, go to <a href="#ALM-12061__li1668345092117">12</a>.</li></ul> <ol start="8" id="ALM-12061__ol1093673902112"><li id="ALM-12061__li293443912213"><a name="ALM-12061__li293443912213"></a><a name="li293443912213"></a><span>In the alarm list on <span id="ALM-12061__text164151523204612">MRS</span> Manager, locate the row that contains the alarm, and view the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12061__li3934143952119"><span>Log in to the host where the alarm is generated as user <strong id="ALM-12061__b209341539202116">root</strong>.</span></li><li id="ALM-12061__li893473922118"><span>Run the <strong id="ALM-12061__b199341039112112">ps -o nlwp, pid, lwp, args, -u omm|sort -n</strong> command to check the numbers of threads used by the system. The result is sorted based on the thread number. Analyze the top 5 thread numbers and check whether the threads are incorrectly used. If they are, contact maintenance personnel to rectify the fault. If they are not, run the <strong id="ALM-12061__b209343391212">ulimit -u</strong> command to change the maximum number to be greater than 60000.</span></li><li id="ALM-12061__li119349396211"><span>Five minutes later, check whether the alarm is cleared.</span><p><ul id="ALM-12061__ul11934203918217"><li id="ALM-12061__li29341139172111">If it is, no further action is required.</li><li id="ALM-12061__li10934539102120">If it is not, go to <a href="#ALM-12061__li1668345092117">12</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12061__p56917471218"><strong id="ALM-12061__b1493463982113">Collect fault information.</strong></p> <p id="ALM-12061__p56917471218"><strong id="ALM-12061__b1493463982113">Collect fault information.</strong></p>
<ol start="12" id="ALM-12061__ol18685115014216"><li id="ALM-12061__li1668345092117"><a name="ALM-12061__li1668345092117"></a><a name="li1668345092117"></a><span>On the FusionInsight Manager home page of the active clusters, choose <strong id="ALM-12061__b968317505217">O&amp;M </strong>&gt; <strong id="ALM-12061__b156836505210">Log</strong> &gt; <strong id="ALM-12061__b7683135018213">Download</strong>.</span></li><li id="ALM-12061__li868355022113"><span>Select <strong id="ALM-12061__b6683950172114">OmmServer</strong> and <strong id="ALM-12061__b468318504214">NodeAgent</strong> from the <strong id="ALM-12061__b33411729132615">Service</strong> and click <strong id="ALM-12061__b3991118545">OK</strong>.</span></li><li id="ALM-12061__li8685135062120"><span>Click <span><img id="ALM-12061__image12683135092120" src="en-us_image_0000001532927646.png"></span> in the upper right corner. In the displayed dialog box, set <strong id="ALM-12061__b136837501219">Start Date</strong> and <strong id="ALM-12061__b86832508216">End Date</strong> to 10 minutes before and after the alarm generation time respectively and click <strong id="ALM-12061__b1168545014219">OK</strong>. Then, click <strong id="ALM-12061__b13685125042113">Download</strong>.</span></li><li id="ALM-12061__li495644512588"><span>Contact the <span id="ALM-12061__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="12" id="ALM-12061__ol18685115014216"><li id="ALM-12061__li1668345092117"><a name="ALM-12061__li1668345092117"></a><a name="li1668345092117"></a><span>On the <span id="ALM-12061__text18528182417464">MRS</span> Manager home page of the active clusters, choose <strong id="ALM-12061__b968317505217">O&amp;M </strong>&gt; <strong id="ALM-12061__b156836505210">Log</strong> &gt; <strong id="ALM-12061__b7683135018213">Download</strong>.</span></li><li id="ALM-12061__li868355022113"><span>Select <strong id="ALM-12061__b6683950172114">OmmServer</strong> and <strong id="ALM-12061__b468318504214">NodeAgent</strong> from the <strong id="ALM-12061__b33411729132615">Service</strong> and click <strong id="ALM-12061__b3991118545">OK</strong>.</span></li><li id="ALM-12061__li8685135062120"><span>Click <span><img id="ALM-12061__image12683135092120" src="en-us_image_0000001532927646.png"></span> in the upper right corner. In the displayed dialog box, set <strong id="ALM-12061__b136837501219">Start Date</strong> and <strong id="ALM-12061__b86832508216">End Date</strong> to 10 minutes before and after the alarm generation time respectively and click <strong id="ALM-12061__b1168545014219">OK</strong>. Then, click <strong id="ALM-12061__b13685125042113">Download</strong>.</span></li><li id="ALM-12061__li495644512588"><span>Contact the <span id="ALM-12061__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12061__section10584175161919"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12061__p6698105111191">This alarm will be automatically cleared after the fault is rectified.</p> <div class="section" id="ALM-12061__section10584175161919"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12061__p6698105111191">This alarm will be automatically cleared after the fault is rectified.</p>
</div> </div>

View File

@ -59,10 +59,10 @@
<div class="section" id="ALM-12062__section6765152119174"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12062__p12567152916541">The OMS parameter configurations mismatch with the cluster scale.</p> <div class="section" id="ALM-12062__section6765152119174"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12062__p12567152916541">The OMS parameter configurations mismatch with the cluster scale.</p>
</div> </div>
<div class="section" id="ALM-12062__section87667210173"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12062__p456793710548"><strong id="ALM-12062__b356720377542">Check whether the OMS parameter configurations match with the cluster scale.</strong></p> <div class="section" id="ALM-12062__section87667210173"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12062__p456793710548"><strong id="ALM-12062__b356720377542">Check whether the OMS parameter configurations match with the cluster scale.</strong></p>
<ol id="ALM-12062__ol87012317557"><li id="ALM-12062__li489962395514"><span>In the alarm list on FusionInsight Manager, locate the row that contains the alarm, and view the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12062__li152261503555"><span>Log in to the host where the alarm is generated as user <strong id="ALM-12062__b022675065516">root</strong>. <span id="ALM-12062__text985593916354"></span></span></li><li id="ALM-12062__li95861858185515"><span>Run the <strong id="ALM-12062__b19586105865511">su - omm</strong> command to switch to user <strong id="ALM-12062__b6602115865515">omm</strong>.</span></li><li id="ALM-12062__li960214583555"><span>Run the <strong id="ALM-12062__b660235865514">vi $BIGDATA_LOG_HOME/controller/scriptlog/modify_manager_param.log</strong> command to open the log file and search for the log file containing the following information: Current oms configurations cannot support <em id="ALM-12062__i260210581552">xx</em> nodes. In the information, <em id="ALM-12062__i1760210587558">xx</em> indicates the number of nodes in the cluster.</span></li><li id="ALM-12062__li1895714113811"><span>Optimize the current cluster configuration by following the instructions in <a href="#ALM-12062__section117861721171717">Optimizing Manager Configurations Based on the Number of Cluster Nodes</a>.</span></li><li id="ALM-12062__li199275175618"><span>One hour later, check whether the alarm is cleared.</span><p><ul id="ALM-12062__ul65231712185619"><li id="ALM-12062__li4861118105614">If it is, no further action is required.</li><li id="ALM-12062__li152720248562">If it is not, go to <a href="#ALM-12062__li8140111212587">7</a>.</li></ul> <ol id="ALM-12062__ol87012317557"><li id="ALM-12062__li489962395514"><span>In the alarm list on <span id="ALM-12062__text34789336432">MRS</span> Manager, locate the row that contains the alarm, and view the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12062__li152261503555"><span>Log in to the host where the alarm is generated as user <strong id="ALM-12062__b022675065516">root</strong>. <span id="ALM-12062__text985593916354"></span></span></li><li id="ALM-12062__li95861858185515"><span>Run the <strong id="ALM-12062__b19586105865511">su - omm</strong> command to switch to user <strong id="ALM-12062__b6602115865515">omm</strong>.</span></li><li id="ALM-12062__li960214583555"><span>Run the <strong id="ALM-12062__b660235865514">vi $BIGDATA_LOG_HOME/controller/scriptlog/modify_manager_param.log</strong> command to open the log file and search for the log file containing the following information: Current oms configurations cannot support <em id="ALM-12062__i260210581552">xx</em> nodes. In the information, <em id="ALM-12062__i1760210587558">xx</em> indicates the number of nodes in the cluster.</span></li><li id="ALM-12062__li1895714113811"><span>Optimize the current cluster configuration by following the instructions in <a href="#ALM-12062__section117861721171717">Optimizing Manager Configurations Based on the Number of Cluster Nodes</a>.</span></li><li id="ALM-12062__li199275175618"><span>One hour later, check whether the alarm is cleared.</span><p><ul id="ALM-12062__ul65231712185619"><li id="ALM-12062__li4861118105614">If it is, no further action is required.</li><li id="ALM-12062__li152720248562">If it is not, go to <a href="#ALM-12062__li8140111212587">7</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12062__p13421113195811"><strong id="ALM-12062__b204218131586">Collect fault information.</strong></p> <p id="ALM-12062__p13421113195811"><strong id="ALM-12062__b204218131586">Collect fault information.</strong></p>
<ol start="7" id="ALM-12062__ol1514001219584"><li id="ALM-12062__li8140111212587"><a name="ALM-12062__li8140111212587"></a><a name="li8140111212587"></a><span>On FusionInsight Manager, choose <strong id="ALM-12062__b12140112175816">O&amp;M</strong> &gt; <strong id="ALM-12062__b114011127584">Log</strong> &gt; <strong id="ALM-12062__b141404121585">Download</strong>.</span></li><li id="ALM-12062__li9140101216585"><span>Select <strong id="ALM-12062__b15140101214581">Controller</strong> from the <strong id="ALM-12062__b214071255817">Service</strong> and click <strong id="ALM-12062__b3991118545">OK</strong>.</span></li><li id="ALM-12062__li121401712195814"><span>Click <span><img id="ALM-12062__image1914021213589" src="en-us_image_0000001532607874.png"></span> in the upper right corner, and set <strong id="ALM-12062__b15140101215811">Start Date</strong> and <strong id="ALM-12062__b121408123588">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12062__b1214091210583">Download</strong>.</span></li><li id="ALM-12062__li495644512588"><span>Contact the <span id="ALM-12062__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="7" id="ALM-12062__ol1514001219584"><li id="ALM-12062__li8140111212587"><a name="ALM-12062__li8140111212587"></a><a name="li8140111212587"></a><span>On <span id="ALM-12062__text72532719461">MRS</span> Manager, choose <strong id="ALM-12062__b12140112175816">O&amp;M</strong> &gt; <strong id="ALM-12062__b114011127584">Log</strong> &gt; <strong id="ALM-12062__b141404121585">Download</strong>.</span></li><li id="ALM-12062__li9140101216585"><span>Select <strong id="ALM-12062__b15140101214581">Controller</strong> from the <strong id="ALM-12062__b214071255817">Service</strong> and click <strong id="ALM-12062__b3991118545">OK</strong>.</span></li><li id="ALM-12062__li121401712195814"><span>Click <span><img id="ALM-12062__image1914021213589" src="en-us_image_0000001532607874.png"></span> in the upper right corner, and set <strong id="ALM-12062__b15140101215811">Start Date</strong> and <strong id="ALM-12062__b121408123588">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12062__b1214091210583">Download</strong>.</span></li><li id="ALM-12062__li495644512588"><span>Contact the <span id="ALM-12062__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12062__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12062__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12062__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12062__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -64,14 +64,14 @@
<div class="section" id="ALM-12063__section6765152119174"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12063__ul161755171417"><li id="ALM-12063__li10175717546">The permission of the disk mount directory is abnormal.</li><li id="ALM-12063__li101751172415">There are disk bad sectors.</li></ul> <div class="section" id="ALM-12063__section6765152119174"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12063__ul161755171417"><li id="ALM-12063__li10175717546">The permission of the disk mount directory is abnormal.</li><li id="ALM-12063__li101751172415">There are disk bad sectors.</li></ul>
</div> </div>
<div class="section" id="ALM-12063__section87667210173"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12063__p982615013436"><strong id="ALM-12063__b148266084312">Check whether the permission of the disk mount directory is normal.</strong></p> <div class="section" id="ALM-12063__section87667210173"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12063__p982615013436"><strong id="ALM-12063__b148266084312">Check whether the permission of the disk mount directory is normal.</strong></p>
<ol id="ALM-12063__ol153513712451"><li id="ALM-12063__li053519784510"><span>In the alarm list on FusionInsight Manager, locate the row that contains the alarm, and view the IP address of the host and <strong id="ALM-12063__b9535779453">DiskName</strong> for the disk for which the alarm is generated.</span></li><li id="ALM-12063__li8535167194513"><span>Log in to the host where the alarm is generated as user <strong id="ALM-12063__b1053519711454">root</strong>. <span id="ALM-12063__text985593916354"></span></span></li><li id="ALM-12063__li135352074456"><span>Run the <strong id="ALM-12063__b165354764512">df -h |grep DiskName</strong> command to obtain the mount point and check whether the permission of the mount directory is unwritable or unreadable.</span><p><ul id="ALM-12063__ul25357764510"><li id="ALM-12063__li753517711453">If it is, go to <a href="#ALM-12063__li1053537184512">4</a>.</li><li id="ALM-12063__li2535107184515">If it is not, go to <a href="#ALM-12063__li8140111212587">8</a>.<div class="note" id="ALM-12063__note483271124518"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12063__p681582324511">If the permission of the mount directory is 000 or the owner is <strong id="ALM-12063__b5815192354515">root</strong>, the mount directory is unreadable and unwritable.</p> <ol id="ALM-12063__ol153513712451"><li id="ALM-12063__li053519784510"><span>In the alarm list on <span id="ALM-12063__text34789336432">MRS</span> Manager, locate the row that contains the alarm, and view the IP address of the host and <strong id="ALM-12063__b9535779453">DiskName</strong> for the disk for which the alarm is generated.</span></li><li id="ALM-12063__li8535167194513"><span>Log in to the host where the alarm is generated as user <strong id="ALM-12063__b1053519711454">root</strong>. <span id="ALM-12063__text985593916354"></span></span></li><li id="ALM-12063__li135352074456"><span>Run the <strong id="ALM-12063__b165354764512">df -h |grep DiskName</strong> command to obtain the mount point and check whether the permission of the mount directory is unwritable or unreadable.</span><p><ul id="ALM-12063__ul25357764510"><li id="ALM-12063__li753517711453">If it is, go to <a href="#ALM-12063__li1053537184512">4</a>.</li><li id="ALM-12063__li2535107184515">If it is not, go to <a href="#ALM-12063__li8140111212587">8</a>.<div class="note" id="ALM-12063__note483271124518"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12063__p681582324511">If the permission of the mount directory is 000 or the owner is <strong id="ALM-12063__b5815192354515">root</strong>, the mount directory is unreadable and unwritable.</p>
</div></div> </div></div>
</li></ul> </li></ul>
</p></li></ol><ol start="4" id="ALM-12063__ol1053510744517"><li id="ALM-12063__li1053537184512"><a name="ALM-12063__li1053537184512"></a><a name="li1053537184512"></a><span>Modify the directory permission.</span></li><li id="ALM-12063__li13535977455"><span>One hour later, check whether this alarm is cleared.</span><p><ul id="ALM-12063__ul1453518794514"><li id="ALM-12063__li453514774516">If it is, no further action is required.</li><li id="ALM-12063__li135357784518">If it is not, go to <a href="#ALM-12063__li4535871458">6</a>.</li></ul> </p></li></ol><ol start="4" id="ALM-12063__ol1053510744517"><li id="ALM-12063__li1053537184512"><a name="ALM-12063__li1053537184512"></a><a name="li1053537184512"></a><span>Modify the directory permission.</span></li><li id="ALM-12063__li13535977455"><span>One hour later, check whether this alarm is cleared.</span><p><ul id="ALM-12063__ul1453518794514"><li id="ALM-12063__li453514774516">If it is, no further action is required.</li><li id="ALM-12063__li135357784518">If it is not, go to <a href="#ALM-12063__li4535871458">6</a>.</li></ul>
</p></li><li id="ALM-12063__li4535871458"><a name="ALM-12063__li4535871458"></a><a name="li4535871458"></a><span>Contact hardware engineers to rectify the disk.</span></li><li id="ALM-12063__li1353518719457"><span>One hour later, check whether this alarm is cleared.</span><p><ul id="ALM-12063__ul6535167124514"><li id="ALM-12063__li05355711456">If it is, no further action is required.</li><li id="ALM-12063__li65354717453">If it is not, go to <a href="#ALM-12063__li8140111212587">8</a>.</li></ul> </p></li><li id="ALM-12063__li4535871458"><a name="ALM-12063__li4535871458"></a><a name="li4535871458"></a><span>Contact hardware engineers to rectify the disk.</span></li><li id="ALM-12063__li1353518719457"><span>One hour later, check whether this alarm is cleared.</span><p><ul id="ALM-12063__ul6535167124514"><li id="ALM-12063__li05355711456">If it is, no further action is required.</li><li id="ALM-12063__li65354717453">If it is not, go to <a href="#ALM-12063__li8140111212587">8</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12063__p18256224611"><strong id="ALM-12063__b42515254610">Collect fault information.</strong></p> <p id="ALM-12063__p18256224611"><strong id="ALM-12063__b42515254610">Collect fault information.</strong></p>
<ol start="8" id="ALM-12063__ol1996717458377"><li id="ALM-12063__li8140111212587"><a name="ALM-12063__li8140111212587"></a><a name="li8140111212587"></a><span>On FusionInsight Manager, choose <strong id="ALM-12063__b12140112175816">O&amp;M</strong> &gt; <strong id="ALM-12063__b114011127584">Log</strong> &gt; <strong id="ALM-12063__b141404121585">Download</strong>.</span></li><li id="ALM-12063__li9140101216585"><span>Select <strong id="ALM-12063__b069717155404">NodeAgent</strong> from the <strong id="ALM-12063__b214071255817">Service</strong> and click <strong id="ALM-12063__b3991118545">OK</strong>.</span></li><li id="ALM-12063__li296716454377"><span>Click <span><img id="ALM-12063__image109671245153716" src="en-us_image_0000001583087405.png"></span> in the upper right corner, and set <strong id="ALM-12063__b99671445103719">Start Date</strong> and <strong id="ALM-12063__b3967114563711">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12063__b2967194513374">Download</strong>.</span></li><li id="ALM-12063__li495644512588"><span>Contact the <span id="ALM-12063__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="8" id="ALM-12063__ol1996717458377"><li id="ALM-12063__li8140111212587"><a name="ALM-12063__li8140111212587"></a><a name="li8140111212587"></a><span>On <span id="ALM-12063__text7526129144614">MRS</span> Manager, choose <strong id="ALM-12063__b12140112175816">O&amp;M</strong> &gt; <strong id="ALM-12063__b114011127584">Log</strong> &gt; <strong id="ALM-12063__b141404121585">Download</strong>.</span></li><li id="ALM-12063__li9140101216585"><span>Select <strong id="ALM-12063__b069717155404">NodeAgent</strong> from the <strong id="ALM-12063__b214071255817">Service</strong> and click <strong id="ALM-12063__b3991118545">OK</strong>.</span></li><li id="ALM-12063__li296716454377"><span>Click <span><img id="ALM-12063__image109671245153716" src="en-us_image_0000001583087405.png"></span> in the upper right corner, and set <strong id="ALM-12063__b99671445103719">Start Date</strong> and <strong id="ALM-12063__b3967114563711">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12063__b2967194513374">Download</strong>.</span></li><li id="ALM-12063__li495644512588"><span>Contact the <span id="ALM-12063__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12063__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12063__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12063__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12063__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -59,11 +59,11 @@
<div class="section" id="ALM-12064__section1735461018427"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12064__p52213894213">The random port range configuration is modified.</p> <div class="section" id="ALM-12064__section1735461018427"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12064__p52213894213">The random port range configuration is modified.</p>
</div> </div>
<div class="section" id="ALM-12064__section693292174218"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12064__p158581330123713"><strong id="ALM-12064__b1585853015379">Check the random port range of the system.</strong></p> <div class="section" id="ALM-12064__section693292174218"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12064__p158581330123713"><strong id="ALM-12064__b1585853015379">Check the random port range of the system.</strong></p>
<ol id="ALM-12064__ol13967134518379"><li id="ALM-12064__li199671454371"><span>In the alarm list on FusionInsight Manager, locate the row that contains the alarm, and view the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12064__li69671245143710"><span>Log in to the host where the alarm is generated as user <strong id="ALM-12064__b1396794515374">root</strong>. <span id="ALM-12064__text985593916354"></span></span></li><li id="ALM-12064__li1996794533711"><span>Run the <strong id="ALM-12064__b69671845103713">cat /proc/sys/net/ipv4/ip_local_port_range</strong> command to obtain the random port range of the host and check whether the minimum value is smaller than 32768.</span><p><ul id="ALM-12064__ul1496716452370"><li id="ALM-12064__li209671245173712">If it is, go to <a href="#ALM-12064__li1796713455375">4</a>.</li><li id="ALM-12064__li89671745183711">If it is not, goto <a href="#ALM-12064__li1396704514377">7</a>.</li></ul> <ol id="ALM-12064__ol13967134518379"><li id="ALM-12064__li199671454371"><span>In the alarm list on <span id="ALM-12064__text34789336432">MRS</span> Manager, locate the row that contains the alarm, and view the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12064__li69671245143710"><span>Log in to the host where the alarm is generated as user <strong id="ALM-12064__b1396794515374">root</strong>. <span id="ALM-12064__text985593916354"></span></span></li><li id="ALM-12064__li1996794533711"><span>Run the <strong id="ALM-12064__b69671845103713">cat /proc/sys/net/ipv4/ip_local_port_range</strong> command to obtain the random port range of the host and check whether the minimum value is smaller than 32768.</span><p><ul id="ALM-12064__ul1496716452370"><li id="ALM-12064__li209671245173712">If it is, go to <a href="#ALM-12064__li1796713455375">4</a>.</li><li id="ALM-12064__li89671745183711">If it is not, goto <a href="#ALM-12064__li1396704514377">7</a>.</li></ul>
</p></li><li id="ALM-12064__li1796713455375"><a name="ALM-12064__li1796713455375"></a><a name="li1796713455375"></a><span>Run the <strong id="ALM-12064__b1296734510372">vim /etc/sysctl.conf</strong> command to change the value of <strong id="ALM-12064__b1296711459372">net.ipv4.ip_local_port_range</strong> to <strong id="ALM-12064__b496794523715">32768 61000</strong>. If this parameter does not exist, add the following configuration: <strong id="ALM-12064__b129678452378">net.ipv4.ip_local_port_range = 32768 61000</strong>.</span></li><li id="ALM-12064__li79678451371"><span>Run the <strong id="ALM-12064__b11967445133718">sysctl -p /etc/sysctl.conf</strong> command for the modification to take effect.</span></li><li id="ALM-12064__li496704563711"><span>One hour later, check whether the alarm is cleared.</span><p><ul id="ALM-12064__ul16967445203711"><li id="ALM-12064__li1596784553710">If it is, no further action is required.</li><li id="ALM-12064__li1796784514375">If it is not, go to <a href="#ALM-12064__li1396704514377">7</a>.</li></ul> </p></li><li id="ALM-12064__li1796713455375"><a name="ALM-12064__li1796713455375"></a><a name="li1796713455375"></a><span>Run the <strong id="ALM-12064__b1296734510372">vim /etc/sysctl.conf</strong> command to change the value of <strong id="ALM-12064__b1296711459372">net.ipv4.ip_local_port_range</strong> to <strong id="ALM-12064__b496794523715">32768 61000</strong>. If this parameter does not exist, add the following configuration: <strong id="ALM-12064__b129678452378">net.ipv4.ip_local_port_range = 32768 61000</strong>.</span></li><li id="ALM-12064__li79678451371"><span>Run the <strong id="ALM-12064__b11967445133718">sysctl -p /etc/sysctl.conf</strong> command for the modification to take effect.</span></li><li id="ALM-12064__li496704563711"><span>One hour later, check whether the alarm is cleared.</span><p><ul id="ALM-12064__ul16967445203711"><li id="ALM-12064__li1596784553710">If it is, no further action is required.</li><li id="ALM-12064__li1796784514375">If it is not, go to <a href="#ALM-12064__li1396704514377">7</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12064__p23701710174214"><strong id="ALM-12064__b123701110164218">Collect fault information.</strong></p> <p id="ALM-12064__p23701710174214"><strong id="ALM-12064__b123701110164218">Collect fault information.</strong></p>
<ol start="7" id="ALM-12064__ol1996717458377"><li id="ALM-12064__li1396704514377"><a name="ALM-12064__li1396704514377"></a><a name="li1396704514377"></a><span>On FusionInsight Manager, choose <strong id="ALM-12064__b1996754543712">O&amp;M</strong> &gt; <strong id="ALM-12064__b20967645173714">Log</strong> &gt; <strong id="ALM-12064__b1496734511372">Download</strong>.</span></li><li id="ALM-12064__li1596764533717"><span>Select <strong id="ALM-12064__b13967174519376">NodeAgent</strong> for <strong id="ALM-12064__b196744553714">Service</strong> and click <strong id="ALM-12064__b3991118545">OK</strong>.</span></li><li id="ALM-12064__li296716454377"><span>Click <span><img id="ALM-12064__image109671245153716" src="en-us_image_0000001532767522.png"></span> in the upper right corner, and set <strong id="ALM-12064__b99671445103719">Start Date</strong> and <strong id="ALM-12064__b3967114563711">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12064__b2967194513374">Download</strong>.</span></li><li id="ALM-12064__li495644512588"><span>Contact the <span id="ALM-12064__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="7" id="ALM-12064__ol1996717458377"><li id="ALM-12064__li1396704514377"><a name="ALM-12064__li1396704514377"></a><a name="li1396704514377"></a><span>On <span id="ALM-12064__text9920631194617">MRS</span> Manager, choose <strong id="ALM-12064__b1996754543712">O&amp;M</strong> &gt; <strong id="ALM-12064__b20967645173714">Log</strong> &gt; <strong id="ALM-12064__b1496734511372">Download</strong>.</span></li><li id="ALM-12064__li1596764533717"><span>Select <strong id="ALM-12064__b13967174519376">NodeAgent</strong> for <strong id="ALM-12064__b196744553714">Service</strong> and click <strong id="ALM-12064__b3991118545">OK</strong>.</span></li><li id="ALM-12064__li296716454377"><span>Click <span><img id="ALM-12064__image109671245153716" src="en-us_image_0000001532767522.png"></span> in the upper right corner, and set <strong id="ALM-12064__b99671445103719">Start Date</strong> and <strong id="ALM-12064__b3967114563711">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12064__b2967194513374">Download</strong>.</span></li><li id="ALM-12064__li495644512588"><span>Contact the <span id="ALM-12064__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12064__section14385121020422"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12064__p2038591034212">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12064__section14385121020422"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12064__p2038591034212">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -59,7 +59,7 @@
<div class="section" id="ALM-12066__section950130153414"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12066__ul913288183510"><li id="ALM-12066__li713414815352">The <strong id="ALM-12066__b22461400518">/etc/ssh/sshd_config</strong> configuration file is damaged.</li><li id="ALM-12066__li131351185357">The password of user <strong id="ALM-12066__b10643161513517">omm</strong> has expired.</li></ul> <div class="section" id="ALM-12066__section950130153414"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12066__ul913288183510"><li id="ALM-12066__li713414815352">The <strong id="ALM-12066__b22461400518">/etc/ssh/sshd_config</strong> configuration file is damaged.</li><li id="ALM-12066__li131351185357">The password of user <strong id="ALM-12066__b10643161513517">omm</strong> has expired.</li></ul>
</div> </div>
<div class="section" id="ALM-12066__section071212121445"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12066__p14212204913111"><strong id="ALM-12066__b4515327657">Check the status of the /etc/ssh/sshd_config configuration file.</strong></p> <div class="section" id="ALM-12066__section071212121445"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12066__p14212204913111"><strong id="ALM-12066__b4515327657">Check the status of the /etc/ssh/sshd_config configuration file.</strong></p>
<ol id="ALM-12066__ol363257182811"><li id="ALM-12066__li263016792816"><span>In the alarm list on FusionInsight Manager, locate the row that contains the alarm and click <span><img id="ALM-12066__image1663017722814" src="en-us_image_0000001532448306.png"></span> to view the host list in the alarm details.</span></li><li id="ALM-12066__li17631167192814"><span>Log in to the active OMS node as user <strong id="ALM-12066__b173458362104930">omm</strong>. <span id="ALM-12066__text38540585518"></span></span></li><li id="ALM-12066__li17631374283"><span>Run the <strong id="ALM-12066__b8591193761511">ssh</strong> command, for example, <strong id="ALM-12066__b1611013111616">ssh</strong> <strong id="ALM-12066__b461113131618"><em id="ALM-12066__i8702204181616">host2</em></strong>, on each node in the alarm details to check whether the connection fails. (<em id="ALM-12066__i1032492071610"><strong id="ALM-12066__b18558144131812">host2</strong></em> is a node other than the OMS node in the alarm details.)</span><p><ul id="ALM-12066__ul1963111718289"><li id="ALM-12066__li363117710285">If yes, go to <a href="#ALM-12066__li176321676280">4</a>.</li><li id="ALM-12066__li136319782815">If no, go to <a href="#ALM-12066__li9148131091317">6</a>.</li></ul> <ol id="ALM-12066__ol363257182811"><li id="ALM-12066__li263016792816"><span>In the alarm list on <span id="ALM-12066__text34789336432">MRS</span> Manager, locate the row that contains the alarm and click <span><img id="ALM-12066__image1663017722814" src="en-us_image_0000001532448306.png"></span> to view the host list in the alarm details.</span></li><li id="ALM-12066__li17631167192814"><span>Log in to the active OMS node as user <strong id="ALM-12066__b173458362104930">omm</strong>. <span id="ALM-12066__text38540585518"></span></span></li><li id="ALM-12066__li17631374283"><span>Run the <strong id="ALM-12066__b8591193761511">ssh</strong> command, for example, <strong id="ALM-12066__b1611013111616">ssh</strong> <strong id="ALM-12066__b461113131618"><em id="ALM-12066__i8702204181616">host2</em></strong>, on each node in the alarm details to check whether the connection fails. (<em id="ALM-12066__i1032492071610"><strong id="ALM-12066__b18558144131812">host2</strong></em> is a node other than the OMS node in the alarm details.)</span><p><ul id="ALM-12066__ul1963111718289"><li id="ALM-12066__li363117710285">If yes, go to <a href="#ALM-12066__li176321676280">4</a>.</li><li id="ALM-12066__li136319782815">If no, go to <a href="#ALM-12066__li9148131091317">6</a>.</li></ul>
</p></li><li id="ALM-12066__li176321676280"><a name="ALM-12066__li176321676280"></a><a name="li176321676280"></a><span>Open the <strong id="ALM-12066__b19350203172016">/etc/ssh/sshd_config</strong> configuration file on host2 and check whether <strong id="ALM-12066__b497416449207">AllowUsers</strong> or <strong id="ALM-12066__b683084712203">DenyUsers</strong> is configured for other nodes.</span><p><ul id="ALM-12066__ul263219711285"><li id="ALM-12066__li66323716289">If yes, go to <a href="#ALM-12066__li846318425575">5</a>.</li><li id="ALM-12066__li1763211732817">If no, contact OS experts.</li></ul> </p></li><li id="ALM-12066__li176321676280"><a name="ALM-12066__li176321676280"></a><a name="li176321676280"></a><span>Open the <strong id="ALM-12066__b19350203172016">/etc/ssh/sshd_config</strong> configuration file on host2 and check whether <strong id="ALM-12066__b497416449207">AllowUsers</strong> or <strong id="ALM-12066__b683084712203">DenyUsers</strong> is configured for other nodes.</span><p><ul id="ALM-12066__ul263219711285"><li id="ALM-12066__li66323716289">If yes, go to <a href="#ALM-12066__li846318425575">5</a>.</li><li id="ALM-12066__li1763211732817">If no, contact OS experts.</li></ul>
</p></li><li id="ALM-12066__li846318425575"><a name="ALM-12066__li846318425575"></a><a name="li846318425575"></a><span>Modify the whitelist or blacklist to ensure that user <strong id="ALM-12066__b5862624122211">omm</strong> is in the whitelist or not in the blacklist. Check whether the alarm is cleared.</span><p><ul id="ALM-12066__ul111918318587"><li id="ALM-12066__li17191331165814">If yes, no further action is required.</li><li id="ALM-12066__li15858237195817">If no, go to <a href="#ALM-12066__li9148131091317">6</a>.</li></ul> </p></li><li id="ALM-12066__li846318425575"><a name="ALM-12066__li846318425575"></a><a name="li846318425575"></a><span>Modify the whitelist or blacklist to ensure that user <strong id="ALM-12066__b5862624122211">omm</strong> is in the whitelist or not in the blacklist. Check whether the alarm is cleared.</span><p><ul id="ALM-12066__ul111918318587"><li id="ALM-12066__li17191331165814">If yes, no further action is required.</li><li id="ALM-12066__li15858237195817">If no, go to <a href="#ALM-12066__li9148131091317">6</a>.</li></ul>
</p></li></ol> </p></li></ol>
@ -69,7 +69,7 @@
</p></li><li id="ALM-12066__li19341633125911"><span>Add the public key of user <strong id="ALM-12066__b0377113310287">omm</strong> of the peer host to the trust list of the local host. Run the <strong id="ALM-12066__b1737092382919">ssh</strong> command, for example, <strong id="ALM-12066__b6889113012290">ssh host2</strong>, on each node in the alarm details to check whether the connection fails. (<em id="ALM-12066__i81833373014"><strong id="ALM-12066__b0720241270">host2</strong></em> is a node other than the OMS node in the alarm details.)</span><p><ul id="ALM-12066__ul137211213508"><li id="ALM-12066__li153121714307">If yes, go to <a href="#ALM-12066__li106306742813">9</a>.</li><li id="ALM-12066__li7313414402">If no, check whether the alarm is cleared. If the alarm is cleared, no further action is required; otherwise, go to <a href="#ALM-12066__li106306742813">9</a>.</li></ul> </p></li><li id="ALM-12066__li19341633125911"><span>Add the public key of user <strong id="ALM-12066__b0377113310287">omm</strong> of the peer host to the trust list of the local host. Run the <strong id="ALM-12066__b1737092382919">ssh</strong> command, for example, <strong id="ALM-12066__b6889113012290">ssh host2</strong>, on each node in the alarm details to check whether the connection fails. (<em id="ALM-12066__i81833373014"><strong id="ALM-12066__b0720241270">host2</strong></em> is a node other than the OMS node in the alarm details.)</span><p><ul id="ALM-12066__ul137211213508"><li id="ALM-12066__li153121714307">If yes, go to <a href="#ALM-12066__li106306742813">9</a>.</li><li id="ALM-12066__li7313414402">If no, check whether the alarm is cleared. If the alarm is cleared, no further action is required; otherwise, go to <a href="#ALM-12066__li106306742813">9</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12066__p124132216288"><strong id="ALM-12066__b1967293410811">Collect the fault information.</strong></p> <p id="ALM-12066__p124132216288"><strong id="ALM-12066__b1967293410811">Collect the fault information.</strong></p>
<ol start="9" id="ALM-12066__ol146302742816"><li class="subitemlist" id="ALM-12066__li106306742813"><a name="ALM-12066__li106306742813"></a><a name="li106306742813"></a><span>On FusionInsight Manager, choose <strong id="ALM-12066__b140942549104930">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12066__b180541324104930">Log</strong> &gt; <strong id="ALM-12066__b1225148528104930">Download</strong>.</span></li><li id="ALM-12066__li06301476283"><span>Select <strong id="ALM-12066__b192996136104930">Controller</strong> for <strong id="ALM-12066__b345013368916">Service</strong> and click <strong id="ALM-12066__b1962404791104930">OK</strong>.</span></li><li id="ALM-12066__li126301173286"><span>Click <span><img id="ALM-12066__image863057122812" src="en-us_image_0000001583087445.png"></span> in the upper right corner to set the log collection time range. Generally, the time range is 10 minutes before and after the alarm generation time. Click <strong id="ALM-12066__b575409479104930">Download</strong>.</span></li><li id="ALM-12066__li2630274284"><span>Contact <span id="ALM-12066__text1793615574113">O&amp;M personnel</span> and provide the collected logs.</span></li></ol> <ol start="9" id="ALM-12066__ol146302742816"><li class="subitemlist" id="ALM-12066__li106306742813"><a name="ALM-12066__li106306742813"></a><a name="li106306742813"></a><span>On <span id="ALM-12066__text1497103410468">MRS</span> Manager, choose <strong id="ALM-12066__b140942549104930">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12066__b180541324104930">Log</strong> &gt; <strong id="ALM-12066__b1225148528104930">Download</strong>.</span></li><li id="ALM-12066__li06301476283"><span>Select <strong id="ALM-12066__b192996136104930">Controller</strong> for <strong id="ALM-12066__b345013368916">Service</strong> and click <strong id="ALM-12066__b1962404791104930">OK</strong>.</span></li><li id="ALM-12066__li126301173286"><span>Click <span><img id="ALM-12066__image863057122812" src="en-us_image_0000001583087445.png"></span> in the upper right corner to set the log collection time range. Generally, the time range is 10 minutes before and after the alarm generation time. Click <strong id="ALM-12066__b575409479104930">Download</strong>.</span></li><li id="ALM-12066__li2630274284"><span>Contact <span id="ALM-12066__text1793615574113">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12066__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12066__p754913417333">This alarm is automatically cleared after the fault is rectified.</p> <div class="section" id="ALM-12066__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12066__p754913417333">This alarm is automatically cleared after the fault is rectified.</p>
</div> </div>

View File

@ -61,10 +61,10 @@
<div class="section" id="ALM-12067__section950130153414"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12067__ul12589142315014"><li id="ALM-12067__li3591142315501">The Tomcat directory permission is abnormal, and the Tomcat process is abnormal.</li></ul> <div class="section" id="ALM-12067__section950130153414"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12067__ul12589142315014"><li id="ALM-12067__li3591142315501">The Tomcat directory permission is abnormal, and the Tomcat process is abnormal.</li></ul>
</div> </div>
<div class="section" id="ALM-12067__section071212121445"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12067__p3197164020479"><strong id="ALM-12067__b64575930152820">Check whether the permission on the Tomcat directory is normal.</strong></p> <div class="section" id="ALM-12067__section071212121445"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12067__p3197164020479"><strong id="ALM-12067__b64575930152820">Check whether the permission on the Tomcat directory is normal.</strong></p>
<ol id="ALM-12067__ol01141266283"><li id="ALM-12067__li111412602820"><span>In the alarm list on FusionInsight Manager, locate the row that contains the alarm, and click <span><img id="ALM-12067__image10114162611289" src="en-us_image_0000001583127457.png"></span> to view the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12067__li2011432610283"><span>Log in to the alarm host as user <strong id="ALM-12067__b2011452617286">root</strong>. <span id="ALM-12067__text65184518511"></span></span></li><li id="ALM-12067__li6114182682819"><span>Run the <strong id="ALM-12067__b101141226122818">su - omm</strong> command to switch to user <strong id="ALM-12067__b1740514446548">omm</strong>.</span></li><li id="ALM-12067__li181141726192815"><span>Run the <strong id="ALM-12067__b19114226192818">vi $BIGDATA_LOG_HOME/omm/oms/ha/scriptlog/tomcat.log</strong> command to check whether the Tomcat resource log contains keyword <strong id="ALM-12067__b61141926122811">Cannot find <em id="ALM-12067__i1163833916240">XXX</em></strong> and rectify the file permission based on the keyword.</span></li><li id="ALM-12067__li51141626202816"><span>After 5 minutes, check whether the alarm is automatically cleared. </span><p><ul class="subitemlist" id="ALM-12067__ul911415261288"><li id="ALM-12067__li911492612811">If yes, no further action is required.</li><li id="ALM-12067__li1711402612820">If no, go to <a href="#ALM-12067__li711211264288">6</a>.</li></ul> <ol id="ALM-12067__ol01141266283"><li id="ALM-12067__li111412602820"><span>In the alarm list on <span id="ALM-12067__text34789336432">MRS</span> Manager, locate the row that contains the alarm, and click <span><img id="ALM-12067__image10114162611289" src="en-us_image_0000001583127457.png"></span> to view the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12067__li2011432610283"><span>Log in to the alarm host as user <strong id="ALM-12067__b2011452617286">root</strong>. <span id="ALM-12067__text65184518511"></span></span></li><li id="ALM-12067__li6114182682819"><span>Run the <strong id="ALM-12067__b101141226122818">su - omm</strong> command to switch to user <strong id="ALM-12067__b1740514446548">omm</strong>.</span></li><li id="ALM-12067__li181141726192815"><span>Run the <strong id="ALM-12067__b19114226192818">vi $BIGDATA_LOG_HOME/omm/oms/ha/scriptlog/tomcat.log</strong> command to check whether the Tomcat resource log contains keyword <strong id="ALM-12067__b61141926122811">Cannot find <em id="ALM-12067__i1163833916240">XXX</em></strong> and rectify the file permission based on the keyword.</span></li><li id="ALM-12067__li51141626202816"><span>After 5 minutes, check whether the alarm is automatically cleared.</span><p><ul class="subitemlist" id="ALM-12067__ul911415261288"><li id="ALM-12067__li911492612811">If yes, no further action is required.</li><li id="ALM-12067__li1711402612820">If no, go to <a href="#ALM-12067__li711211264288">6</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12067__p124132216288"><strong id="ALM-12067__b1967293410811">Collect the fault information.</strong></p> <p id="ALM-12067__p124132216288"><strong id="ALM-12067__b1967293410811">Collect the fault information.</strong></p>
<ol start="6" id="ALM-12067__ol7112102616281"><li class="subitemlist" id="ALM-12067__li711211264288"><a name="ALM-12067__li711211264288"></a><a name="li711211264288"></a><span>On FusionInsight Manager, choose <strong id="ALM-12067__b8360182718578">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12067__b536002785718">Log</strong> &gt; <strong id="ALM-12067__b33611827205714">Download</strong>.</span></li><li id="ALM-12067__li31126266289"><span>In the <strong id="ALM-12067__b1071163118573">Services</strong> area, select <strong id="ALM-12067__b3821031155716">OmmServer</strong> and <strong id="ALM-12067__b68263135711">Tomcat</strong>, and click <strong id="ALM-12067__b682931185716">OK</strong>.</span></li><li id="ALM-12067__li2011292612815"><span>Click <span><img id="ALM-12067__image51121126122816" src="en-us_image_0000001532767558.png"></span> in the upper right corner, and set <strong id="ALM-12067__b55511722583">Start Date</strong> and <strong id="ALM-12067__b17552923588">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12067__b8552127586">Download</strong>.</span></li><li id="ALM-12067__li15112192672816"><span>Contact <span id="ALM-12067__text1694528635">O&amp;M personnel</span> and provide the collected logs.</span></li></ol> <ol start="6" id="ALM-12067__ol7112102616281"><li class="subitemlist" id="ALM-12067__li711211264288"><a name="ALM-12067__li711211264288"></a><a name="li711211264288"></a><span>On <span id="ALM-12067__text3909136134618">MRS</span> Manager, choose <strong id="ALM-12067__b8360182718578">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12067__b536002785718">Log</strong> &gt; <strong id="ALM-12067__b33611827205714">Download</strong>.</span></li><li id="ALM-12067__li31126266289"><span>In the <strong id="ALM-12067__b1071163118573">Services</strong> area, select <strong id="ALM-12067__b3821031155716">OmmServer</strong> and <strong id="ALM-12067__b68263135711">Tomcat</strong>, and click <strong id="ALM-12067__b682931185716">OK</strong>.</span></li><li id="ALM-12067__li2011292612815"><span>Click <span><img id="ALM-12067__image51121126122816" src="en-us_image_0000001532767558.png"></span> in the upper right corner, and set <strong id="ALM-12067__b55511722583">Start Date</strong> and <strong id="ALM-12067__b17552923588">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12067__b8552127586">Download</strong>.</span></li><li id="ALM-12067__li15112192672816"><span>Contact <span id="ALM-12067__text1694528635">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12067__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12067__p754913417333">This alarm is automatically cleared after the fault is rectified.</p> <div class="section" id="ALM-12067__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12067__p754913417333">This alarm is automatically cleared after the fault is rectified.</p>
</div> </div>

View File

@ -56,16 +56,16 @@
</table> </table>
</div> </div>
</div> </div>
<div class="section" id="ALM-12068__section2990133614335"><h4 class="sectiontitle">Impact on the System</h4><ul id="ALM-12068__ul25260697"><li id="ALM-12068__li26019688">The active/standby Manager switchover occurs.</li><li id="ALM-12068__li32850608">The ACS process repeatedly restarts, which may cause the FusionInsight Manager login failure.</li></ul> <div class="section" id="ALM-12068__section2990133614335"><h4 class="sectiontitle">Impact on the System</h4><ul id="ALM-12068__ul25260697"><li id="ALM-12068__li26019688">The active/standby Manager switchover occurs.</li><li id="ALM-12068__li32850608">The ACS process repeatedly restarts, which may cause the <span id="ALM-12068__text34789336432">MRS</span> Manager login failure.</li></ul>
</div> </div>
<div class="section" id="ALM-12068__section950130153414"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12068__p610083015544">The ACS process is abnormal.</p> <div class="section" id="ALM-12068__section950130153414"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12068__p610083015544">The ACS process is abnormal.</p>
</div> </div>
<div class="section" id="ALM-12068__section5440125035617"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12068__p8324186"><strong id="ALM-12068__b15118501163833">Check whether the ACS process is normal.</strong></p> <div class="section" id="ALM-12068__section5440125035617"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12068__p8324186"><strong id="ALM-12068__b15118501163833">Check whether the ACS process is normal.</strong></p>
<ol id="ALM-12068__ol5558276163811"><li id="ALM-12068__li34357272165726"><span>In the alarm list on FusionInsight Manager, locate the row that contains the alarm, and click <span><img id="ALM-12068__image168221113135319" src="en-us_image_0000001582927805.png"></span> to view the name of the host for which the alarm is generated.</span></li><li id="ALM-12068__li50024484163811"><span>Log in to the alarm host as user <strong id="ALM-12068__b1241211221169">root</strong>. <span id="ALM-12068__text1942962220620"></span></span></li><li id="ALM-12068__li17626636132716"><span>Run the <strong id="ALM-12068__b8588144553112">su - omm</strong> command and then <strong id="ALM-12068__b32015537163811">sh ${BIGDATA_HOME}/om-server/OMS/workspace0/ha/module/hacom/script/status_ha.sh</strong> to check whether the status of the ACS resources managed by the HA is normal. In the single-node system, the ACS resource is in the normal state. In the dual-node system, the ACS resource is in the normal state on the active node and in the stopped state on the standby node.</span><p><ul class="subitemlist" id="ALM-12068__ul66289368274"><li id="ALM-12068__li1062811360271">If yes, go to <a href="#ALM-12068__li6152360163635">6</a>.</li><li id="ALM-12068__li46281436112719">If no, go to <a href="#ALM-12068__li139657016249">4</a>.</li></ul> <ol id="ALM-12068__ol5558276163811"><li id="ALM-12068__li34357272165726"><span>In the alarm list on <span id="ALM-12068__text96841139114610">MRS</span> Manager, locate the row that contains the alarm, and click <span><img id="ALM-12068__image168221113135319" src="en-us_image_0000001582927805.png"></span> to view the name of the host for which the alarm is generated.</span></li><li id="ALM-12068__li50024484163811"><span>Log in to the alarm host as user <strong id="ALM-12068__b1241211221169">root</strong>. <span id="ALM-12068__text1942962220620"></span></span></li><li id="ALM-12068__li17626636132716"><span>Run the <strong id="ALM-12068__b8588144553112">su - omm</strong> command and then <strong id="ALM-12068__b32015537163811">sh ${BIGDATA_HOME}/om-server/OMS/workspace0/ha/module/hacom/script/status_ha.sh</strong> to check whether the status of the ACS resources managed by the HA is normal. In the single-node system, the ACS resource is in the normal state. In the dual-node system, the ACS resource is in the normal state on the active node and in the stopped state on the standby node.</span><p><ul class="subitemlist" id="ALM-12068__ul66289368274"><li id="ALM-12068__li1062811360271">If yes, go to <a href="#ALM-12068__li6152360163635">6</a>.</li><li id="ALM-12068__li46281436112719">If no, go to <a href="#ALM-12068__li139657016249">4</a>.</li></ul>
</p></li><li id="ALM-12068__li139657016249"><a name="ALM-12068__li139657016249"></a><a name="li139657016249"></a><span>Run the <strong id="ALM-12068__b20158102319162">vi $BIGDATA_LOG_HOME/omm/oms/ha/scriptlog/acs.log</strong> command to check whether the ACS resource log of HA contains the keyword <strong id="ALM-12068__b12635154014714">ERROR</strong>. If yes, analyze the logs to locate the resource exception cause and fix the exception.</span></li><li id="ALM-12068__li14736019164314"><span>After 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12068__ul473671984320"><li id="ALM-12068__li9736151912432">If yes, no further action is required.</li><li id="ALM-12068__li4736141910439">If no, go to <a href="#ALM-12068__li6152360163635">6</a>.</li></ul> </p></li><li id="ALM-12068__li139657016249"><a name="ALM-12068__li139657016249"></a><a name="li139657016249"></a><span>Run the <strong id="ALM-12068__b20158102319162">vi $BIGDATA_LOG_HOME/omm/oms/ha/scriptlog/acs.log</strong> command to check whether the ACS resource log of HA contains the keyword <strong id="ALM-12068__b12635154014714">ERROR</strong>. If yes, analyze the logs to locate the resource exception cause and fix the exception.</span></li><li id="ALM-12068__li14736019164314"><span>After 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12068__ul473671984320"><li id="ALM-12068__li9736151912432">If yes, no further action is required.</li><li id="ALM-12068__li4736141910439">If no, go to <a href="#ALM-12068__li6152360163635">6</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12068__p3652216163758"><strong id="ALM-12068__b26858758163828">Collect the fault information.</strong></p> <p id="ALM-12068__p3652216163758"><strong id="ALM-12068__b26858758163828">Collect the fault information.</strong></p>
<ol start="6" id="ALM-12068__ol26111342163819"><li id="ALM-12068__li6152360163635"><a name="ALM-12068__li6152360163635"></a><a name="li6152360163635"></a><span>On FusionInsight Manager, choose <strong id="ALM-12068__b198926401682">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12068__b16892134019819">Log</strong> &gt; <strong id="ALM-12068__b1789318401185">Download</strong>.</span></li><li id="ALM-12068__li55371246163635"><span>In the <strong id="ALM-12068__b8713343188">Services</strong> area, select <strong id="ALM-12068__b1272114432815">Controller</strong> and <strong id="ALM-12068__b1872120439817">OmmServer</strong>, and click <strong id="ALM-12068__b177222431087">OK</strong>.</span></li><li id="ALM-12068__li28579174163635"><span>Click <span><img id="ALM-12068__image69691781225" src="en-us_image_0000001532607914.png"></span> in the upper right corner, and set <strong id="ALM-12068__b1482814481884">Start Date</strong> and <strong id="ALM-12068__b68291648584">End Date</strong> for log collection to 1 hour ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12068__b1382914812818">Download</strong>.</span></li><li id="ALM-12068__li33211732163635"><span>Contact <span id="ALM-12068__text21221703916">O&amp;M personnel</span> and provide the collected logs.</span></li></ol> <ol start="6" id="ALM-12068__ol26111342163819"><li id="ALM-12068__li6152360163635"><a name="ALM-12068__li6152360163635"></a><a name="li6152360163635"></a><span>On <span id="ALM-12068__text209047409464">MRS</span> Manager, choose <strong id="ALM-12068__b198926401682">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12068__b16892134019819">Log</strong> &gt; <strong id="ALM-12068__b1789318401185">Download</strong>.</span></li><li id="ALM-12068__li55371246163635"><span>In the <strong id="ALM-12068__b8713343188">Services</strong> area, select <strong id="ALM-12068__b1272114432815">Controller</strong> and <strong id="ALM-12068__b1872120439817">OmmServer</strong>, and click <strong id="ALM-12068__b177222431087">OK</strong>.</span></li><li id="ALM-12068__li28579174163635"><span>Click <span><img id="ALM-12068__image69691781225" src="en-us_image_0000001532607914.png"></span> in the upper right corner, and set <strong id="ALM-12068__b1482814481884">Start Date</strong> and <strong id="ALM-12068__b68291648584">End Date</strong> for log collection to 1 hour ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12068__b1382914812818">Download</strong>.</span></li><li id="ALM-12068__li33211732163635"><span>Contact <span id="ALM-12068__text21221703916">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12068__section129720811223"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12068__p19973168152211">This alarm is automatically cleared after the fault is rectified.</p> <div class="section" id="ALM-12068__section129720811223"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12068__p19973168152211">This alarm is automatically cleared after the fault is rectified.</p>
</div> </div>

View File

@ -56,16 +56,16 @@
</table> </table>
</div> </div>
</div> </div>
<div class="section" id="ALM-12069__section2990133614335"><h4 class="sectiontitle">Impact on the System</h4><ul id="ALM-12069__ul25260697"><li id="ALM-12069__li26019688">The active/standby Manager switchover occurs.</li><li id="ALM-12069__li32850608">The AOS process repeatedly restarts, which may cause the FusionInsight Manager login failure.</li></ul> <div class="section" id="ALM-12069__section2990133614335"><h4 class="sectiontitle">Impact on the System</h4><ul id="ALM-12069__ul25260697"><li id="ALM-12069__li26019688">The active/standby Manager switchover occurs.</li><li id="ALM-12069__li32850608">The AOS process repeatedly restarts, which may cause the <span id="ALM-12069__text34789336432">MRS</span> Manager login failure.</li></ul>
</div> </div>
<div class="section" id="ALM-12069__section950130153414"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12069__p14940123162411">The AOS process is abnormal.</p> <div class="section" id="ALM-12069__section950130153414"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12069__p14940123162411">The AOS process is abnormal.</p>
</div> </div>
<div class="section" id="ALM-12069__section1541443812244"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12069__p8324186"><strong id="ALM-12069__b15118501163833">Check whether the AOS process is normal.</strong></p> <div class="section" id="ALM-12069__section1541443812244"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12069__p8324186"><strong id="ALM-12069__b15118501163833">Check whether the AOS process is normal.</strong></p>
<ol id="ALM-12069__ol5558276163811"><li id="ALM-12069__li34357272165726"><span>In the alarm list on FusionInsight Manager, locate the row that contains the alarm, and click <span><img id="ALM-12069__image168221113135319" src="en-us_image_0000001532448286.png"></span> to view the name of the host for which the alarm is generated.</span></li><li id="ALM-12069__li50024484163811"><span>Log in to the alarm host as user <strong id="ALM-12069__b96866141813">root</strong>. <span id="ALM-12069__text116882111811"></span></span></li><li id="ALM-12069__li17626636132716"><span>Run the <strong id="ALM-12069__b199545565144538">sh ${BIGDATA_HOME}/om-server/OMS/workspace0/ha/module/hacom/script/status_ha.sh</strong> command to check whether the status of the AOS resources managed by the HA is normal. In the single-node system, the AOS resource is in the normal state. In the dual-node system, the AOS resource is in the normal state on the active node and in the stopped state on the standby node.</span><p><ul class="subitemlist" id="ALM-12069__ul66289368274"><li id="ALM-12069__li1062811360271">If yes, go to <a href="#ALM-12069__li6152360163635">6</a>.</li><li id="ALM-12069__li46281436112719">If no, go to <a href="#ALM-12069__li139657016249">4</a>.</li></ul> <ol id="ALM-12069__ol5558276163811"><li id="ALM-12069__li34357272165726"><span>In the alarm list on <span id="ALM-12069__text4588154344612">MRS</span> Manager, locate the row that contains the alarm, and click <span><img id="ALM-12069__image168221113135319" src="en-us_image_0000001532448286.png"></span> to view the name of the host for which the alarm is generated.</span></li><li id="ALM-12069__li50024484163811"><span>Log in to the alarm host as user <strong id="ALM-12069__b96866141813">root</strong>. <span id="ALM-12069__text116882111811"></span></span></li><li id="ALM-12069__li17626636132716"><span>Run the <strong id="ALM-12069__b199545565144538">sh ${BIGDATA_HOME}/om-server/OMS/workspace0/ha/module/hacom/script/status_ha.sh</strong> command to check whether the status of the AOS resources managed by the HA is normal. In the single-node system, the AOS resource is in the normal state. In the dual-node system, the AOS resource is in the normal state on the active node and in the stopped state on the standby node.</span><p><ul class="subitemlist" id="ALM-12069__ul66289368274"><li id="ALM-12069__li1062811360271">If yes, go to <a href="#ALM-12069__li6152360163635">6</a>.</li><li id="ALM-12069__li46281436112719">If no, go to <a href="#ALM-12069__li139657016249">4</a>.</li></ul>
</p></li><li id="ALM-12069__li139657016249"><a name="ALM-12069__li139657016249"></a><a name="li139657016249"></a><span>Run the <strong id="ALM-12069__b15175108193211">vi $BIGDATA_LOG_HOME/omm/oms/ha/scriptlog/aos.log</strong> command to check whether the AOS resource log of HA contains the keyword <strong id="ALM-12069__b1918314817326">ERROR</strong>. If yes, analyze the logs to locate the resource exception cause and fix the exception.</span></li><li id="ALM-12069__li14736019164314"><span>After 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12069__ul473671984320"><li id="ALM-12069__li9736151912432">If yes, no further action is required.</li><li id="ALM-12069__li4736141910439">If no, go to <a href="#ALM-12069__li6152360163635">6</a>.</li></ul> </p></li><li id="ALM-12069__li139657016249"><a name="ALM-12069__li139657016249"></a><a name="li139657016249"></a><span>Run the <strong id="ALM-12069__b15175108193211">vi $BIGDATA_LOG_HOME/omm/oms/ha/scriptlog/aos.log</strong> command to check whether the AOS resource log of HA contains the keyword <strong id="ALM-12069__b1918314817326">ERROR</strong>. If yes, analyze the logs to locate the resource exception cause and fix the exception.</span></li><li id="ALM-12069__li14736019164314"><span>After 5 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12069__ul473671984320"><li id="ALM-12069__li9736151912432">If yes, no further action is required.</li><li id="ALM-12069__li4736141910439">If no, go to <a href="#ALM-12069__li6152360163635">6</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12069__p3652216163758"><strong id="ALM-12069__b26858758163828">Collect the fault information.</strong></p> <p id="ALM-12069__p3652216163758"><strong id="ALM-12069__b26858758163828">Collect the fault information.</strong></p>
<ol start="6" id="ALM-12069__ol26111342163819"><li id="ALM-12069__li6152360163635"><a name="ALM-12069__li6152360163635"></a><a name="li6152360163635"></a><span>On FusionInsight Manager, choose <strong id="ALM-12069__b4651852193219">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12069__b76526528326">Log</strong> &gt; <strong id="ALM-12069__b46521552153219">Download</strong>.</span></li><li id="ALM-12069__li55371246163635"><span>In the <strong id="ALM-12069__b118685519325">Services</strong> area, select <strong id="ALM-12069__b1586165523216">Controller</strong> and <strong id="ALM-12069__b1686155512326">OmmServer</strong>, and click <strong id="ALM-12069__b5861955163217">OK</strong>.</span></li><li id="ALM-12069__li28579174163635"><span>Click <span><img id="ALM-12069__image69691781225" src="en-us_image_0000001582927665.png"></span> in the upper right corner, and set <strong id="ALM-12069__b182615123314">Start Date</strong> and <strong id="ALM-12069__b102629118330">End Date</strong> for log collection to 1 hour ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12069__b62621211335">Download</strong>.</span></li><li id="ALM-12069__li33211732163635"><span>Contact <span id="ALM-12069__text5719151393316">O&amp;M personnel</span> and provide the collected logs.</span></li></ol> <ol start="6" id="ALM-12069__ol26111342163819"><li id="ALM-12069__li6152360163635"><a name="ALM-12069__li6152360163635"></a><a name="li6152360163635"></a><span>On <span id="ALM-12069__text10902134410467">MRS</span> Manager, choose <strong id="ALM-12069__b4651852193219">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-12069__b76526528326">Log</strong> &gt; <strong id="ALM-12069__b46521552153219">Download</strong>.</span></li><li id="ALM-12069__li55371246163635"><span>In the <strong id="ALM-12069__b118685519325">Services</strong> area, select <strong id="ALM-12069__b1586165523216">Controller</strong> and <strong id="ALM-12069__b1686155512326">OmmServer</strong>, and click <strong id="ALM-12069__b5861955163217">OK</strong>.</span></li><li id="ALM-12069__li28579174163635"><span>Click <span><img id="ALM-12069__image69691781225" src="en-us_image_0000001582927665.png"></span> in the upper right corner, and set <strong id="ALM-12069__b182615123314">Start Date</strong> and <strong id="ALM-12069__b102629118330">End Date</strong> for log collection to 1 hour ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12069__b62621211335">Download</strong>.</span></li><li id="ALM-12069__li33211732163635"><span>Contact <span id="ALM-12069__text5719151393316">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12069__section129720811223"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12069__p19973168152211">This alarm is automatically cleared after the fault is rectified.</p> <div class="section" id="ALM-12069__section129720811223"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12069__p19973168152211">This alarm is automatically cleared after the fault is rectified.</p>
</div> </div>

View File

@ -3,7 +3,7 @@
<h1 class="topictitle1">ALM-12070 Controller Resource Is Abnormal</h1> <h1 class="topictitle1">ALM-12070 Controller Resource Is Abnormal</h1>
<div id="body1547192931355"><div class="section" id="ALM-12070__section2747821101717"><h4 class="sectiontitle">Alarm Description</h4><p id="ALM-12070__p868415518212">HA checks the controller resources of Manager every 80 seconds. This alarm is generated when HA detects that the controller resources are abnormal for 2 consecutive times.</p> <div id="body1547192931355"><div class="section" id="ALM-12070__section2747821101717"><h4 class="sectiontitle">Alarm Description</h4><p id="ALM-12070__p868415518212">HA checks the controller resources of Manager every 80 seconds. This alarm is generated when HA detects that the controller resources are abnormal for 2 consecutive times.</p>
<p id="ALM-12070__p06843510216">This alarm is cleared when the Controller resource is normal.</p> <p id="ALM-12070__p06843510216">This alarm is cleared when the Controller resource is normal.</p>
<p id="ALM-12070__p14684251525"><strong id="ALM-12070__b06841253216">Resource Type</strong> of Controller is <strong id="ALM-12070__b1068413515211">Single-active</strong>. Active/standby will be triggered upon resource exceptions. When this alarm is generated, the active/standby switchover is complete and new Controller resources have been enabled on the new active FusionInsight Manager. In this case, this alarm is cleared. This alarm is used to notify users of the cause of the active/standby switchover.</p> <p id="ALM-12070__p14684251525"><strong id="ALM-12070__b06841253216">Resource Type</strong> of Controller is <strong id="ALM-12070__b1068413515211">Single-active</strong>. Active/standby will be triggered upon resource exceptions. When this alarm is generated, the active/standby switchover is complete and new Controller resources have been enabled on the new active <span id="ALM-12070__text34789336432">MRS</span> Manager. In this case, this alarm is cleared. This alarm is used to notify users of the cause of the active/standby switchover.</p>
</div> </div>
<div class="section" id="ALM-12070__section127478213171"><h4 class="sectiontitle">Attribute</h4> <div class="section" id="ALM-12070__section127478213171"><h4 class="sectiontitle">Attribute</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-12070__table7749721191719" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-12070__row6867152161714"><th align="left" class="cellrowborder" valign="top" width="34.34343434343434%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-12070__p03908133538">Alarm ID</p> <div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-12070__table7749721191719" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-12070__row6867152161714"><th align="left" class="cellrowborder" valign="top" width="34.34343434343434%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-12070__p03908133538">Alarm ID</p>
@ -56,16 +56,16 @@
</table> </table>
</div> </div>
</div> </div>
<div class="section" id="ALM-12070__section1776462111715"><h4 class="sectiontitle">Impact on the System</h4><ul id="ALM-12070__ul57451821824"><li id="ALM-12070__li6745132115216">The active/standby FusionInsight Manager switchover occurs.</li><li id="ALM-12070__li1274512211725">The Controller process repeatedly restarts, which may cause the FusionInsight Manager login failure.</li></ul> <div class="section" id="ALM-12070__section1776462111715"><h4 class="sectiontitle">Impact on the System</h4><ul id="ALM-12070__ul57451821824"><li id="ALM-12070__li6745132115216">The active/standby <span id="ALM-12070__text1857694719469">MRS</span> Manager switchover occurs.</li><li id="ALM-12070__li1274512211725">The Controller process repeatedly restarts, which may cause the <span id="ALM-12070__text8961548194619">MRS</span> Manager login failure.</li></ul>
</div> </div>
<div class="section" id="ALM-12070__section6765152119174"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12070__p3608142911216">The Controller process is abnormal.</p> <div class="section" id="ALM-12070__section6765152119174"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12070__p3608142911216">The Controller process is abnormal.</p>
</div> </div>
<div class="section" id="ALM-12070__section87667210173"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12070__p8711155114555"><strong id="ALM-12070__b598112553552">Check whether the controller process is normal.</strong></p> <div class="section" id="ALM-12070__section87667210173"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12070__p8711155114555"><strong id="ALM-12070__b598112553552">Check whether the controller process is normal.</strong></p>
<ol id="ALM-12070__ol12903923231"><li id="ALM-12070__li1890320236317"><span>In the alarm list on FusionInsight Manager, locate the row that contains the alarm, and view the name of the host for which the alarm is generated.</span></li><li id="ALM-12070__li2903122315315"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12070__b1190319231738">root</strong>. <span id="ALM-12070__text985593916354"></span></span></li><li id="ALM-12070__li47901049125519"><span>Run the <strong id="ALM-12070__b171639418567">su - omm</strong> command to switch to user <strong id="ALM-12070__b1482687105615">omm</strong>.Run the <strong id="ALM-12070__b590314231316">sh ${BIGDATA_HOME}/om-server/OMS/workspace0/ha/module/hacom/script/status_ha.sh</strong> command to check whether the status of the Controller resources managed by the HA is normal. In the single-node system, the Controller resource is in the normal state. In the dual-node system, the Controller resource is in the normal state on the active node and in the stopped state on the standby node.</span><p><ul id="ALM-12070__ul1490314236318"><li id="ALM-12070__li090332320310">If it is, go to <a href="#ALM-12070__li69038231234">6</a>.</li><li id="ALM-12070__li9903192317316">If it is not, go to <a href="#ALM-12070__li6903202312318">4</a>.</li></ul> <ol id="ALM-12070__ol12903923231"><li id="ALM-12070__li1890320236317"><span>In the alarm list on <span id="ALM-12070__text9261650154616">MRS</span> Manager, locate the row that contains the alarm, and view the name of the host for which the alarm is generated.</span></li><li id="ALM-12070__li2903122315315"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12070__b1190319231738">root</strong>. <span id="ALM-12070__text985593916354"></span></span></li><li id="ALM-12070__li47901049125519"><span>Run the <strong id="ALM-12070__b171639418567">su - omm</strong> command to switch to user <strong id="ALM-12070__b1482687105615">omm</strong>.Run the <strong id="ALM-12070__b590314231316">sh ${BIGDATA_HOME}/om-server/OMS/workspace0/ha/module/hacom/script/status_ha.sh</strong> command to check whether the status of the Controller resources managed by the HA is normal. In the single-node system, the Controller resource is in the normal state. In the dual-node system, the Controller resource is in the normal state on the active node and in the stopped state on the standby node.</span><p><ul id="ALM-12070__ul1490314236318"><li id="ALM-12070__li090332320310">If it is, go to <a href="#ALM-12070__li69038231234">6</a>.</li><li id="ALM-12070__li9903192317316">If it is not, go to <a href="#ALM-12070__li6903202312318">4</a>.</li></ul>
</p></li><li id="ALM-12070__li6903202312318"><a name="ALM-12070__li6903202312318"></a><a name="li6903202312318"></a><span>Run the <strong id="ALM-12070__b16903112313312">vi $BIGDATA_LOG_HOME/omm/oms/ha/scriptlog/controller.log</strong> command to view the Controller resource logs, and run the <strong id="ALM-12070__b290310231836">vi $BIGDATA_LOG_HOME/controller/controller.log </strong>command to view the Controller running logs, check whether the keyword <strong id="ALM-12070__b9187145311439">ERROR</strong> exists. Analyze the logs to locate and rectify the fault.</span></li><li id="ALM-12070__li1590310231933"><span>Five minutes later, check whether this alarm is cleared.</span><p><ul id="ALM-12070__ul209032231431"><li id="ALM-12070__li199031823835">If it is, no further action is required.</li><li id="ALM-12070__li159039231338">If it is not, go to <a href="#ALM-12070__li69038231234">6</a>.</li></ul> </p></li><li id="ALM-12070__li6903202312318"><a name="ALM-12070__li6903202312318"></a><a name="li6903202312318"></a><span>Run the <strong id="ALM-12070__b16903112313312">vi $BIGDATA_LOG_HOME/omm/oms/ha/scriptlog/controller.log</strong> command to view the Controller resource logs, and run the <strong id="ALM-12070__b290310231836">vi $BIGDATA_LOG_HOME/controller/controller.log </strong>command to view the Controller running logs, check whether the keyword <strong id="ALM-12070__b9187145311439">ERROR</strong> exists. Analyze the logs to locate and rectify the fault.</span></li><li id="ALM-12070__li1590310231933"><span>Five minutes later, check whether this alarm is cleared.</span><p><ul id="ALM-12070__ul209032231431"><li id="ALM-12070__li199031823835">If it is, no further action is required.</li><li id="ALM-12070__li159039231338">If it is not, go to <a href="#ALM-12070__li69038231234">6</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12070__p13421113195811"><strong id="ALM-12070__b204218131586">Collect fault information.</strong></p> <p id="ALM-12070__p13421113195811"><strong id="ALM-12070__b204218131586">Collect fault information.</strong></p>
<ol start="6" id="ALM-12070__ol39031423835"><li id="ALM-12070__li69038231234"><a name="ALM-12070__li69038231234"></a><a name="li69038231234"></a><span>On FusionInsight Manager, choose <strong id="ALM-12070__b590352315317">O&amp;M</strong> &gt; <strong id="ALM-12070__b59030233320">Log</strong> &gt; <strong id="ALM-12070__b1290362318319">Download</strong>.</span></li><li id="ALM-12070__li18903202318317"><span>Select <strong id="ALM-12070__b6883925124310">Controller </strong>and<strong id="ALM-12070__b1588372554312"> OmmServe</strong> for <strong id="ALM-12070__b890312231830">Service</strong> and click <strong id="ALM-12070__b3991118545">OK</strong>.</span></li><li id="ALM-12070__li18903523531"><span>Click <span><img id="ALM-12070__image18903132310317" src="en-us_image_0000001582927629.png"></span> in the upper right corner, and set <strong id="ALM-12070__b129031823137">Start Date</strong> and <strong id="ALM-12070__b1990322312314">End Date</strong> for log collection to 1 hour before and after the alarm generation time, respectively. Then, click <strong id="ALM-12070__b4903132312320">Download</strong>.</span></li><li id="ALM-12070__li495644512588"><span>Contact the <span id="ALM-12070__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="6" id="ALM-12070__ol39031423835"><li id="ALM-12070__li69038231234"><a name="ALM-12070__li69038231234"></a><a name="li69038231234"></a><span>On <span id="ALM-12070__text452995118469">MRS</span> Manager, choose <strong id="ALM-12070__b590352315317">O&amp;M</strong> &gt; <strong id="ALM-12070__b59030233320">Log</strong> &gt; <strong id="ALM-12070__b1290362318319">Download</strong>.</span></li><li id="ALM-12070__li18903202318317"><span>Select <strong id="ALM-12070__b6883925124310">Controller </strong>and<strong id="ALM-12070__b1588372554312"> OmmServe</strong> for <strong id="ALM-12070__b890312231830">Service</strong> and click <strong id="ALM-12070__b3991118545">OK</strong>.</span></li><li id="ALM-12070__li18903523531"><span>Click <span><img id="ALM-12070__image18903132310317" src="en-us_image_0000001582927629.png"></span> in the upper right corner, and set <strong id="ALM-12070__b129031823137">Start Date</strong> and <strong id="ALM-12070__b1990322312314">End Date</strong> for log collection to 1 hour before and after the alarm generation time, respectively. Then, click <strong id="ALM-12070__b4903132312320">Download</strong>.</span></li><li id="ALM-12070__li495644512588"><span>Contact the <span id="ALM-12070__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12070__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12070__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p> <div class="section" id="ALM-12070__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12070__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div> </div>

View File

@ -3,7 +3,7 @@
<h1 class="topictitle1">ALM-12071 Httpd Resource Is Abnormal</h1> <h1 class="topictitle1">ALM-12071 Httpd Resource Is Abnormal</h1>
<div id="body1547193420658"><div class="section" id="ALM-12071__section1873012221819"><h4 class="sectiontitle">Description</h4><p id="ALM-12071__p1795612211819">HA checks the httpd resources of Manager every 120 seconds. This alarm is generated when HA detects that the httpd resources are abnormal for 10 consecutive times.</p> <div id="body1547193420658"><div class="section" id="ALM-12071__section1873012221819"><h4 class="sectiontitle">Description</h4><p id="ALM-12071__p1795612211819">HA checks the httpd resources of Manager every 120 seconds. This alarm is generated when HA detects that the httpd resources are abnormal for 10 consecutive times.</p>
<p id="ALM-12071__p14956102261815">This alarm is cleared when the httpd resource is normal.</p> <p id="ALM-12071__p14956102261815">This alarm is cleared when the httpd resource is normal.</p>
<p id="ALM-12071__p1495610221182"><strong id="ALM-12071__b9956132220185">Resource Type</strong> of httpd is <strong id="ALM-12071__b1195615225187">Single-active</strong>. Active/standby will be triggered upon resource exceptions. When this alarm is generated, the active/standby switchover is complete and new httpd resources have been enabled on the new active FusionInsight Manager. In this case, this alarm is cleared. This alarm is used to notify users of the cause of the active/standby switchover.</p> <p id="ALM-12071__p1495610221182"><strong id="ALM-12071__b9956132220185">Resource Type</strong> of httpd is <strong id="ALM-12071__b1195615225187">Single-active</strong>. Active/standby will be triggered upon resource exceptions. When this alarm is generated, the active/standby switchover is complete and new httpd resources have been enabled on the new active <span id="ALM-12071__text34789336432">MRS</span> Manager. In this case, this alarm is cleared. This alarm is used to notify users of the cause of the active/standby switchover.</p>
</div> </div>
<div class="section" id="ALM-12071__section17732172241819"><h4 class="sectiontitle">Attribute</h4> <div class="section" id="ALM-12071__section17732172241819"><h4 class="sectiontitle">Attribute</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-12071__table3734112271815" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-12071__row1195613222180"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-12071__p1695610220189">Alarm ID</p> <div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-12071__table3734112271815" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-12071__row1195613222180"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-12071__p1695610220189">Alarm ID</p>
@ -56,16 +56,16 @@
</table> </table>
</div> </div>
</div> </div>
<div class="section" id="ALM-12071__section167631322161811"><h4 class="sectiontitle">Impact on the System</h4><ul id="ALM-12071__ul16957122212187"><li id="ALM-12071__li9957322141814">The active/standby FusionInsight Manager switchover occurs.</li><li id="ALM-12071__li2957162220182">The httpd process is repeatedly restarts, which may lead to the failure to visit the native service UI.</li></ul> <div class="section" id="ALM-12071__section167631322161811"><h4 class="sectiontitle">Impact on the System</h4><ul id="ALM-12071__ul16957122212187"><li id="ALM-12071__li9957322141814">The active/standby <span id="ALM-12071__text20387145494613">MRS</span> Manager switchover occurs.</li><li id="ALM-12071__li2957162220182">The httpd process is repeatedly restarts, which may lead to the failure to visit the native service UI.</li></ul>
</div> </div>
<div class="section" id="ALM-12071__section17770132211810"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12071__p480785161917">The httpd process is abnormal.</p> <div class="section" id="ALM-12071__section17770132211810"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12071__p480785161917">The httpd process is abnormal.</p>
</div> </div>
<div class="section" id="ALM-12071__section17774192220180"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12071__p5957822201812"><strong id="ALM-12071__b1595714229188">Check whether the httpd process is abnormal.</strong></p> <div class="section" id="ALM-12071__section17774192220180"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12071__p5957822201812"><strong id="ALM-12071__b1595714229188">Check whether the httpd process is abnormal.</strong></p>
<ol id="ALM-12071__ol108431951181816"><li id="ALM-12071__li11843165119182"><span>In the alarm list on FusionInsight Manager, locate the row that contains the alarm, and view the name of the host for which the alarm is generated.</span></li><li id="ALM-12071__li8843105111811"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12071__b584335112186">root</strong>. <span id="ALM-12071__text985593916354"></span></span></li><li id="ALM-12071__li214051915720"><span>Run the <strong id="ALM-12071__b43699226576">su - omm</strong> command to switch to user <strong id="ALM-12071__b9147527105718">omm</strong>.</span></li><li id="ALM-12071__li4843951151819"><span>Run the <strong id="ALM-12071__b3843205114186">sh ${BIGDATA_HOME}/om-server/OMS/workspace0/ha/module/hacom/script/status_ha.sh</strong> command to check whether the status of the httpd resources managed by the HA is normal. In the single-node system, the httpd resource is in the normal state. In the dual-node system, the httpd resource is in the normal state on the active node and in the stopped state on the standby node.</span><p><ul id="ALM-12071__ul14843105113181"><li id="ALM-12071__li88432515181">If it is, go to <a href="#ALM-12071__li384145118188">7</a>.</li><li id="ALM-12071__li158431517188">If it is not, go to <a href="#ALM-12071__li584395101819">5</a>.</li></ul> <ol id="ALM-12071__ol108431951181816"><li id="ALM-12071__li11843165119182"><span>In the alarm list on <span id="ALM-12071__text5629155154612">MRS</span> Manager, locate the row that contains the alarm, and view the name of the host for which the alarm is generated.</span></li><li id="ALM-12071__li8843105111811"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12071__b584335112186">root</strong>. <span id="ALM-12071__text985593916354"></span></span></li><li id="ALM-12071__li214051915720"><span>Run the <strong id="ALM-12071__b43699226576">su - omm</strong> command to switch to user <strong id="ALM-12071__b9147527105718">omm</strong>.</span></li><li id="ALM-12071__li4843951151819"><span>Run the <strong id="ALM-12071__b3843205114186">sh ${BIGDATA_HOME}/om-server/OMS/workspace0/ha/module/hacom/script/status_ha.sh</strong> command to check whether the status of the httpd resources managed by the HA is normal. In the single-node system, the httpd resource is in the normal state. In the dual-node system, the httpd resource is in the normal state on the active node and in the stopped state on the standby node.</span><p><ul id="ALM-12071__ul14843105113181"><li id="ALM-12071__li88432515181">If it is, go to <a href="#ALM-12071__li384145118188">7</a>.</li><li id="ALM-12071__li158431517188">If it is not, go to <a href="#ALM-12071__li584395101819">5</a>.</li></ul>
</p></li><li id="ALM-12071__li584395101819"><a name="ALM-12071__li584395101819"></a><a name="li584395101819"></a><span>Run the <strong id="ALM-12071__b6843951201818">vi $BIGDATA_LOG_HOME/omm/oms/ha/scriptlog/httpd.log</strong> command to view the httpd resource logs, check whether the keyword <strong id="ALM-12071__b9187145311439">ERROR</strong> exists. Analyze the logs to locate and rectify the fault.</span></li><li id="ALM-12071__li118438511180"><span>Five minutes later, check whether this alarm is cleared.</span><p><ul id="ALM-12071__ul1484315115185"><li id="ALM-12071__li3843175115182">If it is, no further action is required.</li><li id="ALM-12071__li1184355116180">If it is not, go to <a href="#ALM-12071__li384145118188">7</a>.</li></ul> </p></li><li id="ALM-12071__li584395101819"><a name="ALM-12071__li584395101819"></a><a name="li584395101819"></a><span>Run the <strong id="ALM-12071__b6843951201818">vi $BIGDATA_LOG_HOME/omm/oms/ha/scriptlog/httpd.log</strong> command to view the httpd resource logs, check whether the keyword <strong id="ALM-12071__b9187145311439">ERROR</strong> exists. Analyze the logs to locate and rectify the fault.</span></li><li id="ALM-12071__li118438511180"><span>Five minutes later, check whether this alarm is cleared.</span><p><ul id="ALM-12071__ul1484315115185"><li id="ALM-12071__li3843175115182">If it is, no further action is required.</li><li id="ALM-12071__li1184355116180">If it is not, go to <a href="#ALM-12071__li384145118188">7</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12071__p1674954751819"><strong id="ALM-12071__b149571522171815">Collect fault information.</strong></p> <p id="ALM-12071__p1674954751819"><strong id="ALM-12071__b149571522171815">Collect fault information.</strong></p>
<ol start="7" id="ALM-12071__ol118431551101813"><li id="ALM-12071__li384145118188"><a name="ALM-12071__li384145118188"></a><a name="li384145118188"></a><span>On FusionInsight Manager, choose <strong id="ALM-12071__b884013510187">O&amp;M</strong> &gt; <strong id="ALM-12071__b384045118183">Log</strong> &gt; <strong id="ALM-12071__b8841155115188">Download</strong>.</span></li><li id="ALM-12071__li5841351151811"><span>Select <strong id="ALM-12071__b78412516184">Controller</strong> and <strong id="ALM-12071__b2841175116185">OmmServer</strong> for <strong id="ALM-12071__b18841451201818">Service</strong> and click <strong id="ALM-12071__b3991118545">OK</strong>.</span></li><li id="ALM-12071__li1684175131820"><span>Click <span><img id="ALM-12071__image1084185120186" src="en-us_image_0000001582927741.png"></span> in the upper right corner. In the displayed dialog box, set <strong id="ALM-12071__b684175111183">Start Date</strong> and <strong id="ALM-12071__b14841185112187">End Date</strong> to 1 hour before and after the alarm generation time respectively and click <strong id="ALM-12071__b8841551191812">OK</strong>. Then, click <strong id="ALM-12071__b10841155112188">Download</strong>.</span></li><li id="ALM-12071__li495644512588"><span>Contact the <span id="ALM-12071__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="7" id="ALM-12071__ol118431551101813"><li id="ALM-12071__li384145118188"><a name="ALM-12071__li384145118188"></a><a name="li384145118188"></a><span>On <span id="ALM-12071__text2970145684619">MRS</span> Manager, choose <strong id="ALM-12071__b884013510187">O&amp;M</strong> &gt; <strong id="ALM-12071__b384045118183">Log</strong> &gt; <strong id="ALM-12071__b8841155115188">Download</strong>.</span></li><li id="ALM-12071__li5841351151811"><span>Select <strong id="ALM-12071__b78412516184">Controller</strong> and <strong id="ALM-12071__b2841175116185">OmmServer</strong> for <strong id="ALM-12071__b18841451201818">Service</strong> and click <strong id="ALM-12071__b3991118545">OK</strong>.</span></li><li id="ALM-12071__li1684175131820"><span>Click <span><img id="ALM-12071__image1084185120186" src="en-us_image_0000001582927741.png"></span> in the upper right corner. In the displayed dialog box, set <strong id="ALM-12071__b684175111183">Start Date</strong> and <strong id="ALM-12071__b14841185112187">End Date</strong> to 1 hour before and after the alarm generation time respectively and click <strong id="ALM-12071__b8841551191812">OK</strong>. Then, click <strong id="ALM-12071__b10841155112188">Download</strong>.</span></li><li id="ALM-12071__li495644512588"><span>Contact the <span id="ALM-12071__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12071__section17816122101811"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12071__p1395992212185">This alarm will be automatically cleared after the fault is rectified.</p> <div class="section" id="ALM-12071__section17816122101811"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12071__p1395992212185">This alarm will be automatically cleared after the fault is rectified.</p>
</div> </div>

View File

@ -3,7 +3,7 @@
<h1 class="topictitle1">ALM-12072 FloatIP Resource Is Abnormal</h1> <h1 class="topictitle1">ALM-12072 FloatIP Resource Is Abnormal</h1>
<div id="body1547193420658"><div class="section" id="ALM-12072__section626017484164"><h4 class="sectiontitle">Description</h4><p id="ALM-12072__p15433448121614">HA checks the floatip resources of Manager every 9 seconds. This alarm is generated when HA detects that the floatip resources are abnormal for 3 consecutive times.</p> <div id="body1547193420658"><div class="section" id="ALM-12072__section626017484164"><h4 class="sectiontitle">Description</h4><p id="ALM-12072__p15433448121614">HA checks the floatip resources of Manager every 9 seconds. This alarm is generated when HA detects that the floatip resources are abnormal for 3 consecutive times.</p>
<p id="ALM-12072__p15433154881618">This alarm is cleared when the FloatIP resource is normal.</p> <p id="ALM-12072__p15433154881618">This alarm is cleared when the FloatIP resource is normal.</p>
<p id="ALM-12072__p7433164891618"><strong id="ALM-12072__b4433648141611">Resource Type</strong> of FloatIP is <strong id="ALM-12072__b243317487168">Single-active</strong>. Active/standby will be triggered upon resource exceptions. When this alarm is generated, the active/standby switchover is complete and new FloatIP resources have been enabled on the new active FusionInsight Manager. In this case, this alarm is cleared. This alarm is used to notify users of the cause of the active/standby switchover.</p> <p id="ALM-12072__p7433164891618"><strong id="ALM-12072__b4433648141611">Resource Type</strong> of FloatIP is <strong id="ALM-12072__b243317487168">Single-active</strong>. Active/standby will be triggered upon resource exceptions. When this alarm is generated, the active/standby switchover is complete and new FloatIP resources have been enabled on the new active <span id="ALM-12072__text34789336432">MRS</span> Manager. In this case, this alarm is cleared. This alarm is used to notify users of the cause of the active/standby switchover.</p>
</div> </div>
<div class="section" id="ALM-12072__section626118482164"><h4 class="sectiontitle">Attribute</h4> <div class="section" id="ALM-12072__section626118482164"><h4 class="sectiontitle">Attribute</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-12072__table12262104816161" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-12072__row2433548161618"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-12072__p44341548171620">Alarm ID</p> <div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-12072__table12262104816161" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-12072__row2433548161618"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-12072__p44341548171620">Alarm ID</p>
@ -56,21 +56,21 @@
</table> </table>
</div> </div>
</div> </div>
<div class="section" id="ALM-12072__section182791448171620"><h4 class="sectiontitle">Impact on the System</h4><ul id="ALM-12072__ul74355481165"><li id="ALM-12072__li1435184812166">The active/standby FusionInsight Manager switchover occurs.</li><li id="ALM-12072__li104351748121610">The FloatIP process is repeatedly restarts, which may lead to the failure to visit the native service UI.</li></ul> <div class="section" id="ALM-12072__section182791448171620"><h4 class="sectiontitle">Impact on the System</h4><ul id="ALM-12072__ul74355481165"><li id="ALM-12072__li1435184812166">The active/standby <span id="ALM-12072__text494625919465">MRS</span> Manager switchover occurs.</li><li id="ALM-12072__li104351748121610">The FloatIP process is repeatedly restarts, which may lead to the failure to visit the native service UI.</li></ul>
</div> </div>
<div class="section" id="ALM-12072__section15284948161619"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12072__ul11435134871614"><li id="ALM-12072__li134353483164">The floating IP address is abnormal.</li></ul> <div class="section" id="ALM-12072__section15284948161619"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-12072__ul11435134871614"><li id="ALM-12072__li134353483164">The floating IP address is abnormal.</li></ul>
</div> </div>
<div class="section" id="ALM-12072__section182871248141613"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12072__p14354485169"><strong id="ALM-12072__b154352487167">Check the floating IP address status of the active management node.</strong></p> <div class="section" id="ALM-12072__section182871248141613"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12072__p14354485169"><strong id="ALM-12072__b154352487167">Check the floating IP address status of the active management node.</strong></p>
<ol id="ALM-12072__ol1726941101718"><li id="ALM-12072__li13268519176"><span>In the alarm list on FusionInsight Manager, locate the row that contains the alarm, and view the address of the host for which the alarm is generated and the resource name.</span></li><li id="ALM-12072__li326811112176"><span>Log in to the active management node as user <strong id="ALM-12072__b122688191718">root</strong>. <span id="ALM-12072__text985593916354"></span><span id="ALM-12072__text3230164484916"></span></span></li><li id="ALM-12072__li67021344840"><span>Run the following command, go to the <strong id="ALM-12072__b19704174415416">${BIGDATA_HOME}/om-server/om/sbin/</strong> directory.</span><p><p id="ALM-12072__p6644753175416"><strong id="ALM-12072__b1043540105513">su - omm</strong></p> <ol id="ALM-12072__ol1726941101718"><li id="ALM-12072__li13268519176"><span>In the alarm list on <span id="ALM-12072__text824218184714">MRS</span> Manager, locate the row that contains the alarm, and view the address of the host for which the alarm is generated and the resource name.</span></li><li id="ALM-12072__li326811112176"><span>Log in to the active management node as user <strong id="ALM-12072__b122688191718">root</strong>. <span id="ALM-12072__text985593916354"></span><span id="ALM-12072__text3230164484916"></span></span></li><li id="ALM-12072__li67021344840"><span>Run the following command, go to the <strong id="ALM-12072__b19704174415416">${BIGDATA_HOME}/om-server/om/sbin/</strong> directory.</span><p><p id="ALM-12072__p6644753175416"><strong id="ALM-12072__b1043540105513">su - omm</strong></p>
<p id="ALM-12072__p1435805011578"><strong id="ALM-12072__b18358125075711">cd </strong><strong id="ALM-12072__b113580509579">${BIGDATA_HOME}/om-server/om/sbin/</strong></p> <p id="ALM-12072__p1435805011578"><strong id="ALM-12072__b18358125075711">cd </strong><strong id="ALM-12072__b113580509579">${BIGDATA_HOME}/om-server/om/sbin/</strong></p>
</p></li><li id="ALM-12072__li62687118178"><span>Run the <strong id="ALM-12072__b14833124916718">sh status-oms.sh</strong> command, and execute the <strong id="ALM-12072__b12681519174">status-oms.sh</strong> script to check whether the floating IP address of the active FusionInsight Manager is normal. View the command output, locate the row where <strong id="ALM-12072__b202681611179">ResName</strong> is <strong id="ALM-12072__b17268111111713">floatip</strong>, and check whether the following information is displayed.</span><p><p id="ALM-12072__p202681101719">For example:</p> </p></li><li id="ALM-12072__li62687118178"><span>Run the <strong id="ALM-12072__b14833124916718">sh status-oms.sh</strong> command, and execute the <strong id="ALM-12072__b12681519174">status-oms.sh</strong> script to check whether the floating IP address of the active <span id="ALM-12072__text095515213477">MRS</span> Manager is normal. View the command output, locate the row where <strong id="ALM-12072__b202681611179">ResName</strong> is <strong id="ALM-12072__b17268111111713">floatip</strong>, and check whether the following information is displayed.</span><p><p id="ALM-12072__p202681101719">For example:</p>
<pre class="screen" id="ALM-12072__screen826841131713">10-10-10-160 floatip Normal Normal Single_active</pre> <pre class="screen" id="ALM-12072__screen826841131713">10-10-10-160 floatip Normal Normal Single_active</pre>
<ul id="ALM-12072__ul1326814118178"><li id="ALM-12072__li42686101713">If it is, go to <a href="#ALM-12072__li726861151715">8</a>.</li><li id="ALM-12072__li1726812111720">If it is not, go to <a href="#ALM-12072__li162681212172">5</a>.</li></ul> <ul id="ALM-12072__ul1326814118178"><li id="ALM-12072__li42686101713">If it is, go to <a href="#ALM-12072__li726861151715">8</a>.</li><li id="ALM-12072__li1726812111720">If it is not, go to <a href="#ALM-12072__li162681212172">5</a>.</li></ul>
</p></li><li id="ALM-12072__li162681212172"><a name="ALM-12072__li162681212172"></a><a name="li162681212172"></a><span>Run the <strong id="ALM-12072__b132681110175">ifconfig </strong>command to check whether the NIC with the floating IP address exists.</span><p><ul id="ALM-12072__ul1268171101715"><li id="ALM-12072__li192680141716">If it does, go to <a href="#ALM-12072__li726861151715">8</a>.</li><li id="ALM-12072__li226812111171">If it does not, go to <a href="#ALM-12072__li19269111111714">6</a>.</li></ul> </p></li><li id="ALM-12072__li162681212172"><a name="ALM-12072__li162681212172"></a><a name="li162681212172"></a><span>Run the <strong id="ALM-12072__b132681110175">ifconfig </strong>command to check whether the NIC with the floating IP address exists.</span><p><ul id="ALM-12072__ul1268171101715"><li id="ALM-12072__li192680141716">If it does, go to <a href="#ALM-12072__li726861151715">8</a>.</li><li id="ALM-12072__li226812111171">If it does not, go to <a href="#ALM-12072__li19269111111714">6</a>.</li></ul>
</p></li><li id="ALM-12072__li19269111111714"><a name="ALM-12072__li19269111111714"></a><a name="li19269111111714"></a><span>Run the <strong id="ALM-12072__b1426819113173">ifconfig</strong> <em id="ALM-12072__i192695111177">NIC name Floating IPaddress</em> netmask <em id="ALM-12072__i2269181181716">Subnet mask</em> command to reconfigure the NIC with the floating IP address. (For example, <strong id="ALM-12072__b2026991161713">ifconfig eth0 10.10.10.102 netmask 255.255.255.0</strong>).</span></li><li id="ALM-12072__li1426917141719"><span>Five minutes later, check whether the alarm is cleared.</span><p><ul id="ALM-12072__ul3269113174"><li id="ALM-12072__li5269101141717">If it is, no further action is required.</li><li id="ALM-12072__li152691214173">If it is not, go to <a href="#ALM-12072__li726861151715">8</a>.</li></ul> </p></li><li id="ALM-12072__li19269111111714"><a name="ALM-12072__li19269111111714"></a><a name="li19269111111714"></a><span>Run the <strong id="ALM-12072__b1426819113173">ifconfig</strong> <em id="ALM-12072__i192695111177">NIC name Floating IPaddress</em> netmask <em id="ALM-12072__i2269181181716">Subnet mask</em> command to reconfigure the NIC with the floating IP address. (For example, <strong id="ALM-12072__b2026991161713">ifconfig eth0 10.10.10.102 netmask 255.255.255.0</strong>).</span></li><li id="ALM-12072__li1426917141719"><span>Five minutes later, check whether the alarm is cleared.</span><p><ul id="ALM-12072__ul3269113174"><li id="ALM-12072__li5269101141717">If it is, no further action is required.</li><li id="ALM-12072__li152691214173">If it is not, go to <a href="#ALM-12072__li726861151715">8</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12072__p194344582164"><strong id="ALM-12072__b11436748171614">Collect fault information.</strong></p> <p id="ALM-12072__p194344582164"><strong id="ALM-12072__b11436748171614">Collect fault information.</strong></p>
<ol start="8" id="ALM-12072__ol1326817181717"><li id="ALM-12072__li726861151715"><a name="ALM-12072__li726861151715"></a><a name="li726861151715"></a><span>On FusionInsight Manager, choose <strong id="ALM-12072__b026812121711">O&amp;M</strong> &gt; <strong id="ALM-12072__b726811111719">Log</strong> &gt; <strong id="ALM-12072__b926841131719">Download</strong>.</span></li><li id="ALM-12072__li162681171713"><span>Select <strong id="ALM-12072__b17268191151713">Controller</strong> and <strong id="ALM-12072__b42681516170">OmmServer</strong> for <strong id="ALM-12072__b112681114179">Service</strong> and click <strong id="ALM-12072__b3991118545">OK</strong>.</span></li><li id="ALM-12072__li1326812151712"><span>Click <span><img id="ALM-12072__image1626812113177" src="en-us_image_0000001582807857.png"></span> in the upper right corner. In the displayed dialog box, set <strong id="ALM-12072__b1726819191714">Start Date</strong> and <strong id="ALM-12072__b182681113175">End Date</strong> to 1 hour before and after the alarm generation time respectively and click <strong id="ALM-12072__b2268161191713">OK</strong>. Then, click <strong id="ALM-12072__b1326891101719">Download</strong>.</span></li><li id="ALM-12072__li495644512588"><span>Contact the <span id="ALM-12072__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="8" id="ALM-12072__ol1326817181717"><li id="ALM-12072__li726861151715"><a name="ALM-12072__li726861151715"></a><a name="li726861151715"></a><span>On <span id="ALM-12072__text1629874104711">MRS</span> Manager, choose <strong id="ALM-12072__b026812121711">O&amp;M</strong> &gt; <strong id="ALM-12072__b726811111719">Log</strong> &gt; <strong id="ALM-12072__b926841131719">Download</strong>.</span></li><li id="ALM-12072__li162681171713"><span>Select <strong id="ALM-12072__b17268191151713">Controller</strong> and <strong id="ALM-12072__b42681516170">OmmServer</strong> for <strong id="ALM-12072__b112681114179">Service</strong> and click <strong id="ALM-12072__b3991118545">OK</strong>.</span></li><li id="ALM-12072__li1326812151712"><span>Click <span><img id="ALM-12072__image1626812113177" src="en-us_image_0000001582807857.png"></span> in the upper right corner. In the displayed dialog box, set <strong id="ALM-12072__b1726819191714">Start Date</strong> and <strong id="ALM-12072__b182681113175">End Date</strong> to 1 hour before and after the alarm generation time respectively and click <strong id="ALM-12072__b2268161191713">OK</strong>. Then, click <strong id="ALM-12072__b1326891101719">Download</strong>.</span></li><li id="ALM-12072__li495644512588"><span>Contact the <span id="ALM-12072__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12072__section1132214841620"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12072__p134361483167">This alarm will be automatically cleared after the fault is rectified.</p> <div class="section" id="ALM-12072__section1132214841620"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12072__p134361483167">This alarm will be automatically cleared after the fault is rectified.</p>
</div> </div>

View File

@ -3,7 +3,7 @@
<h1 class="topictitle1">ALM-12073 CEP Resource Is Abnormal</h1> <h1 class="topictitle1">ALM-12073 CEP Resource Is Abnormal</h1>
<div id="body1547193420658"><div class="section" id="ALM-12073__section24601758201512"><h4 class="sectiontitle">Description</h4><p id="ALM-12073__p99041958181518">HA checks the cep resources of Manager every 60 seconds. This alarm is generated when HA detects that the cep resources are abnormal for 2 consecutive times.</p> <div id="body1547193420658"><div class="section" id="ALM-12073__section24601758201512"><h4 class="sectiontitle">Description</h4><p id="ALM-12073__p99041958181518">HA checks the cep resources of Manager every 60 seconds. This alarm is generated when HA detects that the cep resources are abnormal for 2 consecutive times.</p>
<p id="ALM-12073__p69041558161518">This alarm is cleared when the CEP resource is normal.</p> <p id="ALM-12073__p69041558161518">This alarm is cleared when the CEP resource is normal.</p>
<p id="ALM-12073__p1490411584158"><strong id="ALM-12073__b79043583158">Resource Type</strong> of CEP is <strong id="ALM-12073__b1904175821515">Single-active</strong>. Active/standby will be triggered upon resource exceptions. When this alarm is generated, the active/standby switchover is complete and new CEP resources have been enabled on the new active FusionInsight Manager. In this case, this alarm is cleared. This alarm is used to notify users of the cause of the active/standby switchover.</p> <p id="ALM-12073__p1490411584158"><strong id="ALM-12073__b79043583158">Resource Type</strong> of CEP is <strong id="ALM-12073__b1904175821515">Single-active</strong>. Active/standby will be triggered upon resource exceptions. When this alarm is generated, the active/standby switchover is complete and new CEP resources have been enabled on the new active <span id="ALM-12073__text34789336432">MRS</span> Manager. In this case, this alarm is cleared. This alarm is used to notify users of the cause of the active/standby switchover.</p>
</div> </div>
<div class="section" id="ALM-12073__section3467175816152"><h4 class="sectiontitle">Attribute</h4> <div class="section" id="ALM-12073__section3467175816152"><h4 class="sectiontitle">Attribute</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-12073__table547145861512" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-12073__row139041258201517"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-12073__p1090465816153">Alarm ID</p> <div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-12073__table547145861512" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-12073__row139041258201517"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-12073__p1090465816153">Alarm ID</p>
@ -56,16 +56,16 @@
</table> </table>
</div> </div>
</div> </div>
<div class="section" id="ALM-12073__section752805841514"><h4 class="sectiontitle">Impact on the System</h4><ul id="ALM-12073__ul109071458171515"><li id="ALM-12073__li2907195814159">The active/standby FusionInsight Manager switchover occurs.</li><li id="ALM-12073__li12907145815158">The CEP process repeatedly restarts, causing monitoring data to be abnormal.</li></ul> <div class="section" id="ALM-12073__section752805841514"><h4 class="sectiontitle">Impact on the System</h4><ul id="ALM-12073__ul109071458171515"><li id="ALM-12073__li2907195814159">The active/standby <span id="ALM-12073__text19333615472">MRS</span> Manager switchover occurs.</li><li id="ALM-12073__li12907145815158">The CEP process repeatedly restarts, causing monitoring data to be abnormal.</li></ul>
</div> </div>
<div class="section" id="ALM-12073__section154117580159"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12073__p13955124019163">The CEP process is abnormal.</p> <div class="section" id="ALM-12073__section154117580159"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12073__p13955124019163">The CEP process is abnormal.</p>
</div> </div>
<div class="section" id="ALM-12073__section355120589155"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12073__p1090845841520"><strong id="ALM-12073__b15908558141519">Check whether the CEP process is abnormal.</strong></p> <div class="section" id="ALM-12073__section355120589155"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12073__p1090845841520"><strong id="ALM-12073__b15908558141519">Check whether the CEP process is abnormal.</strong></p>
<ol id="ALM-12073__ol3262531161613"><li id="ALM-12073__li162612031121620"><span>In the alarm list on FusionInsight Manager, locate the row that contains the alarm, and view the name of the host for which the alarm is generated.</span></li><li id="ALM-12073__li8261133171620"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12073__b1026143111610">root</strong>. <span id="ALM-12073__text985593916354"></span></span></li><li id="ALM-12073__li14261163118165"><span>Run the <strong id="ALM-12073__b2261131171610">su -omm</strong> command and then the <strong id="ALM-12073__b426119312164">sh ${BIGDATA_HOME}/om-server/OMS/workspace0/ha/module/hacom/script/status_ha.sh</strong> command to check whether the status of the CEP resources managed by the HA is normal. In the single-node system, the CEP resource is in the normal state. In the dual-node system, the CEP resource is in the normal state on the active node and in the stopped state on the standby node.</span><p><ul id="ALM-12073__ul426133112169"><li id="ALM-12073__li172611231161613">If it is, go to <a href="#ALM-12073__li9258163110165">6</a>.</li><li id="ALM-12073__li12613317162">If it is not, go to <a href="#ALM-12073__li8262123151618">4</a>.</li></ul> <ol id="ALM-12073__ol3262531161613"><li id="ALM-12073__li162612031121620"><span>In the alarm list on <span id="ALM-12073__text5527118154718">MRS</span> Manager, locate the row that contains the alarm, and view the name of the host for which the alarm is generated.</span></li><li id="ALM-12073__li8261133171620"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12073__b1026143111610">root</strong>. <span id="ALM-12073__text985593916354"></span></span></li><li id="ALM-12073__li14261163118165"><span>Run the <strong id="ALM-12073__b2261131171610">su -omm</strong> command and then the <strong id="ALM-12073__b426119312164">sh ${BIGDATA_HOME}/om-server/OMS/workspace0/ha/module/hacom/script/status_ha.sh</strong> command to check whether the status of the CEP resources managed by the HA is normal. In the single-node system, the CEP resource is in the normal state. In the dual-node system, the CEP resource is in the normal state on the active node and in the stopped state on the standby node.</span><p><ul id="ALM-12073__ul426133112169"><li id="ALM-12073__li172611231161613">If it is, go to <a href="#ALM-12073__li9258163110165">6</a>.</li><li id="ALM-12073__li12613317162">If it is not, go to <a href="#ALM-12073__li8262123151618">4</a>.</li></ul>
</p></li><li id="ALM-12073__li8262123151618"><a name="ALM-12073__li8262123151618"></a><a name="li8262123151618"></a><span>Run the <strong id="ALM-12073__b1026193171612">vi $BIGDATA_LOG_HOME/omm/oms/cep/cep.log </strong>and <strong id="ALM-12073__b1226213316168">vi $BIGDATA_LOG_HOME/omm/oms/cep/scriptlog/cep_ha.log </strong>commands to view the CEP resource logs, check whether the keyword <strong id="ALM-12073__b9187145311439">ERROR</strong> exists. Analyze the logs to locate and rectify the fault.</span></li><li id="ALM-12073__li132629311160"><span>Five minutes later, check whether this alarm is cleared.</span><p><ul id="ALM-12073__ul6262831171619"><li id="ALM-12073__li16262153141620">If it is, no further action is required.</li><li id="ALM-12073__li826216312163">If it is not, go to <a href="#ALM-12073__li9258163110165">6</a>.</li></ul> </p></li><li id="ALM-12073__li8262123151618"><a name="ALM-12073__li8262123151618"></a><a name="li8262123151618"></a><span>Run the <strong id="ALM-12073__b1026193171612">vi $BIGDATA_LOG_HOME/omm/oms/cep/cep.log </strong>and <strong id="ALM-12073__b1226213316168">vi $BIGDATA_LOG_HOME/omm/oms/cep/scriptlog/cep_ha.log </strong>commands to view the CEP resource logs, check whether the keyword <strong id="ALM-12073__b9187145311439">ERROR</strong> exists. Analyze the logs to locate and rectify the fault.</span></li><li id="ALM-12073__li132629311160"><span>Five minutes later, check whether this alarm is cleared.</span><p><ul id="ALM-12073__ul6262831171619"><li id="ALM-12073__li16262153141620">If it is, no further action is required.</li><li id="ALM-12073__li826216312163">If it is not, go to <a href="#ALM-12073__li9258163110165">6</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12073__p10254192814164"><strong id="ALM-12073__b3909105810152">Collect fault information.</strong></p> <p id="ALM-12073__p10254192814164"><strong id="ALM-12073__b3909105810152">Collect fault information.</strong></p>
<ol start="6" id="ALM-12073__ol526063113163"><li id="ALM-12073__li9258163110165"><a name="ALM-12073__li9258163110165"></a><a name="li9258163110165"></a><span>On FusionInsight Manager, choose <strong id="ALM-12073__b1125815315166">O&amp;M</strong> &gt; <strong id="ALM-12073__b1525823113164">Log</strong> &gt; <strong id="ALM-12073__b625823114166">Download</strong>.</span></li><li id="ALM-12073__li18258163151613"><span>Select <strong id="ALM-12073__b9258163115162">Controller</strong> and <strong id="ALM-12073__b22584311164">OmmServer</strong> for <strong id="ALM-12073__b2258831111615">Service</strong> and click <strong id="ALM-12073__b3991118545">OK</strong>.</span></li><li id="ALM-12073__li12260531161614"><span>Click <span><img id="ALM-12073__image126014312167" src="en-us_image_0000001532927546.png"></span> in the upper right corner. In the displayed dialog box, set <strong id="ALM-12073__b19260123119168">Start Date</strong> and <strong id="ALM-12073__b32609319168">End Date</strong> to 1 hour before and after the alarm generation time respectively and click <strong id="ALM-12073__b626015316169">OK</strong>. Then, click <strong id="ALM-12073__b1426043161612">Download</strong>.</span></li><li id="ALM-12073__li495644512588"><span>Contact the <span id="ALM-12073__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="6" id="ALM-12073__ol526063113163"><li id="ALM-12073__li9258163110165"><a name="ALM-12073__li9258163110165"></a><a name="li9258163110165"></a><span>On <span id="ALM-12073__text11127131015475">MRS</span> Manager, choose <strong id="ALM-12073__b1125815315166">O&amp;M</strong> &gt; <strong id="ALM-12073__b1525823113164">Log</strong> &gt; <strong id="ALM-12073__b625823114166">Download</strong>.</span></li><li id="ALM-12073__li18258163151613"><span>Select <strong id="ALM-12073__b9258163115162">Controller</strong> and <strong id="ALM-12073__b22584311164">OmmServer</strong> for <strong id="ALM-12073__b2258831111615">Service</strong> and click <strong id="ALM-12073__b3991118545">OK</strong>.</span></li><li id="ALM-12073__li12260531161614"><span>Click <span><img id="ALM-12073__image126014312167" src="en-us_image_0000001532927546.png"></span> in the upper right corner. In the displayed dialog box, set <strong id="ALM-12073__b19260123119168">Start Date</strong> and <strong id="ALM-12073__b32609319168">End Date</strong> to 1 hour before and after the alarm generation time respectively and click <strong id="ALM-12073__b626015316169">OK</strong>. Then, click <strong id="ALM-12073__b1426043161612">Download</strong>.</span></li><li id="ALM-12073__li495644512588"><span>Contact the <span id="ALM-12073__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12073__section9650125851520"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12073__p1909195801515">This alarm will be automatically cleared after the fault is rectified.</p> <div class="section" id="ALM-12073__section9650125851520"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12073__p1909195801515">This alarm will be automatically cleared after the fault is rectified.</p>
</div> </div>

View File

@ -3,7 +3,7 @@
<h1 class="topictitle1">ALM-12074 FMS Resource Is Abnormal</h1> <h1 class="topictitle1">ALM-12074 FMS Resource Is Abnormal</h1>
<div id="body1547193420658"><div class="section" id="ALM-12074__section1025315248149"><h4 class="sectiontitle">Description</h4><p id="ALM-12074__p12471152414146">HA checks the fms resources of Manager every 60 seconds. This alarm is generated when HA detects that the fms resources are abnormal for 2 consecutive times.</p> <div id="body1547193420658"><div class="section" id="ALM-12074__section1025315248149"><h4 class="sectiontitle">Description</h4><p id="ALM-12074__p12471152414146">HA checks the fms resources of Manager every 60 seconds. This alarm is generated when HA detects that the fms resources are abnormal for 2 consecutive times.</p>
<p id="ALM-12074__p9471162418140">This alarm is cleared when the FMS resource is normal.</p> <p id="ALM-12074__p9471162418140">This alarm is cleared when the FMS resource is normal.</p>
<p id="ALM-12074__p1471142491415"><strong id="ALM-12074__b134712246140">Resource Type</strong> of FMS is <strong id="ALM-12074__b1947192491410">Single-active</strong>. Active/standby will be triggered upon resource exceptions. When this alarm is generated, the active/standby switchover is complete and new FMS resources have been enabled on the new active FusionInsight Manager. In this case, this alarm is cleared. This alarm is used to notify users of the cause of the active/standby switchover.</p> <p id="ALM-12074__p1471142491415"><strong id="ALM-12074__b134712246140">Resource Type</strong> of FMS is <strong id="ALM-12074__b1947192491410">Single-active</strong>. Active/standby will be triggered upon resource exceptions. When this alarm is generated, the active/standby switchover is complete and new FMS resources have been enabled on the new active <span id="ALM-12074__text34789336432">MRS</span> Manager. In this case, this alarm is cleared. This alarm is used to notify users of the cause of the active/standby switchover.</p>
</div> </div>
<div class="section" id="ALM-12074__section1925572441410"><h4 class="sectiontitle">Attribute</h4> <div class="section" id="ALM-12074__section1925572441410"><h4 class="sectiontitle">Attribute</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-12074__table12256142420143" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-12074__row7471124161414"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-12074__p104711624121419">Alarm ID</p> <div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-12074__table12256142420143" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-12074__row7471124161414"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-12074__p104711624121419">Alarm ID</p>
@ -56,16 +56,16 @@
</table> </table>
</div> </div>
</div> </div>
<div class="section" id="ALM-12074__section182821024171411"><h4 class="sectiontitle">Impact on the System</h4><ul id="ALM-12074__ul147362419144"><li id="ALM-12074__li1847362418142">The active/standby FusionInsight Manager switchover occurs.</li><li id="ALM-12074__li20473132411147">The FMS process repeatedly restarts. As a result, alarm information may fail to be reported.</li></ul> <div class="section" id="ALM-12074__section182821024171411"><h4 class="sectiontitle">Impact on the System</h4><ul id="ALM-12074__ul147362419144"><li id="ALM-12074__li1847362418142">The active/standby <span id="ALM-12074__text131231513144710">MRS</span> Manager switchover occurs.</li><li id="ALM-12074__li20473132411147">The FMS process repeatedly restarts. As a result, alarm information may fail to be reported.</li></ul>
</div> </div>
<div class="section" id="ALM-12074__section192899247141"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12074__p360513492551">The FMS process is abnormal.</p> <div class="section" id="ALM-12074__section192899247141"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12074__p360513492551">The FMS process is abnormal.</p>
</div> </div>
<div class="section" id="ALM-12074__section11292132412149"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12074__p3473172411416"><strong id="ALM-12074__b94731524131416">Check whether the FMS process is abnormal.</strong></p> <div class="section" id="ALM-12074__section11292132412149"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12074__p3473172411416"><strong id="ALM-12074__b94731524131416">Check whether the FMS process is abnormal.</strong></p>
<ol id="ALM-12074__ol7833539131412"><li id="ALM-12074__li2083323971413"><span>In the alarm list on FusionInsight Manager, locate the row that contains the alarm, and view the name of the host for which the alarm is generated.</span></li><li id="ALM-12074__li683353971413"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12074__b78338399141">root</strong>. <span id="ALM-12074__text985593916354"></span></span></li><li id="ALM-12074__li3833339151414"><span>Run the <strong id="ALM-12074__b38330393148">su -omm</strong> command and then the <strong id="ALM-12074__b78339393148">sh ${BIGDATA_HOME}/om-server/OMS/workspace0/ha/module/hacom/script/status_ha.sh</strong> command to check whether the status of the FMS resources managed by the HA is normal. In the single-node system, the FMS resource is in the normal state. In the dual-node system, the FMS resource is in the normal state on the active node and in the stopped state on the standby node.</span><p><ul id="ALM-12074__ul178331739111416"><li id="ALM-12074__li198331739171419">If it is, go to <a href="#ALM-12074__li5828173931412">6</a>.</li><li id="ALM-12074__li12833193919144">If it is not, go to <a href="#ALM-12074__li1183383931416">4</a>.</li></ul> <ol id="ALM-12074__ol7833539131412"><li id="ALM-12074__li2083323971413"><span>In the alarm list on <span id="ALM-12074__text10631171416479">MRS</span> Manager, locate the row that contains the alarm, and view the name of the host for which the alarm is generated.</span></li><li id="ALM-12074__li683353971413"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-12074__b78338399141">root</strong>. <span id="ALM-12074__text985593916354"></span></span></li><li id="ALM-12074__li3833339151414"><span>Run the <strong id="ALM-12074__b38330393148">su -omm</strong> command and then the <strong id="ALM-12074__b78339393148">sh ${BIGDATA_HOME}/om-server/OMS/workspace0/ha/module/hacom/script/status_ha.sh</strong> command to check whether the status of the FMS resources managed by the HA is normal. In the single-node system, the FMS resource is in the normal state. In the dual-node system, the FMS resource is in the normal state on the active node and in the stopped state on the standby node.</span><p><ul id="ALM-12074__ul178331739111416"><li id="ALM-12074__li198331739171419">If it is, go to <a href="#ALM-12074__li5828173931412">6</a>.</li><li id="ALM-12074__li12833193919144">If it is not, go to <a href="#ALM-12074__li1183383931416">4</a>.</li></ul>
</p></li><li id="ALM-12074__li1183383931416"><a name="ALM-12074__li1183383931416"></a><a name="li1183383931416"></a><span>Run the <strong id="ALM-12074__b783323918148">vi $BIGDATA_LOG_HOME/omm/oms/fms/fms.log </strong>and <strong id="ALM-12074__b108331539101416">vi $BIGDATA_LOG_HOME/omm/oms/fms/scriptlog/fms_ha.log </strong>commands to view the FMS resource logs, check whether the keyword <strong id="ALM-12074__b9187145311439">ERROR</strong> exists. Analyze the logs to locate and rectify the fault.</span></li><li id="ALM-12074__li4833133971410"><span>5 minutes later, check whether this alarm is cleared.</span><p><ul id="ALM-12074__ul1983383991412"><li id="ALM-12074__li983311395149">If it is, no further action is required.</li><li id="ALM-12074__li0833103914141">If it is not, go to <a href="#ALM-12074__li5828173931412">6</a>.</li></ul> </p></li><li id="ALM-12074__li1183383931416"><a name="ALM-12074__li1183383931416"></a><a name="li1183383931416"></a><span>Run the <strong id="ALM-12074__b783323918148">vi $BIGDATA_LOG_HOME/omm/oms/fms/fms.log </strong>and <strong id="ALM-12074__b108331539101416">vi $BIGDATA_LOG_HOME/omm/oms/fms/scriptlog/fms_ha.log </strong>commands to view the FMS resource logs, check whether the keyword <strong id="ALM-12074__b9187145311439">ERROR</strong> exists. Analyze the logs to locate and rectify the fault.</span></li><li id="ALM-12074__li4833133971410"><span>5 minutes later, check whether this alarm is cleared.</span><p><ul id="ALM-12074__ul1983383991412"><li id="ALM-12074__li983311395149">If it is, no further action is required.</li><li id="ALM-12074__li0833103914141">If it is not, go to <a href="#ALM-12074__li5828173931412">6</a>.</li></ul>
</p></li></ol> </p></li></ol>
<p id="ALM-12074__p590913362141"><strong id="ALM-12074__b2474172420144">Collect fault information.</strong></p> <p id="ALM-12074__p590913362141"><strong id="ALM-12074__b2474172420144">Collect fault information.</strong></p>
<ol start="6" id="ALM-12074__ol1683343918146"><li id="ALM-12074__li5828173931412"><a name="ALM-12074__li5828173931412"></a><a name="li5828173931412"></a><span>On FusionInsight Manager, choose <strong id="ALM-12074__b18828113913148">O&amp;M</strong>&gt; <strong id="ALM-12074__b98286392144">Log</strong> &gt; <strong id="ALM-12074__b13828203912147">Download</strong>.</span></li><li id="ALM-12074__li383393912140"><span>Select <strong id="ALM-12074__b3828039191417">Controller</strong> and <strong id="ALM-12074__b1583363981411">OmmServer</strong> for <strong id="ALM-12074__b17833639111417">Service</strong> and click <strong id="ALM-12074__b3991118545">OK</strong>.</span></li><li id="ALM-12074__li18833339101411"><span>Click <span><img id="ALM-12074__image1383383917144" src="en-us_image_0000001532767442.png"></span> in the upper right corner. In the displayed dialog box, set <strong id="ALM-12074__b783323981410">Start Date</strong> and <strong id="ALM-12074__b1683314393142">End Date</strong> to 1 hour before and after the alarm generation time respectively and click <strong id="ALM-12074__b08331339131417">OK</strong>. Then, click <strong id="ALM-12074__b11833103913145">Download</strong>.</span></li><li id="ALM-12074__li495644512588"><span>Contact the <span id="ALM-12074__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol> <ol start="6" id="ALM-12074__ol1683343918146"><li id="ALM-12074__li5828173931412"><a name="ALM-12074__li5828173931412"></a><a name="li5828173931412"></a><span>On <span id="ALM-12074__text496414152476">MRS</span> Manager, choose <strong id="ALM-12074__b18828113913148">O&amp;M</strong>&gt; <strong id="ALM-12074__b98286392144">Log</strong> &gt; <strong id="ALM-12074__b13828203912147">Download</strong>.</span></li><li id="ALM-12074__li383393912140"><span>Select <strong id="ALM-12074__b3828039191417">Controller</strong> and <strong id="ALM-12074__b1583363981411">OmmServer</strong> for <strong id="ALM-12074__b17833639111417">Service</strong> and click <strong id="ALM-12074__b3991118545">OK</strong>.</span></li><li id="ALM-12074__li18833339101411"><span>Click <span><img id="ALM-12074__image1383383917144" src="en-us_image_0000001532767442.png"></span> in the upper right corner. In the displayed dialog box, set <strong id="ALM-12074__b783323981410">Start Date</strong> and <strong id="ALM-12074__b1683314393142">End Date</strong> to 1 hour before and after the alarm generation time respectively and click <strong id="ALM-12074__b08331339131417">OK</strong>. Then, click <strong id="ALM-12074__b11833103913145">Download</strong>.</span></li><li id="ALM-12074__li495644512588"><span>Contact the <span id="ALM-12074__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div> </div>
<div class="section" id="ALM-12074__section13393241148"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12074__p34742024121418">This alarm will be automatically cleared after the fault is rectified.</p> <div class="section" id="ALM-12074__section13393241148"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12074__p34742024121418">This alarm will be automatically cleared after the fault is rectified.</p>
</div> </div>

Some files were not shown because too many files have changed in this diff Show More