doc-exports/docs/mrs/umn/ALM-16002.html
Yang, Tong 3b1f73dece MRS UMN 2.0.38.SP20 version
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com>
Co-authored-by: Yang, Tong <yangtong2@huawei.com>
Co-committed-by: Yang, Tong <yangtong2@huawei.com>
2022-12-13 12:03:34 +00:00

99 lines
16 KiB
HTML

<a name="ALM-16002"></a><a name="ALM-16002"></a>
<h1 class="topictitle1">ALM-16002 Hive SQL Execution Success Rate Is Lower Than the Threshold</h1>
<div id="body5620840"><div class="section" id="ALM-16002__scaabba1f6537427c88e4f26794017c1b"><h4 class="sectiontitle">Description</h4><p id="ALM-16002__en-us_topic_0070543660_p19980869">The system checks the percentage of the HQL statements that are executed successfully in every 30 seconds. The formula is: Percentage of HQL statements that are executed successfully = Number of HQL statements that are executed successfully by Hive in a specified period/Total number of HQL statements that are executed by Hive. This indicator can be viewed on the <strong id="ALM-16002__b5688105512216">Cluster<em id="ALM-16002__i1568865582111"> &gt;</em></strong><em id="ALM-16002__i969285517216"> Name of the desired cluster</em> <strong id="ALM-16002__b14689855162111">&gt; Services</strong> &gt; <strong id="ALM-16002__b17567026111711">Hive &gt; Instance</strong> &gt; <em id="ALM-16002__i2561101611189">HiveServer instance </em><strong id="ALM-16002__b10565937186">.</strong> The default threshold of the percentage of HQL statements that are executed successfully is <strong id="ALM-16002__en-us_topic_0070543660_b45610096">90%</strong>. An alarm is reported when the percentage is lower than the <strong id="ALM-16002__en-us_topic_0070543660_b7837682">90%</strong>. Users can view the name of the host where an alarm is generated in the location information about the alarm. The IP address of the host is the IP address of the HiveServer node.</p>
<p id="ALM-16002__en-us_topic_0070543660_p3430276">Users can modify the threshold by choosing <strong id="ALM-16002__b14283121134517"><strong id="ALM-16002__b4283711134518">O&amp;M &gt; Alarm &gt; Thresholds &gt;</strong></strong> <em id="ALM-16002__i192850112450">Name of the desired cluster</em> &gt;<strong id="ALM-16002__b10284811114518"> <strong id="ALM-16002__b1528421124517">Hive</strong></strong> &gt; <strong id="ALM-16002__en-us_topic_0070543660_b19840075">Percentage of HQL Statements That Are Executed Successfully by Hive</strong>.</p>
<p id="ALM-16002__en-us_topic_0070543660_p44342955">This alarm is cleared when the execution success rate is higher than 110% of the threshold.</p>
</div>
<div class="section" id="ALM-16002__s107d6b02863845a3a14bff27bc9ee2fb"><h4 class="sectiontitle">Attribute</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-16002__en-us_topic_0070543660_table35009634" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-16002__en-us_topic_0070543660_row25029423"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-16002__en-us_topic_0070543660_p14117383">Alarm ID</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-16002__en-us_topic_0070543660_p2657388">Alarm Severity</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-16002__en-us_topic_0070543660_p13921861">Automatically Cleared</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-16002__en-us_topic_0070543660_row53928955"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-16002__en-us_topic_0070543660_p6169205">16002</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-16002__en-us_topic_0070543660_p29943579">Major</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-16002__en-us_topic_0070543660_p9510871">Yes</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-16002__sed4e4f2116a1411fb1df6637b7afedf9"><h4 class="sectiontitle">Parameters</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-16002__en-us_topic_0070543660_table32183054" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-16002__en-us_topic_0070543660_row55498160"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-16002__en-us_topic_0070543660_p66165942">Name</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-16002__en-us_topic_0070543660_p57841085">Meaning</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-16002__row134994415328"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-16002__p192431315431">Source</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-16002__p692551319435">Specifies the cluster for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-16002__en-us_topic_0070543660_row54616300"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-16002__en-us_topic_0070543660_p61844204">ServiceName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-16002__en-us_topic_0070543660_p43324645">Specifies the service for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-16002__en-us_topic_0070543660_row54377486"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-16002__en-us_topic_0070543660_p42500206">RoleName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-16002__en-us_topic_0070543660_p19964634">Specifies the role for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-16002__en-us_topic_0070543660_row45463985"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-16002__en-us_topic_0070543660_p58704155">HostName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-16002__en-us_topic_0070543660_p57416134">Specifies the host for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-16002__en-us_topic_0070543660_row46983163"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-16002__en-us_topic_0070543660_p47539882">Trigger Condition</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-16002__en-us_topic_0070543660_p25525229">Specifies the threshold triggering the alarm. If the current indicator value exceeds this threshold, the alarm is generated.</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-16002__s46fc7b85217640a1bc241a40aa8a60c7"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-16002__en-us_topic_0070543660_p54277677">The system configuration and performance cannot meet service processing requirements.</p>
</div>
<div class="section" id="ALM-16002__s8428e3da6acf4b279725cdbceb7180ff"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-16002__en-us_topic_0070543660_ul34415697"><li id="ALM-16002__en-us_topic_0070543660_li41305820">A syntax error occurs in HQL statements.</li><li id="ALM-16002__en-us_topic_0070543660_li36208066">The HBase service is abnormal when a Hive on HBase task is performed.</li><li id="ALM-16002__en-us_topic_0070543660_li57437138">The Spark service is abnormal when a Hive on Spark task is performed.</li><li id="ALM-16002__en-us_topic_0070543660_li47172199">The dependent basic services, such as HDFS, Yarn, and ZooKeeper, are abnormal.</li></ul>
</div>
<div class="section" id="ALM-16002__s040fb042243d4c34a52ac2842a8ec5f4"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-16002__en-us_topic_0070543660_p62851752"><strong id="ALM-16002__b286936431454">Check whether the HQL statements comply with syntax.</strong></p>
<ol id="ALM-16002__ol552903514512"><li id="ALM-16002__li81761748141418"><span>On the FusionInsight Manager page, choose <strong id="ALM-16002__b8177548141417">O&amp;M</strong> &gt; <strong id="ALM-16002__b71771048101419">Alarm</strong> to view the alarm details and obtain the node where the alarm is generated.</span></li><li id="ALM-16002__li3922952414456"><span>Use the Hive client to log in to the HiveServer node where an alarm is reported. Query the HQL syntax provided by Apache, and check whether the HQL commands are correct. </span><p><ul class="subitemlist" id="ALM-16002__ul1876744714456"><li id="ALM-16002__li702200414456">If yes, go to <a href="#ALM-16002__li2677546914456">4</a>.</li><li id="ALM-16002__li3191143314456">If no, go to <a href="#ALM-16002__li3343432914456">3</a>.</li></ul>
<div class="note" id="ALM-16002__note435883614456"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p class="text" id="ALM-16002__p3031047614456">To view the user who runs an incorrect statement, you can download the hiveserver audit log file of the HiveServer node where this alarm is generated.<strong id="ALM-16002__b3468929514456"> Start Data</strong> and <strong id="ALM-16002__b4376820414456">End Data</strong> are 10 minutes before and after the alarm generation time respectively. Open the log file and search for the <strong id="ALM-16002__b5836951714456">Result=FAIL</strong> keyword to filter the log information about the incorrect statement, and then view the user who runs the incorrect statement according to <strong id="ALM-16002__b5556361314456">UserName</strong> in the log information.</p>
</div></div>
</p></li><li id="ALM-16002__li3343432914456"><a name="ALM-16002__li3343432914456"></a><a name="li3343432914456"></a><span>Enter the correct HQL statements, and check whether the command can be properly executed.</span><p><ul class="subitemlist" id="ALM-16002__ul41276914456"><li id="ALM-16002__li2347486014456">If yes, go to <a href="#ALM-16002__li5821800114456">12</a>.</li><li id="ALM-16002__li2241548414456">If no, go to <a href="#ALM-16002__li2677546914456">4</a>.</li></ul>
</p></li></ol>
<p class="tableheading" id="ALM-16002__p371492514456"><strong id="ALM-16002__b3625462014526">Check whether the HBase service is abnormal.</strong></p>
<ol start="4" id="ALM-16002__ol3652616614547"><li id="ALM-16002__li2677546914456"><a name="ALM-16002__li2677546914456"></a><a name="li2677546914456"></a><span>Check whether an Hive on HBase task is performed with the user who runs the HQL command.</span><p><ul class="subitemlist" id="ALM-16002__ul5517083514456"><li id="ALM-16002__li2382614214456">If yes, go to <a href="#ALM-16002__li1989232914456">5</a>.</li><li id="ALM-16002__li5086933514456">If no, go to <a href="#ALM-16002__li4623094014456">8</a>.</li></ul>
</p></li><li id="ALM-16002__li1989232914456"><a name="ALM-16002__li1989232914456"></a><a name="li1989232914456"></a><span>On the FusionInsight Manager page, click <strong id="ALM-16002__b213171918166">Cluster </strong>&gt; <em id="ALM-16002__i413181911163">Name of the desired cluster</em> &gt;<strong id="ALM-16002__b12131019111614"> Services</strong>, check whether the HBase service is normal in the service list.</span><p><ul class="subitemlist" id="ALM-16002__ul4694950114456"><li id="ALM-16002__li2132935114456">If yes, go to <a href="#ALM-16002__li4623094014456">8</a>.</li><li id="ALM-16002__li4995585314456">If no, go to <a href="#ALM-16002__li4481323314456">6</a>.</li></ul>
</p></li><li id="ALM-16002__li4481323314456"><a name="ALM-16002__li4481323314456"></a><a name="li4481323314456"></a><span>Choose <strong id="ALM-16002__b4838153716362">O&amp;M</strong> &gt; <strong id="ALM-16002__b28381637143618">Alarm</strong>, check the related alarms displayed on the alarm page and clear them according to related alarm help.</span></li><li id="ALM-16002__li3255046614456"><span>Enter the correct HQL statements, and check whether the command can be properly executed.</span><p><ul class="subitemlist" id="ALM-16002__ul702989314456"><li id="ALM-16002__li599328614456">If yes, go to <a href="#ALM-16002__li5821800114456">12</a>.</li><li id="ALM-16002__li1569418014456">If no, go to <a href="#ALM-16002__li4623094014456">8</a>.</li></ul>
</p></li></ol>
<p class="tableheading" id="ALM-16002__p5327181614456"><strong id="ALM-16002__b19868814141755">Check whether the HDFS, Yarn, and ZooKeeper are normal.</strong></p>
<ol start="8" id="ALM-16002__ol4532453314637"><li id="ALM-16002__li4623094014456"><a name="ALM-16002__li4623094014456"></a><a name="li4623094014456"></a><span>On the FusionInsight Manager portal, click <strong id="ALM-16002__b18425135013429">Cluster </strong>&gt; <em id="ALM-16002__i94271501427">Name of the desired cluster</em> &gt;<strong id="ALM-16002__b15426050144217"> Services</strong>.</span></li><li id="ALM-16002__li5945449914456"><span>In the service list, check whether the services, such as HDFS, Yarn, and ZooKeeper are normal.</span><p><ul class="subitemlist" id="ALM-16002__ul3643221714456"><li id="ALM-16002__li5371865514456">If yes, go to <a href="#ALM-16002__li5821800114456">12</a>.</li><li id="ALM-16002__li5624380714456">If no, go to <a href="#ALM-16002__li6532844614456">10</a>.</li></ul>
</p></li><li id="ALM-16002__li6532844614456"><a name="ALM-16002__li6532844614456"></a><a name="li6532844614456"></a><span>Check the related alarms displayed on the alarm page and clear them according to related alarm help.</span></li><li id="ALM-16002__li4821816114456"><span>Enter the correct HQL statements, and check whether the command can be properly executed.</span><p><ul class="subitemlist" id="ALM-16002__ul2772719414456"><li id="ALM-16002__li5711278814456">If yes, go to <a href="#ALM-16002__li5821800114456">12</a>.</li><li id="ALM-16002__li6273312214456">If no, go to <a href="#ALM-16002__li2812112614456">13</a>.</li></ul>
</p></li><li id="ALM-16002__li5821800114456"><a name="ALM-16002__li5821800114456"></a><a name="li5821800114456"></a><span>After 1 minute, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-16002__ul817528114456"><li id="ALM-16002__li3131027314456">If yes, no further action is required.</li><li id="ALM-16002__li5310414714456">If no, go to <a href="#ALM-16002__li2812112614456">13</a>.</li></ul>
</p></li></ol>
<p class="tableheading" id="ALM-16002__p646866614456"><strong id="ALM-16002__b2252645014648">Collect fault information.</strong></p>
<ol start="13" id="ALM-16002__ol840629514651"><li id="ALM-16002__li2812112614456"><a name="ALM-16002__li2812112614456"></a><a name="li2812112614456"></a><span>On the FusionInsight Manager home page, choose <strong id="ALM-16002__b39977366113627">O&amp;M</strong> &gt; <strong id="ALM-16002__b24251979113627">Log &gt; Download</strong>.</span></li><li id="ALM-16002__li4542161014456"><span>Select the following nodes in the required cluster from the <strong id="ALM-16002__b5176354714456">Service</strong>:</span><p><ul class="subitemlist" id="ALM-16002__ul4978608814456"><li id="ALM-16002__li3209781314456">MapReduce</li><li id="ALM-16002__li2044486814456">Hive</li></ul>
</p></li><li id="ALM-16002__li1145664103113"><span>Click <span><img id="ALM-16002__image1945644173117" src="en-us_image_0269417377.png"></span> in the upper right corner, and set <strong id="ALM-16002__b6456941173117">Start Date</strong> and <strong id="ALM-16002__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-16002__b13456164113319">Download</strong>.</span></li><li id="ALM-16002__li2779776214456"><span>Contact the <span id="ALM-16002__text4614151421417">O&amp;M personnel</span> and send the collected logs.</span></li></ol>
</div>
<div class="section" id="ALM-16002__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-16002__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div>
<div class="section" id="ALM-16002__sf415d68f597d4e96b38074218093c1ef"><h4 class="sectiontitle">Related Information</h4><p id="ALM-16002__en-us_topic_0070543660_p49115527">None</p>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
</div>
</div>