The system checks the percentage of the HQL statements that are executed successfully in every 30 seconds. The formula is: Percentage of HQL statements that are executed successfully = Number of HQL statements that are executed successfully by Hive in a specified period/Total number of HQL statements that are executed by Hive. This indicator can be viewed on the Cluster > Name of the desired cluster > Services > Hive > Instance > HiveServer instance . The default threshold of the percentage of HQL statements that are executed successfully is 90%. An alarm is reported when the percentage is lower than the 90%. Users can view the name of the host where an alarm is generated in the location information about the alarm. The IP address of the host is the IP address of the HiveServer node.
Users can modify the threshold by choosing O&M > Alarm > Thresholds > Name of the desired cluster > Hive > Percentage of HQL Statements That Are Executed Successfully by Hive.
This alarm is cleared when the execution success rate is higher than 110% of the threshold.
Alarm ID |
Alarm Severity |
Automatically Cleared |
---|---|---|
16002 |
Major |
Yes |
Name |
Meaning |
---|---|
Source |
Specifies the cluster for which the alarm is generated. |
ServiceName |
Specifies the service for which the alarm is generated. |
RoleName |
Specifies the role for which the alarm is generated. |
HostName |
Specifies the host for which the alarm is generated. |
Trigger Condition |
Specifies the threshold triggering the alarm. If the current indicator value exceeds this threshold, the alarm is generated. |
The system configuration and performance cannot meet service processing requirements.
Check whether the HQL statements comply with syntax.
To view the user who runs an incorrect statement, you can download the hiveserver audit log file of the HiveServer node where this alarm is generated. Start Data and End Data are 10 minutes before and after the alarm generation time respectively. Open the log file and search for the Result=FAIL keyword to filter the log information about the incorrect statement, and then view the user who runs the incorrect statement according to UserName in the log information.
Check whether the HBase service is abnormal.
Check whether the HDFS, Yarn, and ZooKeeper are normal.
Collect fault information.
After the fault is rectified, the system automatically clears this alarm.
None