This alarm is generated when the Hive warehouse space usage exceeds the specified threshold (85% by default). The system checks the Hive data warehouse space usage every 30s. The indicator Percentage of HDFS Space Used by Hive to the Available Space can be viewed on the Hive service monitoring page.
To change the threshold, choose O&M > Alarm > Thresholds > Name of the desired cluster > Hive > Percentage of HDFS Space Used by Hive to the Available Space.
When the Trigger Count is 1, this alarm is cleared when the Hive warehouse space usage is less than or equal to the threshold. When the Trigger Count is greater than 1, this alarm is cleared when the Hive warehouse space usage is less than or equal to 90% of the threshold.
The administrator can reduce the warehouse space usage by expanding the warehouse capacity or releasing the used space.
Alarm ID |
Alarm Severity |
Automatically Cleared |
---|---|---|
16001 |
Minor |
Yes |
Name |
Meaning |
---|---|
Source |
Specifies the cluster for which the alarm is generated. |
ServiceName |
Specifies the service for which the alarm is generated. |
RoleName |
Specifies the role for which the alarm is generated. |
HostName |
Specifies the host for which the alarm is generated. |
Trigger Condition |
Specifies the threshold triggering the alarm. If the current indicator value exceeds this threshold, the alarm is generated. |
The system fails to write data, which causes data loss.
Expand the system configuration.
Log in to FusionInsight Manager, choose Cluster > Name of the desired cluster > Services > Hive > Configurations > All Configurations, find hive.metastore.warehouse.size.percent, and increase its value so that larger HDFS capacity will be available for Hive. Assume that the value of the configuration item is A, the total HDFS storage space is B, the threshold is C, and the HDFS space used by Hive is D. The adjustment policy is A x B x C > D. The total HDFS storage space can be viewed on the HDFS NameNode page. The HDFS space used by Hive can be viewed on the Hive monitoring page.
Expand the system.
Check whether the data node is normal.
Collect fault information.
After the fault is rectified, the system automatically clears this alarm.
None