The system checks the Spark2x service status every 300 seconds. This alarm is generated when the Spark2x service is unavailable.
This alarm is cleared when the Spark2x service recovers.
Alarm ID |
Alarm Severity |
Auto Clear |
---|---|---|
43001 |
Critical |
Yes |
Name |
Meaning |
---|---|
Source |
Specifies the cluster for which the alarm is generated. |
ServiceName |
Specifies the service for which the alarm is generated. |
RoleName |
Specifies the role for which the alarm is generated. |
HostName |
Specifies the host for which the alarm is generated. |
The Spark tasks submitted by users fail to be executed.
If the alarm is abnormal Spark2x assembly packet, the Spark packet is abnormal. Wait for about 10 minutes. The alarm is automatically cleared.
Check whether service unavailability alarms exist in services that Spark2x depends on.
If the multi-instance function is enabled for the cluster and multiple Spark2x services are installed, check the Spark2x service for which the alarm is generated based on the value of ServiceName in location information and check whether the Hive service is faulty. Spark2x corresponds to Hive, spark2x1 corresponds to Hive1, and other services follow the same rule.
After the alarm is cleared, wait a few minutes and check whether the alarm GuardianService Unavailable is cleared.
Collect fault information.
This alarm is automatically cleared after the fault is rectified.
None