ALM-12101 AZ Unhealthy

Description

After the AZ DR function is enabled, the system checks the AZ health status every 5 minutes. This alarm is generated when the system detects that the AZ is subhealthy or unhealthy. This alarm is cleared when the AZ becomes healthy.

Attribute

Alarm ID

Alarm Severity

Auto Clear

12101

Major

Yes

Parameters

Parameter

Meaning

Source

Specifies the cluster for which the alarm is generated.

ServiceName

Specifies the service for which the alarm is generated.

AZName

Specifies the AZ for which the alarm is generated.

HostName

Specifies the host for which the alarm is generated.

Impact on the System

The health status of an AZ is determined by whether the health status of storage resources (HDFS), computing resources (Yarn), and key roles in the AZ exceeds the configured threshold.

An AZ is subhealthy when:

An AZ is unhealthy when:

Possible Causes

Procedure

Disable the DR drill.

  1. On FusionInsight Manager, choose Cluster > Name of the desired cluster > Cross-AZ HA. The Cross-AZ HA page is displayed.
  2. In the AZ DR list, check whether Perform DR Drill in the Operation column of the AZ whose health status is Unhealthy is gray.

    • If yes, go to 4.
    • If no, go to 3.

  3. Click Restore in the Operation column of the target AZ. Wait 2 minutes and refresh the page to view the health status of the AZ. Check whether the health status is normal.

    • If yes, no further action is required.
    • If no, go to 4.

Collect the fault information.

  1. Log in to the active management node as user root.
  2. View logs of unhealthy services.

    • HDFS log files are stored in /var/log/Bigdata/hdfs/nn/hdfs-az-state.log.
    • Yarn log files are stored in /var/log/Bigdata/yarn/rm/yarn-az-state.log.
    • For other services, view the service health check logs in the corresponding service log directory.

  3. Contact O&M personnel and provide detailed log file information.

Alarm Clearing

After the fault is rectified, the system automatically clears this alarm.

Related Information

None