doc-exports/docs/mrs/umn/ALM-20002.html
Yang, Tong 2195db241c MRS UMN 20231220 version update
Reviewed-by: Pruthi, Vineet <vineet.pruthi@t-systems.com>
Reviewed-by: Rechenburg, Matthias <matthias.rechenburg@t-systems.com>
Co-authored-by: Yang, Tong <yangtong2@huawei.com>
Co-committed-by: Yang, Tong <yangtong2@huawei.com>
2024-05-16 09:40:21 +00:00

14 KiB

ALM-20002 Hue Service Unavailable

Description

This alarm is generated when the Hue service is unavailable. The system checks the Hue service status every 60 seconds.

This alarm is cleared when the Hue service is normal.

Attribute

Alarm ID

Alarm Severity

Automatically Cleared

20002

Critical

Yes

Parameters

Name

Meaning

Source

Specifies the cluster for which the alarm is generated.

ServiceName

Specifies the service for which the alarm is generated.

RoleName

Specifies the role for which the alarm is generated.

HostName

Specifies the host for which the alarm is generated.

Impact on the System

The system cannot provide data loading, query, and extraction services.

Possible Causes

  • The internal KrbServer service on which the Hue service depends is abnormal.
  • The internal DBService service on which the Hue service depends is abnormal.
  • The network connection to the DBService is abnormal.

Procedure

Check whether the KrbServer is abnormal.

  1. On the MRS Manager home page, choose Cluster > Name of the desired cluster > Services. In the service list, check whether the KrbServer running status is Normal.

    • If yes, go to 4.
    • If no, go to 2.

  2. Restart the KrbServer service.
  3. Wait several minutes, and check whether Hue Service Unavailable is cleared.

    • If yes, no further action is required.
    • If no, go to 4.

Check whether the DBService is abnormal.

  1. On the MRS Manager home page, choose Cluster > Name of the desired cluster > Services.
  2. In the service list, check whether the DBService running status is Normal.

    • If yes, go to 8.
    • If no, go to 6.

  3. Restart the DBService.

    To restart the service, enter the MRS Manager administrator password.

  4. Wait several minutes, and check whether Hue Service Unavailable is cleared.

    • If yes, no further action is required.
    • If no, go to 8.

Check whether the network connection to the DBService is normal.

  1. Choose Cluster > Name of the desired cluster > Services > Hue > Instance, record the IP address of the active Hue.
  2. Log in to the active Hue.
  3. Run the ping command to check whether communication between the host that runs the active Hue and the hosts that run the DBService is normal. (Obtain the IP addresses of the hosts that run the DBService in the same way as that for obtaining the IP address of the active Hue.)

    • If yes, go to 13.
    • If no, go to 11.

  4. Contact the administrator to restore the network.
  5. Wait several minutes, and check whether Hue Service Unavailable is cleared.

    • If yes, no further action is required.
    • If no, go to 13.

Collect fault information.

  1. On MRS Manager, choose O&M > Log > Download.
  2. Select the following nodes in the required cluster from the Service drop-down list:

    • Hue
    • Controller

  3. Click in the upper right corner, and set Start Date and End Date for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click Download.
  4. On the MRS Manager, choose Cluster > Name of the desired cluster > Services > Hue.
  5. Choose More > Restart Service, and click OK.
  1. Check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 19.

  2. Contact the O&M personnel and send the collected logs.

Alarm Clearing

After the fault is rectified, the system automatically clears this alarm.

Related Information

None