For MRS 3.3.0 and its later versions:
The collection period is 3 seconds, and the detection period is 30 or 300 seconds. This alarm is automatically cleared when neither of the preceding conditions is met for three consecutive detection periods (30 or 300 seconds).
For versions earlier than MRS 3.3.0:
This alarm is automatically cleared when the preceding conditions have not been met for 90s.
Parameters are as follows:
avgqu-sz indicates the disk queue depth.
The sum of r/s and w/s is the IOPS.
The sum of rkB/s and wkB/s is the bandwidth.
%util is the ioutil value.
Run the iostat -x -t command in the OS.
svctm = (tot_ticks_new - tot_ticks_old)/(rd_ios_new + wr_ios_new - rd_ios_old - wr_ios_old)
When the detection period is 30 seconds, if rd_ios_new + wr_ios_new - rd_ios_old - wr_ios_old = 0, then svctm = 0.
When the detection period is 300 seconds and rd_ios_new + wr_ios_new - rd_ios_old - wr_ios_old = 0, if tot_ticks_new - tot_ticks_old = 0, then svctm = 0; otherwise, the value of svctm is infinite.
The parameters can be obtained as follows:
The system runs the cat /proc/diskstats command every 3 seconds to collect data. For example:
In these two commands:
In the data collected for the first time, the number in the fourth column is the rd_ios_old value, the number in the eighth column is the wr_ios_old value, and the number in the thirteenth column is the tot_ticks_old value.
In the data collected for the second time, the number in the fourth column is the rd_ios_new value, the number in the eighth column is the wr_ios_new value, and the number in the thirteenth column is the tot_ticks_new value.
In this case, the value of svctm is as follows:
(19571460 - 19569526)/(1101553 + 28747977 - 1101553 - 28744856) = 0.6197
Alarm ID |
Alarm Severity |
Auto Clear |
---|---|---|
12180 |
Major |
Yes |
Name |
Meaning |
---|---|
Source |
Specifies the cluster or system for which the alarm is generated. |
ServiceName |
Specifies the service for which the alarm is generated. |
RoleName |
Specifies the role for which the alarm is generated. |
HostName |
Specifies the host for which the alarm is generated. |
DiskName |
Specifies the disk for which the alarm is generated. |
A continuously high I/O usage may adversely affect service operations and result in service loss.
The disk is aged.
Replace the disk.
Collect fault information.
This alarm is automatically cleared after the fault is rectified.
None