A deployed real-time service is in the Alarm state.
The prediction using a real-time service that is in the Alarm state may fail. Perform the following operations to locate the fault and deploy the service again:
If you call APIs for prediction, check whether there are too many prediction requests. A large number of prediction requests lead to the alarm state of the real-time service.
Check whether memory overflow or leakage occurs in the inference code.
If the model fails, for example, the associated resources are faulty, check inference logs.