Viewing Service Details

After a model is deployed as a real-time service, you can access the service page to view its details.

  1. Log in to the ModelArts management console and choose Service Deployment > Real-Time Services.
  2. On the Real-Time Services page, click the name of the target service. The service details page is displayed.

    You can view the service name and status. For details, see Table 1.

    Table 1 Real-time service parameters

    Parameter

    Description

    Name

    Name of the real-time service.

    Status

    Current status of the real-time service.

    Source

    Model source of the real-time service.

    Failed Calls/Total Calls

    Number of service calls, which is counted from the time when the service was created.

    If the number of models is changed or a service is invoked when a model is not ready, the number of calls is not counted.

    Network Configuration

    Customized network configuration of the used dedicated resource pool.

    Description

    Service description, which can be edited after you click the edit button on the right side.

    Custom Settings

    Customized configurations based on real-time service versions. This allows version-based traffic distribution policies and configurations. Enable this option and click View Settings to customize the settings. For details, see Modifying Customized Settings.

    Data Collection

    Enable this option to store the data generated when the real-time service is invoked to a specified OBS path.

    Filter

    Enable this option so that the system automatically identifies hard examples in all sample data.

    Synchronize Data

    Synchronize the collected data to a dataset for centralized management and utilization.

    Traffic Limit

    Maximum number of times a service can be accessed within a second.

  3. You can switch between tabs on the details page of a real-time service to view more details. For details, see Table 2.
    Table 2 Service details

    Parameter

    Description

    Usage Guides

    Displays the API address, model information, input parameters, and output parameters. You can click to copy the API address to call the service.

    Prediction

    Performs a prediction test on the real-time service. For details, see Testing a Service.

    Configuration Updates

    Displays Existing Configuration and Historical Updates.

    • Existing Configuration: includes the model name, version, status, traffic ratio, .
    • Historical Updates: displays historical model information.

    Monitoring

    Displays Resource Usage and Model Calls.

    • Resource Usage: includes the used and available CPU, memory, and GPU resources.
    • Model Calls: indicates the number of model calls. The statistics collection starts after the model status changes to Ready.

    Event

    Displays key operations during service use, such as the service deployment progress, detailed causes of deployment exceptions, and time points when a service is started, stopped, or modified.

    Logs

    Displays the log information about each model in the service. You can view logs generated in the latest 5 minutes, latest 30 minutes, latest 1 hour, and user-defined time segment.

    • You can select the start time and end time when defining the time segment.

Modifying Customized Settings

A customized configuration rule consists of the configuration condition (Setting), access version (Version), and customized running parameters (including Setting Name and Setting Value).

You can configure different settings with customized running parameters for different versions of a real-time service.

The priorities of customized configuration rules are in descending order. You can change the priorities by dragging the sequence of customized configuration rules.

After a rule is matched, the system will no longer match subsequent rules. A maximum of 10 configuration rules can be configured.

Table 3 Parameters for Custom Settings

Parameter

Mandatory

Description

Setting

Yes

Expression of the Spring Expression Language (SPEL) rule. Only the equal and matches expressions of the character type are supported.

Version

Yes

Access version for a customized service configuration rule. When a rule is matched, the real-time service of the version is requested.

Setting Name

No

Key of a customized running parameter, consisting of a maximum of 128 characters.

Configure this parameter if the HTTP message header is used to carry customized running parameters to a real-time service.

Setting Value

No

Value of a customized running parameter, consisting of a maximum of 256 characters.

Configure this parameter if the HTTP message header is used to carry customized running parameters to a real-time service.

Customized settings can be used in the following scenarios: