Deploying a Model as a Service

Deploying a Model

You can deploy a model as a real-time service that provides a real-time test UI and monitoring capabilities. After model training is complete, you can deploy a version with the ideal accuracy and in the Successful status as a service. The procedure is as follows:

  1. On the Train Model tab page, wait until the training status changes to Successful. Click Deploy in the Version Manager pane to deploy the model as a real-time service.
  2. In the Deploy dialog box, select resource flavor, set the Auto Stop function, and click OK to start the deployment.
    • Specifications: The GPU specifications are better, and the CPU specifications are more cost-effective.
    • Compute Nodes: The default value is 1 and cannot be changed.
    • Auto Stop: After this function is enabled and the auto stop time is set, a service automatically stops at the specified time.

    The options are 1 hour later, 2 hours later, 4 hours later, 6 hours later, and Custom. If you select Custom, you can enter any integer from 1 to 24 hours in the text box on the right.

    Figure 1 Deploying a model
  3. After the model deployment is started, view the deployment status on the Service Deployment page.

    It takes a certain period of time to deploy a model. When the status in the Version Manager pane changes from Deploying to Running, the deployment is complete.

    On the ExeML page, trained models can only be deployed as real-time services. For details about how to deploy them as batch services, see Where Are Models Generated by ExeML Stored? What Other Operations Are Supported?

Testing a Service