Requirements on Datasets

The built-in algorithms provided by ModelArts can be used for image classification, object detection, and image semantic segmentation. The requirements for the datasets vary according to the built-in algorithms used for different purposes. Before using a built-in algorithm to create a training job, you are advised to prepare a dataset based on the requirements of the algorithm.

Image Classification

The training dataset must be stored in the OBS bucket. The following shows the OBS path structure of the dataset:

|-- data_url
      |--a.jpg
      |--a.txt
      |--b.jpg
      |--b.txt
      ...

Object Detection and Locating

The training dataset must be stored in the OBS bucket. The following shows the OBS path structure of the dataset:

|-- data_url
      |--a.jpg
      |--a.xml
      |--b.jpg
      |--b.xml
      ...

Image Semantic Segmentation

The training dataset must be stored in the OBS bucket. The following shows the OBS path structure of the dataset:

|-- data_url
       |--Image
              |--a.jpg
              |--b.jpg
              ...
       |--Label
              |--a.jpg
              |--b.jpg
              ...
       |--train.txt
       |--val.txt

Description: