To improve the precision of auto labeling algorithms, you can evenly label multiple classes. ModelArts provides built-in grouping algorithms. You can enable auto grouping to improve data labeling efficiency.
Auto grouping can be understood as data labeling preprocessing. Clustering algorithms are used to cluster unlabeled images, and images are labeled or cleaned by group based on the clustering result.
For example, a user searches for XX through a search engine, downloads and uploads related images to the dataset, and then uses the auto grouping function to classify XX images, such as papers, posters, images confirmed as XX, and others. The user can quickly remove unwanted images from a group or select all images of a type and add labels to the images.
Only datasets of image classification, object detection, and image segmentation types support the auto grouping function.
You can start auto group tasks or view task history only in the All tab.
In the All tab of the dataset details page, expand Filter Criteria, set Sample Attribute to the attribute name of the auto grouping task, and set the sample attribute value to filter the grouping result.
In the All tab of the dataset details page, choose Auto Grouping > View Task History. In the View Task History dialog box, basic information about the auto grouping tasks of the current dataset is displayed.