Triplet labeling is suitable for scenarios where structured information, such as subjects, predicates, and objects, needs to be labeled in statements. With this function, not only entities in statements, but also relationships between entities can be labeled. Triplet labeling is often used in natural language processing tasks such as dependency syntax analysis and information extraction.
Text triplet labeling involves two classes of important labels: Entity Label and Relationship Label. For the Relationship Label, you need to set its Source entity and Target entity.
Before labeling, ensure that the Entity Label and Relationship Label of a dataset have been defined. For the Relationship Label, you need to set its Source entity and Target entity. The Relationship Label must be between the defined Source entity and Target entity.
For example, if two entities are labeled as Place, you cannot add any relationship label between them. If a relationship label cannot be added, a red cross is displayed.
By default, the Dashboard tab page of the current dataset version is displayed. If you need to label the dataset of another version, click the Versions tab and then click Set to Current Version in the right pane. For details, see Managing Dataset Versions.
The dataset details page displays the labeled and unlabeled text objects in the dataset. The Unlabeled tab page is displayed by default.
You cannot modify the labels of a dataset in the text triplet type on the labeling page. Instead, click Edit to enter the Modify Dataset page and modify the Entity Label and Relationship Label.
After labeling data, you can modify labeled data on the Labeled tab page.
On the dataset details page, click the Labeled tab. Select a text object in the left pane and the right pane displays the detailed label information. You can move your cursor to the entity or relationship label, and right-click to delete it. You can also click the source entity and target entity in sequence to add a relationship label.
You can click Delete Labels on Current Item at the bottom of the page to delete all labels in the selected text object.
In addition to automatically synchronizing data from Input Dataset Path, you can directly add text files on ModelArts for data labeling.
Select one or more files to be uploaded in the local environment. Only .txt and .csv files are supported. The total size of files uploaded at a time cannot exceed 8 MB.
You can quickly delete the files you want to discard.
The background of the selected text is blue. If no text is selected on the page, the Delete button is unavailable.