Named Entity Recognition

Named entity recognition assigns labels to named entities in text, such as time and locations. Before labeling, pay attention to the following:

A label name of a named entity can contain a maximum of 1,024 characters, including letters, digits, hyphens (-), underscores (_), and special characters.

Starting Labeling

  1. Log in to the ModelArts management console. In the navigation pane on the left, choose Data Management > Label Data.
  2. In the labeling job list, select a labeling type from the All type drop-down list, click the job to be performed based on the labeling type. The details page of the job is displayed.
  3. The job details page displays all data of the labeling job.

Synchronizing New Data

ModelArts automatically synchronizes data and labeling information from datasets to the labeling job.

To quickly obtain the latest data in the datasets, in the Unlabeled tab of the labeling job details page, click Synchronize New Data.

Labeling Text Files

The labeling job details page displays the Unlabeled and Labeled tabs. The Unlabeled tab is displayed by default.

  1. In the Unlabeled tab, the objects to be labeled are listed in the left pane. In the list, click the text object to be labeled, select a part of text displayed under Label Set for labeling, and select a label in the Label Set area in the right pane. Multiple labels can be added to a labeling object.

    You can repeat this operation to select objects and add labels to the objects.

  2. Click Save Current Page in the lower part of the page to complete the labeling.

Adding a Label

Viewing the Labeled Text

On the dataset details page, click the Labeled tab to view the list of the labeled text. You can also view all labels supported by the dataset in the All Labels area on the right.

Modifying Labeled Data

After labeling data, you can modify labeled data in the Labeled tab.

On the labeling job details page, click the Labeled tab, and modify the text information in the label information area on the right.

Adding a File

In addition to the data synchronized, you can directly add data on labeling job details page for labeling.

  1. On the labeling job details page, click the Unlabeled tab, click Add data in the upper left corner.
  2. Configure the data source, import mode, and other parameters, and click OK.

    For details about how to import data, see section "Importing Data".

Deleting a File

You can quickly delete the files you want to discard.

The background of the selected text is blue.

Managing Annotators

If team labeling is enabled for a labeling job, view its labeling details in the Annotator Management tab. Additionally, you can add, modify, or delete annotators.

  1. Choose Data Management > Label Data. In the My Creations or My Participations tab, view the list of all labeling jobs.
  2. Locate the target team labeling job. (The name of a team labeling job is followed by .)
  3. Choose More > Annotator Management in the Operation column. Alternatively, click the job name to go to the job details page, and choose Team Labeling > Annotator Management in the upper right corner.