doc-exports/docs/modelarts/umn/datalabel-modelarts_0015.html
Lai, Weijian 6aa966a79a ModelArts UMN 24.3.0 version
Reviewed-by: Pruthi, Vineet <vineet.pruthi@t-systems.com>
Co-authored-by: Lai, Weijian <laiweijian4@huawei.com>
Co-committed-by: Lai, Weijian <laiweijian4@huawei.com>
2024-11-02 09:04:52 +00:00

34 lines
9.2 KiB
HTML

<a name="EN-US_TOPIC_0000002079101725"></a><a name="EN-US_TOPIC_0000002079101725"></a>
<h1 class="topictitle1">Speech Labeling</h1>
<div id="body8662426"><p id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_p2248318172212">Model training requires a large amount of labeled data. Therefore, before the model training, label the unlabeled audio files. ModelArts enables you to label audio files in batches by one click. In addition, you can modify the labels of audio files, or remove their labels and label the audio files again.</p>
<div class="section" id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_section139520290612"><h4 class="sectiontitle">Starting Labeling</h4><ol id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_en-us_topic_0000001185384417_ol1332113431875"><li id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_en-us_topic_0000001185384417_li173221243078">Log in to the ModelArts management console. In the navigation pane on the left, choose <strong id="EN-US_TOPIC_0000002079101725__b209030229442748">Data Management</strong> &gt; <span class="parmname" id="EN-US_TOPIC_0000002079101725__parmname104662104442748"><b>Label Data</b></span>.</li><li id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_en-us_topic_0000001185384417_li1241123818438">In the labeling job list, select a labeling type from the <strong id="EN-US_TOPIC_0000002079101725__b34716581705">All type</strong> drop-down list, click the job to be performed based on the labeling type. The details page of the job is displayed.</li><li id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_en-us_topic_0000001185384417_li1766010993710">The job details page displays all data of the labeling job.</li></ol>
</div>
<div class="section" id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_section616011413170"><h4 class="sectiontitle">Synchronizing New Data</h4><p id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_en-us_topic_0000001185384417_p114233449177">ModelArts automatically synchronizes data and labeling information from datasets to the labeling job.</p>
<p id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_en-us_topic_0000001185384417_p38711841115016">To quickly obtain the latest data in the datasets, in the <strong id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_b13166443191320">Unlabeled</strong> tab of the labeling job details page, click <strong id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_b2016604312137">Synchronize New Data</strong>.</p>
</div>
<div class="section" id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_section888019266174"><h4 class="sectiontitle">Labeling Audio Files</h4><p id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_p6622193613311">The labeling job details page displays the labeled and unlabeled audio files. The <strong id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_b101071919101512">Unlabeled</strong> tab is displayed by default.</p>
<ol id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_ol875212814111"><li id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_li127526819118">In the audio file list in the <strong id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_b32691438075">Unlabeled</strong> tab, click the target audio file. In the area on the right, the audio file is displayed. Click <span><img id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_image1759614183158" src="figure/en-us_image_0000002043022684.png"></span> below the audio file to play the audio.</li><li id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_li189431247173317">In <strong id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_b098171614183">Speech Content</strong>, enter the speech content.</li><li id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_li122591050672">After entering the content, click <strong id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_b8280053201810">Label</strong> to complete the labeling. The audio file is automatically moved to the <span class="wintitle" id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_wintitle1084459171913"><b>Labeled</b></span> tab.</li></ol>
</div>
<div class="section" id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_section2958731141718"><h4 class="sectiontitle">Viewing the Labeled Audio Files</h4><p id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_p138711071810">On the labeling job details page, click the <strong id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_b10279150132211">Labeled</strong> tab to view the list of labeled audio files. Click the audio file to view the audio content in the <strong id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_b542118518227">Speech Content</strong> text box on the right.</p>
</div>
<div class="section" id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_section0534612151819"><h4 class="sectiontitle">Modifying Labeled Data</h4><p id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_p1981864110595">After labeling data, you can modify labeled data in the <span class="wintitle" id="EN-US_TOPIC_0000002079101725__wintitle119001147185015"><b>Labeled</b></span> tab.</p>
<p id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_p1123204517125">On the labeling job details page, click the <strong id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_b12598413123810">Labeled</strong> tab and select the audio file to be modified from the audio file list. In the label information area on the right, modify the content of the <strong id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_b17895124643811">Speech Content</strong> text box, and click <strong id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_b889574613383">Label</strong> to complete the modification.</p>
</div>
<div class="section" id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_section15984542128"><h4 class="sectiontitle">Adding an Audio File</h4><p id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_en-us_topic_0170889731_p266117351147">In addition to the data synchronized, you can directly add data on labeling job details page for labeling.</p>
<ol id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_en-us_topic_0170889731_ol429210266513"><li id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_en-us_topic_0170889731_li1924993814520">On the labeling job details page, click the <strong id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_b1238234910446">Unlabeled</strong> tab, click <strong id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_b538264913441">Add data</strong> in the upper left corner.</li><li id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_li01454543219">Configure input data and click <span class="uicontrol" id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_uicontrol1238181934618"><b>OK</b></span>.<p id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_p2070881211273">For details about how to import data, see section "Importing Data".</p>
</li></ol>
</div>
<div class="section" id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_section15379942161810"><h4 class="sectiontitle">Deleting Audio Files</h4><p id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_p3752212171611">You can quickly delete the audio files you want to discard.</p>
<p id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_p1614812015518">In the <span class="wintitle" id="EN-US_TOPIC_0000002079101725__wintitle19360248120"><b>Unlabeled</b></span> or <span class="wintitle" id="EN-US_TOPIC_0000002079101725__wintitle436164913"><b>Labeled</b></span> tab, select the audio files to be deleted, and then click <span class="uicontrol" id="EN-US_TOPIC_0000002079101725__uicontrol0361241519"><b>Delete File</b></span> in the upper left corner. In the displayed dialog box, select or deselect <span class="parmname" id="EN-US_TOPIC_0000002079101725__parmname23611946115"><b>Delete the source files from OBS</b></span> as required. After confirmation, click <span class="uicontrol" id="EN-US_TOPIC_0000002079101725__uicontrol1136264612"><b>OK</b></span> to delete the audio files.</p>
<div class="note" id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_note10831343207"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_p17833431906">If you select <span class="parmname" id="EN-US_TOPIC_0000002079101725__en-us_topic_0000001139944452_parmname10279252184713"><b>Delete the source files from OBS</b></span>, audio files stored in the corresponding OBS directory will be deleted when you delete the selected audio files. Deleting source files may affect other dataset versions or datasets using those files. As a result, the page display, training, or inference is abnormal. Deleted data cannot be recovered. Exercise caution when performing this operation.</p>
</div></div>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="datalabel-modelarts_0013.html">Audio Labeling</a></div>
</div>
</div>