@modelarts:content
|
No
|
String
|
Speech text content, which is a default attribute dedicated to the speech label (including the speech content and speech start and end points).
|
@modelarts:end_index
|
No
|
Integer
|
End position of the text, which is a default attribute dedicated to the named entity label. The end position does not include the character corresponding to the value of end_index. Example:
If the text is "Barack Hussein Obama II (born August 4, 1961) is an attorney and politician.", start_index and end_index of Barack Hussein Obama II are 0 and 23, respectively.
If the text is "Hope is the thing with feathers", start_index and end_index of Hope are 0 and 4, respectively.
|
@modelarts:end_time
|
No
|
String
|
Speech end time, which is a default attribute dedicated to the speech start/end point label, in the format of hh:mm:ss.SSS. (hh indicates hour; mm indicates minute; ss indicates second; and SSS indicates millisecond.)
|
@modelarts:feature
|
No
|
Object
|
Shape feature, which is a default attribute dedicated to the object detection label, with type of List. The upper left corner of the image is used as the coordinate origin [0, 0]. Each coordinate point is represented by [x, y], where x indicates the horizontal coordinate and y indicates the vertical coordinate (both x and y are >=0). The format of each shape is as follows:
bndbox consists of two points, for example, [[0,10],[50,95]]. The upper left vertex of the rectangle is the first point, and the lower right vertex is the second point. That is, the x-coordinate of the first point must be less than the x-coordinate of the second point, and the y-coordinate of the first point must be less than the y-coordinate of the second point.
polygon: consists of multiple points that are connected in sequence to form a polygon, for example, [[0,100],[50,95],[10,60],[500,400]].
circle: consists of the center and radius, for example, [[100,100],[50]].
line: consists of two points, for example, [[0,100],[50,95]]. The first point is the start point, and the second point is the end point.
dashed: consists of two points, for example, [[0,100],[50,95]]. The first point is the start point, and the second point is the end point.
point: consists of one point, for example, [[0,100]].
polyline: consists of multiple points, for example, [[0,100],[50,95],[10,60],[500,400]].
|
@modelarts:from
|
No
|
String
|
ID of the head entity in the triplet relationship label, which is a default attribute dedicated to the triplet relationship label.
|
@modelarts:hard
|
No
|
String
|
Sample labeled as a hard sample or not, which is a default attribute. Options:
|
@modelarts:hard_coefficient
|
No
|
String
|
Coefficient of difficulty of each label level, which is a default attribute. The value range is [0,1].
|
@modelarts:hard_reasons
|
No
|
String
|
Reasons that the sample is a hard sample, which is a default attribute. Use a hyphen (-) to separate every two hard sample reason IDs, for example, 3-20-21-19. Options:
0: No target objects are identified.
1: The confidence is low.
2: The clustering result based on the training dataset is inconsistent with the prediction result.
3: The prediction result is greatly different from the data of the same type in the training dataset.
4: The prediction results of multiple consecutive similar images are inconsistent.
5: There is a large offset between the image resolution and the feature distribution of the training dataset.
6: There is a large offset between the aspect ratio of the image and the feature distribution of the training dataset.
7: There is a large offset between the brightness of the image and the feature distribution of the training dataset.
8: There is a large offset between the saturation of the image and the feature distribution of the training dataset.
9: There is a large offset between the color richness of the image and the feature distribution of the training dataset.
10: There is a large offset between the definition of the image and the feature distribution of the training dataset.
11: There is a large offset between the number of frames of the image and the feature distribution of the training dataset.
12: There is a large offset between the standard deviation of area of image frames and the feature distribution of the training dataset.
13: There is a large offset between the aspect ratio of image frames and the feature distribution of the training dataset.
14: There is a large offset between the area portion of image frames and the feature distribution of the training dataset.
15: There is a large offset between the edge of image frames and the feature distribution of the training dataset.
16: There is a large offset between the brightness of image frames and the feature distribution of the training dataset.
17: There is a large offset between the definition of image frames and the feature distribution of the training dataset.
18: There is a large offset between the stack of image frames and the feature distribution of the training dataset.
19: The data enhancement result based on GaussianBlur is inconsistent with the prediction result of the original image.
20: The data enhancement result based on fliplr is inconsistent with the prediction result of the original image.
21: The data enhancement result based on Crop is inconsistent with the prediction result of the original image.
22: The data enhancement result based on flipud is inconsistent with the prediction result of the original image.
23: The data enhancement result based on scale is inconsistent with the prediction result of the original image.
24: The data enhancement result based on translate is inconsistent with the prediction result of the original image.
25: The data enhancement result based on shear is inconsistent with the prediction result of the original image.
26: The data enhancement result based on superpixels is inconsistent with the prediction result of the original image.
27: The data enhancement result based on sharpen is inconsistent with the prediction result of the original image.
28: The data enhancement result based on add is inconsistent with the prediction result of the original image.
29: The data enhancement result based on invert is inconsistent with the prediction result of the original image.
30: The data is predicted to be abnormal.
|
@modelarts:shape
|
No
|
String
|
Object shape, which is a default attribute dedicated to the object detection label and is left empty by default. Options:
bndbox: rectangle
polygon: polygon
circle: circle
line: straight line
dashed: dotted line
point: point
polyline: polyline
|
@modelarts:source
|
No
|
String
|
Speech source, which is a default attribute dedicated to the speech start/end point label and can be set to a speaker or narrator.
|
@modelarts:start_index
|
No
|
Integer
|
Start position of the text, which is a default attribute dedicated to the named entity label. The start value begins from 0, including the character corresponding to the value of start_index.
|
@modelarts:start_time
|
No
|
String
|
Speech start time, which is a default attribute dedicated to the speech start/end point label, in the format of hh:mm:ss.SSS. (hh indicates hour; mm indicates minute; ss indicates second; and SSS indicates millisecond.)
|
@modelarts:to
|
No
|
String
|
ID of the tail entity in the triplet relationship label, which is a default attribute dedicated to the triplet relationship label.
|