Annotation consolidation - Amazon SageMaker AI (original) (raw)

An annotation is the result of a single worker's labeling task.Annotation consolidation combines the annotations of two or more workers into a single label for your data objects. A label, which is assigned to each object in the dataset, is a probabilistic estimate of what the true label should be. Each object in the dataset typically has multiple annotations, but only one label or set of labels.

You decide how many workers annotate each object in your dataset. Using more workers can increase the accuracy of your labels, but also increases the cost of labeling. To learn more about Ground Truth pricing, see Amazon SageMaker Ground Truth pricing.

If you use the Amazon SageMaker AI console to create a labeling job, the following are the defaults for the number of workers who can annotate objects:

When you use the CreateLabelingJob operation, you set the number of workers to annotate each data object with the NumberOfHumanWorkersPerDataObject parameter. You can override the default number of workers that annotate a data object using the console or the CreateLabelingJob operation.

Ground Truth provides an annotation consolidation function for each of its predefined labeling tasks: bounding box, image classification, name entity recognition, semantic segmentation, and text classification. These are the functions:

You can use other algorithms to consolidate annotations. For information, seeAnnotation consolidation function creation.