MACS - Multi-Annotator Captioned Soundscapes (original) (raw)

Published July 22, 2021 | Version v1

Dataset Open

Authors/Creators

Description

This is a dataset containing audio captions and corresponding audio tags for a number of 3930 audio files of the TAU Urban Acoustic Scenes 2019 development dataset (airport, public square, and park). The files were annotated using a web-based tool.

Each file is annotated by multiple annotators that provided tags and a one-sentence description of the audio content.

The data also includes annotator competence estimated using MACE (Multi-Annotator Competence Estimation).

The annotation procedure, processing and analysis of the data are presented in the following papers:

Data is provided as two files:

- filename: file1.wav
annotations:
- annotator_id: ann_1
sentence: caption text
tags:
- tag1
- tag2
- annotator_id: ann_2
sentence: caption text
tags:
- tag1

id [tab] competence

The audio files can be downloaded from https://zenodo.org/record/2589280 and are covered by their own license.

Files

LICENSE.txt

Files (2.8 MB)

Additional details