3 datasets found

o
Data from: MACS - Multi-Annotator Captioned Soundscapes
explore.openaire.eu
producciocientifica.uv.es
+2more
Updated Jul 22, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Irene Martin Morato; Annamaria Mesaros (2021). MACS - Multi-Annotator Captioned Soundscapes [Dataset]. http://doi.org/10.5281/zenodo.5114770
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.5114770
Dataset updated
Jul 22, 2021
Authors
Irene Martin Morato; Annamaria Mesaros
Description
This is a dataset containing audio captions and corresponding audio tags for a number of 3930 audio files of the TAU Urban Acoustic Scenes 2019 development dataset (airport, public square, and park). The files were annotated using a web-based tool. Each file is annotated by multiple annotators that provided tags and a one-sentence description of the audio content. The data also includes annotator competence estimated using MACE (Multi-Annotator Competence Estimation). The annotation procedure, processing and analysis of the data are presented in the following papers: Irene Martin-Morato, Annamaria Mesaros. What is the ground truth? Reliability of multi-annotator data for audio tagging, 29th European Signal Processing Conference, EUSIPCO 2021 Irene Martin-Morato, Annamaria Mesaros. Diversity and bias in audio captioning datasets, submitted to DCASE 2021 Workshop (to be updated with arxiv link) Data is provided as two files: MACS.yaml - containing the complete annotations in the following format: - filename: file1.wav
annotations:
- annotator_id: ann_1
sentence: caption text
tags:
- tag1
- tag2
- annotator_id: ann_2
sentence: caption text
tags:
- tag1 MACS_competence.csv - containing the estimated annotator competence; for each annotator_id in the yaml file, competence is a number between 0 (considered as annotating at random) and 1 id [tab] competence The audio files can be downloaded from https://zenodo.org/record/2589280 and are covered by their own license.
P
MACS Dataset
paperswithcode.com
Updated Jun 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Irene Martin-Morato; Annamaria Mesaros (2025). MACS Dataset [Dataset]. https://paperswithcode.com/dataset/macs
Explore at:
Dataset updated
Jun 11, 2025
Authors
Irene Martin-Morato; Annamaria Mesaros
Description
This is a dataset containing audio captions and corresponding audio tags for a number of 3930 audio files of the TAU Urban Acoustic Scenes 2019 development dataset (airport, public square, and park). The files were annotated using a web-based tool. Each file is annotated by multiple annotators that provided tags and a one-sentence description of the audio content.
u
Data from: MATS - Multi-Annotator Tagged Soundscapes
producciocientifica.uv.es
data.niaid.nih.gov
+1more
Updated 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Morato, Irene Martin; Mesaros, Annamaria; Morato, Irene Martin; Mesaros, Annamaria (2021). MATS - Multi-Annotator Tagged Soundscapes [Dataset]. https://producciocientifica.uv.es/documentos/668fc44fb9e7c03b01bd97e5
Explore at:
Dataset updated
2021
Authors
Morato, Irene Martin; Mesaros, Annamaria; Morato, Irene Martin; Mesaros, Annamaria
Description
This is a dataset containing audio tags for a number of 3930 audio files of the TAU Urban Acoustic Scenes 2019 development dataset (airport, public square, and park). The files were annotated using a web-based tool, with multiple annotators providing labels for each file. The dataset contains annotations for 3930 files, annotated with the following tags: announcement jingle announcement speech adults talking birds singing children voices dog barking footsteps music siren traffic noise The annotation procedure and processing is presented in the paper: Irene Martin-Morato, Annamaria Mesaros. What is the ground truth? Reliability of multi-annotator data for audio tagging, 29th European Signal Processing Conference, EUSIPCO 2021 The dataset contains the following: raw annotations provided by 133 annotators, multiple opinions per audio file MATS_labels_full_annotations.yaml content formatted as: - filename: file1.wav
annotations:
- annotator_id: ann_1
tags:
- tag1
- tag2
- annotator_id: ann_3
tags:
- tag1
- filename: file3.wav
... processed annotations using different methods, as presented in the accompanying paper MATS_labels_majority_vote.csv
MATS_labels_union.csv
MATS_labels_mace100.csv
MATS_labels_mace100_competence60 content formatted as: filename [tab] tag1,tag2,tag3
The audio files can be downloaded from https://zenodo.org/record/2589280 and are covered by their own license.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Irene Martin Morato; Annamaria Mesaros (2021). MACS - Multi-Annotator Captioned Soundscapes [Dataset]. http://doi.org/10.5281/zenodo.5114770

Data from: MACS - Multi-Annotator Captioned Soundscapes

Explore at:

26 scholarly articles cite this dataset (View in Google Scholar)

Unique identifier

https://doi.org/10.5281/zenodo.5114770

Dataset updated

Jul 22, 2021

Authors

Irene Martin Morato; Annamaria Mesaros

Description

This is a dataset containing audio captions and corresponding audio tags for a number of 3930 audio files of the TAU Urban Acoustic Scenes 2019 development dataset (airport, public square, and park). The files were annotated using a web-based tool. Each file is annotated by multiple annotators that provided tags and a one-sentence description of the audio content. The data also includes annotator competence estimated using MACE (Multi-Annotator Competence Estimation). The annotation procedure, processing and analysis of the data are presented in the following papers: Irene Martin-Morato, Annamaria Mesaros. What is the ground truth? Reliability of multi-annotator data for audio tagging, 29th European Signal Processing Conference, EUSIPCO 2021 Irene Martin-Morato, Annamaria Mesaros. Diversity and bias in audio captioning datasets, submitted to DCASE 2021 Workshop (to be updated with arxiv link) Data is provided as two files: MACS.yaml - containing the complete annotations in the following format: - filename: file1.wav
annotations:
- annotator_id: ann_1
sentence: caption text
tags:
- tag1
- tag2
- annotator_id: ann_2
sentence: caption text
tags:
- tag1 MACS_competence.csv - containing the estimated annotator competence; for each annotator_id in the yaml file, competence is a number between 0 (considered as annotating at random) and 1 id [tab] competence The audio files can be downloaded from https://zenodo.org/record/2589280 and are covered by their own license.

Clear search

Close search

Google apps

Main menu

Data from: MACS - Multi-Annotator Captioned Soundscapes

MACS Dataset

Data from: MATS - Multi-Annotator Tagged Soundscapes

Data from: MACS - Multi-Annotator Captioned Soundscapes