5 datasets found

Z
DESED_real
data.niaid.nih.gov
zenodo.org
Updated Apr 11, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Serizel, Romain (2022). DESED_real [Dataset]. https://data.niaid.nih.gov/resources?id=ZENODO_3562884
Explore at:
Dataset updated
Apr 11, 2022
Dataset provided by
Serizel, Romain
Shah, Ankit
Turpault, Nicolas
Salamon, Justin
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
If you are in this page, you have to know that the content of this page is also in github associated.

Link to the associated github repository: https://github.com/turpaultn/Desed

Link to the paper: https://hal.inria.fr/hal-02160855

Domestic Environment Sound Event Detection (DESED) dataset.

Description

This dataset is the real part of the DESED dataset. It is a subpart of Audioset.

There is the material to:

Download the metadata of the subset of Audioset used in DCASE 2019 task 4.

Files:

DESED_real_metadata.tar.gz : Annotations of subset of Audioset.

To download the associated soundfile, please visit: https://github.com/turpaultn/DESED .
IDMT-FL Dataset
zenodo.org
zip
Updated Nov 24, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
David S. Johnson; Wolfgang Lorenz; Michael Taenzer; Stylianos Mimilakis; Sascha Grollmisch; Sascha Grollmisch; Jakob Abeßer; Jakob Abeßer; Hanna Lukashevich; Hanna Lukashevich; David S. Johnson; Wolfgang Lorenz; Michael Taenzer; Stylianos Mimilakis (2023). IDMT-FL Dataset [Dataset]. http://doi.org/10.5281/zenodo.7551584
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.7551584
Dataset updated
Nov 24, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
David S. Johnson; Wolfgang Lorenz; Michael Taenzer; Stylianos Mimilakis; Sascha Grollmisch; Sascha Grollmisch; Jakob Abeßer; Jakob Abeßer; Hanna Lukashevich; Hanna Lukashevich; David S. Johnson; Wolfgang Lorenz; Michael Taenzer; Stylianos Mimilakis
License
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Description
The IDMT-DESED-FL and IDMT-URBAN-FL datasets enable research in sound event detection (SED) within a federated learning (FL) context. IDMT-DESED-FL and IDMT-URBAN-FL consist of sound events sourced from well-known DESED and URBAN-8K datasets. Each source dataset contains sound events from ten classes for the use cases of SED in domestic and urban environments, respectively. To simulate an FL scenario, the source events are mixed with background noise to generate 30.000 ten-second soundscapes which are partitioned to 100 edge devices. Each soundscape is generated by mixing up to five sound events (possibly overlapping) with background noise. Both datasets contain independent and identically distributed (IID) and non-IID versions, to provide a more real-world like distribution of event classes.

IDMT-DESED-FL sound event classes include alarm/bell/ringing, blender, cat, dog, dishes, electric shaver/toothbrush, frying, running water, speech, and vacuum cleaner. The background classes include apartment room, computer interior, computer lab, emergency staircase, and library.

IDMT-URBAN-FL sound event classes include air conditioner, car horn, children playing, dog bark, drilling, engine idling, gun shot, jackhammer, siren, and street music. Background classes for IDMT-URBAN-FL are sourced from the Isolated Urban Sound Database (IUSB), and include birds, crowd, fountain, rain, and traffic.

Due to the size of the datasets, with this download we provide the scripts and details necessary to generate the FL datasets using the source material from DESED, URBAN-8K, and IUSB.

See the above referenced paper and README contained with the data folder for further details.
Sound Event Detection results obtained with the DCASE 2021 SSep+SED Baseline...
plos.figshare.com
xls
Updated Jul 5, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Diego de Benito-Gorrón; Katerina Zmolikova; Doroteo T. Toledano (2024). Sound Event Detection results obtained with the DCASE 2021 SSep+SED Baseline system (SSep-SED Bs) and JSS model fusions over the DESED Validation (dev-test) and Public evaluation sets, in terms of PSDS1, PSDS2, and event-based F1 score. [Dataset]. http://doi.org/10.1371/journal.pone.0303994.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0303994.t003
Dataset updated
Jul 5, 2024
Dataset provided by
PLOShttp://plos.org/
Authors
Diego de Benito-Gorrón; Katerina Zmolikova; Doroteo T. Toledano
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
All the models in each fusion use Teacher model selection. The best results for each metric/dataset are highlighted in bold.
f
Sound Event Detection results obtained with the DCASE 2021 SED Baseline...
plos.figshare.com
xls
Updated Jul 5, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Diego de Benito-Gorrón; Katerina Zmolikova; Doroteo T. Toledano (2024). Sound Event Detection results obtained with the DCASE 2021 SED Baseline system (SED Bs) and the Joint Source Separation + Sound Event Detection proposed methods: Initial model (S0), Two-stage Training (S1, S2), and Joint Training (JT). [Dataset]. http://doi.org/10.1371/journal.pone.0303994.t004
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0303994.t004
Dataset updated
Jul 5, 2024
Dataset provided by
PLOS ONE
Authors
Diego de Benito-Gorrón; Katerina Zmolikova; Doroteo T. Toledano
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Results are provided over the DESED Public overlap set [20], in terms of PSDS1, PSDS2 and event-based F1 score. Two pre-training methods for Source Separation (FUSS and DESED) and two model selection criteria (Student and Teacher) are compared. The best results for each metric are highlighted in bold.
h
DESEDpublic_eval
huggingface.co
Updated Oct 10, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shi Qundong (2024). DESEDpublic_eval [Dataset]. https://huggingface.co/datasets/TwinkStart/DESEDpublic_eval
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 10, 2024
Authors
Shi Qundong
Description
This dataset only contains test data, which is integrated into UltraEval-Audio(https://github.com/OpenBMB/UltraEval-Audio) framework.

python audio_evals/main.py --dataset desed --model gpt4o_audio

🚀超凡体验，尽在UltraEval-Audio🚀

UltraEval-Audio——全球首个同时支持语音理解和语音生成评估的开源框架，专为语音大模型评估打造，集合了34项权威Benchmark，覆盖语音、声音、医疗及音乐四大领域，支持十种语言，涵盖十二类任务。选择UltraEval-Audio，您将体验到前所未有的便捷与高效：

一键式基准管理 📥：告别繁琐的手动下载与数据处理，UltraEval-Audio为您自动化完成这一切，轻松获取所需基准测试数据。内置评估利器… See the full description on the dataset page: https://huggingface.co/datasets/TwinkStart/DESEDpublic_eval.
Not seeing a result you expected?
Learn how you can add new datasets to our index.