Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
If you are in this page, you have to know that the content of this page is also in github associated.
Link to the associated github repository: https://github.com/turpaultn/Desed
Link to the paper: https://hal.inria.fr/hal-02160855
Domestic Environment Sound Event Detection (DESED) dataset.
Description
This dataset is the real part of the DESED dataset. It is a subpart of Audioset.
There is the material to:
Download the metadata of the subset of Audioset used in DCASE 2019 task 4.
Files:
DESED_real_metadata.tar.gz : Annotations of subset of Audioset.
To download the associated soundfile, please visit: https://github.com/turpaultn/DESED .
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
The IDMT-DESED-FL and IDMT-URBAN-FL datasets enable research in sound event detection (SED) within a federated learning (FL) context. IDMT-DESED-FL and IDMT-URBAN-FL consist of sound events sourced from well-known DESED and URBAN-8K datasets. Each source dataset contains sound events from ten classes for the use cases of SED in domestic and urban environments, respectively. To simulate an FL scenario, the source events are mixed with background noise to generate 30.000 ten-second soundscapes which are partitioned to 100 edge devices. Each soundscape is generated by mixing up to five sound events (possibly overlapping) with background noise. Both datasets contain independent and identically distributed (IID) and non-IID versions, to provide a more real-world like distribution of event classes.
Due to the size of the datasets, with this download we provide the scripts and details necessary to generate the FL datasets using the source material from DESED, URBAN-8K, and IUSB.
See the above referenced paper and README contained with the data folder for further details.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
All the models in each fusion use Teacher model selection. The best results for each metric/dataset are highlighted in bold.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Results are provided over the DESED Public overlap set [20], in terms of PSDS1, PSDS2 and event-based F1 score. Two pre-training methods for Source Separation (FUSS and DESED) and two model selection criteria (Student and Teacher) are compared. The best results for each metric are highlighted in bold.
This dataset only contains test data, which is integrated into UltraEval-Audio(https://github.com/OpenBMB/UltraEval-Audio) framework.
python audio_evals/main.py --dataset desed --model gpt4o_audio
🚀超凡体验,尽在UltraEval-Audio🚀
UltraEval-Audio——全球首个同时支持语音理解和语音生成评估的开源框架,专为语音大模型评估打造,集合了34项权威Benchmark,覆盖语音、声音、医疗及音乐四大领域,支持十种语言,涵盖十二类任务。选择UltraEval-Audio,您将体验到前所未有的便捷与高效:
一键式基准管理 📥:告别繁琐的手动下载与数据处理,UltraEval-Audio为您自动化完成这一切,轻松获取所需基准测试数据。 内置评估利器… See the full description on the dataset page: https://huggingface.co/datasets/TwinkStart/DESEDpublic_eval.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
If you are in this page, you have to know that the content of this page is also in github associated.
Link to the associated github repository: https://github.com/turpaultn/Desed
Link to the paper: https://hal.inria.fr/hal-02160855
Domestic Environment Sound Event Detection (DESED) dataset.
Description
This dataset is the real part of the DESED dataset. It is a subpart of Audioset.
There is the material to:
Download the metadata of the subset of Audioset used in DCASE 2019 task 4.
Files:
DESED_real_metadata.tar.gz : Annotations of subset of Audioset.
To download the associated soundfile, please visit: https://github.com/turpaultn/DESED .