5 datasets found
  1. Z

    DESED_real

    • data.niaid.nih.gov
    • zenodo.org
    Updated Apr 11, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Serizel, Romain (2022). DESED_real [Dataset]. https://data.niaid.nih.gov/resources?id=ZENODO_3562884
    Explore at:
    Dataset updated
    Apr 11, 2022
    Dataset provided by
    Serizel, Romain
    Shah, Ankit
    Turpault, Nicolas
    Salamon, Justin
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    If you are in this page, you have to know that the content of this page is also in github associated.

    Link to the associated github repository: https://github.com/turpaultn/Desed

    Link to the paper: https://hal.inria.fr/hal-02160855

    Domestic Environment Sound Event Detection (DESED) dataset.

    Description

    This dataset is the real part of the DESED dataset. It is a subpart of Audioset.

    There is the material to:

    Download the metadata of the subset of Audioset used in DCASE 2019 task 4.

    Files:

    DESED_real_metadata.tar.gz : Annotations of subset of Audioset.

    To download the associated soundfile, please visit: https://github.com/turpaultn/DESED .

  2. IDMT-FL Dataset

    • zenodo.org
    zip
    Updated Nov 24, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    David S. Johnson; Wolfgang Lorenz; Michael Taenzer; Stylianos Mimilakis; Sascha Grollmisch; Sascha Grollmisch; Jakob Abeßer; Jakob Abeßer; Hanna Lukashevich; Hanna Lukashevich; David S. Johnson; Wolfgang Lorenz; Michael Taenzer; Stylianos Mimilakis (2023). IDMT-FL Dataset [Dataset]. http://doi.org/10.5281/zenodo.7551584
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 24, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    David S. Johnson; Wolfgang Lorenz; Michael Taenzer; Stylianos Mimilakis; Sascha Grollmisch; Sascha Grollmisch; Jakob Abeßer; Jakob Abeßer; Hanna Lukashevich; Hanna Lukashevich; David S. Johnson; Wolfgang Lorenz; Michael Taenzer; Stylianos Mimilakis
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    The IDMT-DESED-FL and IDMT-URBAN-FL datasets enable research in sound event detection (SED) within a federated learning (FL) context. IDMT-DESED-FL and IDMT-URBAN-FL consist of sound events sourced from well-known DESED and URBAN-8K datasets. Each source dataset contains sound events from ten classes for the use cases of SED in domestic and urban environments, respectively. To simulate an FL scenario, the source events are mixed with background noise to generate 30.000 ten-second soundscapes which are partitioned to 100 edge devices. Each soundscape is generated by mixing up to five sound events (possibly overlapping) with background noise. Both datasets contain independent and identically distributed (IID) and non-IID versions, to provide a more real-world like distribution of event classes.

    • IDMT-DESED-FL sound event classes include alarm/bell/ringing, blender, cat, dog, dishes, electric shaver/toothbrush, frying, running water, speech, and vacuum cleaner. The background classes include apartment room, computer interior, computer lab, emergency staircase, and library.
    • IDMT-URBAN-FL sound event classes include air conditioner, car horn, children playing, dog bark, drilling, engine idling, gun shot, jackhammer, siren, and street music. Background classes for IDMT-URBAN-FL are sourced from the Isolated Urban Sound Database (IUSB), and include birds, crowd, fountain, rain, and traffic.

    Due to the size of the datasets, with this download we provide the scripts and details necessary to generate the FL datasets using the source material from DESED, URBAN-8K, and IUSB.

    See the above referenced paper and README contained with the data folder for further details.

  3. Sound Event Detection results obtained with the DCASE 2021 SSep+SED Baseline...

    • plos.figshare.com
    xls
    Updated Jul 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Diego de Benito-Gorrón; Katerina Zmolikova; Doroteo T. Toledano (2024). Sound Event Detection results obtained with the DCASE 2021 SSep+SED Baseline system (SSep-SED Bs) and JSS model fusions over the DESED Validation (dev-test) and Public evaluation sets, in terms of PSDS1, PSDS2, and event-based F1 score. [Dataset]. http://doi.org/10.1371/journal.pone.0303994.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jul 5, 2024
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Diego de Benito-Gorrón; Katerina Zmolikova; Doroteo T. Toledano
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    All the models in each fusion use Teacher model selection. The best results for each metric/dataset are highlighted in bold.

  4. f

    Sound Event Detection results obtained with the DCASE 2021 SED Baseline...

    • plos.figshare.com
    xls
    Updated Jul 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Diego de Benito-Gorrón; Katerina Zmolikova; Doroteo T. Toledano (2024). Sound Event Detection results obtained with the DCASE 2021 SED Baseline system (SED Bs) and the Joint Source Separation + Sound Event Detection proposed methods: Initial model (S0), Two-stage Training (S1, S2), and Joint Training (JT). [Dataset]. http://doi.org/10.1371/journal.pone.0303994.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jul 5, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Diego de Benito-Gorrón; Katerina Zmolikova; Doroteo T. Toledano
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Results are provided over the DESED Public overlap set [20], in terms of PSDS1, PSDS2 and event-based F1 score. Two pre-training methods for Source Separation (FUSS and DESED) and two model selection criteria (Student and Teacher) are compared. The best results for each metric are highlighted in bold.

  5. h

    DESEDpublic_eval

    • huggingface.co
    Updated Oct 10, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shi Qundong (2024). DESEDpublic_eval [Dataset]. https://huggingface.co/datasets/TwinkStart/DESEDpublic_eval
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 10, 2024
    Authors
    Shi Qundong
    Description

    This dataset only contains test data, which is integrated into UltraEval-Audio(https://github.com/OpenBMB/UltraEval-Audio) framework.

    python audio_evals/main.py --dataset desed --model gpt4o_audio

      🚀超凡体验,尽在UltraEval-Audio🚀
    

    UltraEval-Audio——全球首个同时支持语音理解和语音生成评估的开源框架,专为语音大模型评估打造,集合了34项权威Benchmark,覆盖语音、声音、医疗及音乐四大领域,支持十种语言,涵盖十二类任务。选择UltraEval-Audio,您将体验到前所未有的便捷与高效:

    一键式基准管理 📥:告别繁琐的手动下载与数据处理,UltraEval-Audio为您自动化完成这一切,轻松获取所需基准测试数据。 内置评估利器… See the full description on the dataset page: https://huggingface.co/datasets/TwinkStart/DESEDpublic_eval.

  6. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Serizel, Romain (2022). DESED_real [Dataset]. https://data.niaid.nih.gov/resources?id=ZENODO_3562884

DESED_real

Explore at:
Dataset updated
Apr 11, 2022
Dataset provided by
Serizel, Romain
Shah, Ankit
Turpault, Nicolas
Salamon, Justin
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

If you are in this page, you have to know that the content of this page is also in github associated.

Link to the associated github repository: https://github.com/turpaultn/Desed

Link to the paper: https://hal.inria.fr/hal-02160855

Domestic Environment Sound Event Detection (DESED) dataset.

Description

This dataset is the real part of the DESED dataset. It is a subpart of Audioset.

There is the material to:

Download the metadata of the subset of Audioset used in DCASE 2019 task 4.

Files:

DESED_real_metadata.tar.gz : Annotations of subset of Audioset.

To download the associated soundfile, please visit: https://github.com/turpaultn/DESED .

Search
Clear search
Close search
Google apps
Main menu