2 datasets found

WaivOps WRLD-SMB: Open Audio Resources for Machine Learning in Music
data.niaid.nih.gov
zenodo.org
Updated Oct 12, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Patchbanks (2024). WaivOps WRLD-SMB: Open Audio Resources for Machine Learning in Music [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_13921289
Explore at:
Dataset updated
Oct 12, 2024
Dataset provided by
Patchbanks
WaivOps
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
WRLD-SMB Dataset

WRLD-SMB is an open audio dataset featuring a collection of synthetic drum recordings in the style of Brazilian samba music. It includes 1,100 audio loops recorded in uncompressed stereo WAV format, along with paired JSON files intended for the supervised training of generative AI audio models.

Overview

This dataset was developed using multi-velocity audio samples and a paired MIDI dataset. The intended use of this dataset is to train or fine-tune AI models in learning high-performance drum notations, aiming to replicate the live sound of a small drum ensemble. To facilitate augmentation and supervised training with labeled audio data, a dropout technique was employed on the rendered audio files to generate variational mixes of the drum tracks.

The primary purpose of this dataset is to provide accessible content for machine learning applications in music and audio. Potential use cases include generative music, feature extraction, tempo detection, audio classification, rhythm analysis, drum synthesis, music information retrieval (MIR), sound design and signal processing.

Specifications

1,100 audio loops (approximately 5.5 hours)

16-bit 44.1kHz WAV format

Tempo range: 90–120 BPM

Paired label data (WAV + JSON)

Variational drum patterns

Subgenre styles (Traditional and modern samba, bossa nova, fusion)

A JSON file is provided for referencing and converting MIDI note numbers to text labels. You can update the text labels to suit your preferences.

License

This dataset was compiled by WaivOps, a crowdsourced music project managed by the sound label company Patchbanks. All recordings have been compiled by verified sources for copyright clearance.

The WRLD-SMB dataset is licensed under Creative Commons Attribution 4.0 International (CC BY 4.0).

Additional Info

For audio examples or more information about this dataset, please refer to the GitHub repository.
o
LSD4WSD : An Open Dataset for Wet Snow Detection with SAR Data and Physical...
explore.openaire.eu
zenodo.org
Updated Jan 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Matthieu Gallet; Abdourrahmane Atto; Fatima Karbou; Emmanuel Trouvé (2023). LSD4WSD : An Open Dataset for Wet Snow Detection with SAR Data and Physical Labelling [Dataset]. http://doi.org/10.5281/zenodo.8111484
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.8111484
Dataset updated
Jan 1, 2023
Authors
Matthieu Gallet; Abdourrahmane Atto; Fatima Karbou; Emmanuel Trouvé
Description
LSD4WSD V2.0 Learning SAR Dataset for Wet Snow Detection - Full Analysis Version. The aim of this dataset is to provide a basis for automatic learning to detect wet snow. It is based on Sentinel-1 SAR GRD satellite images acquired between August 2020 and August 2021 over the French Alps. The new version of this dataset is no longer simply restricted to a classification task, and provides a set of metadata for each sample. Modification and improvements of the version 2.0.0 : Number of massif: add 7 new massif to cover the all Sentinel-1 images (cf info.pdf). Acquisition: add images of the descending pass in addition to those originally used in the ascending pass. Sample: reduction in the size of the samples considered to 15 by 15 to facilitate evaluation at the central pixel. Sample: increased density of extracted windows, with a distance of approximately 500 meters between the centers of the windows. Sample: removal of the pre-processing involving the use of logarithms. Sample: removal of the pre-processing involving the normalisation. Labels: new structure for the labels part: dictionary with keys: topography, metadata and physics. Labels: physics: addition of direct information from the CROCUS model for 3 simulations: Liquid Water Content, snow height and minimum snowpack temperature. Labels: topography: information on the slope, altitude and average orientation of the sample. Labels: metadata : information on the date of the sample, the mountain massif and the run (ascending or descending). Dataset: removal of the train/test split* We leave it up to the user to use the Group Kfold method to validate the models using the alpine massif information. Finally, it consists of 2467516 samples of size 15 by 15 by 9. For each sample, the 9 metadata are provided, using in particular the Crocus physical model: topography: elevation (meters) (average), orientation (degrees) (average), slope (degrees) (average), metadata: name of the alpine massif, date of acquisition, type of acquisition (ascending/descending), physics Liquid Water Content (km/m2), snow height (m), minimum snowpack temperature (Celsius degree). The 9 channels are in the following order: Sentinel-1 polarimetric channels: VV, VH and the combination C: VV/VH in linear, Topographical features: altitude, orientation, slope Polarimetric ratio with a reference summer image: VV/VVref, VH/VHref, C/Cref* ** The reference image selected is that of August 9th 2020, as a reference image without snow (cf. Nagler&al) An overview of the distribution and a summary of the sample statistics can be found in the file info.pdf. The data is stored in .hdf5 format with gzip compression. We provide a python script to read and request the data. The script is dataset_load.py. It is based on the h5py, numpy and pandas libraries. It allows to select a part or the whole dataset using requests on the metadata. The script is documented and can be used as described in the README.md file The processing chain is available at the following Github address. The authors would like to acknowledge the support from the National Centre for Space Studies (CNES) in providing computing facilities and access to SAR images via the PEPS platform. The authors would like to deeply thank Mathieu Fructus for running the Crocus simulations. Erratum : In the dataloader file, the name of the "aquisition" column must be added twice, see the correction below.: dtst_ld = Dataset_loader(path_dataset,shuffle=False,descrp=["date","massif","aquisition","aquisition","elevation","slope","orientation","tmin","hsnow","tel",],) If you have any comments, questions or suggestions, please contact the authors: matthieu.gallet@univ-smb.fr fatima.karbou@meteo.fr abdourrahmane.atto@univ-smb.fr emmanuel.trouve@univ-smb.fr
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Patchbanks (2024). WaivOps WRLD-SMB: Open Audio Resources for Machine Learning in Music [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_13921289

WaivOps WRLD-SMB: Open Audio Resources for Machine Learning in Music

Explore at:

Dataset updated

Oct 12, 2024

Dataset provided by

Patchbanks
WaivOps

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

WRLD-SMB Dataset

WRLD-SMB is an open audio dataset featuring a collection of synthetic drum recordings in the style of Brazilian samba music. It includes 1,100 audio loops recorded in uncompressed stereo WAV format, along with paired JSON files intended for the supervised training of generative AI audio models.

Overview

This dataset was developed using multi-velocity audio samples and a paired MIDI dataset. The intended use of this dataset is to train or fine-tune AI models in learning high-performance drum notations, aiming to replicate the live sound of a small drum ensemble. To facilitate augmentation and supervised training with labeled audio data, a dropout technique was employed on the rendered audio files to generate variational mixes of the drum tracks.

The primary purpose of this dataset is to provide accessible content for machine learning applications in music and audio. Potential use cases include generative music, feature extraction, tempo detection, audio classification, rhythm analysis, drum synthesis, music information retrieval (MIR), sound design and signal processing.

Specifications

1,100 audio loops (approximately 5.5 hours)

16-bit 44.1kHz WAV format

Tempo range: 90–120 BPM

Paired label data (WAV + JSON)

Variational drum patterns

Subgenre styles (Traditional and modern samba, bossa nova, fusion)

A JSON file is provided for referencing and converting MIDI note numbers to text labels. You can update the text labels to suit your preferences.

License

This dataset was compiled by WaivOps, a crowdsourced music project managed by the sound label company Patchbanks. All recordings have been compiled by verified sources for copyright clearance.

The WRLD-SMB dataset is licensed under Creative Commons Attribution 4.0 International (CC BY 4.0).

Additional Info

For audio examples or more information about this dataset, please refer to the GitHub repository.

Clear search

Close search

Google apps

Main menu

WaivOps WRLD-SMB: Open Audio Resources for Machine Learning in Music

LSD4WSD : An Open Dataset for Wet Snow Detection with SAR Data and Physical...

WaivOps WRLD-SMB: Open Audio Resources for Machine Learning in Music