6 datasets found

P
GuitarSet Dataset
paperswithcode.com
Updated Jun 17, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Qingyang Xi; Rachel M. Bittner; Johan Pauwels; Xuzhou Ye; Juan Pablo Bello (2023). GuitarSet Dataset [Dataset]. https://paperswithcode.com/dataset/guitarset
Explore at:
Dataset updated
Jun 17, 2023
Authors
Qingyang Xi; Rachel M. Bittner; Johan Pauwels; Xuzhou Ye; Juan Pablo Bello
Description
GuitarSet is a dataset of high-quality guitar recordings and rich annotations. It contains 360 excerpts 30 seconds in length. The 360 excerpts are the result of the following combinations:

6 players, 2 versions: comping and soloing, 5 styles: Rock, Singer-Songwriter, Bossa Nova, Jazz, and Funk, 3 progressions: 12 Bar Blues, Autumn Leaves, and Pachelbel Canon, 2 tempi: slow and fast.

Each excerpt is annotated with 6 pitch contour and midi note annotations (one per string), 2 chord annotations (instructed and performed), beat and tempo annotations.
GuitarSet
zenodo.org
zip
Updated Apr 24, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Qingyang Xi; Rachel M. Bittner; Johan Pauwels; Xuzhou Ye; Juan P. Bello; Qingyang Xi; Rachel M. Bittner; Johan Pauwels; Xuzhou Ye; Juan P. Bello (2025). GuitarSet [Dataset]. http://doi.org/10.5281/zenodo.3371780
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.3371780
Dataset updated
Apr 24, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Qingyang Xi; Rachel M. Bittner; Johan Pauwels; Xuzhou Ye; Juan P. Bello; Qingyang Xi; Rachel M. Bittner; Johan Pauwels; Xuzhou Ye; Juan P. Bello
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Accompanying website and repository are here.

If you make use of GuitarSet for academic purposes, please cite the following publication:

Q. Xi, R. Bittner, J. Pauwels, X. Ye, and J. P. Bello, "Guitarset: A Dataset for Guitar Transcription", in 19th International Society for Music Information Retrieval Conference, Paris, France, Sept. 2018.

This project was lead by Qingyang Xi at NYU's Music and Audio Research Lab, along with Rachel Bittner, Xuzhou Ye and Juan Pablo Bello from the same lab, as well as Johan Pauwels at the Center for Digital Music at Queen Mary University.

We present GuitarSet, a dataset that provides high-quality guitar recordings alongside rich annotations and metadata.
In particular, by recording guitars using a hexaphonic pickup, we are able to not only provide recordings of the individual strings but also to largely automate the expensive annotation process, therefore providing rich annotation.
The dataset contains recordings of a variety of musical excerpts played on an acoustic guitar, along with time-aligned annotations of pitch contours, string and fret positions, chords, beats, downbeats, and playing style.

N.B. Known Errors:

Incorrect timings in two annotations Thanks to @xavriley

Duplicated note in one MIDI annotation Thanks to @maxpv
h
guitarset_jukebox_embeddings
huggingface.co
Updated Oct 26, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jon Flynn (2024). guitarset_jukebox_embeddings [Dataset]. https://huggingface.co/datasets/jonflynn/guitarset_jukebox_embeddings
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 26, 2024
Authors
Jon Flynn
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Jukebox Embeddings for the GuitarSet Dataset

Repo with Colab notebook used to extract the embeddings.

Overview

This dataset extends the GuitarSet Dataset by providing embeddings for each audio file.

Original GuitarSet Dataset

Link to official site GuitarSet is a dataset that provides high quality guitar recordings alongside rich annotations and metadata. By recording guitars using a hexaphonic pickup, it provides recordings of individual strings and… See the full description on the dataset page: https://huggingface.co/datasets/jonflynn/guitarset_jukebox_embeddings.
Beat This! Spectrograms for Beat and Downbeat Tracking
zenodo.org
zip
Updated Oct 17, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jan Schlüter; Jan Schlüter; Francesco Foscarin; Francesco Foscarin (2024). Beat This! Spectrograms for Beat and Downbeat Tracking [Dataset]. http://doi.org/10.5281/zenodo.13922116
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.13922116
Dataset updated
Oct 17, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Jan Schlüter; Jan Schlüter; Francesco Foscarin; Francesco Foscarin
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This collection contains mel spectrograms and annotations of 16 datasets for beat and downbeat tracking. All datasets have been used in "Beat This! Accurate beat tracking without DBN postprocessing" (Foscarin/Schlüter/Widmer, ISMIR 2024) and prior publications by other authors, but for many of these datasets, audio data is not publicly available. By publishing the spectrograms, we invite other researchers to improve the state of the art in beat and downbeat tracking.

Datasets

Spectrograms for the following datasets are included in the collection:

asap: "ASAP: a dataset of aligned scores and performances for piano transcription" (Foscarin et al., ISMIR 2020)

ballroom: "An experimental comparison of audio tempo induction algorithms" (Gouyon et al., TASLP 2006) for the audio and "Rhythmic Pattern Modeling for Beat and Downbeat Tracking in Musical Audio" (Krebs/Böck/Widmer, ISMIR 2013) for the annotations

beatles: "Evaluation methods for musical audio beat tracking algorithms" (Davies/Degara/Plumbley, Tech. Rep., QMU, 2019)

candombe: "Beat and Downbeat Tracking Based on Rhythmic Patterns Applied to the Uruguayan Candombe Drumming" (Nunes et al., ISMIR 2015)

filosax: "Filosax: A dataset of annotated jazz saxophone recordings" (Foster/Dixon, ISMIR 2021)

groove_midi: "Learning to groove with inverse sequence transformations" (Gillick et al., ICML 2019)

gtzan: "Musical genre classification of audio signals" (Tzanetakis/Cook, TSAP 2002) for the audio and "Swing ratio estimation" (Marchand/Peters, DAFx 2015) for the annotations

guitarset: "GuitarSet: A dataset for guitar transcription" (Xi et al., ISMIR 2018)

hainsworth: "Particle filtering applied to musical tempo tracking" (Hainsworth/Macleod, JASP 2004)

harmonix: "The Harmonix set: Beats, downbeats, and functional segment annotations of western popular music" (Nieto et al., ISMIR 2019) for the original and "Modeling Beats and Downbeats with a Time-Frequency Transformer" (Hung et al., ICASSP 2022) for the version included here

hjdb: "One in the jungle: Downbeat detection in hardcore, jungle, and drum and bass" (Hockman/Davies/Fujinaga, ISMIR 2012)

jaah: "Audio-aligned jazz harmony dataset for automatic chord transcription and corpus-based research" (Eremenko et al., ISMIR 2018)

rwc: "RWC music database: Popular, classical and jazz music databases" (Goto et al., ISMIR 2002) for the audio and "AIST annotation for the RWC music
database" (Goto, ISMIR 2006) for the annotations

simac: "A computational approach to rhythm description — Audio features for the computation of rhythm periodicity functions and their use in tempo induction and music content processing" (Gouyon, PhD thesis, UPF, 2005)

smc: "Selective sampling for beat tracking evaluation" (Holzapfel et al., TASLP 2012)

tapcorrect: "Towards Automatically Correcting Tapped Beat Annotations for Music Recordings" (Driedger et al., ISMIR 2019)

If given, links in the above list point to locations for obtaining the original audio.

Annotations

The corresponding annotations are available on https://github.com/CPJKU/beat_this_annotations. A snapshot of v1.0 is included in this collection as beat_this_annotations.zip, but you may want to use a later release.

Spectrograms

Spectrograms are computed from monophonic audio at a sample rate of 22050 Hz with a window size of 1024 and hop size of 441 samples (yielding 50 frames per second), processed with a mel filterbank of 128 bands from 30 Hz to 11 kHz, and magnitudes scaled with ln(1+1000x). They are provided in half-precision floating-point format. Spectrograms can be reproduced with torchaudio 2.3.1 from a 22050 Hz waveform tensor (resampled with soxr.resample(), if needed) via:

melspect = torchaudio.transforms.MelSpectrogram(sample_rate=22050, n_fft=1024, hop_length=441, f_min=30, f_max=11000, n_mels=128, mel_scale='slaney', normalized='frame_length', power=1)(waveform).mul(1000).log1p()

Format

For each dataset, a compressed .zip file is provided, which in turn holds an uncompressed .npz file. The .npz file holds a set of numpy arrays in subdirectories named after the annotations. Each subdirectory contains a spectrogram of the original audio file ("track.npy"), 11 pitch-shifted versions from -5 to +6 semitones ("track_ps-5.npy" to "track_ps6.npy") and 10 time-stretched versions from -20% to +20% ("track_ts-20.npy" to "track_ts20.npy"), except for gtzan.npz, which is designated for testing and only holds the original audio files. The .npz files can be loaded in numpy via np.load(), or unzipped into a set of .npy files that can again be loaded via np.load(). We also provide code to load .npz files as memory maps for more efficiency.
Guitars with guitar neck annotated. YOLO bounding box format.
figshare.com
jpeg
Updated Apr 27, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Eoin Mc Keever (2021). Guitars with guitar neck annotated. YOLO bounding box format. [Dataset]. http://doi.org/10.6084/m9.figshare.14494713.v1
Explore at:
jpegAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.14494713.v1
Dataset updated
Apr 27, 2021
Dataset provided by
figshare
Authors
Eoin Mc Keever
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
A dataset of guitars with the necks of the guitars annotated, images are similar to what would be expected from a video guitar tutorial.
P
YourMT3 Dataset Dataset
paperswithcode.com
Updated Jul 12, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sungkyun Chang; Emmanouil Benetos; Holger Kirchhoff; Simon Dixon (2024). YourMT3 Dataset Dataset [Dataset]. https://paperswithcode.com/dataset/yourmt3-dataset
Explore at:
Dataset updated
Jul 12, 2024
Authors
Sungkyun Chang; Emmanouil Benetos; Holger Kirchhoff; Simon Dixon
Description
We redistribute a suite of datasets as part of the YourMT3 project. The license for redistribution is attached.

YourMT3 Dataset Includes:

Slakh MusicNet (original and EM) MAPS (not used for training) Maestro GuitarSet ENST-drums EGMD MIR-ST500 Restricted Access CMedia Restricted Access RWC-Pop (Bass and Full) Restricted Access URMP IDMT-SMT-Bass
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Qingyang Xi; Rachel M. Bittner; Johan Pauwels; Xuzhou Ye; Juan Pablo Bello (2023). GuitarSet Dataset [Dataset]. https://paperswithcode.com/dataset/guitarset

GuitarSet Dataset

Explore at:

105 scholarly articles cite this dataset (View in Google Scholar)

Dataset updated

Jun 17, 2023

Authors

Qingyang Xi; Rachel M. Bittner; Johan Pauwels; Xuzhou Ye; Juan Pablo Bello

Description

GuitarSet is a dataset of high-quality guitar recordings and rich annotations. It contains 360 excerpts 30 seconds in length. The 360 excerpts are the result of the following combinations:

6 players, 2 versions: comping and soloing, 5 styles: Rock, Singer-Songwriter, Bossa Nova, Jazz, and Funk, 3 progressions: 12 Bar Blues, Autumn Leaves, and Pachelbel Canon, 2 tempi: slow and fast.

Each excerpt is annotated with 6 pitch contour and midi note annotations (one per string), 2 chord annotations (instructed and performed), beat and tempo annotations.

Clear search

Close search

Google apps

Main menu

GuitarSet Dataset

GuitarSet

guitarset_jukebox_embeddings

Beat This! Spectrograms for Beat and Downbeat Tracking

Datasets

Annotations

Spectrograms

Format

Guitars with guitar neck annotated. YOLO bounding box format.

YourMT3 Dataset Dataset

GuitarSet Dataset