24 datasets found

h
voxceleb
huggingface.co
Updated Aug 27, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Paul C (2023). voxceleb [Dataset]. http://doi.org/10.57967/hf/0999
Explore at:
Unique identifier
https://doi.org/10.57967/hf/0999
Dataset updated
Aug 27, 2023
Authors
Paul C
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset includes both VoxCeleb and VoxCeleb2

Multipart Zips

Already joined zips for convenience but these specified files are NOT part of the original datasets vox2_mp4_1.zip - vox2_mp4_6.zip vox2_aac_1.zip - vox2_aac_2.zip

Joining Zip

cat vox1_dev* > vox1_dev_wav.zip

cat vox2_dev_aac* > vox2_aac.zip

cat vox2_dev_mp4* > vox2_mp4.zip

Citation Information

@article{Nagrani19, author = "Arsha Nagrani and Joon~Son Chung and Weidi Xie and Andrew… See the full description on the dataset page: https://huggingface.co/datasets/ProgramComputer/voxceleb.
T
voxceleb
tensorflow.org
Updated Dec 6, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2022). voxceleb [Dataset]. https://www.tensorflow.org/datasets/catalog/voxceleb
Explore at:
Dataset updated
Dec 6, 2022
Description
An large scale dataset for speaker identification. This data is collected from over 1,251 speakers, with over 150k samples in total. This release contains the audio part of the voxceleb1.1 dataset.

To use this dataset:

import tensorflow_datasets as tfds ds = tfds.load('voxceleb', split='train') for ex in ds.take(4): print(ex)

See the guide for more informations on tensorflow_datasets.
h
VoxCeleb
huggingface.co
Updated Oct 10, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shi Qundong (2024). VoxCeleb [Dataset]. https://huggingface.co/datasets/TwinkStart/VoxCeleb
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 10, 2024
Authors
Shi Qundong
Description
This dataset only contains test data, which is integrated into UltraEval-Audio(https://github.com/OpenBMB/UltraEval-Audio) framework.

python audio_evals/main.py --dataset voxceleb1 --model gpt4o_audio

python audio_evals/main.py --dataset voxceleb2 --model gpt4o_audio

🚀超凡体验，尽在UltraEval-Audio🚀

UltraEval-Audio——全球首个同时支持语音理解和语音生成评估的开源框架，专为语音大模型评估打造，集合了34项权威Benchmark，覆盖语音、声音、医疗及音乐四大领域，支持十种语言，涵盖十二类任务。选择UltraEval-Audio，您将体验到前所未有的便捷与高效：

一键式基准管理… See the full description on the dataset page: https://huggingface.co/datasets/TwinkStart/VoxCeleb.
a
voxceleb
academictorrents.com
bittorrent
Updated Apr 3, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Joon Son Chung and Arsha Nagrani and Andrew Zisserman (2024). voxceleb [Dataset]. https://academictorrents.com/details/bdd9f57a6f47aa197f502b68bc0195f5ac786ec4
Explore at:
bittorrent(274288526425)Available download formats
Dataset updated
Apr 3, 2024
Dataset authored and provided by
Joon Son Chung and Arsha Nagrani and Andrew Zisserman
License
https://academictorrents.com/nolicensespecifiedhttps://academictorrents.com/nolicensespecified
Description
This torrent shares the VoxCeleb1 and VoxCeleb2 datasets. The original dataset creators do not provide access to the dataset anymore. To ensure papers in the field of speaker recognition can be reproduced (many have used VoxCeleb in recent years) the data should be available for academic purposes. The audio data is stored as mono-channel, 16000hz, signed 16-bit (little-endian) PCM wav files. This torrent does not include video data.
h
voxceleb
huggingface.co
Updated May 28, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
martin (2025). voxceleb [Dataset]. https://huggingface.co/datasets/Drazic/voxceleb
Explore at:
Dataset updated
May 28, 2025
Authors
martin
Description
Drazic/voxceleb dataset hosted on Hugging Face and contributed by the HF Datasets community
h
SentimentAnalysis_SLUE-VoxCeleb
huggingface.co
Updated Aug 23, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dynamic-SUPERB (2024). SentimentAnalysis_SLUE-VoxCeleb [Dataset]. https://huggingface.co/datasets/DynamicSuperb/SentimentAnalysis_SLUE-VoxCeleb
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 23, 2024
Dataset authored and provided by
Dynamic-SUPERB
Description
DynamicSuperb/SentimentAnalysis_SLUE-VoxCeleb dataset hosted on Hugging Face and contributed by the HF Datasets community
Voxceleb-1-Dataset
kaggle.com
zip
Updated May 12, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Arya Gokhale (2025). Voxceleb-1-Dataset [Dataset]. https://www.kaggle.com/datasets/aryagokh/voxceleb-1-dataset
Explore at:
zip(0 bytes)Available download formats
Dataset updated
May 12, 2025
Authors
Arya Gokhale
Description
Dataset

This dataset was created by Arya Gokhale

Contents
VoxCeleb with Noisy Labels
zenodo.org
txt
Updated Jul 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zhihua Fang; Zhihua Fang (2025). VoxCeleb with Noisy Labels [Dataset]. http://doi.org/10.5281/zenodo.15927629
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.15927629
Dataset updated
Jul 15, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Zhihua Fang; Zhihua Fang
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The training lists with noisy labels based on the VoxCeleb dataset

The clean training list for VoxCeleb1 is vox1_clean.txt.

The clean training list for VoxCeleb2 is vox2_clean.txt.

The noisy training lists for VoxCeleb1 are formatted as vox1_[noisy_type]_[noisy_rate].txt.

The noisy training lists for VoxCeleb2 are formatted as vox2_[noisy_type]_[noisy_rate].txt.

The noisy training lists for VoxCeleb1K-O are formatted as vox1k_[noisy_type]_[noisy_rate].txt.

The noisy training lists for VoxCeleb5K-O are formatted as vox5k_[noisy_type]_[noisy_rate].txt.

The evaluation lists are vox_O.txt, vox_E.txt, and vox_H.txt.
h
voxceleb-language-metadata
huggingface.co
Updated Apr 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
John Backsund (2025). voxceleb-language-metadata [Dataset]. https://huggingface.co/datasets/johbac/voxceleb-language-metadata
Explore at:
Dataset updated
Apr 6, 2025
Authors
John Backsund
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
VoxCeleb2 Language-Detected Subset

This dataset is a language-labeled version of the VoxCeleb2 speaker identification dataset. It was created using the ProgramComputer/voxceleb Hugging Face dataset and the speechbrain/lang-id-voxlingua107-ecapa language identification model.

Dataset Contents

The dataset consists of two CSV files:

audio_clips_meta_data.csvContains metadata for each audio clip, including:

clip_id: Unique identifier for the audio clip. speaker_id: ID… See the full description on the dataset page: https://huggingface.co/datasets/johbac/voxceleb-language-metadata.
voxceleb
kaggle.com
zip
Updated Dec 4, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yosra Hashem (2020). voxceleb [Dataset]. https://www.kaggle.com/yosrahashem/voxceleb
Explore at:
zip(13979898 bytes)Available download formats
Dataset updated
Dec 4, 2020
Authors
Yosra Hashem
Description
Dataset

This dataset was created by Yosra Hashem

Contents

It contains the following files:
voxceleb
kaggle.com
Updated Jan 2, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
K Tharun Chowdary (2024). voxceleb [Dataset]. https://www.kaggle.com/datasets/ktharunchowdary/voxceleb/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 2, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
K Tharun Chowdary
Description
Dataset

This dataset was created by K Tharun Chowdary

Contents
h
voxceleb-Mimi-filtered
huggingface.co
Updated May 2, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Diabolocom (2025). voxceleb-Mimi-filtered [Dataset]. https://huggingface.co/datasets/diabolocom/voxceleb-Mimi-filtered
Explore at:
Dataset updated
May 2, 2025
Dataset authored and provided by
Diabolocom
Description
diabolocom/voxceleb-Mimi-filtered dataset hosted on Hugging Face and contributed by the HF Datasets community
SLF Evaluation Dataset
zenodo.org
bin, csv
Updated Jul 13, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ahmed Abotaleb; Ahmed Abotaleb (2024). SLF Evaluation Dataset [Dataset]. http://doi.org/10.5281/zenodo.12706833
Explore at:
bin, csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.12706833
Dataset updated
Jul 13, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Ahmed Abotaleb; Ahmed Abotaleb
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
This dataset was constructed from the test set split of the VoxCeleb 2 dataset (VoxCeleb). The VoxCeleb 2 test set contains 118 speakers each in several different videos. To develop this dataset, only one video per speaker was selected. A face image was also extracted from the video, as well as, a low resolution face image (8x8). Age, gender and ethnicity of the person in the face image were determined using the “DeepFace” library, a face recognition and facial attribute analysis library.

This dataset can be used to evaluate speech2face, speech conditioned face generation and speech conditioned face super-resolution systems.
O
Voxceleb-3D
opendatalab.com
zip
Updated Mar 22, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
University of Southern California (2023). Voxceleb-3D [Dataset]. https://opendatalab.com/OpenDataLab/Voxceleb-3D
Explore at:
zipAvailable download formats
Dataset updated
Mar 22, 2023
Dataset provided by
University of Southern California
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
We propose a novel Voxceleb-3D dataset that includes paired voices and 3D face models. Voxceleb-3D is inherited from two widely used datasets: Voxceleb) and VGGFace, which include voice and face images of celebrities, respectively.
h
voxceleb-sentiment
huggingface.co
Updated Jul 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Massive Text Embedding Benchmark (2025). voxceleb-sentiment [Dataset]. https://huggingface.co/datasets/mteb/voxceleb-sentiment
Explore at:
Dataset updated
Jul 6, 2025
Dataset authored and provided by
Massive Text Embedding Benchmark
Description
mteb/voxceleb-sentiment dataset hosted on Hugging Face and contributed by the HF Datasets community
h
VoxCeleb-Gender
huggingface.co
Updated Jul 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ahmed Elsayed Rashad (2025). VoxCeleb-Gender [Dataset]. https://huggingface.co/datasets/ahmedelsayed/VoxCeleb-Gender
Explore at:
Dataset updated
Jul 27, 2025
Authors
Ahmed Elsayed Rashad
Description
ahmedelsayed/VoxCeleb-Gender dataset hosted on Hugging Face and contributed by the HF Datasets community
O
Voice Gender Detection
opendatalab.com
zip
Updated Apr 23, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). Voice Gender Detection [Dataset]. https://opendatalab.com/OpenDataLab/Voice_Gender_Detection
Explore at:
zipAvailable download formats
Dataset updated
Apr 23, 2023
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Cleaned Dataset for Voice gender detection using the VoxCeleb dataset (7000+ unique speakers and utterances, 3683 males / 2312 females). The VoxCeleb is an audio-visual dataset consisting of short clips of human speech, extracted from interview videos uploaded to YouTube. VoxCeleb contains speech from speakers spanning a wide range of different ethnicities, accents, professions and ages.
h
vox2-veri-3s
huggingface.co
Updated Jul 1, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yang Wang (2023). vox2-veri-3s [Dataset]. https://huggingface.co/datasets/yangwang825/vox2-veri-3s
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 1, 2023
Authors
Yang Wang
Description
VoxCeleb 2

VoxCeleb2 contains over 1 million utterances for 6,112 celebrities, extracted from videos uploaded to YouTube.

Verification Split

train validation test

of speakers

5,994 5,994 118

of samples

982,808 109,201 36,237

Data Fields

ID (string): The ID of the sample with format
h
CapTTS-SFT-voxceleb-cleaned
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jaehoon Kang, CapTTS-SFT-voxceleb-cleaned [Dataset]. https://huggingface.co/datasets/morateng/CapTTS-SFT-voxceleb-cleaned
Explore at:
Authors
Jaehoon Kang
Description
morateng/CapTTS-SFT-voxceleb-cleaned dataset hosted on Hugging Face and contributed by the HF Datasets community
h
vox1-iden-full
huggingface.co
Updated Jun 5, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yang Wang (2023). vox1-iden-full [Dataset]. https://huggingface.co/datasets/yangwang825/vox1-iden-full
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 5, 2023
Authors
Yang Wang
Description
VoxCeleb 1

VoxCeleb1 contains over 100,000 utterances for 1,251 celebrities, extracted from videos uploaded to YouTube.

Identification Split

train validation test

of speakers

1251 1251 1251

of samples

138361 6904 8251

References

https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox1.html

Facebook

Twitter

Click to copy link

Link copied

Cite

Paul C (2023). voxceleb [Dataset]. http://doi.org/10.57967/hf/0999

voxceleb

ProgramComputer/voxceleb

Explore at:

Unique identifier

https://doi.org/10.57967/hf/0999

Dataset updated

Aug 27, 2023

Authors

Paul C

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

This dataset includes both VoxCeleb and VoxCeleb2

  Multipart Zips

Already joined zips for convenience but these specified files are NOT part of the original datasets vox2_mp4_1.zip - vox2_mp4_6.zip vox2_aac_1.zip - vox2_aac_2.zip

  Joining Zip

cat vox1_dev* > vox1_dev_wav.zip

cat vox2_dev_aac* > vox2_aac.zip

cat vox2_dev_mp4* > vox2_mp4.zip

  Citation Information

@article{Nagrani19, author = "Arsha Nagrani and Joon~Son Chung and Weidi Xie and Andrew… See the full description on the dataset page: https://huggingface.co/datasets/ProgramComputer/voxceleb.

Clear search

Close search

Google apps

Main menu

voxceleb

voxceleb

VoxCeleb

voxceleb

voxceleb

SentimentAnalysis_SLUE-VoxCeleb

Voxceleb-1-Dataset

Dataset

Contents

VoxCeleb with Noisy Labels

voxceleb-language-metadata

voxceleb

Dataset

Contents

voxceleb

Dataset

Contents

voxceleb-Mimi-filtered

SLF Evaluation Dataset

Voxceleb-3D

voxceleb-sentiment

VoxCeleb-Gender

Voice Gender Detection

vox2-veri-3s

of speakers

of samples

CapTTS-SFT-voxceleb-cleaned

vox1-iden-full

of speakers

of samples

voxceleb

ProgramComputer/voxceleb