24 datasets found
  1. h

    voxceleb

    • huggingface.co
    Updated Aug 27, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Paul C (2023). voxceleb [Dataset]. http://doi.org/10.57967/hf/0999
    Explore at:
    Dataset updated
    Aug 27, 2023
    Authors
    Paul C
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset includes both VoxCeleb and VoxCeleb2

      Multipart Zips
    

    Already joined zips for convenience but these specified files are NOT part of the original datasets vox2_mp4_1.zip - vox2_mp4_6.zip vox2_aac_1.zip - vox2_aac_2.zip

      Joining Zip
    

    cat vox1_dev* > vox1_dev_wav.zip

    cat vox2_dev_aac* > vox2_aac.zip

    cat vox2_dev_mp4* > vox2_mp4.zip

      Citation Information
    

    @article{Nagrani19, author = "Arsha Nagrani and Joon~Son Chung and Weidi Xie and Andrew… See the full description on the dataset page: https://huggingface.co/datasets/ProgramComputer/voxceleb.

  2. T

    voxceleb

    • tensorflow.org
    Updated Dec 6, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). voxceleb [Dataset]. https://www.tensorflow.org/datasets/catalog/voxceleb
    Explore at:
    Dataset updated
    Dec 6, 2022
    Description

    An large scale dataset for speaker identification. This data is collected from over 1,251 speakers, with over 150k samples in total. This release contains the audio part of the voxceleb1.1 dataset.

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('voxceleb', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

  3. h

    VoxCeleb

    • huggingface.co
    Updated Oct 10, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shi Qundong (2024). VoxCeleb [Dataset]. https://huggingface.co/datasets/TwinkStart/VoxCeleb
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 10, 2024
    Authors
    Shi Qundong
    Description

    This dataset only contains test data, which is integrated into UltraEval-Audio(https://github.com/OpenBMB/UltraEval-Audio) framework.

    python audio_evals/main.py --dataset voxceleb1 --model gpt4o_audio

    python audio_evals/main.py --dataset voxceleb2 --model gpt4o_audio

      🚀超凡体验,尽在UltraEval-Audio🚀
    

    UltraEval-Audio——全球首个同时支持语音理解和语音生成评估的开源框架,专为语音大模型评估打造,集合了34项权威Benchmark,覆盖语音、声音、医疗及音乐四大领域,支持十种语言,涵盖十二类任务。选择UltraEval-Audio,您将体验到前所未有的便捷与高效:

    一键式基准管理… See the full description on the dataset page: https://huggingface.co/datasets/TwinkStart/VoxCeleb.

  4. a

    voxceleb

    • academictorrents.com
    bittorrent
    Updated Apr 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joon Son Chung and Arsha Nagrani and Andrew Zisserman (2024). voxceleb [Dataset]. https://academictorrents.com/details/bdd9f57a6f47aa197f502b68bc0195f5ac786ec4
    Explore at:
    bittorrent(274288526425)Available download formats
    Dataset updated
    Apr 3, 2024
    Dataset authored and provided by
    Joon Son Chung and Arsha Nagrani and Andrew Zisserman
    License

    https://academictorrents.com/nolicensespecifiedhttps://academictorrents.com/nolicensespecified

    Description

    This torrent shares the VoxCeleb1 and VoxCeleb2 datasets. The original dataset creators do not provide access to the dataset anymore. To ensure papers in the field of speaker recognition can be reproduced (many have used VoxCeleb in recent years) the data should be available for academic purposes. The audio data is stored as mono-channel, 16000hz, signed 16-bit (little-endian) PCM wav files. This torrent does not include video data.

  5. h

    voxceleb

    • huggingface.co
    Updated May 28, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    martin (2025). voxceleb [Dataset]. https://huggingface.co/datasets/Drazic/voxceleb
    Explore at:
    Dataset updated
    May 28, 2025
    Authors
    martin
    Description

    Drazic/voxceleb dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. h

    SentimentAnalysis_SLUE-VoxCeleb

    • huggingface.co
    Updated Aug 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dynamic-SUPERB (2024). SentimentAnalysis_SLUE-VoxCeleb [Dataset]. https://huggingface.co/datasets/DynamicSuperb/SentimentAnalysis_SLUE-VoxCeleb
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 23, 2024
    Dataset authored and provided by
    Dynamic-SUPERB
    Description

    DynamicSuperb/SentimentAnalysis_SLUE-VoxCeleb dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. Voxceleb-1-Dataset

    • kaggle.com
    zip
    Updated May 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Arya Gokhale (2025). Voxceleb-1-Dataset [Dataset]. https://www.kaggle.com/datasets/aryagokh/voxceleb-1-dataset
    Explore at:
    zip(0 bytes)Available download formats
    Dataset updated
    May 12, 2025
    Authors
    Arya Gokhale
    Description

    Dataset

    This dataset was created by Arya Gokhale

    Contents

  8. VoxCeleb with Noisy Labels

    • zenodo.org
    txt
    Updated Jul 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zhihua Fang; Zhihua Fang (2025). VoxCeleb with Noisy Labels [Dataset]. http://doi.org/10.5281/zenodo.15927629
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jul 15, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Zhihua Fang; Zhihua Fang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The training lists with noisy labels based on the VoxCeleb dataset

    The clean training list for VoxCeleb1 is vox1_clean.txt.

    The clean training list for VoxCeleb2 is vox2_clean.txt.

    The noisy training lists for VoxCeleb1 are formatted as vox1_[noisy_type]_[noisy_rate].txt.

    The noisy training lists for VoxCeleb2 are formatted as vox2_[noisy_type]_[noisy_rate].txt.

    The noisy training lists for VoxCeleb1K-O are formatted as vox1k_[noisy_type]_[noisy_rate].txt.

    The noisy training lists for VoxCeleb5K-O are formatted as vox5k_[noisy_type]_[noisy_rate].txt.

    The evaluation lists are vox_O.txt, vox_E.txt, and vox_H.txt.

  9. h

    voxceleb-language-metadata

    • huggingface.co
    Updated Apr 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    John Backsund (2025). voxceleb-language-metadata [Dataset]. https://huggingface.co/datasets/johbac/voxceleb-language-metadata
    Explore at:
    Dataset updated
    Apr 6, 2025
    Authors
    John Backsund
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    VoxCeleb2 Language-Detected Subset

    This dataset is a language-labeled version of the VoxCeleb2 speaker identification dataset. It was created using the ProgramComputer/voxceleb Hugging Face dataset and the speechbrain/lang-id-voxlingua107-ecapa language identification model.

      Dataset Contents
    

    The dataset consists of two CSV files:

    audio_clips_meta_data.csvContains metadata for each audio clip, including:

    clip_id: Unique identifier for the audio clip. speaker_id: ID… See the full description on the dataset page: https://huggingface.co/datasets/johbac/voxceleb-language-metadata.

  10. voxceleb

    • kaggle.com
    zip
    Updated Dec 4, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yosra Hashem (2020). voxceleb [Dataset]. https://www.kaggle.com/yosrahashem/voxceleb
    Explore at:
    zip(13979898 bytes)Available download formats
    Dataset updated
    Dec 4, 2020
    Authors
    Yosra Hashem
    Description

    Dataset

    This dataset was created by Yosra Hashem

    Contents

    It contains the following files:

  11. voxceleb

    • kaggle.com
    Updated Jan 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    K Tharun Chowdary (2024). voxceleb [Dataset]. https://www.kaggle.com/datasets/ktharunchowdary/voxceleb/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 2, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    K Tharun Chowdary
    Description

    Dataset

    This dataset was created by K Tharun Chowdary

    Contents

  12. h

    voxceleb-Mimi-filtered

    • huggingface.co
    Updated May 2, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Diabolocom (2025). voxceleb-Mimi-filtered [Dataset]. https://huggingface.co/datasets/diabolocom/voxceleb-Mimi-filtered
    Explore at:
    Dataset updated
    May 2, 2025
    Dataset authored and provided by
    Diabolocom
    Description

    diabolocom/voxceleb-Mimi-filtered dataset hosted on Hugging Face and contributed by the HF Datasets community

  13. SLF Evaluation Dataset

    • zenodo.org
    bin, csv
    Updated Jul 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ahmed Abotaleb; Ahmed Abotaleb (2024). SLF Evaluation Dataset [Dataset]. http://doi.org/10.5281/zenodo.12706833
    Explore at:
    bin, csvAvailable download formats
    Dataset updated
    Jul 13, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Ahmed Abotaleb; Ahmed Abotaleb
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    This dataset was constructed from the test set split of the VoxCeleb 2 dataset (VoxCeleb). The VoxCeleb 2 test set contains 118 speakers each in several different videos. To develop this dataset, only one video per speaker was selected. A face image was also extracted from the video, as well as, a low resolution face image (8x8). Age, gender and ethnicity of the person in the face image were determined using the “DeepFace” library, a face recognition and facial attribute analysis library.

    This dataset can be used to evaluate speech2face, speech conditioned face generation and speech conditioned face super-resolution systems.

  14. O

    Voxceleb-3D

    • opendatalab.com
    zip
    Updated Mar 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    University of Southern California (2023). Voxceleb-3D [Dataset]. https://opendatalab.com/OpenDataLab/Voxceleb-3D
    Explore at:
    zipAvailable download formats
    Dataset updated
    Mar 22, 2023
    Dataset provided by
    University of Southern California
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    We propose a novel Voxceleb-3D dataset that includes paired voices and 3D face models. Voxceleb-3D is inherited from two widely used datasets: Voxceleb) and VGGFace, which include voice and face images of celebrities, respectively.

  15. h

    voxceleb-sentiment

    • huggingface.co
    Updated Jul 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Massive Text Embedding Benchmark (2025). voxceleb-sentiment [Dataset]. https://huggingface.co/datasets/mteb/voxceleb-sentiment
    Explore at:
    Dataset updated
    Jul 6, 2025
    Dataset authored and provided by
    Massive Text Embedding Benchmark
    Description

    mteb/voxceleb-sentiment dataset hosted on Hugging Face and contributed by the HF Datasets community

  16. h

    VoxCeleb-Gender

    • huggingface.co
    Updated Jul 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ahmed Elsayed Rashad (2025). VoxCeleb-Gender [Dataset]. https://huggingface.co/datasets/ahmedelsayed/VoxCeleb-Gender
    Explore at:
    Dataset updated
    Jul 27, 2025
    Authors
    Ahmed Elsayed Rashad
    Description

    ahmedelsayed/VoxCeleb-Gender dataset hosted on Hugging Face and contributed by the HF Datasets community

  17. O

    Voice Gender Detection

    • opendatalab.com
    zip
    Updated Apr 23, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Voice Gender Detection [Dataset]. https://opendatalab.com/OpenDataLab/Voice_Gender_Detection
    Explore at:
    zipAvailable download formats
    Dataset updated
    Apr 23, 2023
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Cleaned Dataset for Voice gender detection using the VoxCeleb dataset (7000+ unique speakers and utterances, 3683 males / 2312 females). The VoxCeleb is an audio-visual dataset consisting of short clips of human speech, extracted from interview videos uploaded to YouTube. VoxCeleb contains speech from speakers spanning a wide range of different ethnicities, accents, professions and ages.

  18. h

    vox2-veri-3s

    • huggingface.co
    Updated Jul 1, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yang Wang (2023). vox2-veri-3s [Dataset]. https://huggingface.co/datasets/yangwang825/vox2-veri-3s
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 1, 2023
    Authors
    Yang Wang
    Description

    VoxCeleb 2

    VoxCeleb2 contains over 1 million utterances for 6,112 celebrities, extracted from videos uploaded to YouTube.

      Verification Split
    

    train validation test

    of speakers

    5,994 5,994 118

    of samples

    982,808 109,201 36,237

      Data Fields
    

    ID (string): The ID of the sample with format

  19. h

    CapTTS-SFT-voxceleb-cleaned

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jaehoon Kang, CapTTS-SFT-voxceleb-cleaned [Dataset]. https://huggingface.co/datasets/morateng/CapTTS-SFT-voxceleb-cleaned
    Explore at:
    Authors
    Jaehoon Kang
    Description

    morateng/CapTTS-SFT-voxceleb-cleaned dataset hosted on Hugging Face and contributed by the HF Datasets community

  20. h

    vox1-iden-full

    • huggingface.co
    Updated Jun 5, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yang Wang (2023). vox1-iden-full [Dataset]. https://huggingface.co/datasets/yangwang825/vox1-iden-full
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 5, 2023
    Authors
    Yang Wang
    Description

    VoxCeleb 1

    VoxCeleb1 contains over 100,000 utterances for 1,251 celebrities, extracted from videos uploaded to YouTube.

      Identification Split
    

    train validation test

    of speakers

    1251 1251 1251

    of samples

    138361 6904 8251

      References
    

    https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox1.html

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Paul C (2023). voxceleb [Dataset]. http://doi.org/10.57967/hf/0999

voxceleb

ProgramComputer/voxceleb

Explore at:
Dataset updated
Aug 27, 2023
Authors
Paul C
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

This dataset includes both VoxCeleb and VoxCeleb2

  Multipart Zips

Already joined zips for convenience but these specified files are NOT part of the original datasets vox2_mp4_1.zip - vox2_mp4_6.zip vox2_aac_1.zip - vox2_aac_2.zip

  Joining Zip

cat vox1_dev* > vox1_dev_wav.zip

cat vox2_dev_aac* > vox2_aac.zip

cat vox2_dev_mp4* > vox2_mp4.zip

  Citation Information

@article{Nagrani19, author = "Arsha Nagrani and Joon~Son Chung and Weidi Xie and Andrew… See the full description on the dataset page: https://huggingface.co/datasets/ProgramComputer/voxceleb.

Search
Clear search
Close search
Google apps
Main menu