2 datasets found

h
libritts
huggingface.co
Updated Feb 9, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mythic Infinity (2024). libritts [Dataset]. https://huggingface.co/datasets/mythicinfinity/libritts
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 9, 2024
Dataset authored and provided by
Mythic Infinity
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset Card for LibriTTS

LibriTTS is a multi-speaker English corpus of approximately 585 hours of read English speech at 24kHz sampling rate, prepared by Heiga Zen with the assistance of Google Speech and Google Brain team members. The LibriTTS corpus is designed for TTS research. It is derived from the original materials (mp3 audio files from LibriVox and text files from Project Gutenberg) of the LibriSpeech corpus.

Overview

This is the LibriTTS dataset, adapted… See the full description on the dataset page: https://huggingface.co/datasets/mythicinfinity/libritts.
T
libritts
tensorflow.org
opendatalab.com
Updated Dec 13, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2022). libritts [Dataset]. https://www.tensorflow.org/datasets/catalog/libritts
Explore at:
Dataset updated
Dec 13, 2022
Description
LibriTTS is a multi-speaker English corpus of approximately 585 hours of read English speech at 24kHz sampling rate, prepared by Heiga Zen with the assistance of Google Speech and Google Brain team members. The LibriTTS corpus is designed for TTS research. It is derived from the original materials (mp3 audio files from LibriVox and text files from Project Gutenberg) of the LibriSpeech corpus. The main differences from the LibriSpeech corpus are listed below:

The audio files are at 24kHz sampling rate.

The speech is split at sentence breaks.

Both original and normalized texts are included.

Contextual information (e.g., neighbouring sentences) can be extracted.

Utterances with significant background noise are excluded.

To use this dataset:

import tensorflow_datasets as tfds ds = tfds.load('libritts', split='train') for ex in ds.take(4): print(ex)

See the guide for more informations on tensorflow_datasets.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Mythic Infinity (2024). libritts [Dataset]. https://huggingface.co/datasets/mythicinfinity/libritts

libritts

mythicinfinity/libritts

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Feb 9, 2024

Dataset authored and provided by

Mythic Infinity

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Dataset Card for LibriTTS

LibriTTS is a multi-speaker English corpus of approximately 585 hours of read English speech at 24kHz sampling rate, prepared by Heiga Zen with the assistance of Google Speech and Google Brain team members. The LibriTTS corpus is designed for TTS research. It is derived from the original materials (mp3 audio files from LibriVox and text files from Project Gutenberg) of the LibriSpeech corpus.

  Overview

This is the LibriTTS dataset, adapted… See the full description on the dataset page: https://huggingface.co/datasets/mythicinfinity/libritts.

Clear search

Close search

Google apps

Main menu