Saved datasets
Last updated
Download format
Croissant
Croissant is a format for Machine Learning datasets
Learn more about this at mlcommons.org/croissant.
Usage rights
License from data provider
Please review the applicable license to make sure your contemplated use is permitted.
Topic
Provider
Free
Cost to access
Described as free to access or have a license that allows redistribution.
100+ datasets found
  1. Hindi Speech Recognition Dataset

    • kaggle.com
    zip
    Updated Jan 21, 2026
    + more versions
  2. F

    Hindi Agent-Customer Chat Dataset for Healthcare Domain

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    + more versions
  3. Customer Service Conversations in English, Hindi

    • kaggle.com
    zip
    Updated Jul 30, 2025
  4. F

    Hindi General Conversation Speech Dataset for ASR

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    + more versions
  5. Hindi Call Center Conversation Dataset – Real customer-agent dialogues for...

    • datarade.ai
    .wav, .flac
    Updated Dec 8, 2023
  6. F

    Hindi Human-Human Chat Dataset for Conversational AI & NLP

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    + more versions
  7. h

    Hinglish-Everyday-Conversations-1M

    • huggingface.co
    Updated Dec 1, 2024
    + more versions
  8. Hindi Children Speech Dataset – 34 Hours (Real-world Conversation &...

    • nexdata.ai
    Updated Sep 12, 2025
  9. Conversation

    • kaggle.com
    zip
    Updated Dec 28, 2023
  10. F

    Hindi Agent-Customer Chat Dataset for Delivery & Logistics

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    + more versions
  11. Hindi Conversational Speech Dataset – Real support dialogues for training...

    • datarade.ai
    .wav, .flac
    Updated Dec 8, 2023
  12. F

    Hindi Agent-Customer Chat Dataset for Retail & E-Commerce

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    + more versions
  13. h

    hindi-speech-recognition-dataset

    • huggingface.co
    Updated Aug 1, 2025
    + more versions
  14. s

    AI-Ready Hindi Speech Datasets – TTS, Podcasts & More

    • shaip.com
    Updated Mar 22, 2023
    + more versions
  15. Chatbot Conversation Dataset [English & Hindi] – Real customer service audio...

    • datarade.ai
    .wav, .flac
    Updated Dec 8, 2023
  16. 1003 Hours - Hindi Speech Dataset (Spontaneous Conversation)

    • nexdata.ai
    Updated Nov 11, 2023
  17. F

    Hindi Agent-Customer Chat Dataset for Telecom

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    + more versions
  18. D

    Hindi Call Center

    • defined.ai
    Updated Jan 2, 2026
    + more versions
  19. F

    Hindi Agent-Customer Chat Dataset for Travel

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    + more versions
  20. Hindi Banking Speech Dataset

    • kaggle.com
    zip
    Updated Jul 16, 2025
Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Unidata (2026). Hindi Speech Recognition Dataset [Dataset]. https://www.kaggle.com/datasets/unidpro/hindi-speech-recognition-dataset
Organization logo

Hindi Speech Recognition Dataset

Dataset comprises 760 hours of telephone dialogues in Hindi

Explore at:
zip(291 bytes)Available download formats
Dataset updated
Jan 21, 2026
Authors
Unidata
License

Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically

Description

Hindi Speech Dataset for recognition task

Dataset comprises 760 hours of telephone dialogues in Hindi, collected from 1,000+ native speakers across various topics and domains. This dataset boasts an impressive 95% sentence accuracy rate, making it a valuable resource for advancing speech recognition technology.

By utilizing this dataset, researchers and developers can advance their understanding and capabilities in automatic speech recognition (ASR) systems, transcribing audio, and natural language processing (NLP). - Get the data

The dataset includes high-quality audio recordings with text transcriptions, making it ideal for training and evaluating speech recognition models.

💵 Buy the Dataset: This is a limited preview of the data. To access the full dataset, please contact us at https://unidata.pro to discuss your requirements and pricing options.

Metadata for the dataset

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F22059654%2Fa3f375fb273dcad3fe17403bdfccb63b%2Fssssssssss.PNG?generation=1739884059328284&alt=media" alt=""> - Audio files: High-quality recordings in WAV format - Text transcriptions: Accurate and detailed transcripts for each audio segment - Speaker information: Metadata on native speakers, including gender and etc - Topics: Diverse domains such as general conversations, business and etc

This dataset is essential for anyone looking to improve speech recognition technology and develop more effective automatic speech systems.

🌐 UniData provides high-quality datasets, content moderation, data collection and annotation for your AI/ML projects

Search
Clear search
Close search
Google apps
Main menu