2 datasets found
  1. h

    SwitchLingua_text

    • huggingface.co
    Updated May 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Peng Xie (2025). SwitchLingua_text [Dataset]. https://huggingface.co/datasets/Shelton1013/SwitchLingua_text
    Explore at:
    Dataset updated
    May 28, 2025
    Authors
    Peng Xie
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Dataset Card for SwitchLingua_text

      Dataset Summary
    

    SwitchLingua is a comprehensive multilingual and multicultural code-switching dataset designed to advance research in automatic speech recognition, natural language processing, and conversational AI. The textual data for SwitchLingua was first generated using the proposed LinguaMaster framework, and the audio data was recorded by 174 bilingual speakers from diverse linguistic and cultural backgrounds to ensure high… See the full description on the dataset page: https://huggingface.co/datasets/Shelton1013/SwitchLingua_text.

  2. h

    SwitchLingua_audio

    • huggingface.co
    Updated May 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Peng Xie (2025). SwitchLingua_audio [Dataset]. https://huggingface.co/datasets/Shelton1013/SwitchLingua_audio
    Explore at:
    Dataset updated
    May 13, 2025
    Authors
    Peng Xie
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Dataset Card for SwitchLingua_text

      Dataset Summary
    

    SwitchLingua is a comprehensive multilingual and multicultural code-switching dataset designed to advance research in automatic speech recognition, natural language processing, and conversational AI. The textual data for SwitchLingua was first generated using the proposed LinguaMaster framework, and the audio data was recorded by 174 bilingual speakers from diverse linguistic and cultural backgrounds to ensure high… See the full description on the dataset page: https://huggingface.co/datasets/Shelton1013/SwitchLingua_audio.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Peng Xie (2025). SwitchLingua_text [Dataset]. https://huggingface.co/datasets/Shelton1013/SwitchLingua_text

SwitchLingua_text

Shelton1013/SwitchLingua_text

SwitchLingua: The First Large-Scale Multilingual and Multi-Ethnic Code-Switching Dataset

Explore at:
Dataset updated
May 28, 2025
Authors
Peng Xie
License

Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically

Description

Dataset Card for SwitchLingua_text

  Dataset Summary

SwitchLingua is a comprehensive multilingual and multicultural code-switching dataset designed to advance research in automatic speech recognition, natural language processing, and conversational AI. The textual data for SwitchLingua was first generated using the proposed LinguaMaster framework, and the audio data was recorded by 174 bilingual speakers from diverse linguistic and cultural backgrounds to ensure high… See the full description on the dataset page: https://huggingface.co/datasets/Shelton1013/SwitchLingua_text.

Search
Clear search
Close search
Google apps
Main menu