2 datasets found

h
SwitchLingua_text
huggingface.co
Updated May 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Peng Xie (2025). SwitchLingua_text [Dataset]. https://huggingface.co/datasets/Shelton1013/SwitchLingua_text
Explore at:
Dataset updated
May 28, 2025
Authors
Peng Xie
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
Dataset Card for SwitchLingua_text

Dataset Summary

SwitchLingua is a comprehensive multilingual and multicultural code-switching dataset designed to advance research in automatic speech recognition, natural language processing, and conversational AI. The textual data for SwitchLingua was first generated using the proposed LinguaMaster framework, and the audio data was recorded by 174 bilingual speakers from diverse linguistic and cultural backgrounds to ensure high… See the full description on the dataset page: https://huggingface.co/datasets/Shelton1013/SwitchLingua_text.
h
SwitchLingua_audio
huggingface.co
Updated May 13, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Peng Xie (2025). SwitchLingua_audio [Dataset]. https://huggingface.co/datasets/Shelton1013/SwitchLingua_audio
Explore at:
Dataset updated
May 13, 2025
Authors
Peng Xie
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
Dataset Card for SwitchLingua_text

Dataset Summary

SwitchLingua is a comprehensive multilingual and multicultural code-switching dataset designed to advance research in automatic speech recognition, natural language processing, and conversational AI. The textual data for SwitchLingua was first generated using the proposed LinguaMaster framework, and the audio data was recorded by 174 bilingual speakers from diverse linguistic and cultural backgrounds to ensure high… See the full description on the dataset page: https://huggingface.co/datasets/Shelton1013/SwitchLingua_audio.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Peng Xie (2025). SwitchLingua_text [Dataset]. https://huggingface.co/datasets/Shelton1013/SwitchLingua_text

SwitchLingua_text

Shelton1013/SwitchLingua_text

SwitchLingua: The First Large-Scale Multilingual and Multi-Ethnic Code-Switching Dataset

Explore at:

Dataset updated

May 28, 2025

Authors

Peng Xie

License

Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically

Description

Dataset Card for SwitchLingua_text

  Dataset Summary

SwitchLingua is a comprehensive multilingual and multicultural code-switching dataset designed to advance research in automatic speech recognition, natural language processing, and conversational AI. The textual data for SwitchLingua was first generated using the proposed LinguaMaster framework, and the audio data was recorded by 174 bilingual speakers from diverse linguistic and cultural backgrounds to ensure high… See the full description on the dataset page: https://huggingface.co/datasets/Shelton1013/SwitchLingua_text.

Clear search

Close search

Google apps

Main menu

SwitchLingua_text

SwitchLingua_audio

SwitchLingua_text

Shelton1013/SwitchLingua_text

SwitchLingua: The First Large-Scale Multilingual and Multi-Ethnic Code-Switching Dataset