https://choosealicense.com/licenses/openrail/https://choosealicense.com/licenses/openrail/
MAP-CC
π Homepage | π€ MAP-CC | π€ CHC-Bench | π€ CT-LLM | π arXiv | GitHub An open-source Chinese pretraining dataset with a scale of 800 billion tokens, offering the NLP community high-quality Chinese pretraining data.
Disclaimer
This model, developed for academic purposes, employs rigorously compliance-checked training data to uphold the highest standards of integrity and compliance. Despite our efforts, the inherent complexities of data and the broad⦠See the full description on the dataset page: https://huggingface.co/datasets/Eric-Valyu/Test-Prompt.
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
MAP-CC
π Homepage | π€ MAP-CC | π€ CHC-Bench | π€ CT-LLM | π arXiv | GitHub An open-source Chinese pretraining dataset with a scale of 800 billion tokens, offering the NLP community high-quality Chinese pretraining data.
Disclaimer
This model, developed for academic purposes, employs rigorously compliance-checked training data to uphold the highest standards of integrity and compliance. Despite our efforts, the inherent complexities of data and the broad spectrum of⦠See the full description on the dataset page: https://huggingface.co/datasets/m-a-p/MAP-CC.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
https://choosealicense.com/licenses/openrail/https://choosealicense.com/licenses/openrail/
MAP-CC
π Homepage | π€ MAP-CC | π€ CHC-Bench | π€ CT-LLM | π arXiv | GitHub An open-source Chinese pretraining dataset with a scale of 800 billion tokens, offering the NLP community high-quality Chinese pretraining data.
Disclaimer
This model, developed for academic purposes, employs rigorously compliance-checked training data to uphold the highest standards of integrity and compliance. Despite our efforts, the inherent complexities of data and the broad⦠See the full description on the dataset page: https://huggingface.co/datasets/Eric-Valyu/Test-Prompt.