YFCC15M Recaption Dataset
This YFCC15M Dataset is filtered by DeCLIP and recaptioned utilize the diverse description generation framework proposed in RWKV-CLIP. The text is a list of text tokens with a length of 77, encoded using the CLIP tokenizer. You can use from clip.simple_tokenizer import SimpleTokenizer as _Tokenizer to decode it back into the original text.
Using Dataset
You can easily download and use the arxiver dataset with Hugging Face's datasets library.… See the full description on the dataset page: https://huggingface.co/datasets/Kaichengalex/YFCC15M.
5CD-AI/Vietnamese-yfcc15m-OpenAICLIP dataset hosted on Hugging Face and contributed by the HF Datasets community
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Dataset description
Recaptioned YFCC15M by MiniCPM-Llama3-V-2_5.
Uses
See https://github.com/MIV-XJTU/FLAME.
Citation
@article{cao2024flame, title={FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training}, author={Cao, Anjia and Wei, Xing and Ma, Zhiheng}, journal={arXiv preprint arXiv:2411.11927}, year={2024} }
@article{yao2024minicpmv, title={MiniCPM-V: A GPT-4V Level MLLM on Your Phone}, author={Yao, Yuan… See the full description on the dataset page: https://huggingface.co/datasets/caj/FLAME-ReCap-YFCC15M-MiniCPM-Llama3-V-2_5.
yxchng/cc15m_yfcc15m dataset hosted on Hugging Face and contributed by the HF Datasets community
Not seeing a result you expected?
Learn how you can add new datasets to our index.
YFCC15M Recaption Dataset
This YFCC15M Dataset is filtered by DeCLIP and recaptioned utilize the diverse description generation framework proposed in RWKV-CLIP. The text is a list of text tokens with a length of 77, encoded using the CLIP tokenizer. You can use from clip.simple_tokenizer import SimpleTokenizer as _Tokenizer to decode it back into the original text.
Using Dataset
You can easily download and use the arxiver dataset with Hugging Face's datasets library.… See the full description on the dataset page: https://huggingface.co/datasets/Kaichengalex/YFCC15M.