10 datasets found
  1. h

    LLaVA-CoT-o1-Instruct

    • huggingface.co
    Updated Feb 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fahrizal Farid (2025). LLaVA-CoT-o1-Instruct [Dataset]. https://huggingface.co/datasets/mamangracing/LLaVA-CoT-o1-Instruct
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 6, 2025
    Authors
    Fahrizal Farid
    Description

    Example1

      Input:
    

    Please answer the question below, explaining your reasoning step by step before providing the final answer. Question: Are there enough straws for every cup ? A. yes B. no

      Output:
    

    The question asks whether there are enough straws to provide one for each cup depicted in an image. To answer, we need to count the number of straws and cups separately and then compare those quantities. The image shows three… See the full description on the dataset page: https://huggingface.co/datasets/mamangracing/LLaVA-CoT-o1-Instruct.

  2. h

    LLaVA-CoT-o1-eCoT-old

    • huggingface.co
    Updated Apr 27, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abdul Waheed (2025). LLaVA-CoT-o1-eCoT-old [Dataset]. https://huggingface.co/datasets/macabdul9/LLaVA-CoT-o1-eCoT-old
    Explore at:
    Dataset updated
    Apr 27, 2025
    Authors
    Abdul Waheed
    Description

    macabdul9/LLaVA-CoT-o1-eCoT-old dataset hosted on Hugging Face and contributed by the HF Datasets community

  3. h

    llava-cot-20k-docvqa-chartqa

    • huggingface.co
    Updated May 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MLP-VLM2 (2025). llava-cot-20k-docvqa-chartqa [Dataset]. https://huggingface.co/datasets/MLP-VLM2/llava-cot-20k-docvqa-chartqa
    Explore at:
    Dataset updated
    May 11, 2025
    Dataset authored and provided by
    MLP-VLM2
    Description

    MLP-VLM2/llava-cot-20k-docvqa-chartqa dataset hosted on Hugging Face and contributed by the HF Datasets community

  4. h

    pisc-tr

    • huggingface.co
    Updated Dec 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Berhan Türkü Ay (2024). pisc-tr [Dataset]. https://huggingface.co/datasets/berhaan/pisc-tr
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 9, 2024
    Authors
    Berhan Türkü Ay
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Card for CoT

      Dataset Sources
    

    Repository: LLaVA-CoT GitHub Repository Paper: LLaVA-CoT on arXiv

      Dataset Structure
    

    cat image.zip.part-* > image.zip #not uploaded yet unzip image.zip

    The train.jsonl file contains the question-answering data and is structured in the following format: { "id": "example_id", "image": "example_image_path", "conversations": [ {"from": "human", "value": "Lütfen resimdeki kırmızı metal nesnelerin sayısını belirtin."}… See the full description on the dataset page: https://huggingface.co/datasets/berhaan/pisc-tr.

  5. h

    clevr-tr

    • huggingface.co
    Updated Dec 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Berhan Türkü Ay (2024). clevr-tr [Dataset]. https://huggingface.co/datasets/berhaan/clevr-tr
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 7, 2024
    Authors
    Berhan Türkü Ay
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Card for CoT

      Dataset Sources
    

    Repository: LLaVA-CoT GitHub Repository Paper: LLaVA-CoT on arXiv

      Dataset Structure
    

    unzip image.zip

    The train.jsonl file contains the question-answering data and is structured in the following format: { "id": "example_id", "image": "example_image_path", "conversations": [ {"from": "human", "value": "Lütfen resimdeki kırmızı metal nesnelerin sayısını belirtin."}, {"from": "gpt", "value": "Resimde 3 kırmızı… See the full description on the dataset page: https://huggingface.co/datasets/berhaan/clevr-tr.

  6. h

    Viet-LLaVA-CoT-o1-Instruct

    • huggingface.co
    Updated May 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fifth Civil Defender - 5CD (2025). Viet-LLaVA-CoT-o1-Instruct [Dataset]. https://huggingface.co/datasets/5CD-AI/Viet-LLaVA-CoT-o1-Instruct
    Explore at:
    Dataset updated
    May 27, 2025
    Dataset authored and provided by
    Fifth Civil Defender - 5CD
    Description

    5CD-AI/Viet-LLaVA-CoT-o1-Instruct dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. h

    WeThink-Multimodal-Reasoning-120K

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    WeThink, WeThink-Multimodal-Reasoning-120K [Dataset]. https://huggingface.co/datasets/WeThink/WeThink-Multimodal-Reasoning-120K
    Explore at:
    Authors
    WeThink
    Description

    WeThink-Multimodal-Reasoning-120K

      Image Type
    

    Images data can be access from https://huggingface.co/datasets/Xkev/LLaVA-CoT-100k

    Image Type Source Dataset Images

    General Images COCO 25,344

    SAM-1B 18,091

    Visual Genome 4,441

    GQA 3,251

    PISC 835

    LLaVA 134

    Text-Intensive Images TextVQA 25,483

    ShareTextVQA 538

    DocVQA 4,709

    OCR-VQA5,142

    ChartQA 21,781

    Scientific & Technical GeoQA+ 4,813

    ScienceQA 4,990

    AI2D 1,812

    CLEVR-Math 677… See the full description on the dataset page: https://huggingface.co/datasets/WeThink/WeThink-Multimodal-Reasoning-120K.

  8. h

    llavacot-r1-RL

    • huggingface.co
    Updated Apr 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ahmed Heakl (2025). llavacot-r1-RL [Dataset]. https://huggingface.co/datasets/ahmedheakl/llavacot-r1-RL
    Explore at:
    Dataset updated
    Apr 14, 2025
    Authors
    Ahmed Heakl
    Description

    ahmedheakl/llavacot-r1-RL dataset hosted on Hugging Face and contributed by the HF Datasets community

  9. h

    llavacot-think

    • huggingface.co
    Updated Apr 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ahmed Heakl (2025). llavacot-think [Dataset]. https://huggingface.co/datasets/ahmedheakl/llavacot-think
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 14, 2025
    Authors
    Ahmed Heakl
    Description

    ahmedheakl/llavacot-think dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. h

    RESA-CoT-data

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yifan Wang, RESA-CoT-data [Dataset]. https://huggingface.co/datasets/yfwang22/RESA-CoT-data
    Explore at:
    Authors
    Yifan Wang
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    RESA-CoT Dataset

    The RESA-CoT dataset is a multimodal dataset designed for large language model alignment and reasoning research. It consists of image-conversation pairs in LLaVA format, enhanced with Chain-of-Thought (CoT) style reasoning to improve interpretability and alignment.

      Dataset Versions
    

    RESA

    Based on VLGuard data. Augmented using GPT-4o to generate CoT-style conversations.

    RESA-mix

    Combines RESA with 10K LLaVA-NEXT samples. Also enhanced with CoT-style… See the full description on the dataset page: https://huggingface.co/datasets/yfwang22/RESA-CoT-data.

  11. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Fahrizal Farid (2025). LLaVA-CoT-o1-Instruct [Dataset]. https://huggingface.co/datasets/mamangracing/LLaVA-CoT-o1-Instruct

LLaVA-CoT-o1-Instruct

mamangracing/LLaVA-CoT-o1-Instruct

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 6, 2025
Authors
Fahrizal Farid
Description

Example1

  Input:

Please answer the question below, explaining your reasoning step by step before providing the final answer. Question: Are there enough straws for every cup ? A. yes B. no

  Output:

The question asks whether there are enough straws to provide one for each cup depicted in an image. To answer, we need to count the number of straws and cups separately and then compare those quantities. The image shows three… See the full description on the dataset page: https://huggingface.co/datasets/mamangracing/LLaVA-CoT-o1-Instruct.

Search
Clear search
Close search
Google apps
Main menu