2 datasets found
  1. h

    Viet-OCR-VQA

    • huggingface.co
    Updated Jul 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fifth Civil Defender - 5CD (2024). Viet-OCR-VQA [Dataset]. https://huggingface.co/datasets/5CD-AI/Viet-OCR-VQA
    Explore at:
    Dataset updated
    Jul 13, 2024
    Dataset authored and provided by
    Fifth Civil Defender - 5CD
    Area covered
    Vietnam
    Description

    Dataset Overview

    The dataset comprises over 137,000 images potentially containing Vietnamese 🇻🇳 textual content. It was curated using the Gemini 1.5 Flash model, currently Google model leading on the WildVision Arena Leaderboard for Visual Question Answering (VQA). Each image is accompanied by a detailed description and 5 self-generated questions and answers related to the textual content within the image. In total, there are more than 822,679 individual questions, encompassing… See the full description on the dataset page: https://huggingface.co/datasets/5CD-AI/Viet-OCR-VQA.

  2. h

    textvqa

    • huggingface.co
    • live.european-language-grid.eu
    Updated Jun 30, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AI at Meta (2022). textvqa [Dataset]. https://huggingface.co/datasets/facebook/textvqa
    Explore at:
    Dataset updated
    Jun 30, 2022
    Dataset authored and provided by
    AI at Meta
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    TextVQA requires models to read and reason about text in images to answer questions about them. Specifically, models need to incorporate a new modality of text present in the images and reason over it to answer TextVQA questions. TextVQA dataset contains 45,336 questions over 28,408 images from the OpenImages dataset.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Fifth Civil Defender - 5CD (2024). Viet-OCR-VQA [Dataset]. https://huggingface.co/datasets/5CD-AI/Viet-OCR-VQA

Viet-OCR-VQA

5CD-AI/Viet-OCR-VQA

Explore at:
Dataset updated
Jul 13, 2024
Dataset authored and provided by
Fifth Civil Defender - 5CD
Area covered
Vietnam
Description

Dataset Overview

The dataset comprises over 137,000 images potentially containing Vietnamese 🇻🇳 textual content. It was curated using the Gemini 1.5 Flash model, currently Google model leading on the WildVision Arena Leaderboard for Visual Question Answering (VQA). Each image is accompanied by a detailed description and 5 self-generated questions and answers related to the textual content within the image. In total, there are more than 822,679 individual questions, encompassing… See the full description on the dataset page: https://huggingface.co/datasets/5CD-AI/Viet-OCR-VQA.

Search
Clear search
Close search
Google apps
Main menu