2 datasets found

h
Viet-OCR-VQA
huggingface.co
Updated Jul 13, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fifth Civil Defender - 5CD (2024). Viet-OCR-VQA [Dataset]. https://huggingface.co/datasets/5CD-AI/Viet-OCR-VQA
Explore at:
Dataset updated
Jul 13, 2024
Dataset authored and provided by
Fifth Civil Defender - 5CD
Area covered
Vietnam
Description
Dataset Overview

The dataset comprises over 137,000 images potentially containing Vietnamese 🇻🇳 textual content. It was curated using the Gemini 1.5 Flash model, currently Google model leading on the WildVision Arena Leaderboard for Visual Question Answering (VQA). Each image is accompanied by a detailed description and 5 self-generated questions and answers related to the textual content within the image. In total, there are more than 822,679 individual questions, encompassing… See the full description on the dataset page: https://huggingface.co/datasets/5CD-AI/Viet-OCR-VQA.
h
textvqa
huggingface.co
live.european-language-grid.eu
Updated Jun 30, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AI at Meta (2022). textvqa [Dataset]. https://huggingface.co/datasets/facebook/textvqa
Explore at:
Dataset updated
Jun 30, 2022
Dataset authored and provided by
AI at Meta
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
TextVQA requires models to read and reason about text in images to answer questions about them. Specifically, models need to incorporate a new modality of text present in the images and reason over it to answer TextVQA questions. TextVQA dataset contains 45,336 questions over 28,408 images from the OpenImages dataset.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Fifth Civil Defender - 5CD (2024). Viet-OCR-VQA [Dataset]. https://huggingface.co/datasets/5CD-AI/Viet-OCR-VQA

Viet-OCR-VQA

5CD-AI/Viet-OCR-VQA

Explore at:

Dataset updated

Jul 13, 2024

Dataset authored and provided by

Fifth Civil Defender - 5CD

Area covered

Vietnam

Description

Dataset Overview

The dataset comprises over 137,000 images potentially containing Vietnamese 🇻🇳 textual content. It was curated using the Gemini 1.5 Flash model, currently Google model leading on the WildVision Arena Leaderboard for Visual Question Answering (VQA). Each image is accompanied by a detailed description and 5 self-generated questions and answers related to the textual content within the image. In total, there are more than 822,679 individual questions, encompassing… See the full description on the dataset page: https://huggingface.co/datasets/5CD-AI/Viet-OCR-VQA.

Clear search

Close search

Google apps

Main menu

Viet-OCR-VQA

textvqa

Viet-OCR-VQA

5CD-AI/Viet-OCR-VQA