3 datasets found

h
Viet-OCR-VQA
huggingface.co
Updated Jul 13, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fifth Civil Defender - 5CD (2024). Viet-OCR-VQA [Dataset]. https://huggingface.co/datasets/5CD-AI/Viet-OCR-VQA
Explore at:
Dataset updated
Jul 13, 2024
Dataset authored and provided by
Fifth Civil Defender - 5CD
Area covered
Việt Nam
Description
Dataset Overview

The dataset comprises over 137,000 images potentially containing Vietnamese 🇻🇳 textual content. It was curated using the Gemini 1.5 Flash model, currently Google model leading on the WildVision Arena Leaderboard for Visual Question Answering (VQA). Each image is accompanied by a detailed description and 5 self-generated questions and answers related to the textual content within the image. In total, there are more than 822,679 individual questions, encompassing… See the full description on the dataset page: https://huggingface.co/datasets/5CD-AI/Viet-OCR-VQA.
h
textvqa
huggingface.co
live.european-language-grid.eu
Updated May 23, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AI at Meta (2024). textvqa [Dataset]. https://huggingface.co/datasets/facebook/textvqa
Explore at:
Dataset updated
May 23, 2024
Dataset authored and provided by
AI at Meta
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
TextVQA requires models to read and reason about text in images to answer questions about them. Specifically, models need to incorporate a new modality of text present in the images and reason over it to answer TextVQA questions. TextVQA dataset contains 45,336 questions over 28,408 images from the OpenImages dataset.
h
Viet-Receipt-VQA
huggingface.co
Updated Oct 14, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fifth Civil Defender - 5CD (2024). Viet-Receipt-VQA [Dataset]. https://huggingface.co/datasets/5CD-AI/Viet-Receipt-VQA
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 14, 2024
Dataset authored and provided by
Fifth Civil Defender - 5CD
Area covered
Việt Nam
Description
Dataset Overview

This dataset is was collected from 2034 Vietnamese 🇻🇳 Receipts MC-OCR 2021 [1]. Each receipt has been analyzed and annotated using advanced Visual Question Answering (VQA) techniques to produce a comprehensive dataset. There is a set of 14,238 detailed descriptions, key information extraction (KIE), and query-based questions and answers generated by the Gemini 1.5 Flash model, currently Google's leading model on the WildVision Arena Leaderboard. This results in a… See the full description on the dataset page: https://huggingface.co/datasets/5CD-AI/Viet-Receipt-VQA.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Fifth Civil Defender - 5CD (2024). Viet-OCR-VQA [Dataset]. https://huggingface.co/datasets/5CD-AI/Viet-OCR-VQA

Viet-OCR-VQA

5CD-AI/Viet-OCR-VQA

Explore at:

Dataset updated

Jul 13, 2024

Dataset authored and provided by

Fifth Civil Defender - 5CD

Area covered

Việt Nam

Description

Dataset Overview

The dataset comprises over 137,000 images potentially containing Vietnamese 🇻🇳 textual content. It was curated using the Gemini 1.5 Flash model, currently Google model leading on the WildVision Arena Leaderboard for Visual Question Answering (VQA). Each image is accompanied by a detailed description and 5 self-generated questions and answers related to the textual content within the image. In total, there are more than 822,679 individual questions, encompassing… See the full description on the dataset page: https://huggingface.co/datasets/5CD-AI/Viet-OCR-VQA.

Clear search

Close search

Google apps

Main menu

Viet-OCR-VQA

textvqa

Viet-Receipt-VQA

Viet-OCR-VQA

5CD-AI/Viet-OCR-VQA