54 datasets found

h
DocVQA
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
LMMs-Lab, DocVQA [Dataset]. https://huggingface.co/datasets/lmms-lab/DocVQA
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset authored and provided by
LMMs-Lab
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Large-scale Multi-modality Models Evaluation Suite

Accelerating the development of large-scale multi-modality models (LMMs) with lmms-eval

🏠 Homepage | 📚 Documentation | 🤗 Huggingface Datasets

This Dataset

This is a formatted version of DocVQA. It is used in our lmms-eval pipeline to allow for one-click evaluations of large multi-modality models. @article{mathew2020docvqa, title={DocVQA: A Dataset for VQA on Document Images. CoRR abs/2007.00398 (2020)}… See the full description on the dataset page: https://huggingface.co/datasets/lmms-lab/DocVQA.
h
docvqa
huggingface.co
Updated Jun 12, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Eliott Zemour (2023). docvqa [Dataset]. https://huggingface.co/datasets/eliolio/docvqa
Explore at:
Dataset updated
Jun 12, 2023
Authors
Eliott Zemour
Description
DocVQA: A Dataset for VQA on Document Images

The DocVQA dataset can be downloaded from the challenge page in RRC portal ("Downloads" tab).

Dataset Structure

The DocVQA comprises 50, 000 questions framed on 12,767 images. The data is split randomly in an 80−10−10 ratio to train, validation and test splits.

Train split: 39,463 questions, 10,194 images Validation split: 5,349 questions and 1,286 images Test split has 5,188 questions and 1,287 images.

Resources and… See the full description on the dataset page: https://huggingface.co/datasets/eliolio/docvqa.
h
mp-docvqa
huggingface.co
Updated Feb 22, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rubèn Tito (2023). mp-docvqa [Dataset]. https://huggingface.co/datasets/rubentito/mp-docvqa
Explore at:
Dataset updated
Feb 22, 2023
Authors
Rubèn Tito
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset Card for Multipage Document Visual Question Answering (MP-DocVQA)

Dataset Summary

The dataset is aimed to perform Visual Question Answering on multipage industry scanned documents. The questions and answers are reused from Single Page DocVQA (SP-DocVQA) dataset. The images also corresponds to the same in original dataset with previous and posterior pages with a limit of up to 20 pages per document.

Download the Dataset

The dataset is not integrated with… See the full description on the dataset page: https://huggingface.co/datasets/rubentito/mp-docvqa.
h
docvqa-single-page-questions
huggingface.co
Updated Mar 29, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pixel Parsing (2024). docvqa-single-page-questions [Dataset]. https://huggingface.co/datasets/pixparse/docvqa-single-page-questions
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 29, 2024
Dataset authored and provided by
Pixel Parsing
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset Card for DocVQA Dataset

Dataset Summary

DocVQA dataset is a document dataset introduced in Mathew et al. (2021) consisting of 50,000 questions defined on 12,000+ document images. Please visit the challenge page (https://rrc.cvc.uab.es/?ch=17) and paper (https://arxiv.org/abs/2007.00398) for further information.

Usage

This dataset can be used with current releases of Hugging Face datasets library. Here is an example using a custom collator to bundle… See the full description on the dataset page: https://huggingface.co/datasets/pixparse/docvqa-single-page-questions.
h
MP-DocVQA
huggingface.co
Updated Oct 4, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
LMMs-Lab (2024). MP-DocVQA [Dataset]. https://huggingface.co/datasets/lmms-lab/MP-DocVQA
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 4, 2024
Dataset authored and provided by
LMMs-Lab
Description
lmms-lab/MP-DocVQA dataset hosted on Hugging Face and contributed by the HF Datasets community
h
docvqa-val
huggingface.co
Updated Jan 5, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vik Korrapati (2025). docvqa-val [Dataset]. https://huggingface.co/datasets/vikhyatk/docvqa-val
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 5, 2025
Authors
Vik Korrapati
Description
vikhyatk/docvqa-val dataset hosted on Hugging Face and contributed by the HF Datasets community
doc-vqa
huggingface.co
Updated Jun 18, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Credit Mutuel Arkea (2024). doc-vqa [Dataset]. https://huggingface.co/datasets/cmarkea/doc-vqa
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 18, 2024
Dataset provided by
Crédit Mutuel Arkéa
Authors
Credit Mutuel Arkea
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset description

The doc-vqa Dataset integrates images from the Infographic_vqa dataset sourced from HuggingFaceM4 The Cauldron dataset, as well as images from the dataset AFTDB (Arxiv Figure Table Database) curated by cmarkea. This dataset consists of pairs of images and corresponding text, with each image linked to an average of five questions and answers available in both English and French. These questions and answers were generated using Gemini 1.5 Pro, thereby… See the full description on the dataset page: https://huggingface.co/datasets/cmarkea/doc-vqa.
h
VisRAG-Ret-Test-MP-DocVQA
huggingface.co
Updated Oct 16, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
OpenBMB (2024). VisRAG-Ret-Test-MP-DocVQA [Dataset]. https://huggingface.co/datasets/openbmb/VisRAG-Ret-Test-MP-DocVQA
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 16, 2024
Dataset authored and provided by
OpenBMB
Description
Dataset Description

This is a VQA dataset based on Industrial Documents from MP-DocVQA dataset from MP-DocVQA.

Load the dataset

from datasets import load_dataset import csv

def load_beir_qrels(qrels_file): qrels = {} with open(qrels_file) as f: tsvreader = csv.DictReader(f, delimiter="\t") for row in tsvreader: qid = row["query-id"] pid = row["corpus-id"] rel = int(row["score"]) if qid in qrels:… See the full description on the dataset page: https://huggingface.co/datasets/openbmb/VisRAG-Ret-Test-MP-DocVQA.
h
DocVQA
huggingface.co
Updated Aug 19, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
RIPS-Google-23 (2023). DocVQA [Dataset]. https://huggingface.co/datasets/RIPS-Goog-23/DocVQA
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 19, 2023
Dataset authored and provided by
RIPS-Google-23
Description
RIPS-Goog-23/DocVQA dataset hosted on Hugging Face and contributed by the HF Datasets community
h
DocumentVQA
huggingface.co
Updated May 4, 2000
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
HuggingFaceM4 (2000). DocumentVQA [Dataset]. https://huggingface.co/datasets/HuggingFaceM4/DocumentVQA
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 4, 2000
Dataset authored and provided by
HuggingFaceM4
Description
HuggingFaceM4/DocumentVQA dataset hosted on Hugging Face and contributed by the HF Datasets community
h
docvqa
huggingface.co
Updated Jul 20, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jina AI (2025). docvqa [Dataset]. https://huggingface.co/datasets/jinaai/docvqa
Explore at:
Dataset updated
Jul 20, 2025
Dataset authored and provided by
Jina AI
Description
Creation

This dataset is build upon the corresponding dataset from the ViDoRe Benchmark. For more information regarding the filtering please read our paper or this discussion on github.

Disclaimer

This dataset may contain publicly available images or text data. All data is provided for research and educational purposes only. If you are the rights holder of any content and have concerns regarding intellectual property or copyright, please contact us at "support-data… See the full description on the dataset page: https://huggingface.co/datasets/jinaai/docvqa.
h
DOCVQA
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vansh Agrawal, DOCVQA [Dataset]. https://huggingface.co/datasets/Slicky325/DOCVQA
Explore at:
Authors
Vansh Agrawal
Description
Slicky325/DOCVQA dataset hosted on Hugging Face and contributed by the HF Datasets community
h
docvqa
huggingface.co
Updated May 4, 2000
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dan Jacobellis (2000). docvqa [Dataset]. https://huggingface.co/datasets/danjacobellis/docvqa
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 4, 2000
Authors
Dan Jacobellis
Description
danjacobellis/docvqa dataset hosted on Hugging Face and contributed by the HF Datasets community
h
docVQA
huggingface.co
Updated Mar 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Llama Stack (2025). docVQA [Dataset]. https://huggingface.co/datasets/llamastack/docVQA
Explore at:
Dataset updated
Mar 14, 2025
Dataset authored and provided by
Llama Stack
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
llamastack/docVQA dataset hosted on Hugging Face and contributed by the HF Datasets community
h
DOCVQA-Contract
huggingface.co
Updated Nov 30, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ashok Poudel (2024). DOCVQA-Contract [Dataset]. https://huggingface.co/datasets/ashokpoudel/DOCVQA-Contract
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 30, 2024
Authors
Ashok Poudel
Description
ashokpoudel/DOCVQA-Contract dataset hosted on Hugging Face and contributed by the HF Datasets community
h
docvqa
huggingface.co
Updated Aug 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Georgios Skyrianos (2025). docvqa [Dataset]. https://huggingface.co/datasets/geoskyr/docvqa
Explore at:
Dataset updated
Aug 4, 2025
Authors
Georgios Skyrianos
Description
geoskyr/docvqa dataset hosted on Hugging Face and contributed by the HF Datasets community
h
docvqa-10k-donut
huggingface.co
Updated Oct 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
HF Tuner (2025). docvqa-10k-donut [Dataset]. https://huggingface.co/datasets/hf-tuner/docvqa-10k-donut
Explore at:
Dataset updated
Oct 1, 2025
Authors
HF Tuner
Description
hf-tuner/docvqa-10k-donut dataset

This dataset is created using Tommynguyen02/doc-vqa dataset using this notebook

Dataset Summary

This dataset consists of 10k grayscale images of documents with question and ground truth answer. Only one answer with lowercase letters is selected from Tommynguyen02/doc-vqa dataset in a donut specific format.
h
DocVQA
huggingface.co
Updated Apr 5, 2012
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ReplugLens (2012). DocVQA [Dataset]. https://huggingface.co/datasets/ReplugLens/DocVQA
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 5, 2012
Dataset authored and provided by
ReplugLens
Description
ReplugLens/DocVQA dataset hosted on Hugging Face and contributed by the HF Datasets community
h
docvqa-train
huggingface.co
Updated Jun 30, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
s076923 (2024). docvqa-train [Dataset]. https://huggingface.co/datasets/s076923/docvqa-train
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 30, 2024
Authors
s076923
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
s076923/docvqa-train dataset hosted on Hugging Face and contributed by the HF Datasets community
h
boostcamp-docvqa-v5-test
huggingface.co
Updated Nov 2, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
seonjong Yoo (2023). boostcamp-docvqa-v5-test [Dataset]. https://huggingface.co/datasets/Ssunbell/boostcamp-docvqa-v5-test
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 2, 2023
Authors
seonjong Yoo
Description
Dataset Card for "boostcamp-docvqa-v5-test"

More Information needed

Facebook

Twitter

Click to copy link

Link copied

Cite

LMMs-Lab, DocVQA [Dataset]. https://huggingface.co/datasets/lmms-lab/DocVQA

DocVQA

lmms-lab/DocVQA

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset authored and provided by

LMMs-Lab

License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Large-scale Multi-modality Models Evaluation Suite

Accelerating the development of large-scale multi-modality models (LMMs) with lmms-eval

🏠 Homepage | 📚 Documentation | 🤗 Huggingface Datasets

  This Dataset

This is a formatted version of DocVQA. It is used in our lmms-eval pipeline to allow for one-click evaluations of large multi-modality models. @article{mathew2020docvqa, title={DocVQA: A Dataset for VQA on Document Images. CoRR abs/2007.00398 (2020)}… See the full description on the dataset page: https://huggingface.co/datasets/lmms-lab/DocVQA.

Clear search

Close search

Google apps

Main menu

DocVQA

docvqa

mp-docvqa

docvqa-single-page-questions

MP-DocVQA

docvqa-val

doc-vqa

VisRAG-Ret-Test-MP-DocVQA

DocVQA

DocumentVQA

docvqa

DOCVQA

docvqa

docVQA

DOCVQA-Contract

docvqa

docvqa-10k-donut

DocVQA

docvqa-train

boostcamp-docvqa-v5-test

DocVQASee More Versions

lmms-lab/DocVQA

DocVQA