54 datasets found
  1. h

    DocVQA

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    LMMs-Lab, DocVQA [Dataset]. https://huggingface.co/datasets/lmms-lab/DocVQA
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset authored and provided by
    LMMs-Lab
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Large-scale Multi-modality Models Evaluation Suite

    Accelerating the development of large-scale multi-modality models (LMMs) with lmms-eval

    🏠 Homepage | 📚 Documentation | 🤗 Huggingface Datasets

      This Dataset
    

    This is a formatted version of DocVQA. It is used in our lmms-eval pipeline to allow for one-click evaluations of large multi-modality models. @article{mathew2020docvqa, title={DocVQA: A Dataset for VQA on Document Images. CoRR abs/2007.00398 (2020)}… See the full description on the dataset page: https://huggingface.co/datasets/lmms-lab/DocVQA.

  2. h

    docvqa

    • huggingface.co
    Updated Jun 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Eliott Zemour (2023). docvqa [Dataset]. https://huggingface.co/datasets/eliolio/docvqa
    Explore at:
    Dataset updated
    Jun 12, 2023
    Authors
    Eliott Zemour
    Description

    DocVQA: A Dataset for VQA on Document Images

    The DocVQA dataset can be downloaded from the challenge page in RRC portal ("Downloads" tab).

      Dataset Structure
    

    The DocVQA comprises 50, 000 questions framed on 12,767 images. The data is split randomly in an 80−10−10 ratio to train, validation and test splits.

    Train split: 39,463 questions, 10,194 images Validation split: 5,349 questions and 1,286 images Test split has 5,188 questions and 1,287 images.

      Resources and… See the full description on the dataset page: https://huggingface.co/datasets/eliolio/docvqa.
    
  3. h

    mp-docvqa

    • huggingface.co
    Updated Feb 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rubèn Tito (2023). mp-docvqa [Dataset]. https://huggingface.co/datasets/rubentito/mp-docvqa
    Explore at:
    Dataset updated
    Feb 22, 2023
    Authors
    Rubèn Tito
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset Card for Multipage Document Visual Question Answering (MP-DocVQA)

      Dataset Summary
    

    The dataset is aimed to perform Visual Question Answering on multipage industry scanned documents. The questions and answers are reused from Single Page DocVQA (SP-DocVQA) dataset. The images also corresponds to the same in original dataset with previous and posterior pages with a limit of up to 20 pages per document.

      Download the Dataset
    

    The dataset is not integrated with… See the full description on the dataset page: https://huggingface.co/datasets/rubentito/mp-docvqa.

  4. h

    docvqa-single-page-questions

    • huggingface.co
    Updated Mar 29, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pixel Parsing (2024). docvqa-single-page-questions [Dataset]. https://huggingface.co/datasets/pixparse/docvqa-single-page-questions
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 29, 2024
    Dataset authored and provided by
    Pixel Parsing
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset Card for DocVQA Dataset

      Dataset Summary
    

    DocVQA dataset is a document dataset introduced in Mathew et al. (2021) consisting of 50,000 questions defined on 12,000+ document images. Please visit the challenge page (https://rrc.cvc.uab.es/?ch=17) and paper (https://arxiv.org/abs/2007.00398) for further information.

      Usage
    

    This dataset can be used with current releases of Hugging Face datasets library. Here is an example using a custom collator to bundle… See the full description on the dataset page: https://huggingface.co/datasets/pixparse/docvqa-single-page-questions.

  5. h

    MP-DocVQA

    • huggingface.co
    Updated Oct 4, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    LMMs-Lab (2024). MP-DocVQA [Dataset]. https://huggingface.co/datasets/lmms-lab/MP-DocVQA
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 4, 2024
    Dataset authored and provided by
    LMMs-Lab
    Description

    lmms-lab/MP-DocVQA dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. h

    docvqa-val

    • huggingface.co
    Updated Jan 5, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vik Korrapati (2025). docvqa-val [Dataset]. https://huggingface.co/datasets/vikhyatk/docvqa-val
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 5, 2025
    Authors
    Vik Korrapati
    Description

    vikhyatk/docvqa-val dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. doc-vqa

    • huggingface.co
    Updated Jun 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Credit Mutuel Arkea (2024). doc-vqa [Dataset]. https://huggingface.co/datasets/cmarkea/doc-vqa
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 18, 2024
    Dataset provided by
    Crédit Mutuel Arkéa
    Authors
    Credit Mutuel Arkea
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset description

    The doc-vqa Dataset integrates images from the Infographic_vqa dataset sourced from HuggingFaceM4 The Cauldron dataset, as well as images from the dataset AFTDB (Arxiv Figure Table Database) curated by cmarkea. This dataset consists of pairs of images and corresponding text, with each image linked to an average of five questions and answers available in both English and French. These questions and answers were generated using Gemini 1.5 Pro, thereby… See the full description on the dataset page: https://huggingface.co/datasets/cmarkea/doc-vqa.

  8. h

    VisRAG-Ret-Test-MP-DocVQA

    • huggingface.co
    Updated Oct 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OpenBMB (2024). VisRAG-Ret-Test-MP-DocVQA [Dataset]. https://huggingface.co/datasets/openbmb/VisRAG-Ret-Test-MP-DocVQA
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 16, 2024
    Dataset authored and provided by
    OpenBMB
    Description

    Dataset Description

    This is a VQA dataset based on Industrial Documents from MP-DocVQA dataset from MP-DocVQA.

      Load the dataset
    

    from datasets import load_dataset import csv

    def load_beir_qrels(qrels_file): qrels = {} with open(qrels_file) as f: tsvreader = csv.DictReader(f, delimiter="\t") for row in tsvreader: qid = row["query-id"] pid = row["corpus-id"] rel = int(row["score"]) if qid in qrels:… See the full description on the dataset page: https://huggingface.co/datasets/openbmb/VisRAG-Ret-Test-MP-DocVQA.

  9. h

    DocVQA

    • huggingface.co
    Updated Aug 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    RIPS-Google-23 (2023). DocVQA [Dataset]. https://huggingface.co/datasets/RIPS-Goog-23/DocVQA
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 19, 2023
    Dataset authored and provided by
    RIPS-Google-23
    Description

    RIPS-Goog-23/DocVQA dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. h

    DocumentVQA

    • huggingface.co
    Updated May 4, 2000
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HuggingFaceM4 (2000). DocumentVQA [Dataset]. https://huggingface.co/datasets/HuggingFaceM4/DocumentVQA
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 4, 2000
    Dataset authored and provided by
    HuggingFaceM4
    Description

    HuggingFaceM4/DocumentVQA dataset hosted on Hugging Face and contributed by the HF Datasets community

  11. h

    docvqa

    • huggingface.co
    Updated Jul 20, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jina AI (2025). docvqa [Dataset]. https://huggingface.co/datasets/jinaai/docvqa
    Explore at:
    Dataset updated
    Jul 20, 2025
    Dataset authored and provided by
    Jina AI
    Description

    Creation

    This dataset is build upon the corresponding dataset from the ViDoRe Benchmark. For more information regarding the filtering please read our paper or this discussion on github.

      Disclaimer
    

    This dataset may contain publicly available images or text data. All data is provided for research and educational purposes only. If you are the rights holder of any content and have concerns regarding intellectual property or copyright, please contact us at "support-data… See the full description on the dataset page: https://huggingface.co/datasets/jinaai/docvqa.

  12. h

    DOCVQA

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vansh Agrawal, DOCVQA [Dataset]. https://huggingface.co/datasets/Slicky325/DOCVQA
    Explore at:
    Authors
    Vansh Agrawal
    Description

    Slicky325/DOCVQA dataset hosted on Hugging Face and contributed by the HF Datasets community

  13. h

    docvqa

    • huggingface.co
    Updated May 4, 2000
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dan Jacobellis (2000). docvqa [Dataset]. https://huggingface.co/datasets/danjacobellis/docvqa
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 4, 2000
    Authors
    Dan Jacobellis
    Description

    danjacobellis/docvqa dataset hosted on Hugging Face and contributed by the HF Datasets community

  14. h

    docVQA

    • huggingface.co
    Updated Mar 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Llama Stack (2025). docVQA [Dataset]. https://huggingface.co/datasets/llamastack/docVQA
    Explore at:
    Dataset updated
    Mar 14, 2025
    Dataset authored and provided by
    Llama Stack
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    llamastack/docVQA dataset hosted on Hugging Face and contributed by the HF Datasets community

  15. h

    DOCVQA-Contract

    • huggingface.co
    Updated Nov 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ashok Poudel (2024). DOCVQA-Contract [Dataset]. https://huggingface.co/datasets/ashokpoudel/DOCVQA-Contract
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 30, 2024
    Authors
    Ashok Poudel
    Description

    ashokpoudel/DOCVQA-Contract dataset hosted on Hugging Face and contributed by the HF Datasets community

  16. h

    docvqa

    • huggingface.co
    Updated Aug 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Georgios Skyrianos (2025). docvqa [Dataset]. https://huggingface.co/datasets/geoskyr/docvqa
    Explore at:
    Dataset updated
    Aug 4, 2025
    Authors
    Georgios Skyrianos
    Description

    geoskyr/docvqa dataset hosted on Hugging Face and contributed by the HF Datasets community

  17. h

    docvqa-10k-donut

    • huggingface.co
    Updated Oct 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HF Tuner (2025). docvqa-10k-donut [Dataset]. https://huggingface.co/datasets/hf-tuner/docvqa-10k-donut
    Explore at:
    Dataset updated
    Oct 1, 2025
    Authors
    HF Tuner
    Description

    hf-tuner/docvqa-10k-donut dataset

    This dataset is created using Tommynguyen02/doc-vqa dataset using this notebook

      Dataset Summary
    

    This dataset consists of 10k grayscale images of documents with question and ground truth answer. Only one answer with lowercase letters is selected from Tommynguyen02/doc-vqa dataset in a donut specific format.

  18. h

    DocVQA

    • huggingface.co
    Updated Apr 5, 2012
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ReplugLens (2012). DocVQA [Dataset]. https://huggingface.co/datasets/ReplugLens/DocVQA
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 5, 2012
    Dataset authored and provided by
    ReplugLens
    Description

    ReplugLens/DocVQA dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. h

    docvqa-train

    • huggingface.co
    Updated Jun 30, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    s076923 (2024). docvqa-train [Dataset]. https://huggingface.co/datasets/s076923/docvqa-train
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 30, 2024
    Authors
    s076923
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    s076923/docvqa-train dataset hosted on Hugging Face and contributed by the HF Datasets community

  20. h

    boostcamp-docvqa-v5-test

    • huggingface.co
    Updated Nov 2, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    seonjong Yoo (2023). boostcamp-docvqa-v5-test [Dataset]. https://huggingface.co/datasets/Ssunbell/boostcamp-docvqa-v5-test
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 2, 2023
    Authors
    seonjong Yoo
    Description

    Dataset Card for "boostcamp-docvqa-v5-test"

    More Information needed

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
LMMs-Lab, DocVQA [Dataset]. https://huggingface.co/datasets/lmms-lab/DocVQA

DocVQA

lmms-lab/DocVQA

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset authored and provided by
LMMs-Lab
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Large-scale Multi-modality Models Evaluation Suite

Accelerating the development of large-scale multi-modality models (LMMs) with lmms-eval

🏠 Homepage | 📚 Documentation | 🤗 Huggingface Datasets

  This Dataset

This is a formatted version of DocVQA. It is used in our lmms-eval pipeline to allow for one-click evaluations of large multi-modality models. @article{mathew2020docvqa, title={DocVQA: A Dataset for VQA on Document Images. CoRR abs/2007.00398 (2020)}… See the full description on the dataset page: https://huggingface.co/datasets/lmms-lab/DocVQA.

Search
Clear search
Close search
Google apps
Main menu