7 datasets found
  1. P

    MP-DocVQA Dataset

    • paperswithcode.com
    Updated Apr 2, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rubèn Tito; Dimosthenis Karatzas; Ernest Valveny (2023). MP-DocVQA Dataset [Dataset]. https://paperswithcode.com/dataset/mp-docvqa
    Explore at:
    Dataset updated
    Apr 2, 2023
    Authors
    Rubèn Tito; Dimosthenis Karatzas; Ernest Valveny
    Description

    The dataset is aimed to perform Visual Question Answering on multipage industry scanned documents. The questions and answers are reused from Single Page DocVQA (SP-DocVQA) dataset. The images also corresponds to the same in original dataset with previous and posterior pages with a limit of up to 20 pages per document.

  2. h

    Viet-Doc-VQA-II

    • huggingface.co
    Updated Jul 20, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fifth Civil Defender - 5CD (2024). Viet-Doc-VQA-II [Dataset]. https://huggingface.co/datasets/5CD-AI/Viet-Doc-VQA-II
    Explore at:
    Dataset updated
    Jul 20, 2024
    Dataset authored and provided by
    Fifth Civil Defender - 5CD
    Area covered
    Vietnam
    Description

    Dataset Overview

    This dataset is a continuation of the ongoing work from Viet Document VAQ dataset was collected from 64,765 pages of Vietnamese 🇻🇳 textbooks( Sách bài tập, chuyên đề, sách giáo án của Bộ GDĐT, Cánh Diều, Chân trời sáng tạo, Kết nối tri thức), spanning all subjects from grades 1 to 12. Each page has been analyzed and annotated using advanced Visual Question Answering (VQA) techniques to produce a comprehensive dataset. There is a set of 388,277 detailed… See the full description on the dataset page: https://huggingface.co/datasets/5CD-AI/Viet-Doc-VQA-II.

  3. h

    DocVQA

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    LMMs-Lab, DocVQA [Dataset]. https://huggingface.co/datasets/lmms-lab/DocVQA
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset authored and provided by
    LMMs-Lab
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Large-scale Multi-modality Models Evaluation Suite

    Accelerating the development of large-scale multi-modality models (LMMs) with lmms-eval

    🏠 Homepage | 📚 Documentation | 🤗 Huggingface Datasets

      This Dataset
    

    This is a formatted version of DocVQA. It is used in our lmms-eval pipeline to allow for one-click evaluations of large multi-modality models. @article{mathew2020docvqa, title={DocVQA: A Dataset for VQA on Document Images. CoRR abs/2007.00398 (2020)}… See the full description on the dataset page: https://huggingface.co/datasets/lmms-lab/DocVQA.

  4. h

    DocumentVQA

    • huggingface.co
    Updated May 4, 2000
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HuggingFaceM4 (2000). DocumentVQA [Dataset]. https://huggingface.co/datasets/HuggingFaceM4/DocumentVQA
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 4, 2000
    Dataset authored and provided by
    HuggingFaceM4
    Description

    HuggingFaceM4/DocumentVQA dataset hosted on Hugging Face and contributed by the HF Datasets community

  5. Document Conversion and Retrieval System (DOCRS)

    • catalog.data.gov
    • datahub.va.gov
    • +3more
    Updated Aug 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Veterans Affairs (2024). Document Conversion and Retrieval System (DOCRS) [Dataset]. https://catalog.data.gov/dataset/document-conversion-and-retrieval-system-docrs
    Explore at:
    Dataset updated
    Aug 28, 2024
    Dataset provided by
    United States Department of Veterans Affairshttp://va.gov/
    Description

    The Document Conversion and Retrieval System (DOCRS) is a repository of building construction and real property based documents that have been completed. The documents are archival in nature and the system is accessed by CFM personnel and authorized station engineering personnel. Access to these documents limited due to security concerns because many of the documents are building plans type documents for structures throughout the VA. DOCRS is a web based system hosted within the VA intranet.

  6. JA-VG-VQA-500

    • huggingface.co
    Updated May 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sakana AI (2024). JA-VG-VQA-500 [Dataset]. https://huggingface.co/datasets/SakanaAI/JA-VG-VQA-500
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 18, 2024
    Dataset authored and provided by
    Sakana AIhttps://sakana.ai/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    JA-VG-VQA-500

      Dataset Description
    

    JA-VG-VQA-500 is a 500-sample subset of Japanese Visual Genome VQA dataset. This dataset was used in the evaluation of EvoVLM-JP-v1-7B. Please refer to our report and blog for more details. We are grateful to the developers for making the dataset available under Creative Commons Attribution 4.0 License.

    Visual Genome Japanese Visual Genome VQA dataset

      Usage
    

    Use the code below to get started with the dataset. from datasets… See the full description on the dataset page: https://huggingface.co/datasets/SakanaAI/JA-VG-VQA-500.

  7. T

    Electronic Signature (eSig)

    • data.va.gov
    • datahub.va.gov
    • +2more
    application/rdfxml +5
    Updated Sep 12, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2019). Electronic Signature (eSig) [Dataset]. https://www.data.va.gov/d/jksy-jd4d
    Explore at:
    csv, application/rdfxml, application/rssxml, json, xml, tsvAvailable download formats
    Dataset updated
    Sep 12, 2019
    Description

    Beginning with the Government Paperwork Elimination Act of 1998 (GPEA), the Federal government has encouraged the use of electronic / digital signatures to enable electronic transactions with agencies, while still providing a means for proof of user consent and non-repudiation. To support this capability, some means of reliable user identity management must exist. Currently, Veterans have to physically print, sign, and mail various documents that, in turn, need to be processed by VA. This process creates a huge inconvenience on the part of the veteran and a financial burden on VA. eSig enables veterans and their surrogates to digitally sign forms that require a high level of verification that the user signing the document is a legitimate and authorized user. In addition, eSig provides a mechanism for VA applications to verify the authenticity of user documents and data integrity on user forms. This capability is enabled by the eSig service. The eSig service signing process includes the following steps: 1. Form Signing Attestation: The user affirms their intent to electronically sign the document and understands re-authentication is part of that process. 2. Re-Authentication: The user must refresh their authentication by repeating the authentication process. 3. Form Signing: The form and the identity of the user are presented to the eSig service, where they are digitally bound and secured. 4. Form Storage: The signed form must be stored for later validation. In this process, the application is entirely responsible for steps 1, 2, and 4. In step 3, the application must use the eSig web service to request signing of the document. The following table lists the detailed functions offered by the eSig service.

  8. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Rubèn Tito; Dimosthenis Karatzas; Ernest Valveny (2023). MP-DocVQA Dataset [Dataset]. https://paperswithcode.com/dataset/mp-docvqa

MP-DocVQA Dataset

Multipage Document Visual Question Answering

Explore at:
30 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Apr 2, 2023
Authors
Rubèn Tito; Dimosthenis Karatzas; Ernest Valveny
Description

The dataset is aimed to perform Visual Question Answering on multipage industry scanned documents. The questions and answers are reused from Single Page DocVQA (SP-DocVQA) dataset. The images also corresponds to the same in original dataset with previous and posterior pages with a limit of up to 20 pages per document.

Search
Clear search
Close search
Google apps
Main menu