7 datasets found

P
MP-DocVQA Dataset
paperswithcode.com
Updated Apr 2, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rubèn Tito; Dimosthenis Karatzas; Ernest Valveny (2023). MP-DocVQA Dataset [Dataset]. https://paperswithcode.com/dataset/mp-docvqa
Explore at:
Dataset updated
Apr 2, 2023
Authors
Rubèn Tito; Dimosthenis Karatzas; Ernest Valveny
Description
The dataset is aimed to perform Visual Question Answering on multipage industry scanned documents. The questions and answers are reused from Single Page DocVQA (SP-DocVQA) dataset. The images also corresponds to the same in original dataset with previous and posterior pages with a limit of up to 20 pages per document.
h
Viet-Doc-VQA-II
huggingface.co
Updated Jul 20, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fifth Civil Defender - 5CD (2024). Viet-Doc-VQA-II [Dataset]. https://huggingface.co/datasets/5CD-AI/Viet-Doc-VQA-II
Explore at:
Dataset updated
Jul 20, 2024
Dataset authored and provided by
Fifth Civil Defender - 5CD
Area covered
Vietnam
Description
Dataset Overview

This dataset is a continuation of the ongoing work from Viet Document VAQ dataset was collected from 64,765 pages of Vietnamese 🇻🇳 textbooks( Sách bài tập, chuyên đề, sách giáo án của Bộ GDĐT, Cánh Diều, Chân trời sáng tạo, Kết nối tri thức), spanning all subjects from grades 1 to 12. Each page has been analyzed and annotated using advanced Visual Question Answering (VQA) techniques to produce a comprehensive dataset. There is a set of 388,277 detailed… See the full description on the dataset page: https://huggingface.co/datasets/5CD-AI/Viet-Doc-VQA-II.
h
DocVQA
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
LMMs-Lab, DocVQA [Dataset]. https://huggingface.co/datasets/lmms-lab/DocVQA
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset authored and provided by
LMMs-Lab
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Large-scale Multi-modality Models Evaluation Suite

Accelerating the development of large-scale multi-modality models (LMMs) with lmms-eval

🏠 Homepage | 📚 Documentation | 🤗 Huggingface Datasets

This Dataset

This is a formatted version of DocVQA. It is used in our lmms-eval pipeline to allow for one-click evaluations of large multi-modality models. @article{mathew2020docvqa, title={DocVQA: A Dataset for VQA on Document Images. CoRR abs/2007.00398 (2020)}… See the full description on the dataset page: https://huggingface.co/datasets/lmms-lab/DocVQA.
h
DocumentVQA
huggingface.co
Updated May 4, 2000
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
HuggingFaceM4 (2000). DocumentVQA [Dataset]. https://huggingface.co/datasets/HuggingFaceM4/DocumentVQA
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 4, 2000
Dataset authored and provided by
HuggingFaceM4
Description
HuggingFaceM4/DocumentVQA dataset hosted on Hugging Face and contributed by the HF Datasets community
Document Conversion and Retrieval System (DOCRS)
catalog.data.gov
datahub.va.gov
+3more
Updated Aug 28, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department of Veterans Affairs (2024). Document Conversion and Retrieval System (DOCRS) [Dataset]. https://catalog.data.gov/dataset/document-conversion-and-retrieval-system-docrs
Explore at:
Dataset updated
Aug 28, 2024
Dataset provided by
United States Department of Veterans Affairshttp://va.gov/
Description
The Document Conversion and Retrieval System (DOCRS) is a repository of building construction and real property based documents that have been completed. The documents are archival in nature and the system is accessed by CFM personnel and authorized station engineering personnel. Access to these documents limited due to security concerns because many of the documents are building plans type documents for structures throughout the VA. DOCRS is a web based system hosted within the VA intranet.
JA-VG-VQA-500
huggingface.co
Updated May 18, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sakana AI (2024). JA-VG-VQA-500 [Dataset]. https://huggingface.co/datasets/SakanaAI/JA-VG-VQA-500
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 18, 2024
Dataset authored and provided by
Sakana AIhttps://sakana.ai/
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
JA-VG-VQA-500

Dataset Description

JA-VG-VQA-500 is a 500-sample subset of Japanese Visual Genome VQA dataset. This dataset was used in the evaluation of EvoVLM-JP-v1-7B. Please refer to our report and blog for more details. We are grateful to the developers for making the dataset available under Creative Commons Attribution 4.0 License.

Visual Genome Japanese Visual Genome VQA dataset

Usage

Use the code below to get started with the dataset. from datasets… See the full description on the dataset page: https://huggingface.co/datasets/SakanaAI/JA-VG-VQA-500.
T
Electronic Signature (eSig)
data.va.gov
datahub.va.gov
+2more
application/rdfxml +5
Updated Sep 12, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2019). Electronic Signature (eSig) [Dataset]. https://www.data.va.gov/d/jksy-jd4d
Explore at:
csv, application/rdfxml, application/rssxml, json, xml, tsvAvailable download formats
Dataset updated
Sep 12, 2019
Description
Beginning with the Government Paperwork Elimination Act of 1998 (GPEA), the Federal government has encouraged the use of electronic / digital signatures to enable electronic transactions with agencies, while still providing a means for proof of user consent and non-repudiation. To support this capability, some means of reliable user identity management must exist. Currently, Veterans have to physically print, sign, and mail various documents that, in turn, need to be processed by VA. This process creates a huge inconvenience on the part of the veteran and a financial burden on VA. eSig enables veterans and their surrogates to digitally sign forms that require a high level of verification that the user signing the document is a legitimate and authorized user. In addition, eSig provides a mechanism for VA applications to verify the authenticity of user documents and data integrity on user forms. This capability is enabled by the eSig service. The eSig service signing process includes the following steps: 1. Form Signing Attestation: The user affirms their intent to electronically sign the document and understands re-authentication is part of that process. 2. Re-Authentication: The user must refresh their authentication by repeating the authentication process. 3. Form Signing: The form and the identity of the user are presented to the eSig service, where they are digitally bound and secured. 4. Form Storage: The signed form must be stored for later validation. In this process, the application is entirely responsible for steps 1, 2, and 4. In step 3, the application must use the eSig web service to request signing of the document. The following table lists the detailed functions offered by the eSig service.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Rubèn Tito; Dimosthenis Karatzas; Ernest Valveny (2023). MP-DocVQA Dataset [Dataset]. https://paperswithcode.com/dataset/mp-docvqa

MP-DocVQA Dataset

Multipage Document Visual Question Answering

Explore at:

30 scholarly articles cite this dataset (View in Google Scholar)

Dataset updated

Apr 2, 2023

Authors

Rubèn Tito; Dimosthenis Karatzas; Ernest Valveny

Description

The dataset is aimed to perform Visual Question Answering on multipage industry scanned documents. The questions and answers are reused from Single Page DocVQA (SP-DocVQA) dataset. The images also corresponds to the same in original dataset with previous and posterior pages with a limit of up to 20 pages per document.

Clear search

Close search

Google apps

Main menu

MP-DocVQA Dataset

Viet-Doc-VQA-II

DocVQA

DocumentVQA

Document Conversion and Retrieval System (DOCRS)

JA-VG-VQA-500

Electronic Signature (eSig)

MP-DocVQA DatasetSee More Versions

Multipage Document Visual Question Answering

MP-DocVQA Dataset