Facebook
TwitterThis file contains participant response data to Likert scale, open-ended responses and self-reported time taken to complete various tasks related to the extraction exercise. This Excel file also contains: 1) Examples of the Interactive HAWC Visuals that can be created after extracting data into the template. 2) The Initial Post-Extraction Survey Tool ("Survey 1") 3) The Final Post-Pilot Survey Tool ("Survey 2") 4) Survey 2 Results: Willingness to Consider Structured Data During Publication Process (Table 2) 5) Survey 1 Results: Participant Self-Reported Time Spent Performing Various Pilot Tasks (Table 3) 6) Survey 1 Results: Summary of Technical Assistance Provided by Team Members (Table 4) 7) Survey 2 Results: Participant Responses Describing Pilot's Impact on Future Research Activities (Table 5) 8) Survey 1 Results: Initial Survey Likert Scale Results (Table 6) 9) Repeat Extraction: Comparison of the First and Second Data Extraction Experience (Among the Same Participant) 10) Survey 1 Results: Problematic & Easy Fields to Extract. This dataset is associated with the following publication: Wilkins, A., P. Whaley, A. Persad, I. Druwe, J. Lee, M. Taylor, A. Shapiro, N. Blanton, C. Lemeris, and K. Thayer. Assessing author willingness to enter study information into structured data templates as part of the manuscript submission process: A pilot study. Heliyon. Elsevier B.V., Amsterdam, NETHERLANDS, 8(3): 1-9, (2022).
Facebook
TwitterSpanish Question Answer Generation Dataset and Code
Description
This repository contains the dataset and source code developed for the article: "ESQAD: An Open Spanish Dataset for Curriculum-Aligned Question-Answer Generation in Educational Settings"
The resources include:- A Spanish QAG dataset aligned with national curricula (EVAU).- Automatically generated QAG pairs from literary and legal sources.- A pilot study subset with questions validated by teachers and students.
Dataset Structure
evau/docs/EvAU_QA.csv- Description: Manually curated questions and answers aligned with the Spanish Evaluación para el Acceso a la Universidad (EVAU).- Columns: question, answer, subject, difficulty- Purpose: Benchmark for educational QAG tasks in Spanish.quijote/docs/Quijote_QA.csv- Description: Automatically generated QAG pairs from Don Quijote de la Mancha.- Columns: question, answer, chapter, difficulty- Purpose: Evaluation of QAG performance on literary texts.legal_faqs/docs/Legal_QA.csv- Description: Questions and answers extracted and generated from FAQs related to Spanish laws (Ley 39/2015 and Ley 40/2015).- Columns: question, answer, law_reference- Purpose: Testing QAG in legal and administrative contexts.exams/exams_QA_validated.json- Description: 923 automatically generated QAG pairs evaluated by teachers and students during a pilot study:- Ratings: Clarity, complexity, pedagogical value (1–3 scale).- Difficulty: Intended vs perceived difficulty levels.- Comments: Free-text feedback from users.- Purpose: Benchmark for evaluating QAG quality with human-validated data.Citation
This dataset accompanies the article:
Badenes-Olmedo, C., Eyzaguirre-Barreda, P., Chu-Artzt, N., & Gayoso-Cabada, J. (2025). "ESQAD: A Curriculum-Aligned Spanish Dataset for Educational Question Answering" submitted to Computer Speech & Language (Elsevier, 2025)
Please cite this resource using the article (once published), or refer to this Zenodo DOI in the meantime.
License
Contact
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
IntroductionColorectal cancer (CRC) screening rates are lower among immigrant populations in the United States (US) than the general population. Immigrant communities face structural barriers that disincentivize their engagement from CRC screening. A growing body of literature has evaluated the effects of interventions aimed at increasing CRC screening engagement among various immigrant groups, but there has not yet been a systematic synthesis of this literature.ObjectiveThis review will systematically evaluate quantitative studies assessing the effects of interventions designed to increase CRC screening rates among immigrant populations residing in the US.MethodsWe will conduct a comprehensive search of English language peer-reviewed and grey literature using specific keywords and database-specific structured vocabulary on interventions to improve CRC screening rates among immigrants published in 7 databases (PubMed, Cochrane Library (Wiley), CINAHL (EBSCO), ClinicalTrials.gov, Embase (Ovid), Scopus (Elsevier), and Web of Science) from January 1, 2000 to December 31, 2024. All studies will be imported into Covidence. Two reviewers will independently screen titles, abstracts, and full-texts for inclusion and exclusion criteria. Pilot screenings and consensus discussions will ensure accuracy and agreement in study selection and data extraction. Iterative data extraction of eligible studies will include critical appraisal using the Risk of Bias 2 (ROB2) for randomized controlled trials, while other study designs will be appraised with the risk of bias in nonrandomized studies of interventions (ROBINS-I) tool. Data synthesis will disaggregate pooled effect estimates by ethnicity, to the extent possible. The study protocol was pre-registered in International Prospective Register of Systematic reviews (PROSPERO): CRD42023488183.Expected outputsThis systematic review aims to generate an exhaustive summary of the evidence base, including a description of the intervention methods and settings, target populations, recruitment and retention strategies, partnerships and collaborations, and reported outcomes. The results will provide actionable recommendations for public health practitioners, healthcare providers, and policymakers developing tailored interventions and policies aimed at improving CRC screening uptake among diverse immigrant populations in the US.
Facebook
TwitterData from manuscript entitled: "Implications for modeling anion exchange treatment of perfluoroalkyl substances in drinking water and related natural organic impacts: a pilot study". This dataset is associated with the following publication: Smith, S., D. Wahman, E. Kleiner, B. Gray, T. Sanan, E. Stebel, C. Gastaldo, E. Hughes, S. Pedigo, B. Datsov, M. Lathrop-Allen, I. Bass, J. Quinn, G. Abulikemu, J. Pressman, G. Sorial, and L. Haupert. Implications for modeling anion exchange treatment of perfluoroalkyl substances in drinking water and related natural organic impacts: a pilot study. WATER RESEARCH. Elsevier Science Ltd, New York, NY, USA, 288: 124685, (2026).
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
TwitterThis file contains participant response data to Likert scale, open-ended responses and self-reported time taken to complete various tasks related to the extraction exercise. This Excel file also contains: 1) Examples of the Interactive HAWC Visuals that can be created after extracting data into the template. 2) The Initial Post-Extraction Survey Tool ("Survey 1") 3) The Final Post-Pilot Survey Tool ("Survey 2") 4) Survey 2 Results: Willingness to Consider Structured Data During Publication Process (Table 2) 5) Survey 1 Results: Participant Self-Reported Time Spent Performing Various Pilot Tasks (Table 3) 6) Survey 1 Results: Summary of Technical Assistance Provided by Team Members (Table 4) 7) Survey 2 Results: Participant Responses Describing Pilot's Impact on Future Research Activities (Table 5) 8) Survey 1 Results: Initial Survey Likert Scale Results (Table 6) 9) Repeat Extraction: Comparison of the First and Second Data Extraction Experience (Among the Same Participant) 10) Survey 1 Results: Problematic & Easy Fields to Extract. This dataset is associated with the following publication: Wilkins, A., P. Whaley, A. Persad, I. Druwe, J. Lee, M. Taylor, A. Shapiro, N. Blanton, C. Lemeris, and K. Thayer. Assessing author willingness to enter study information into structured data templates as part of the manuscript submission process: A pilot study. Heliyon. Elsevier B.V., Amsterdam, NETHERLANDS, 8(3): 1-9, (2022).