62 datasets found

S
Test dataset of ChatGPT in medical field
scidb.cn
Updated Mar 3, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
robin shen (2023). Test dataset of ChatGPT in medical field [Dataset]. http://doi.org/10.57760/sciencedb.o00130.00001
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.57760/sciencedb.o00130.00001
Dataset updated
Mar 3, 2023
Dataset provided by
Science Data Bank
Authors
robin shen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The researcher tests the QA capability of ChatGPT in the medical field from the following aspects:1. Test their reserve capacity for medical knowledge2. Check their ability to read literature and understand medical literature3. Test their ability of auxiliary diagnosis after reading case data4. Test its error correction ability for case data5. Test its ability to standardize medical terms6. Test their evaluation ability to experts7. Check their ability to evaluate medical institutionsThe conclusion is:ChatGPT has great potential in the application of medical and health care, and may directly replace human beings or even professionals at a certain level in some fields;The researcher preliminarily believe that ChatGPT has basic medical knowledge and the ability of multiple rounds of dialogue, and its ability to understand Chinese is not weak;ChatGPT has the ability to read, understand and correct cases;ChatGPT has the ability of information extraction and terminology standardization, and is quite excellent;ChatGPT has the reasoning ability of medical knowledge;ChatGPT has the ability of continuous learning. After continuous training, its level has improved significantly;ChatGPT does not have the academic evaluation ability of Chinese medical talents, and the results are not ideal;ChatGPT does not have the academic evaluation ability of Chinese medical institutions, and the results are not ideal;ChatGPT is an epoch-making product, which can become a useful assistant for medical diagnosis and treatment, knowledge service, literature reading, review and paper writing.
U
Data from: Dataset of the study: "Chatbots put to the test in math and logic...
researchdata.bath.ac.uk
zenodo.org
Updated May 20, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vagelis Plevris; George Papazafeiropoulos; Alejandro Jimenez Rios (2023). Dataset of the study: "Chatbots put to the test in math and logic problems: A preliminary comparison and assessment of ChatGPT-3.5, ChatGPT-4, and Google Bard" [Dataset]. http://doi.org/10.5281/zenodo.7940781
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.7940781
Dataset updated
May 20, 2023
Dataset provided by
Zenodo
Authors
Vagelis Plevris; George Papazafeiropoulos; Alejandro Jimenez Rios
Dataset funded by
Oslo Metropolitan University
Description
This dataset contains the 30 questions that were posed to the chatbots (i) ChatGPT-3.5; (ii) ChatGPT-4; and (iii) Google Bard, in May 2023 for the study “Chatbots put to the test in math and logic problems: A preliminary comparison and assessment of ChatGPT-3.5, ChatGPT-4, and Google Bard”. These 30 questions describe mathematics and logic problems that have a unique correct answer. The questions are fully described with plain text only, without the need for any images or special formatting. The questions are divided into two sets of 15 questions each (Set A and Set B). The questions of Set A are 15 “Original” problems that cannot be found online, at least in their exact wording, while Set B contains 15 “Published” problems that one can find online by searching on the internet, usually with their solution. Each question is posed three times to each chatbot.

This dataset contains the following: (i) The full set of the 30 questions, A01-A15 and B01-B15; (ii) the correct answer for each one of them; (iii) an explanation of the solution, for the problems where such an explanation is needed, (iv) the 30 (questions) × 3 (chatbots) × 3 (answers) = 270 detailed answers of the chatbots. For the published problems of Set B, we also provide a reference to the source where each problem was taken from.
Datasets .csv
figshare.com
txt
Updated Jan 24, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yaser Alhasawi (2024). Datasets .csv [Dataset]. http://doi.org/10.6084/m9.figshare.25053146.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.25053146.v1
Dataset updated
Jan 24, 2024
Dataset provided by
figshare
Authors
Yaser Alhasawi
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The dataset for this research project was meticulously constructed to investigate the adoption of ChatGPT among students in the United States. The primary objective was to gain insights into the technological barriers and resistances faced by students in integrating ChatGPT into their information systems. The dataset was designed to capture the diverse adoption patterns among students in various public and private schools and universities across the United States. By examining adoption rates, frequency of usage, and the contexts in which ChatGPT is employed, the research sought to provide a comprehensive understanding of how students are incorporating this technology into their information systems. Moreover, by including participants from diverse educational institutions, the research sought to ensure a comprehensive representation of the student population in the United States. This approach aimed to provide nuanced insights into how factors such as educational background, institution type, and technological familiarity influence ChatGPT adoption.
i
"ChatGPT vs. Student: A Dataset for Source Classification of Computer...
ieee-dataport.org
Updated Jul 19, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ALI ABDULLAH S ALQAHTANI (2023). "ChatGPT vs. Student: A Dataset for Source Classification of Computer Science Answers [Dataset]. https://ieee-dataport.org/documents/chatgpt-vs-student-dataset-source-classification-computer-science-answers
Explore at:
Dataset updated
Jul 19, 2023
Authors
ALI ABDULLAH S ALQAHTANI
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
along with the corresponding answers from students and ChatGPT.
d
Replication Data for: ChatGPT on ChatGPT: An Exploratory Analysis of its...
search.dataone.org
Updated Sep 24, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wang, Jieshu; Kiran, Elif; S.R. Aurora (also known as Mai P. Trinh); Simeone, Michael; Lobo, José (2024). Replication Data for: ChatGPT on ChatGPT: An Exploratory Analysis of its Performance in the Public Sector Workforce [Dataset]. http://doi.org/10.7910/DVN/P3CDHS
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/P3CDHS
Dataset updated
Sep 24, 2024
Dataset provided by
Harvard Dataverse
Authors
Wang, Jieshu; Kiran, Elif; S.R. Aurora (also known as Mai P. Trinh); Simeone, Michael; Lobo, José
Description
This repository contains two datasets used in the study exploring the impact of Generative AI, specifically ChatGPT, on the public sector workforce in the United States. The datasets provide detailed information on the core tasks of public sector occupations and their estimated performance metrics, including potential for automation and augmentation by ChatGPT. These estimations are generated by OpenAI’s GPT-4 model (GPT-4-1106-preview) through OpenAI API.
r
Public data files containing the data used for the ChatGPT survey (XLSX) and...
researchdata.edu.au
figshare.mq.edu.au
Updated Sep 21, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Peter Petocz; Matt Bower; Mark Alfano; Jodie Torrington; Jennifer Lai (2023). Public data files containing the data used for the ChatGPT survey (XLSX) and the survey containing variable selection codes (DOCX). [Dataset]. http://doi.org/10.25949/24123306.V1
Explore at:
Unique identifier
https://doi.org/10.25949/24123306.V1
Dataset updated
Sep 21, 2023
Dataset provided by
Macquarie University
Authors
Peter Petocz; Matt Bower; Mark Alfano; Jodie Torrington; Jennifer Lai
Description
This project investigated teacher attitudes towards Generative Artificial Intelligence Tools (GAITs). In excess of three hundred teachers were surveyed across a broad variety of teaching levels, demographic areas, experience levels, and disciplinary areas, to better understand how they believe teaching and assessment should change as a result of GAITs such as ChatGPT.
Teachers were invited to complete an online survey relating to their perceptions of the open Artificial Intelligence (AI) tool ChatGPT, and how it will influence what they teach and how they assess. The purpose of the study is to provide teachers, policymakers, and society at large with an understanding of the potential impact of tools such as ChatGPT on Education.
This dataset contains public data files used for the ChatGPT survey (XLSX) and the survey containing variable selection codes (DOCX). See the second sheet of the XLSX file for variable descriptions.
m
The Impact of AI and ChatGPT on Bangladeshi University Students
data.mendeley.com
Updated Jan 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Md Jhirul Islam (2025). The Impact of AI and ChatGPT on Bangladeshi University Students [Dataset]. http://doi.org/10.17632/zykphpvbr7.2
Explore at:
Unique identifier
https://doi.org/10.17632/zykphpvbr7.2
Dataset updated
Jan 6, 2025
Authors
Md Jhirul Islam
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Bangladesh
Description
The data set records the perceptions of Bangladeshi university students on the influence that AI tools, especially ChatGPT, have on their academic practices, learning experiences, and problem-solving abilities. The varying role of AI in education, which covers common usage statistics, what AI does to our creative abilities, its impact on our learning, and whether it could invade our privacy. This dataset reveals perspective on how AI tools are changing education in the country and offering valuable information for researchers, educators, policymakers, to understand trends, challenges, and opportunities in the adoption of AI in the academic contex.

Methodology Data Collection Method: Online survey using google from Participants: A total of 3,512 students from various Bangladeshi universities participated. Survey Questions:The survey included questions on demographic information, frequency of AI tool usage, perceived benefits, concerns regarding privacy, and impacts on creativity and learning.

Sampling Technique: Random sampling of university students Data Collection Period: June 2024 to December 2024

Privacy Compliance This dataset has been anonymized to remove any personally identifiable information (PII). It adheres to relevant privacy regulations to ensure the confidentiality of participants.

For further inquiries, please contact: Name: Md Jhirul Islam, Daffodil International University Email: jhirul15-4063@diu.edu.bd Phone: 01316317573
🤖 ChatGPT App Google Store Reviews
kaggle.com
Updated Nov 17, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
BwandoWando (2023). 🤖 ChatGPT App Google Store Reviews [Dataset]. http://doi.org/10.34740/kaggle/ds/4017553
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.34740/kaggle/ds/4017553
Dataset updated
Nov 17, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
BwandoWando
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1842206%2Fd7e02bf38f4b08df2508d6b6e42f3066%2Fchatgpt2.png?generation=1700233710310045&alt=media" alt="">

Based on their wikipedia page

ChatGPT (Chat Generative Pre-trained Transformer) is a large language model-based chatbot developed by OpenAI and launched on November 30, 2022, that enables users to refine and steer a conversation towards a desired length, format, style, level of detail, and language. Successive prompts and replies, known as prompt engineering, are considered at each conversation stage as a context.

These reviews were extracted from Google Store App

Usage

This dataset should paint a good picture on what is the public's perception of the app over the years. Using this dataset, we can do the following

Extract sentiments and trends

Identify which version of the app had the most positive feedback, the worst.

Use topic modeling to identify the pain points of the application.

(AND MANY MORE!)

Note

Images generated using Bing Image Generator
ChatGPT in Engineering Education: Survey Data on AI Usage, Learning Impact,...
figshare.com
csv
Updated May 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Davood Khodadad (2025). ChatGPT in Engineering Education: Survey Data on AI Usage, Learning Impact, and Collaboration [Dataset]. http://doi.org/10.6084/m9.figshare.28536422.v1
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.28536422.v1
Dataset updated
May 8, 2025
Dataset provided by
Figsharehttp://figshare.com/
Authors
Davood Khodadad
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Abstract:This dataset presents survey responses from first-year engineering students on their use of ChatGPT and other AI tools in a project-based learning environment. Collected as part of a study on AI’s role in engineering education, the data captures key insights into how students utilize ChatGPT for coding assistance, conceptual understanding, and collaborative work. The dataset includes responses on frequency of AI usage, perceived benefits and challenges, ethical concerns, and the impact of AI on learning outcomes and problem-solving skills.With AI increasingly integrated into education, this dataset provides valuable empirical evidence for researchers, educators, and policymakers interested in AI-assisted learning, STEM education, and academic integrity. It enables further analysis of student perceptions, responsible AI use, and the evolving role of generative AI in higher education.By making this dataset publicly available, we aim to support future research on AI literacy, pedagogy, and best practices for integrating AI into engineering and science curricula..................................................................................................................................................................Related PublicationThis dataset supports the findings presented in the following peer-reviewed article:ChatGPT in Engineering Education: A Breakthrough or a Challenge?Davood KhodadadPublished: 7 May 2025 | Physics Education, Volume 60, Number 4© 2025 The Author(s). Published by IOP Publishing LtdCitation: Davood Khodadad 2025 Phys. Educ. 60 045006DOI: 10.1088/1361-6552/add073If you use or reference this dataset, please consider citing the above publication......................................................................................................................................................................Description of the data and file structureTitle: ChatGPT in Engineering Education: Survey Data on AI Usage, Learning Impact, and CollaborationDescription of Data Collection:This dataset was collected through a survey distributed via the Canvas learning platform following the completion of group projects in an introductory engineering course. The survey aimed to investigate how students engaged with ChatGPT and other AI tools in a project-based learning environment, particularly in relation to coding, report writing, idea generation, and collaboration.The survey consisted of 15 questions:12 multiple-choice questions to capture quantitative insights on AI usage patterns, frequency, and perceived benefits.3 open-ended questions to collect qualitative perspectives on challenges, ethical concerns, and students' reflections on AI-assisted learning.Key areas assessed in the survey include:Students’ prior familiarity with AI tools before the course.Frequency and purpose of ChatGPT usage (e.g., coding assistance, conceptual learning, collaboration).Perceived benefits and limitations of using AI tools in an engineering learning environment.Ethical considerations, including concerns about over-reliance and academic integrity.The dataset provides valuable empirical insights into the evolving role of AI in STEM education and can support further research on AI-assisted learning, responsible AI usage, and best practices for integrating AI tools in engineering education.
How User Language Affects Conflict Fatality Estimates in ChatGPT, Query...
zenodo.org
data.niaid.nih.gov
text/x-python, zip
Updated Jul 26, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kazenwadel Daniel*; Kazenwadel Daniel*; Steinert Christoph*; Steinert Christoph* (2023). How User Language Affects Conflict Fatality Estimates in ChatGPT, Query script and dataset [Dataset]. http://doi.org/10.5281/zenodo.8181226
Explore at:
zip, text/x-pythonAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.8181226
Dataset updated
Jul 26, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Kazenwadel Daniel*; Kazenwadel Daniel*; Steinert Christoph*; Steinert Christoph*
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
*both authors contributed equally

Automated query script for automated language bias studies in GPT 3-5

Dataset of the paper "How User Language Affects Conflict Fatality Estimates in ChatGPT" preprint available on ArXiv
e
ChatGPT Usage by Gender – Survey Data
expresslegalfunding.com
html
Updated May 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Express Legal Funding (2025). ChatGPT Usage by Gender – Survey Data [Dataset]. https://expresslegalfunding.com/chatgpt-study/
Explore at:
htmlAvailable download formats
Dataset updated
May 2, 2025
Dataset authored and provided by
Express Legal Funding
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
Men, Women
Description
This dataset shows how men and women in the U.S. reported using ChatGPT in a 2025 survey, including whether they followed its advice or chose not to use it.
h
awesome-chatgpt-prompts
huggingface.co
Updated Dec 15, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fatih Kadir Akın (2023). awesome-chatgpt-prompts [Dataset]. https://huggingface.co/datasets/fka/awesome-chatgpt-prompts
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 15, 2023
Authors
Fatih Kadir Akın
License
https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/
Description
🧠 Awesome ChatGPT Prompts [CSV dataset]

This is a Dataset Repository of Awesome ChatGPT Prompts View All Prompts on GitHub

License

CC-0
f
Data from: How generative AI models such as ChatGPT can be (mis)used in SPC...
tandf.figshare.com
html
Updated Mar 6, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fadel M. Megahed; Ying-Ju Chen; Joshua A. Ferris; Sven Knoth; L. Allison Jones-Farmer (2024). How generative AI models such as ChatGPT can be (mis)used in SPC practice, education, and research? An exploratory study [Dataset]. http://doi.org/10.6084/m9.figshare.23532743.v1
Explore at:
htmlAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.23532743.v1
Dataset updated
Mar 6, 2024
Dataset provided by
Taylor & Francis
Authors
Fadel M. Megahed; Ying-Ju Chen; Joshua A. Ferris; Sven Knoth; L. Allison Jones-Farmer
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Generative Artificial Intelligence (AI) models such as OpenAI’s ChatGPT have the potential to revolutionize Statistical Process Control (SPC) practice, learning, and research. However, these tools are in the early stages of development and can be easily misused or misunderstood. In this paper, we give an overview of the development of Generative AI. Specifically, we explore ChatGPT’s ability to provide code, explain basic concepts, and create knowledge related to SPC practice, learning, and research. By investigating responses to structured prompts, we highlight the benefits and limitations of the results. Our study indicates that the current version of ChatGPT performs well for structured tasks, such as translating code from one language to another and explaining well-known concepts but struggles with more nuanced tasks, such as explaining less widely known terms and creating code from scratch. We find that using new AI tools may help practitioners, educators, and researchers to be more efficient and productive. However, in their current stages of development, some results are misleading and wrong. Overall, the use of generative AI models in SPC must be properly validated and used in conjunction with other methods to ensure accurate results.
f
Data Sheet 1_Large language models generating synthetic clinical datasets: a...
frontiersin.figshare.com
xlsx
Updated Feb 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Austin A. Barr; Joshua Quan; Eddie Guo; Emre Sezgin (2025). Data Sheet 1_Large language models generating synthetic clinical datasets: a feasibility and comparative analysis with real-world perioperative data.xlsx [Dataset]. http://doi.org/10.3389/frai.2025.1533508.s001
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.3389/frai.2025.1533508.s001
Dataset updated
Feb 5, 2025
Dataset provided by
Frontiers
Authors
Austin A. Barr; Joshua Quan; Eddie Guo; Emre Sezgin
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
BackgroundClinical data is instrumental to medical research, machine learning (ML) model development, and advancing surgical care, but access is often constrained by privacy regulations and missing data. Synthetic data offers a promising solution to preserve privacy while enabling broader data access. Recent advances in large language models (LLMs) provide an opportunity to generate synthetic data with reduced reliance on domain expertise, computational resources, and pre-training.ObjectiveThis study aims to assess the feasibility of generating realistic tabular clinical data with OpenAI’s GPT-4o using zero-shot prompting, and evaluate the fidelity of LLM-generated data by comparing its statistical properties to the Vital Signs DataBase (VitalDB), a real-world open-source perioperative dataset.MethodsIn Phase 1, GPT-4o was prompted to generate a dataset with qualitative descriptions of 13 clinical parameters. The resultant data was assessed for general errors, plausibility of outputs, and cross-verification of related parameters. In Phase 2, GPT-4o was prompted to generate a dataset using descriptive statistics of the VitalDB dataset. Fidelity was assessed using two-sample t-tests, two-sample proportion tests, and 95% confidence interval (CI) overlap.ResultsIn Phase 1, GPT-4o generated a complete and structured dataset comprising 6,166 case files. The dataset was plausible in range and correctly calculated body mass index for all case files based on respective heights and weights. Statistical comparison between the LLM-generated datasets and VitalDB revealed that Phase 2 data achieved significant fidelity. Phase 2 data demonstrated statistical similarity in 12/13 (92.31%) parameters, whereby no statistically significant differences were observed in 6/6 (100.0%) categorical/binary and 6/7 (85.71%) continuous parameters. Overlap of 95% CIs were observed in 6/7 (85.71%) continuous parameters.ConclusionZero-shot prompting with GPT-4o can generate realistic tabular synthetic datasets, which can replicate key statistical properties of real-world perioperative data. This study highlights the potential of LLMs as a novel and accessible modality for synthetic data generation, which may address critical barriers in clinical data access and eliminate the need for technical expertise, extensive computational resources, and pre-training. Further research is warranted to enhance fidelity and investigate the use of LLMs to amplify and augment datasets, preserve multivariate relationships, and train robust ML models.
H
Replication Data for: Consumers' attitudes toward ChatGPT usage
dataverse.harvard.edu
Updated Mar 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
wenshuai su (2025). Replication Data for: Consumers' attitudes toward ChatGPT usage [Dataset]. http://doi.org/10.7910/DVN/OFXGOL
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/OFXGOL
Dataset updated
Mar 11, 2025
Dataset provided by
Harvard Dataverse
Authors
wenshuai su
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This dataset was collected for research purposes and contains numerical data representing consumers' attitudes toward AI usage. The data was gathered through a survey conducted from May 22 to June 12, 2024. The dataset covers responses from participants in South Korea, China, Vietnam, and Japan. Key variables include consumer perceptions, adoption patterns, and demographic information to support analysis on regional differences in AI acceptance.
e
ChatGPT Usage by Age Group – Survey Data
expresslegalfunding.com
html
Updated May 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Express Legal Funding (2025). ChatGPT Usage by Age Group – Survey Data [Dataset]. https://expresslegalfunding.com/chatgpt-study/
Explore at:
htmlAvailable download formats
Dataset updated
May 2, 2025
Dataset authored and provided by
Express Legal Funding
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
60+, 18–29, 30–44, 45–60
Description
This dataset presents ChatGPT usage patterns across different age groups, showing the percentage of users who have followed its advice, used it without following advice, or have never used it, based on a 2025 U.S. survey.
AI Tool Usage by Indian College Students 2025
kaggle.com
Updated Jun 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rakesh Kapilavayi (2025). AI Tool Usage by Indian College Students 2025 [Dataset]. https://www.kaggle.com/datasets/rakeshkapilavai/ai-tool-usage-by-indian-college-students-2025
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 9, 2025
Dataset provided by
Kaggle
Authors
Rakesh Kapilavayi
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
AI Tool Usage by Indian College Students 2025

This unique dataset, collected via a May 2025 survey, captures how 496 Indian college students use AI tools (e.g., ChatGPT, Gemini, Copilot) in academics. It includes 16 attributes like AI tool usage, trust, impact on grades, and internet access, ideal for education analytics and machine learning.

Columns

Student_Name: Anonymized student name.

College_Name: College attended.

Stream: Academic discipline (e.g., Engineering, Arts).

Year_of_Study: Year of study (1–4).

AI_Tools_Used: Tools used (e.g., ChatGPT, Gemini).

Daily_Usage_Hours: Hours spent daily on AI tools.

Use_Cases: Purposes (e.g., Assignments, Exam Prep).

Trust_in_AI_Tools: Trust level (1–5).

Impact_on_Grades: Grade impact (-3 to +3).

Do_Professors_Allow_Use: Professor approval (Yes/No).

Preferred_AI_Tool: Preferred tool.

Awareness_Level: AI awareness (1–10).

Willing_to_Pay_for_Access: Willingness to pay (Yes/No).

State: Indian state.

Device_Used: Device (e.g., Laptop, Mobile).

Internet_Access: Access quality (Poor/Medium/High).

Use Cases

Predict academic performance using AI tool usage.

Analyze trust in AI across streams or regions.

Cluster students by usage patterns.

Study digital divide via Internet_Access.

Source: Collected via Google Forms survey in May 2025, ensuring diverse representation across India. Note: First dataset of its kind on Kaggle!
m
Composing alt text using large language models: dataset in Russian
data.mendeley.com
Updated Jun 17, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yekaterina Kosova (2024). Composing alt text using large language models: dataset in Russian [Dataset]. http://doi.org/10.17632/73dptbyxbb.1
Explore at:
Unique identifier
https://doi.org/10.17632/73dptbyxbb.1
Dataset updated
Jun 17, 2024
Authors
Yekaterina Kosova
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The dataset contains the results of developing alternative text for images using chatbots based on large language models. The study was carried out in April-June 2024. Microsoft Copilot, Google Gemini, and YandexGPT chatbots were used to generate 108 text descriptions for 12 images. Descriptions were generated by chatbots using keywords specified by a person. The experts then rated the resulting descriptions on a Likert scale (from 1 to 5). The data set is presented in a Microsoft Excel table on the “Data” sheet with the following fields: record number; image number; chatbot; image type (photo, logo); request date; list of keywords; number of keywords; length of keywords; time of compilation of keywords; generated descriptions; required length of descriptions; actual length of descriptions; description generation time; usefulness; reliability; completeness; accuracy; literacy. The “Images” sheet contains links to the original images. Data set is presented in Russian.
e
ChatGPT Usage by U.S. Census Region – Survey Data
expresslegalfunding.com
html
Updated May 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Express Legal Funding (2025). ChatGPT Usage by U.S. Census Region – Survey Data [Dataset]. https://expresslegalfunding.com/chatgpt-study/
Explore at:
htmlAvailable download formats
Dataset updated
May 2, 2025
Dataset authored and provided by
Express Legal Funding
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
Pacific, Mountain, New England, South Atlantic, Middle Atlantic, East North Central, East South Central, West North Central, West South Central
Description
This dataset presents ChatGPT usage patterns across U.S. Census regions, based on a 2025 nationwide survey. It tracks how often users followed, partially used, or never used ChatGPT by state region.
h
ASRS-ChatGPT
huggingface.co
Updated Jun 29, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Archana Tikayat Ray (2023). ASRS-ChatGPT [Dataset]. http://doi.org/10.57967/hf/0830
Explore at:
Unique identifier
https://doi.org/10.57967/hf/0830
Dataset updated
Jun 29, 2023
Authors
Archana Tikayat Ray
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset Summary

The dataset contains a total of 9984 incident records and 9 columns. Some of the columns contain ground truth values whereas others contain information generated by ChatGPT based on the incident Narratives. The creation of this dataset is aimed at providing researchers with columns generated by using ChatGPT API which is not freely available.

Dataset Structure

The column names present in the dataset and their descriptions are provided below:

Column… See the full description on the dataset page: https://huggingface.co/datasets/archanatikayatray/ASRS-ChatGPT.

Facebook

Twitter

Click to copy link

Link copied

Cite

robin shen (2023). Test dataset of ChatGPT in medical field [Dataset]. http://doi.org/10.57760/sciencedb.o00130.00001

Test dataset of ChatGPT in medical field

Explore at:

252 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Unique identifier

https://doi.org/10.57760/sciencedb.o00130.00001

Dataset updated

Mar 3, 2023

Dataset provided by

Science Data Bank

Authors

robin shen

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The researcher tests the QA capability of ChatGPT in the medical field from the following aspects:1. Test their reserve capacity for medical knowledge2. Check their ability to read literature and understand medical literature3. Test their ability of auxiliary diagnosis after reading case data4. Test its error correction ability for case data5. Test its ability to standardize medical terms6. Test their evaluation ability to experts7. Check their ability to evaluate medical institutionsThe conclusion is:ChatGPT has great potential in the application of medical and health care, and may directly replace human beings or even professionals at a certain level in some fields;The researcher preliminarily believe that ChatGPT has basic medical knowledge and the ability of multiple rounds of dialogue, and its ability to understand Chinese is not weak;ChatGPT has the ability to read, understand and correct cases;ChatGPT has the ability of information extraction and terminology standardization, and is quite excellent;ChatGPT has the reasoning ability of medical knowledge;ChatGPT has the ability of continuous learning. After continuous training, its level has improved significantly;ChatGPT does not have the academic evaluation ability of Chinese medical talents, and the results are not ideal;ChatGPT does not have the academic evaluation ability of Chinese medical institutions, and the results are not ideal;ChatGPT is an epoch-making product, which can become a useful assistant for medical diagnosis and treatment, knowledge service, literature reading, review and paper writing.

Clear search

Close search

Google apps

Main menu

Test dataset of ChatGPT in medical field

Data from: Dataset of the study: "Chatbots put to the test in math and logic...

Datasets .csv

"ChatGPT vs. Student: A Dataset for Source Classification of Computer...

Replication Data for: ChatGPT on ChatGPT: An Exploratory Analysis of its...

Public data files containing the data used for the ChatGPT survey (XLSX) and...

The Impact of AI and ChatGPT on Bangladeshi University Students

🤖 ChatGPT App Google Store Reviews

Context

Usage

Note

ChatGPT in Engineering Education: Survey Data on AI Usage, Learning Impact,...

How User Language Affects Conflict Fatality Estimates in ChatGPT, Query...

ChatGPT Usage by Gender – Survey Data

awesome-chatgpt-prompts

Data from: How generative AI models such as ChatGPT can be (mis)used in SPC...

Data Sheet 1_Large language models generating synthetic clinical datasets: a...

Replication Data for: Consumers' attitudes toward ChatGPT usage

ChatGPT Usage by Age Group – Survey Data

AI Tool Usage by Indian College Students 2025

AI Tool Usage by Indian College Students 2025

Columns

Use Cases

Composing alt text using large language models: dataset in Russian

ChatGPT Usage by U.S. Census Region – Survey Data

ASRS-ChatGPT

Test dataset of ChatGPT in medical field