81 datasets found
  1. d

    2019 Public Data File - Students

    • catalog.data.gov
    • data.cityofnewyork.us
    • +1more
    Updated Nov 29, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.cityofnewyork.us (2024). 2019 Public Data File - Students [Dataset]. https://catalog.data.gov/dataset/2019-public-data-file-students
    Explore at:
    Dataset updated
    Nov 29, 2024
    Dataset provided by
    data.cityofnewyork.us
    Description

    To collect feedback on their learning environment from families, students and teachers. Aids in facilitating the understanding of families perceptions, students, and teachers regarding their school. School leaders use feedback from the survey to reflect and make improvements to schools and programs. Each year all parents, teachers and students in grades 6-12 take the NYC School Survey. The survey is aligned to the DOE's Framework for Great Schools. It is designed to collect important information about each school's ability to support student success.

  2. d

    School Attendance by Student Group and District, 2021-2022

    • catalog.data.gov
    • data.ct.gov
    • +1more
    Updated Jun 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.ct.gov (2025). School Attendance by Student Group and District, 2021-2022 [Dataset]. https://catalog.data.gov/dataset/school-attendance-by-student-group-and-district-2021-2022
    Explore at:
    Dataset updated
    Jun 21, 2025
    Dataset provided by
    data.ct.gov
    Description

    This dataset includes the attendance rate for public school students PK-12 by student group and by district during the 2021-2022 school year. Student groups include: Students experiencing homelessness Students with disabilities Students who qualify for free/reduced lunch English learners All high needs students Non-high needs students Students by race/ethnicity (Hispanic/Latino of any race, Black or African American, White, All other races) Attendance rates are provided for each student group by district and for the state. Students who are considered high needs include students who are English language learners, who receive special education, or who qualify for free and reduced lunch. When no attendance data is displayed in a cell, data have been suppressed to safeguard student confidentiality, or to ensure that statistics based on a very small sample size are not interpreted as equally representative as those based on a sufficiently larger sample size. For more information on CSDE data suppression policies, please visit http://edsight.ct.gov/relatedreports/BDCRE%20Data%20Suppression%20Rules.pdf.

  3. Sample Student Data

    • figshare.com
    xls
    Updated Aug 2, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Carrie Ellis (2022). Sample Student Data [Dataset]. http://doi.org/10.6084/m9.figshare.20419434.v1
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Aug 2, 2022
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Carrie Ellis
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In "Sample Student Data", there are 6 sheets. There are three sheets with sample datasets, one for each of the three different exercise protocols described (CrP Sample Dataset, Glycolytic Dataset, Oxidative Dataset). Additionally, there are three sheets with sample graphs created using one of the three datasets (CrP Sample Graph, Glycolytic Graph, Oxidative Graph). Each dataset and graph pairs are from different subjects. · CrP Sample Dataset and CrP Sample Graph: This is an example of a dataset and graph created from an exercise protocol designed to stress the creatine phosphate system. Here, the subject was a track and field athlete who threw the shot put for the DeSales University track team. The NIRS monitor was placed on the right triceps muscle, and the student threw the shot put six times with a minute rest in between throws. Data was collected telemetrically by the NIRS device and then downloaded after the student had completed the protocol. · Glycolytic Dataset and Glycolytic Graph: This is an example of a dataset and graph created from an exercise protocol designed to stress the glycolytic energy system. In this example, the subject performed continuous squat jumps for 30 seconds, followed by a 90 second rest period, for a total of three exercise bouts. The NIRS monitor was place on the left gastrocnemius muscle. Here again, data was collected telemetrically by the NIRS device and then downloaded after he had completed the protocol. · Oxidative Dataset and Oxidative Graph: In this example, the dataset and graph are from an exercise protocol designed to stress the oxidative system. Here, the student held a sustained, light-intensity, isometric biceps contraction (pushing against a table). The NIRS monitor was attached to the left biceps muscle belly. Here, data was collected by a student observing the SmO2 values displayed on a secondary device; specifically, a smartphone with the IPSensorMan APP displaying data. The recorder student observed and recorded the data on an Excel Spreadsheet, and marked the times that exercise began and ended on the Spreadsheet.

  4. Fictional Student Performance Dataset

    • kaggle.com
    Updated Nov 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Bin Imran (2023). Fictional Student Performance Dataset [Dataset]. https://www.kaggle.com/datasets/muhammadbinimran/fictional-student-performance-dataset/discussion?sort=undefined
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 4, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Muhammad Bin Imran
    Description

    Dataset Name: Fictional Student Performance Dataset

    Description: The "Fictional Student Performance Dataset" is a comprehensive collection of fictional student records designed for educational and analytical purposes. This dataset comprises 500 student profiles and their associated attributes, making it a valuable resource for exploring various aspects of student performance and data analysis.

    Attributes:

    StudentID: A unique identifier for each student, facilitating individual tracking and analysis. Name: The name of each student, ensuring the dataset's personalization. Age: The age of each student, providing demographic information. Gender: The gender of each student, offering insights into gender-based performance trends. Grade: A continuous variable representing the academic performance of students, which can be used for regression and prediction tasks. Attendance: A percentage value denoting the attendance rate of each student, enabling attendance-related analyses. FinalExamScore: A continuous variable indicating the final exam score achieved by each student, making it suitable for evaluation and prediction tasks. Use Cases:

    Educational Research: This dataset is ideal for educational institutions and researchers to analyze student performance and identify factors that influence academic outcomes. Machine Learning Practice: It is an excellent resource for data science enthusiasts and students looking to practice various machine learning techniques, such as regression, classification, and clustering. Predictive Modeling: The "Grade" and "FinalExamScore" attributes can be used to develop predictive models to forecast student performance. Gender-Based Analysis: Explore gender-based trends in student performance and attendance. Attendance Impact: Investigate the correlation between attendance and academic success. Disclaimer: Please note that this dataset is entirely fictional and created for educational and practice purposes. Any resemblance to real individuals or institutions is purely coincidental.

    Citation: If you use this dataset in your research or projects, kindly acknowledge its source as the "Fictional Student Performance Dataset"

    Data Generation: The dataset was generated using a combination of randomization and scripting to ensure that it does not contain any real or personally identifiable information.

    Feel free to explore and utilize this dataset for educational purposes, data analysis, or machine learning exercises. It is intended to foster learning and experimentation in data science.

  5. o

    Synthetic Student Performance Dataset

    • opendatabay.com
    .undefined
    Updated May 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Opendatabay Labs (2025). Synthetic Student Performance Dataset [Dataset]. https://www.opendatabay.com/data/synthetic/09e2de7b-9830-4337-a801-f4b8ca312c53
    Explore at:
    .undefinedAvailable download formats
    Dataset updated
    May 6, 2025
    Dataset authored and provided by
    Opendatabay Labs
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Education & Learning Analytics
    Description

    This synthetic Student Performance Dataset is designed as an educational resource for data science, machine learning, and education analytics applications. The dataset provides detailed information on various factors influencing students’ academic performance, including demographics, family background, extracurricular activities, and study habits. It aims to help users analyze relationships between these factors and students’ grades, providing insights into student success and well-being.

    Dataset Features:

    • Gender: Gender of the student (e.g., "Male," "Female").
    • Age: Age of the student (in years).
    • Family Size: Size of the student’s family.
    • Parental Status (Together/Apart): Whether the parents are living together or apart.
    • Mother's Education Level: Education level of the student’s mother.
    • Father's Education Level: Education level of the student’s father.
    • Mother's Job: Occupation of the student’s mother.
    • Father's Job: Occupation of the student’s father.
    • Reason for Choosing School: Primary reason for selecting the school (e.g., proximity, reputation).
    • Legal Guardian: Legal guardian of the student (e.g., "Mother," "Father," "Other").
    • Travel Time to School (in hours): Daily travel time between home and school.
    • Weekly Study Time (in hours): Hours spent studying outside school per week.
    • Number of Past Failures: Number of previously failed subjects.
    • Extra Educational Support: Whether the student receives additional educational support (e.g., "Yes," "No").
    • Family Educational Support: Whether the family provides educational support (e.g., "Yes," "No").
    • Paid Extra Classes: Whether the student takes extra paid classes (e.g., "Yes," "No").
    • Extracurricular Activities: Participation in extracurricular activities (e.g., "Yes," "No").
    • Attended Nursery School: Whether the student attended nursery school (e.g., "Yes," "No").
    • Aspiration for Higher Education: Whether the student aspires to pursue higher education (e.g., "Yes," "No").
    • Internet Access at Home: Availability of internet access at home (e.g., "Yes," "No").
    • In a Romantic Relationship: Whether the student is in a romantic relationship (e.g., "Yes," "No").
    • Quality of Family Relationships: Rated quality of relationships within the family.
    • Free Time After School: Amount of free time available after school hours.
    • Going Out with Friends: Frequency of going out with friends.
    • Workday Alcohol Consumption: Level of alcohol consumption during workdays.
    • Weekend Alcohol Consumption: Level of alcohol consumption during weekends.
    • Current Health Status: Self-reported health status of the student.
    • Number of School Absences: Total number of school days missed.
    • First Period Grade: Grade received during the first grading period.
    • Second Period Grade: Grade received during the second grading period.
    • Final Grade: Final grade achieved by the student.

    Distribution:

    https://storage.googleapis.com/opendatabay_public/images/image_725529a8-e4cb-4bee-bcca-a9adc2658dbd.png" alt="Student Performance Dataset Distribution">

    https://storage.googleapis.com/opendatabay_public/images/image_55f1fa29-442d-49ea-89a1-e90b85d8c95f.png" alt="Student Performance Data">

    Usage:

    This dataset is useful for a variety of applications, including:

    • Student Performance Analysis: To explore relationships between family background, study habits, and academic outcomes.
    • Educational Research: To identify key factors influencing student success and well-being.
    • Predictive Modeling: To build models that predict student grades or identify students at risk of underperforming.
    • Policy Making: To analyze how socioeconomic factors and family structure impact education outcomes.

    Coverage:

    This dataset is synthetic and anonymized, ensuring that it is safe for experimentation and learning without compromising any real student data.

    License:

    CCO (Public Domain)

    Who can use it:

    Data science learners: For practising data manipulation, visualization, and predictive modelling. Educators and researchers: For academic studies or teaching purposes in student analytics and education research. Education professionals: For analyzing factors that influence student success and tailoring interventions to improve outcomes.

  6. Students' Academic Performance Dataset

    • kaggle.com
    Updated Nov 26, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ibrahim Aljarah (2016). Students' Academic Performance Dataset [Dataset]. https://www.kaggle.com/aljarah/xAPI-Edu-Data/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 26, 2016
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Ibrahim Aljarah
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Students' Academic Performance Dataset (xAPI-Edu-Data)

    Data Set Characteristics: Multivariate

    Number of Instances: 480

    Area: E-learning, Education, Predictive models, Educational Data Mining

    Attribute Characteristics: Integer/Categorical

    Number of Attributes: 16

    Date: 2016-11-8

    Associated Tasks: Classification

    Missing Values? No

    File formats: xAPI-Edu-Data.csv

    Source:

    Elaf Abu Amrieh, Thair Hamtini, and Ibrahim Aljarah, The University of Jordan, Amman, Jordan, http://www.Ibrahimaljarah.com www.ju.edu.jo

    Dataset Information:

    This is an educational data set which is collected from learning management system (LMS) called Kalboard 360. Kalboard 360 is a multi-agent LMS, which has been designed to facilitate learning through the use of leading-edge technology. Such system provides users with a synchronous access to educational resources from any device with Internet connection.

    The data is collected using a learner activity tracker tool, which called experience API (xAPI). The xAPI is a component of the training and learning architecture (TLA) that enables to monitor learning progress and learner’s actions like reading an article or watching a training video. The experience API helps the learning activity providers to determine the learner, activity and objects that describe a learning experience. The dataset consists of 480 student records and 16 features. The features are classified into three major categories: (1) Demographic features such as gender and nationality. (2) Academic background features such as educational stage, grade Level and section. (3) Behavioral features such as raised hand on class, opening resources, answering survey by parents, and school satisfaction.

    The dataset consists of 305 males and 175 females. The students come from different origins such as 179 students are from Kuwait, 172 students are from Jordan, 28 students from Palestine, 22 students are from Iraq, 17 students from Lebanon, 12 students from Tunis, 11 students from Saudi Arabia, 9 students from Egypt, 7 students from Syria, 6 students from USA, Iran and Libya, 4 students from Morocco and one student from Venezuela.

    The dataset is collected through two educational semesters: 245 student records are collected during the first semester and 235 student records are collected during the second semester.

    The data set includes also the school attendance feature such as the students are classified into two categories based on their absence days: 191 students exceed 7 absence days and 289 students their absence days under 7.

    This dataset includes also a new category of features; this feature is parent parturition in the educational process. Parent participation feature have two sub features: Parent Answering Survey and Parent School Satisfaction. There are 270 of the parents answered survey and 210 are not, 292 of the parents are satisfied from the school and 188 are not.

    (See the related papers for more details).

    Attributes

    1 Gender - student's gender (nominal: 'Male' or 'Female’)

    2 Nationality- student's nationality (nominal:’ Kuwait’,’ Lebanon’,’ Egypt’,’ SaudiArabia’,’ USA’,’ Jordan’,’ Venezuela’,’ Iran’,’ Tunis’,’ Morocco’,’ Syria’,’ Palestine’,’ Iraq’,’ Lybia’)

    3 Place of birth- student's Place of birth (nominal:’ Kuwait’,’ Lebanon’,’ Egypt’,’ SaudiArabia’,’ USA’,’ Jordan’,’ Venezuela’,’ Iran’,’ Tunis’,’ Morocco’,’ Syria’,’ Palestine’,’ Iraq’,’ Lybia’)

    4 Educational Stages- educational level student belongs (nominal: ‘lowerlevel’,’MiddleSchool’,’HighSchool’)

    5 Grade Levels- grade student belongs (nominal: ‘G-01’, ‘G-02’, ‘G-03’, ‘G-04’, ‘G-05’, ‘G-06’, ‘G-07’, ‘G-08’, ‘G-09’, ‘G-10’, ‘G-11’, ‘G-12 ‘)

    6 Section ID- classroom student belongs (nominal:’A’,’B’,’C’)

    7 Topic- course topic (nominal:’ English’,’ Spanish’, ‘French’,’ Arabic’,’ IT’,’ Math’,’ Chemistry’, ‘Biology’, ‘Science’,’ History’,’ Quran’,’ Geology’)

    8 Semester- school year semester (nominal:’ First’,’ Second’)

    9 Parent responsible for student (nominal:’mom’,’father’)

    10 Raised hand- how many times the student raises his/her hand on classroom (numeric:0-100)

    11- Visited resources- how many times the student visits a course content(numeric:0-100)

    12 Viewing announcements-how many times the student checks the new announcements(numeric:0-100)

    13 Discussion groups- how many times the student participate on discussion groups (numeric:0-100)

    14 Parent Answering Survey- parent answered the surveys which are provided from school or not (nominal:’Yes’,’No’)

    15 Parent School Satisfaction- the Degree of parent satisfaction from school(nominal:’Yes’,’No’)

    16 Student Absence Days-the number of absence days for each student (nominal: above-7, under-7)

    The students are classified into three numerical intervals based on their total grade/mark:

    Low-Level: i...

  7. f

    Table_1_Raw Data Visualization for Common Factorial Designs Using SPSS: A...

    • frontiersin.figshare.com
    xlsx
    Updated Jun 15, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Florian Loffing (2023). Table_1_Raw Data Visualization for Common Factorial Designs Using SPSS: A Syntax Collection and Tutorial.XLSX [Dataset]. http://doi.org/10.3389/fpsyg.2022.808469.s002
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 15, 2023
    Dataset provided by
    Frontiers
    Authors
    Florian Loffing
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Transparency in data visualization is an essential ingredient for scientific communication. The traditional approach of visualizing continuous quantitative data solely in the form of summary statistics (i.e., measures of central tendency and dispersion) has repeatedly been criticized for not revealing the underlying raw data distribution. Remarkably, however, systematic and easy-to-use solutions for raw data visualization using the most commonly reported statistical software package for data analysis, IBM SPSS Statistics, are missing. Here, a comprehensive collection of more than 100 SPSS syntax files and an SPSS dataset template is presented and made freely available that allow the creation of transparent graphs for one-sample designs, for one- and two-factorial between-subject designs, for selected one- and two-factorial within-subject designs as well as for selected two-factorial mixed designs and, with some creativity, even beyond (e.g., three-factorial mixed-designs). Depending on graph type (e.g., pure dot plot, box plot, and line plot), raw data can be displayed along with standard measures of central tendency (arithmetic mean and median) and dispersion (95% CI and SD). The free-to-use syntax can also be modified to match with individual needs. A variety of example applications of syntax are illustrated in a tutorial-like fashion along with fictitious datasets accompanying this contribution. The syntax collection is hoped to provide researchers, students, teachers, and others working with SPSS a valuable tool to move towards more transparency in data visualization.

  8. P

    EdNet Dataset

    • paperswithcode.com
    Updated Apr 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Youngduck Choi; Youngnam Lee; Dongmin Shin; Junghyun Cho; Seoyon Park; Seewoo Lee; Jineon Baek; Chan Bae; Byung-soo Kim; Jaewe Heo (2023). EdNet Dataset [Dataset]. https://paperswithcode.com/dataset/ednet
    Explore at:
    Dataset updated
    Apr 4, 2023
    Authors
    Youngduck Choi; Youngnam Lee; Dongmin Shin; Junghyun Cho; Seoyon Park; Seewoo Lee; Jineon Baek; Chan Bae; Byung-soo Kim; Jaewe Heo
    Description

    A large-scale hierarchical dataset of diverse student activities collected by Santa, a multi-platform self-study solution equipped with artificial intelligence tutoring system. EdNet contains 131,441,538 interactions from 784,309 students collected over more than 2 years, which is the largest among the ITS datasets released to the public so far.

  9. MHP (Anxiety, Stress, Depression) Dataset of University Students

    • figshare.com
    application/csv
    Updated May 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mahbubul Syeed; Ashifur Rahman; Laila Akter; Kaniz Fatema; Razib Hayat Khan; Md. Rajual Karim; Md Shakhawat Hossain; Mohammad Faisal Uddin (2024). MHP (Anxiety, Stress, Depression) Dataset of University Students [Dataset]. http://doi.org/10.6084/m9.figshare.25771164.v1
    Explore at:
    application/csvAvailable download formats
    Dataset updated
    May 8, 2024
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Mahbubul Syeed; Ashifur Rahman; Laila Akter; Kaniz Fatema; Razib Hayat Khan; Md. Rajual Karim; Md Shakhawat Hossain; Mohammad Faisal Uddin
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Research in assessing the Mental Health Problems (MHPs), e.g., stress, anxiety, and depression of university students has had much interest worldwide for the last decade. This article provides a large and comprehensive dataset concerning the MHPs of 2028 students from 15 top-ranked universities in Bangladesh, including 9 government/public universities and 6 private universities. To collect the data, the GAD-7 (for Anxiety), PSS-10 (for Stress), and PHQ-9 (for Depression) models are adopted to reflect equivalent academic perspectives. Additionally, student sociodemographic data are collected. The adoption of these three models are done by a team of five professors and a student psychologist to best capture the academic and socio-demographic factors that influence MHPs among university students. To conduct the survey, a google form is developed and circulated among the 15 faculty representatives from the participating universities who further circulated and conducted the survey with the students. Collected data is evaluated to ensure the sufficiency of sample size, and internal consistency and reliability of the response. Furthermore, the levels of anxiety, stress, and depression are calculated using the data to demonstrate its' applicability. This dataset can be used to measure the trajectory of students’ the mental and psychosocial stressors, to adopt required mental health and counselling services, and to conduct data intensive Machine Learning (ML) model development to predictive MPH assessment.

  10. c

    Educational Need - Datasets - CTData.org

    • data.ctdata.org
    Updated Mar 29, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2016). Educational Need - Datasets - CTData.org [Dataset]. http://data.ctdata.org/dataset/educational-need
    Explore at:
    Dataset updated
    Mar 29, 2016
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset provides counts and percentages of students in each district that are considered to have indicators of educational need. These indicators include being eligible for Free or Reduced Price Meals, receiving Special Education services, or are English Language Learners. This dataset also provides the total number of students for each district. Charter Districts have been entered as individual districts.

  11. c

    Free or Reduced-price Meal Eligibility - Datasets - CTData.org

    • data.ctdata.org
    Updated Mar 16, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2016). Free or Reduced-price Meal Eligibility - Datasets - CTData.org [Dataset]. http://data.ctdata.org/dataset/free-or-reduced-price-meal-eligibility
    Explore at:
    Dataset updated
    Mar 16, 2016
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Eligibility indicates students from families whose total income is at or below 185 percent of the poverty level. Household income below 130 percent of the poverty level qualifies students for free meals. Household income between 130 and 185 percent of the poverty level qualifies students for reduced-price meals. Connecticut State Department of Education collects data for grades PreK through 12 on a school year basis. CTdata.org carries annual school year data for grades K through 3.

  12. Udemy Dataset

    • brightdata.com
    .json, .csv, .xlsx
    Updated May 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bright Data (2024). Udemy Dataset [Dataset]. https://brightdata.com/products/datasets/udemy
    Explore at:
    .json, .csv, .xlsxAvailable download formats
    Dataset updated
    May 7, 2024
    Dataset authored and provided by
    Bright Datahttps://brightdata.com/
    License

    https://brightdata.com/licensehttps://brightdata.com/license

    Area covered
    Worldwide
    Description

    We'll tailor a Udemy dataset to meet your unique needs, encompassing course titles, user engagement metrics, completion rates, demographic data of learners, enrollment numbers, review scores, and other pertinent metrics.

    Leverage our Udemy datasets for diverse applications to bolster strategic planning and market analysis. Scrutinizing these datasets enables organizations to grasp learner preferences and online education trends, facilitating nuanced educational program development and learning initiatives. Customize your access to the entire dataset or specific subsets as per your business requisites.

    Popular use cases involve optimizing educational content based on engagement insights, enhancing learning strategies through targeted learner segmentation, and identifying and forecasting trends to stay ahead in the online education landscape.

  13. P

    ASAP-AES Dataset

    • paperswithcode.com
    Updated Jul 20, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). ASAP-AES Dataset [Dataset]. https://paperswithcode.com/dataset/asap
    Explore at:
    Dataset updated
    Jul 20, 2022
    Description

    There are eight essay sets. Each of the sets of essays was generated from a single prompt. Selected essays range from an average length of 150 to 550 words per response. Some of the essays are dependent upon source information and others are not. All responses were written by students ranging in grade levels from Grade 7 to Grade 10. All essays were hand graded and were double-scored. Each of the eight data sets has its own unique characteristics. The variability is intended to test the limits of your scoring engine's capabilities.

  14. Student Habits vs Academic Performance

    • kaggle.com
    Updated Apr 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jayanta Nath (2025). Student Habits vs Academic Performance [Dataset]. https://www.kaggle.com/datasets/jayaantanaath/student-habits-vs-academic-performance
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 12, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Jayanta Nath
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    This is a simulated dataset exploring how lifestyle habits affect academic performance in students. With 1,000 synthetic student records and 15+ features including study hours, sleep patterns, social media usage, diet quality, mental health, and final exam scores, it’s perfect for ML projects, regression analysis, clustering, and data viz. Created using realistic patterns for educational practice.

    Ever wondered how much Netflix, sleep, or TikTok scrolling affects your grades? 👀 This dataset simulates 1,000 students' daily habits—from study time to mental health—and compares them to final exam scores. It's like spying on your GPA through the lens of lifestyle. Perfect for EDA, ML practice, or just vibing with data while pretending to be productive.

  15. i

    Learning Behavior Analytics Dataset

    • ieee-dataport.org
    Updated May 18, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sherin Moussa (2022). Learning Behavior Analytics Dataset [Dataset]. https://ieee-dataport.org/open-access/learning-behavior-analytics-dataset
    Explore at:
    Dataset updated
    May 18, 2022
    Authors
    Sherin Moussa
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset represents the main different unique learning behaviors that may be found in any group of learners in e-learning/educational systems. It represents 20 learners through 17 OERs.

  16. d

    NEET 2024 Results: Exam Center-wise Total Students Appeared, Scores Above...

    • dataful.in
    Updated May 15, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataful (Factly) (2025). NEET 2024 Results: Exam Center-wise Total Students Appeared, Scores Above 600 and 700, and Average and Median Marks by State and Examination Center, Including National Averages [Dataset]. https://dataful.in/datasets/20202
    Explore at:
    application/x-parquet, csv, xlsxAvailable download formats
    Dataset updated
    May 15, 2025
    Dataset authored and provided by
    Dataful (Factly)
    License

    https://dataful.in/terms-and-conditionshttps://dataful.in/terms-and-conditions

    Time period covered
    2024
    Area covered
    States, Cities of India
    Variables measured
    Marks
    Description

    This Dataset contains State and Examination Center-wise Total Students Appeared, Students Scores Above 600 and 700, and Average and Median Marks by State and Examination Center, Including National Averages

    Note: It has to be noted that the marks released by the NTA are centre-wise and hence the analysis of state-wise marks/averages is based on the state where the centre is located and not necessarily the domicile state of the student.

  17. A

    ‘Student Performance Data Set’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Mar 2, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2015). ‘Student Performance Data Set’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-student-performance-data-set-f14e/0580d4d6/?iid=079-283&v=presentation
    Explore at:
    Dataset updated
    Mar 2, 2015
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of ‘Student Performance Data Set’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/impapan/student-performance-data-set on 13 February 2022.

    --- Dataset description provided by original source is as follows ---

    About this dataset

    This data approach student achievement in secondary education of two Portuguese schools. The data attributes include student grades, demographic, social and school related features) and it was collected by using school reports and questionnaires. Two datasets are provided regarding the performance in two distinct subjects: Mathematics (mat) and Portuguese language (por). In [Cortez and Silva, 2008], the two datasets were modeled under binary/five-level classification and regression tasks. Important note: the target attribute G3 has a strong correlation with attributes G2 and G1. This occurs because G3 is the final year grade (issued at the 3rd period), while G1 and G2 correspond to the 1st and 2nd period grades.
    

    Attribute Information:

    # Attributes for both student-mat.csv (Math course) and student-por.csv (Portuguese language course) datasets:
    1 school - student's school (binary: 'GP' - Gabriel Pereira or 'MS' - Mousinho da Silveira)
    2 sex - student's sex (binary: 'F' - female or 'M' - male)
    3 age - student's age (numeric: from 15 to 22)
    4 address - student's home address type (binary: 'U' - urban or 'R' - rural)
    5 famsize - family size (binary: 'LE3' - less or equal to 3 or 'GT3' - greater than 3)
    6 Pstatus - parent's cohabitation status (binary: 'T' - living together or 'A' - apart)
    7 Medu - mother's education (numeric: 0 - none, 1 - primary education (4th grade), 2 – 5th to 9th grade, 3 – secondary education or 4 – higher education)
    8 Fedu - father's education (numeric: 0 - none, 1 - primary education (4th grade), 2 – 5th to 9th grade, 3 – secondary education or 4 – higher education)
    9 Mjob - mother's job (nominal: 'teacher', 'health' care related, civil 'services' (e.g. administrative or police), 'at_home' or 'other')
    10 Fjob - father's job (nominal: 'teacher', 'health' care related, civil 'services' (e.g. administrative or police), 'at_home' or 'other')
    11 reason - reason to choose this school (nominal: close to 'home', school 'reputation', 'course' preference or 'other')
    12 guardian - student's guardian (nominal: 'mother', 'father' or 'other')
    13 traveltime - home to school travel time (numeric: 1 - <15 min., 2 - 15 to 30 min., 3 - 30 min. to 1 hour, or 4 - >1 hour)
    14 studytime - weekly study time (numeric: 1 - <2 hours, 2 - 2 to 5 hours, 3 - 5 to 10 hours, or 4 - >10 hours)
    15 failures - number of past class failures (numeric: n if 1<=n<3, else 4)
    16 schoolsup - extra educational support (binary: yes or no)
    17 famsup - family educational support (binary: yes or no)
    18 paid - extra paid classes within the course subject (Math or Portuguese) (binary: yes or no)
    19 activities - extra-curricular activities (binary: yes or no)
    20 nursery - attended nursery school (binary: yes or no)
    21 higher - wants to take higher education (binary: yes or no)
    22 internet - Internet access at home (binary: yes or no)
    23 romantic - with a romantic relationship (binary: yes or no)
    24 famrel - quality of family relationships (numeric: from 1 - very bad to 5 - excellent)
    25 freetime - free time after school (numeric: from 1 - very low to 5 - very high)
    26 goout - going out with friends (numeric: from 1 - very low to 5 - very high)
    27 Dalc - workday alcohol consumption (numeric: from 1 - very low to 5 - very high)
    28 Walc - weekend alcohol consumption (numeric: from 1 - very low to 5 - very high)
    29 health - current health status (numeric: from 1 - very bad to 5 - very good)
    30 absences - number of school absences (numeric: from 0 to 93)
    
    # these grades are related with the course subject, Math or Portuguese:
    31 G1 - first period grade (numeric: from 0 to 20)
    31 G2 - second period grade (numeric: from 0 to 20)
    32 G3 - final grade (numeric: from 0 to 20, output target)
    

    Acknowledgements

    If you use this dataset in your research, please credit the authors

    Citations

    P. Cortez and A. Silva. Using Data Mining to Predict Secondary School Student Performance. In A. Brito and J. Teixeira Eds., Proceedings of 5th FUture BUsiness TEChnology Conference (FUBUTEC 2008) pp. 5-12, Porto, Portugal, April, 2008, EUROSIS, ISBN 978-9077381-39-7.
    

    --- Original source retains full ownership of the source dataset ---

  18. Mental Health Dataset

    • kaggle.com
    Updated Mar 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bhavik Jikadara (2024). Mental Health Dataset [Dataset]. https://www.kaggle.com/datasets/bhavikjikadara/mental-health-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 18, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Bhavik Jikadara
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset appears to contain a variety of features related to text analysis, sentiment analysis, and psychological indicators, likely derived from posts or text data. Some features include readability indices such as Automated Readability Index (ARI), Coleman Liau Index, and Flesch-Kincaid Grade Level, as well as sentiment analysis scores like sentiment compound, negative, neutral, and positive scores. Additionally, there are features related to psychological aspects such as economic stress, isolation, substance use, and domestic stress. The dataset seems to cover a wide range of linguistic, psychological, and behavioural attributes, potentially suitable for analyzing mental health-related topics in online communities or text data.

    Benefits of using this dataset:

    • Insight into Mental Health: The dataset provides valuable insights into mental health by analyzing linguistic patterns, sentiment, and psychological indicators in text data. Researchers and data scientists can gain a better understanding of how mental health issues manifest in online communication.
    • Predictive Modeling: With a wide range of features, including sentiment analysis scores and psychological indicators, the dataset offers opportunities for developing predictive models to identify or predict mental health outcomes based on textual data. This can be useful for early intervention and support.
    • Community Engagement: Mental health is a topic of increasing importance, and this dataset can foster community engagement on platforms like Kaggle. Data enthusiasts, researchers, and mental health professionals can collaborate to analyze the data and develop solutions to address mental health challenges.
    • Data-driven Insights: By analyzing the dataset, users can uncover correlations and patterns between linguistic features, sentiment, and mental health indicators. These insights can inform interventions, policies, and support systems aimed at promoting mental well-being.
    • Educational Resource: The dataset can serve as a valuable educational resource for teaching and learning about mental health analytics, sentiment analysis, and text mining techniques. It provides a real-world dataset for students and practitioners to apply data science skills in a meaningful context.
  19. Data from: THE RELEVANCY OF MASSIVE HEALTH EDUCATION IN THE BRAZILIAN PRISON...

    • zenodo.org
    csv, pdf
    Updated Jul 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Janaína L. R. da S. Valentim; Janaína L. R. da S. Valentim; Sara Dias-Trindade; Sara Dias-Trindade; Eloiza da S. G. Oliveira; Eloiza da S. G. Oliveira; José A. M. Moreira; José A. M. Moreira; Felipe Fernandes; Felipe Fernandes; Manoel Honorio Romão; Manoel Honorio Romão; Philippi S. G. de Morais; Philippi S. G. de Morais; Alexandre R. Caitano; Alexandre R. Caitano; Aline P. Dias; Aline P. Dias; Carlos A. P. Oliveira; Carlos A. P. Oliveira; Karilany D. Coutinho; Karilany D. Coutinho; Ricardo B. Ceccim; Ricardo B. Ceccim; Ricardo A. de M. Valentim; Ricardo A. de M. Valentim (2024). THE RELEVANCY OF MASSIVE HEALTH EDUCATION IN THE BRAZILIAN PRISON SYSTEM: THE COURSE "HEALTH CARE FOR PEOPLE DEPRIVED OF FREEDOM" AND ITS IMPACTS [Dataset]. http://doi.org/10.5281/zenodo.6499752
    Explore at:
    csv, pdfAvailable download formats
    Dataset updated
    Jul 16, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Janaína L. R. da S. Valentim; Janaína L. R. da S. Valentim; Sara Dias-Trindade; Sara Dias-Trindade; Eloiza da S. G. Oliveira; Eloiza da S. G. Oliveira; José A. M. Moreira; José A. M. Moreira; Felipe Fernandes; Felipe Fernandes; Manoel Honorio Romão; Manoel Honorio Romão; Philippi S. G. de Morais; Philippi S. G. de Morais; Alexandre R. Caitano; Alexandre R. Caitano; Aline P. Dias; Aline P. Dias; Carlos A. P. Oliveira; Carlos A. P. Oliveira; Karilany D. Coutinho; Karilany D. Coutinho; Ricardo B. Ceccim; Ricardo B. Ceccim; Ricardo A. de M. Valentim; Ricardo A. de M. Valentim
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset name: asppl_dataset_v2.csv

    Version: 2.0

    Dataset period: 06/07/2018 - 01/14/2022

    Dataset Characteristics: Multivalued

    Number of Instances: 8118

    Number of Attributes: 9

    Missing Values: Yes

    Area(s): Health and education

    Sources:

    • Virtual Learning Environment of the Brazilian Health System (AVASUS) (Brasil, 2022a);

    • Brazilian Occupational Classification (CBO) (Brasil, 2022b);

    • National Registry of Health Establishments (CNES) (Brasil, 2022c);

    • Brazilian Institute of Geography and Statistics (IBGE) (Brasil, 2022e).

    Description: The data contained in the asppl_dataset_v2.csv dataset (see Table 1) originates from participants of the technology-based educational course “Health Care for People Deprived of Freedom.” The course is available on the AVASUS (Brasil, 2022a). This dataset provides elementary data for analyzing the course’s impact and reach and the profile of its participants. In addition, it brings an update of the data presented in work by Valentim et al. (2021).

    Table 1: Description of AVASUS dataset features.

    Attributes

    Description

    datatype

    Value

    gender

    Gender of the course participant.

    Categorical.

    Feminino / Masculino / Não Informado. (In English, Female, Male or Uninformed)

    course_progress

    Percentage of completion of the course.

    Numerical.

    Range from 0 to 100.

    course_evaluation

    A score given to the course by the participant.

    Numerical.

    0, 1, 2, 3, 4, 5 or NaN.

    evaluation_commentary

    Comment made by the participant about the course.

    Categorical.

    Free text or NaN.

    region

    Brazilian region in which the participant resides.

    Categorical.

    Brazilian region according to IBGE: Norte, Nordeste, Centro-Oeste, Sudeste or Sul (In English North, Northeast, Midwest, Southeast or South).

    CNES

    The CNES code refers to the health establishment where the participant works.

    Numerical.

    CNES Code or NaN.

    health_care_level

    Identification of the health care network level for which the course participant works.

    Categorical.

    “ATENCAO PRIMARIA”,

    “MEDIA COMPLEXIDADE”,

    “ALTA COMPLEXIDADE”,

    and their possible combinations.

    (In English "PRIMARY HEALTH CARE", "SECONDARY HEALTH CARE" AND "TERTIARY HEALTH CARE")

    year_enrollment

    Year in which the course participant registered.

    Numerical.

    Year (YYYY).

    CBO

    Participant occupation.

    Categorical.

    Text coded according to the Brazilian Classification of Occupations or “Indivíduo sem afiliação formal.” (In English “Individual without formal affiliation.”)

    Dataset name: prison_syphilis_and_population_brazil.csv

    Dataset period: 2017 - 2020

    Dataset Characteristics: Multivalued

    Number of Instances: 6

    Number of Attributes: 13

    Missing Values: No

    Source:

    • National Penitentiary Department (DEPEN) (Brasil, 2022d);

    Description: The data contained in the prison_syphilis_and_population_brazil.csv dataset (see Table 2) originate from the National Penitentiary Department Information System (SISDEPEN) (Brasil, 2022d). This dataset provides data on the population and prevalence of syphilis in the Brazilian prison system. In addition, it brings a rate that represents the normalized data for purposes of comparison between the populations of each region and Brazil.

    Table 2: Description of DEPEN dataset Features.

    Attributes

    Description

    datatype

    Value

    Region

    Brazilian region in which the participant resides. In addition, the sum of the regions, which refers to Brazil.

    Categorical.

    Brazil and Brazilian region according to IBGE: North, Northeast, Midwest, Southeast or South.

    syphilis_2017

    Number of syphilis cases in the prison system in 2017.

    Numerical.

    Number of syphilis cases.

    syphilis_rate_2017

    Normalized rate of syphilis cases in 2017.

    Numerical.

    Syphilis case rate.

    syphilis_2018

    Number of syphilis cases in the prison system in 2018.

    Numerical.

    Number of syphilis cases.

    syphilis_rate_2018

    Normalized rate of syphilis cases in 2018.

    Numerical.

    Syphilis case rate.

    syphilis_2019

    Number of syphilis cases in the prison system in 2019.

    Numerical.

    Number of syphilis cases.

    syphilis_rate_2019

    Normalized rate of syphilis cases in 2019.

    Numerical.

    Syphilis case rate.

    syphilis_2020

    Number of syphilis cases in the prison system in 2020.

    Numerical.

    Number of syphilis cases.

    syphilis_rate_2020

    Normalized rate of syphilis cases in 2020.

    Numerical.

    Syphilis case rate.

    pop_2017

    Prison population in 2017.

    Numerical.

    Population number.

    pop_2018

    Prison population in 2018.

    Numerical.

    Population number.

    pop_2019

    Prison population in 2019.

    Numerical.

    Population number.

    pop_2020

    Prison population in 2020.

    Numerical.

    Population number.

    Dataset name: students_cumulative_sum.csv

    Dataset period: 2018 - 2020

    Dataset Characteristics: Multivalued

    Number of Instances: 6

    Number of Attributes: 7

    Missing Values: No

    Source:

    • Virtual Learning Environment of the Brazilian Health System (AVASUS) (Brasil, 2022a);

    • Brazilian Institute of Geography and Statistics (IBGE) (Brasil, 2022e).

    Description: The data contained in the students_cumulative_sum.csv dataset (see Table 3) originate mainly from AVASUS (Brasil, 2022a). This dataset provides data on the number of students by region and year. In addition, it brings a rate that represents the normalized data for purposes of comparison between the populations of each region and Brazil. We used population data estimated by the IBGE (Brasil, 2022e) to calculate the rate.

    Table 3: Description of Students dataset Features.

  20. c

    4-year Cohort High School Graduation Rate - Datasets - CTData.org

    • data.ctdata.org
    Updated Mar 16, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2016). 4-year Cohort High School Graduation Rate - Datasets - CTData.org [Dataset]. http://data.ctdata.org/dataset/4-year-cohort-high-school-graduation-rate
    Explore at:
    Dataset updated
    Mar 16, 2016
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The variable examined is graduation status after four years of high school. Early and summer graduates are considered graduates after four years. The "other" rate includes students who dropped out of high school, enrolled in a GED program, transferred to post-secondary education, or have unknown status. Special education students in school after four years but subsequently graduated are not included in the "still enrolled" rate due to Individuals with Disabilities Education Act (IDEA) restrictions. The subgroups reported are gender, race/ethnicity, English language learners, special education students, and students eligible for free or reduced-price meals (FRPM). The data replace the rate of students enrolled in 12th grade in September who graduated the following June. Connecticut State Department of Education (SDE) collects data longitudinally by four-year cohorts. SDE reports and CTdata.org carries graduation rates of four-year cohorts annually.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
data.cityofnewyork.us (2024). 2019 Public Data File - Students [Dataset]. https://catalog.data.gov/dataset/2019-public-data-file-students

2019 Public Data File - Students

Explore at:
Dataset updated
Nov 29, 2024
Dataset provided by
data.cityofnewyork.us
Description

To collect feedback on their learning environment from families, students and teachers. Aids in facilitating the understanding of families perceptions, students, and teachers regarding their school. School leaders use feedback from the survey to reflect and make improvements to schools and programs. Each year all parents, teachers and students in grades 6-12 take the NYC School Survey. The survey is aligned to the DOE's Framework for Great Schools. It is designed to collect important information about each school's ability to support student success.

Search
Clear search
Close search
Google apps
Main menu