100+ datasets found
  1. Student Performance Data Set

    • kaggle.com
    zip
    Updated Mar 27, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data-Science Sean (2020). Student Performance Data Set [Dataset]. https://www.kaggle.com/datasets/larsen0966/student-performance-data-set
    Explore at:
    zip(12353 bytes)Available download formats
    Dataset updated
    Mar 27, 2020
    Authors
    Data-Science Sean
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    If this Data Set is useful, and upvote is appreciated. This data approach student achievement in secondary education of two Portuguese schools. The data attributes include student grades, demographic, social and school related features) and it was collected by using school reports and questionnaires. Two datasets are provided regarding the performance in two distinct subjects: Mathematics (mat) and Portuguese language (por). In [Cortez and Silva, 2008], the two datasets were modeled under binary/five-level classification and regression tasks. Important note: the target attribute G3 has a strong correlation with attributes G2 and G1. This occurs because G3 is the final year grade (issued at the 3rd period), while G1 and G2 correspond to the 1st and 2nd-period grades. It is more difficult to predict G3 without G2 and G1, but such prediction is much more useful (see paper source for more details).

  2. Student Performance Dataset

    • kaggle.com
    zip
    Updated May 26, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dev Ansodariya (2022). Student Performance Dataset [Dataset]. https://www.kaggle.com/datasets/devansodariya/student-performance-data
    Explore at:
    zip(7330 bytes)Available download formats
    Dataset updated
    May 26, 2022
    Authors
    Dev Ansodariya
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Student Performance Data was obtained in a survey of students' math course in secondary school. It consists of 33 Column Dataset Contains Features like - school ID - gender - age - size of family - Father education - Mother education - Occupation of Father and Mother - Family Relation - Health - Grades

    This dataset can be used for Regression (as target variable Grade) as well as Analysis tasks. it might contain imbalanced category features.

    Please do Upvote if this Dataset really helpful to you.

  3. Student Performance Dataset: Academic Insights 10K

    • kaggle.com
    zip
    Updated Dec 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nadeem Majeed (2024). Student Performance Dataset: Academic Insights 10K [Dataset]. https://www.kaggle.com/datasets/nadeemajeedch/students-performance-10000-clean-data-eda
    Explore at:
    zip(129033 bytes)Available download formats
    Dataset updated
    Dec 1, 2024
    Authors
    Nadeem Majeed
    License

    https://cdla.io/sharing-1-0/https://cdla.io/sharing-1-0/

    Description

    The dataset includes: Roll Number: Represent the roll number of the student.

    Gender: Useful for analyzing performance differences between male and female students.

    Race/Ethnicity: Allows analysis of academic performance trends across different racial or ethnic groups.

    Parental Level of Education: Indicates the educational background of the student's family.

    Lunch: Shows whether students receive a free or reduced lunch, which is often a socioeconomic indicator.

    Test Preparation Course: This tells whether students completed a test prep course, which could impact their performance.

    Math Score: Provides a measure of each student’s performance in math, used to calculate averages or trends across various demographics. Science Score: Evaluates students' Science knowledge, which can be analyzed to assess overall scentific knowledge of the student.

    Reading Score: Measures performance in reading, allowing for insights into literacy and comprehension levels among students.

    Writing Score: Evaluates students' writing skills, which can be analyzed to assess overall literacy and expression.

    Total Score: Shows the total number achieved by the student out of 400.

    Grade: Gade achieved by the student. "A" grade if Total marks >= 320, "B" grade if Total marks >= 250, "C" grade if Total marks >= 200, "D" grade if Total marks >= 150 and Fail if <150.

  4. Student Performance and Learning Behavior Dataset for Educational Analytics

    • zenodo.org
    bin, csv
    Updated Aug 13, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kamal NAJEM; Kamal NAJEM (2025). Student Performance and Learning Behavior Dataset for Educational Analytics [Dataset]. http://doi.org/10.5281/zenodo.16459132
    Explore at:
    bin, csvAvailable download formats
    Dataset updated
    Aug 13, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Kamal NAJEM; Kamal NAJEM
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jul 26, 2025
    Description

    The dataset used in this study integrates quantitative data on student learning behaviors, engagement patterns, demographics, and academic performance. It was compiled by merging two publicly available Kaggle datasets, resulting in a combined file (“merged_dataset.csv”) containing 14,003 student records with 16 attributes. All records are anonymized and contain no personally identifiable information.

    The dataset covers the following categories of variables:

    • Study behaviors and engagement: StudyHours, Attendance, Extracurricular, AssignmentCompletion, OnlineCourses, Discussions
    • Resource access and learning environment: Resources, Internet, EduTech

    • Motivation and psychological factors: Motivation, StressLevel

    • Demographic information: Gender, Age (ranging from 18 to 30 years)

    • Learning preference classification: LearningStyle

    • Academic performance indicators: ExamScore, FinalGrade

    In this study, “ExamScore” and “FinalGrade” served as the primary performance indicators. The remaining variables were used to derive behavioral and contextual profiles, which were clustered using unsupervised machine learning techniques.

    The analysis and modeling were implemented in Python through a structured Jupyter Notebook (“Project.ipynb”), which included the following main steps:

    1. Environment Setup – Import of essential libraries (NumPy, pandas, Matplotlib, Seaborn, SciPy, StatsModels, scikit-learn, imbalanced-learn) and visualization configuration.

    2. Data Import and Integration – Loading the two source CSV files, harmonizing columns, removing irrelevant attributes, aligning formats, handling missing values, and merging them into a unified dataset (merged_dataset.csv).

    3. Data Preprocessing

      • Encoding categorical variables using LabelEncoder.

      • Scaling features using both z-score standardization (for statistical tests and PCA) and Min–Max normalization (for clustering).

      • Detecting and removing duplicates.

    4. Clustering Analysis

      • Applying K-Means clustering to segment learners into distinct profiles.

      • Determining the optimal number of clusters using the Elbow Method and Silhouette Score.

      • Evaluating cluster quality with internal metrics (Silhouette Score, Davies–Bouldin Index).

    5. Dimensionality Reduction & Visualization – Using PCA for 2D/3D cluster visualization and feature importance exploration.

    6. Mapping Clusters to Learning Styles – Associating each identified cluster with the most relevant learning style model based on feature patterns and alignment scores.

    7. Statistical Analysis – Conducting ANOVA and regression to test for significant differences in performance between clusters.

    8. Interpretation & Practical Recommendations – Analyzing cluster-specific characteristics and providing implications for adaptive and mobile learning integration.

  5. G

    Student Performance Tracking

    • gomask.ai
    csv, json
    Updated Nov 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GoMask.ai (2025). Student Performance Tracking [Dataset]. https://gomask.ai/marketplace/datasets/student-performance-tracking
    Explore at:
    csv(10 MB), jsonAvailable download formats
    Dataset updated
    Nov 27, 2025
    Dataset provided by
    GoMask.ai
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    2024 - 2025
    Area covered
    Global
    Variables measured
    notes, score, gender, course_id, last_name, school_id, score_max, first_name, student_id, teacher_id, and 13 more
    Description

    This dataset provides a detailed, student-level view of academic performance, attendance, and intervention history, enabling educators to monitor progress, identify at-risk students, and tailor support strategies. It includes assessment scores, course and teacher information, intervention records, and attendance tracking, making it ideal for data-driven educational improvement and early warning systems.

  6. Student Performance & Behavior Dataset

    • kaggle.com
    zip
    Updated May 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mahmoud Elhemaly (2025). Student Performance & Behavior Dataset [Dataset]. https://www.kaggle.com/datasets/mahmoudelhemaly/students-grading-dataset
    Explore at:
    zip(1020509 bytes)Available download formats
    Dataset updated
    May 28, 2025
    Authors
    Mahmoud Elhemaly
    Description

    Student Performance & Behavior Dataset

    This dataset is real data of 5,000 records collected from a private learning provider. The dataset includes key attributes necessary for exploring patterns, correlations, and insights related to academic performance.

    Columns: 01. Student_ID: Unique identifier for each student. 02. First_Name: Student’s first name. 03. Last_Name: Student’s last name. 04. Email: Contact email (can be anonymized). 05. Gender: Male, Female, Other. 06. Age: The age of the student. 07. Department: Student's department (e.g., CS, Engineering, Business). 08. Attendance (%): Attendance percentage (0-100%). 09. Midterm_Score: Midterm exam score (out of 100). 10. Final_Score: Final exam score (out of 100). 11. Assignments_Avg: Average score of all assignments (out of 100). 12. Quizzes_Avg: Average quiz scores (out of 100). 13. Participation_Score: Score based on class participation (0-10). 14. Projects_Score: Project evaluation score (out of 100). 15. Total_Score: Weighted sum of all grades. 16. Grade: Letter grade (A, B, C, D, F). 17. Study_Hours_per_Week: Average study hours per week. 18. Extracurricular_Activities: Whether the student participates in extracurriculars (Yes/No). 19. Internet_Access_at_Home: Does the student have access to the internet at home? (Yes/No). 20. Parent_Education_Level: Highest education level of parents (None, High School, Bachelor's, Master's, PhD). 21. Family_Income_Level: Low, Medium, High. 22. Stress_Level (1-10): Self-reported stress level (1: Low, 10: High). 23. Sleep_Hours_per_Night: Average hours of sleep per night.

    The Attendance is not part of the Total_Score or has very minimal weight.

    Calculating the weighted sum: Total Score=a⋅Midterm+b⋅Final+c⋅Assignments+d⋅Quizzes+e⋅Participation+f⋅Projects

    ComponentWeight (%)
    Midterm15%
    Final25%
    Assignments Avg15%
    Quizzes Avg10%
    Participation5%
    Projects Score30%
    Total100%

    Dataset contains: - Missing values (nulls): in some records (e.g., Attendance, Assignments, or Parent Education Level). - Bias in some Datae (ex: grading e.g., students with high attendance get slightly better grades). - Imbalanced distributions (e.g., some departments having more students).

    Note: - The dataset is real, but I included some bias to create a greater challenge for my students. - Some Columns have been masked as the Data owner requested. "Students_Grading_Dataset_Biased.csv" contains the biased Dataset "Students Performance Dataset" Contains the masked dataset

  7. Student Performance Prediction

    • kaggle.com
    zip
    Updated Mar 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amr Maree (2025). Student Performance Prediction [Dataset]. https://www.kaggle.com/datasets/amrmaree/student-performance-prediction
    Explore at:
    zip(10981 bytes)Available download formats
    Dataset updated
    Mar 3, 2025
    Authors
    Amr Maree
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Student Performance Prediction Dataset 🎓📊

    Overview

    This dataset contains information about students' academic performance, study habits, and external factors affecting their final exam scores. It is designed for predictive modeling, data visualization, and educational analytics.

    Dataset Purpose

    This dataset is useful for:
    - Predicting student final exam scores 📈
    - Identifying key factors that impact academic performance 🎯
    - Exploring feature importance in education-related datasets 📊
    - Building machine learning models for regression and classification 🤖

    Columns Description

    Column NameDescription
    Student_IDUnique identifier for each student.
    GenderGender of the student (Male/Female).
    Study_Hours_per_WeekAverage number of study hours per week.
    Attendance_RateAttendance percentage (50% - 100%).
    Past_Exam_ScoresAverage score of previous exams (50 - 100).
    Parental_Education_LevelEducation level of parents (High School, Bachelors, Masters, PhD).
    Internet_Access_at_HomeWhether the student has internet access at home (Yes/No).
    Extracurricular_ActivitiesWhether the student participates in extracurricular activities (Yes/No).
    Final_Exam_Score (Target)The final exam score of the student (50 - 100, integer values).
    Pass_Fail (Target)The student status (Pass/Fail).

    Ideas for Notebooks 📑

    1. Regression Analysis – Predict final exam scores using machine learning models (Linear Regression, Random Forest, XGBoost).
    2. Feature Importance – Analyze which factors contribute the most to student performance.
    3. Exploratory Data Analysis (EDA) – Visualize the impact of study hours, attendance, and other features.
    4. Classification – Convert scores into categories (Pass/Fail, A/B/C/D) and build classification models.

    License & Usage

    This dataset is open for public use. Feel free to use it for learning, research, and model-building! 🚀

  8. c

    Student Performance Dataset

    • cubig.ai
    zip
    Updated May 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CUBIG (2025). Student Performance Dataset [Dataset]. https://cubig.ai/store/products/358/student-performance-dataset
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 28, 2025
    Dataset authored and provided by
    CUBIG
    License

    https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service

    Measurement technique
    Synthetic data generation using AI techniques for model training, Privacy-preserving data transformation via differential privacy
    Description

    1) Data Introduction • The Student Performance Dataset is a survey of secondary school mathematics students and is a dataset containing a variety of information in a table format, including student demographics, family environment, parents' education and occupation, health, family relationships, and grades.

    2) Data Utilization (1) Student Performance Dataset has characteristics that: • Each row contains a total of 33 different characteristics, including school ID, gender, age, family size, parents' educational level and occupation, family relationship, health status, and grades. • It is suitable for a variety of data analysis and prediction exercises, including regression analysis and categorical variable imbalance analysis, including the target variable Grade. (2) Student Performance Dataset can be used to: • Analyzing academic achievement prediction and influencing factors: It can be used to analyze the impact of various factors such as student's background, family environment, and parental characteristics on grades and to develop a grade prediction model. • Establishing educational policies and customized support strategies: Based on student-specific characteristics and grade data, it can be applied to establishing educational policies such as closing educational gaps, supporting vulnerable student groups, and providing customized learning guidance.

  9. Student performance dataset feature description.

    • plos.figshare.com
    xls
    Updated Jun 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jianwei Dong; Ruishuang Sun; Zhipeng Yan; Meilun Shi; Xinyu Bi (2025). Student performance dataset feature description. [Dataset]. http://doi.org/10.1371/journal.pone.0325713.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 18, 2025
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Jianwei Dong; Ruishuang Sun; Zhipeng Yan; Meilun Shi; Xinyu Bi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Academic achievement is an important index to measure the quality of education and students’ learning outcomes. Reasonable and accurate prediction of academic achievement can help improve teachers’ educational methods. And it also provides corresponding data support for the formulation of education policies. However, traditional methods for classifying academic performance have many problems, such as low accuracy, limited ability to handle nonlinear relationships, and poor handling of data sparsity. Based on this, our study analyzes various characteristics of students, including personal information, academic performance, attendance rate, family background, extracurricular activities and etc. Our work offers a comprehensive view to understand the various factors affecting students’ academic performance. In order to improve the accuracy and robustness of student performance classification, we adopted Gaussian Distribution based Data Augmentation technique (GDO), combined with multiple Deep Learning (DL) and Machine Learning (ML) models. We explored the application of different Machine Learning and Deep Learning models in classifying student grades. And different feature combinations and data augmentation techniques were used to evaluate the performance of multiple models in classification tasks. In addition, we also checked the synthetic data’s effectiveness with variance homogeneity and P-values, and studied how the oversampling rate affects actual classification results. Research has shown that the RBFN model based on educational habit features performs the best after using GDO data augmentation. The accuracy rate is 94.12%, and the F1 score is 94.46%. These results provide valuable references for the classification of student grades and the development of intervention strategies. New methods and perspectives in the field of educational data analysis are proposed in our study. At the same time, it has also promoted innovation and development in the intelligence of the education system.

  10. c

    Student Performance (Multiple Linear Regression) Dataset

    • cubig.ai
    zip
    Updated May 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CUBIG (2025). Student Performance (Multiple Linear Regression) Dataset [Dataset]. https://cubig.ai/store/products/392/student-performance-multiple-linear-regression-dataset
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 29, 2025
    Dataset authored and provided by
    CUBIG
    License

    https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service

    Measurement technique
    Privacy-preserving data transformation via differential privacy, Synthetic data generation using AI techniques for model training
    Description

    1) Data Introduction • The Student Performance (Multiple Linear Regression) Dataset is designed to analyze the relationship between students’ learning habits and academic performance. Each sample includes key indicators related to learning, such as study hours, sleep duration, previous test scores, and the number of practice exams completed.

    2) Data Utilization (1) Characteristics of the Student Performance (Multiple Linear Regression) Dataset: • The target variable, Hours Studied, quantitatively represents the amount of time a student has invested in studying. The dataset is structured to allow modeling and inference of learning behaviors based on correlations with other variables.

    (2) Applications of the Student Performance (Multiple Linear Regression) Dataset: • AI-Based Study Time Prediction Models: The dataset can be used to develop regression models that estimate a student’s expected study time based on inputs like academic performance, sleep habits, and engagement patterns. • Behavioral Analysis and Personalized Learning Strategies: It can be applied to identify students with insufficient study time and design personalized study interventions based on academic and lifestyle patterns.

  11. Open University Learning Analytics Dataset

    • kaggle.com
    zip
    Updated Dec 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2023). Open University Learning Analytics Dataset [Dataset]. https://www.kaggle.com/datasets/thedevastator/open-university-learning-analytics-dataset
    Explore at:
    zip(44203263 bytes)Available download formats
    Dataset updated
    Dec 21, 2023
    Authors
    The Devastator
    Description

    Open University Learning Analytics Dataset

    Student Performance and Engagement Data at The Open University

    By UCI [source]

    About this dataset

    This dataset provides an intimate look into student performance and engagement. It grants researchers access to numerous salient metrics of academic performance which illuminate a broad spectrum of student behaviors: how students interact with online learning material; quantitative indicators reflecting their academic outcomes; as well as demographic data such as age group, gender, prior education level among others.

    The main objective of this dataset is to enable analysts and educators alike with empirical insights underpinning individualized learning experiences - specifically in identifying cases when students may be 'at risk'. Given that preventive early interventions have been shown to significantly mitigate chances of course or program withdrawal among struggling students - having accurate predictive measures such as this can greatly steer pedagogical strategies towards being more success oriented.

    One unique feature about this dataset is its intricate detailing. Not only does it provide overarching summaries on a per-student basis for each presented courses but it also furnishes data related to assessments (scores & submission dates) along with information on individuals' interactions within VLEs (virtual learning environments) - spanning different types like forums, content pages etc... Such comprehensive collation across multiple contextual layers helps paint an encompassing portrayal of student experience that can guide better instructional design.

    Due credit must be given when utilizing this database for research purposes through citation. Specifically referencing (Kuzilek et al., 2015) OU Analyse: Analysing At-Risk Students at The Open University published in Learning Analytics Review is required due to its seminal work related groundings regarding analysis methodologies stem from there.

    Immaterial aspects aside - it is important to note that protection of student privacy is paramount within this dataset's terms and conditions. Stringent anonymization techniques have been implemented across sensitive variables - while detailed, profiles can't be traced back to original respondents.

    How to use the dataset

    How To Use This Dataset:

    • Understanding Your Objectives: Ideal objectives for using this dataset could be to identify at-risk students before they drop out of a class or program, improving course design by analyzing how assignments contribute to final grades, or simply examining relationships between different variables and student performance.

    • Set up your Analytical Environment: Before starting any analysis make sure you have an analytical environment set up where you can load the CSV files included in this dataset. You can use Python notebooks (Jupyter), R Studio or Tableau based software in case you want visual representation as well.

    • Explore Data Individually: There are seven separate datasets available: Assessments; Courses; Student Assessment; Student Info; Vle (Virtual Learning Environment); Student Registeration and Student Vle. Load these CSVs separately into your environment and do an initial exploration of each one: find out what kind of data they contain (numerical/categorical), if they have missing values etc.

    • Merge Datasets As the core idea is to track a student’s journey through multiple courses over time, combining these datasets will provide insights from wider perspectives. One way could be merging them using common key columns such as 'code_module', 'code_presentation', & 'id_student'. But make sure that merge should depend on what question you're trying to answer.

    • Identify Key Metrics Your key metrics will depend on your objectives but might include: overall grade averages per course or assessment type/student/region/gender/age group etc., number of clicks in virtual learning environment, student registration status etc.

    • Run Your Analysis Now you can run queries to analyze the data relevant to your objectives. Try questions like: What factors most strongly predict whether a student will fail an assessment? or How does course difficulty or the number of allotments per week change students' scores?

    • Visualization: Visualizing your data can be crucial for understanding patterns and relationships between variables. Use graphs like bar plots, heatmaps, and histograms to represent different aspects of your analyses.

    • Actionable Insights: The final step is interpreting these results in ways that are meaningf...

  12. w

    Global Visual Analytics in Education Market Research Report: By Application...

    • wiseguyreports.com
    Updated Aug 15, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Global Visual Analytics in Education Market Research Report: By Application (Student Performance Analysis, Curriculum Development, Academic Research, Resource Allocation, Enrollment Management), By Deployment Model (On-Premises, Cloud-Based, Hybrid), By End User (K-12 Schools, Higher Education Institutions, Corporate Training, Educational Non-Profits), By Functionality (Data Visualization, Predictive Analytics, Reporting, Dashboards) and By Regional (North America, Europe, South America, Asia Pacific, Middle East and Africa) - Forecast to 2035 [Dataset]. https://www.wiseguyreports.com/reports/visual-analytics-in-education-market
    Explore at:
    Dataset updated
    Aug 15, 2025
    License

    https://www.wiseguyreports.com/pages/privacy-policyhttps://www.wiseguyreports.com/pages/privacy-policy

    Time period covered
    Aug 25, 2025
    Area covered
    Global
    Description
    BASE YEAR2024
    HISTORICAL DATA2019 - 2023
    REGIONS COVEREDNorth America, Europe, APAC, South America, MEA
    REPORT COVERAGERevenue Forecast, Competitive Landscape, Growth Factors, and Trends
    MARKET SIZE 20242.29(USD Billion)
    MARKET SIZE 20252.49(USD Billion)
    MARKET SIZE 20355.8(USD Billion)
    SEGMENTS COVEREDApplication, Deployment Model, End User, Functionality, Regional
    COUNTRIES COVEREDUS, Canada, Germany, UK, France, Russia, Italy, Spain, Rest of Europe, China, India, Japan, South Korea, Malaysia, Thailand, Indonesia, Rest of APAC, Brazil, Mexico, Argentina, Rest of South America, GCC, South Africa, Rest of MEA
    KEY MARKET DYNAMICSgrowing data-driven decision-making, increasing demand for personalized learning, enhancement of student engagement, rising need for real-time analytics, integration of AI technologies
    MARKET FORECAST UNITSUSD Billion
    KEY COMPANIES PROFILEDTableau Software, MicroStrategy, Microsoft, Alteryx, Sisense, Oracle, Domo, Google, SAP, SAS Institute, Zoho, Qlik, Looker, TIBCO Software, IBM
    MARKET FORECAST PERIOD2025 - 2035
    KEY MARKET OPPORTUNITIESIncreased adoption of online learning, Integration with AI technologies, Demand for personalized learning experiences, Growing need for data-driven decision making, Emergence of interactive educational tools
    COMPOUND ANNUAL GROWTH RATE (CAGR) 8.8% (2025 - 2035)
  13. Student Performance Analysis

    • kaggle.com
    zip
    Updated Sep 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tavershima Dzenda (2025). Student Performance Analysis [Dataset]. https://www.kaggle.com/datasets/tavershimadzenda/student-performance-analysis
    Explore at:
    zip(61680 bytes)Available download formats
    Dataset updated
    Sep 7, 2025
    Authors
    Tavershima Dzenda
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    The Student Performance Dataset originates from the UCI Machine Learning Repository and contains detailed information on secondary school students’ academic performance in Portugal. It includes 395 records with 33 attributes covering demographics, family background, studytime, absences, and grades in three periods (G1, G2, G3).

    This dataset is widely used for educational research and predictive modeling. It allows analysts to explore the impact of factors such as study habits, parental education, family support, and absenteeism on students’ final outcomes.

    Key Features:

    Demographics: age, gender, school

    Family background: parental education, family support, guardian type

    Academic behavior: study time, absences, past grades

    Performance target: final grade (G3, ranging from 0–20)

  14. V

    Visual Analytics in Education Report

    • archivemarketresearch.com
    doc, pdf, ppt
    Updated Jul 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Archive Market Research (2025). Visual Analytics in Education Report [Dataset]. https://www.archivemarketresearch.com/reports/visual-analytics-in-education-564100
    Explore at:
    pdf, ppt, docAvailable download formats
    Dataset updated
    Jul 24, 2025
    Dataset authored and provided by
    Archive Market Research
    License

    https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    Discover the booming Visual Analytics in Education market! Learn about its $2.5 billion (2025) size, 15% CAGR, key drivers, and top players like Oracle & Tableau. Explore market trends and projections to 2033 for informed strategic decisions in EdTech.

  15. G

    Edtech Student Course Engagement

    • gomask.ai
    csv, json
    Updated Nov 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GoMask.ai (2025). Edtech Student Course Engagement [Dataset]. https://gomask.ai/marketplace/datasets/edtech-student-course-engagement
    Explore at:
    json, csv(10 MB)Available download formats
    Dataset updated
    Nov 24, 2025
    Dataset provided by
    GoMask.ai
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    2024 - 2025
    Area covered
    Global
    Variables measured
    course_id, student_id, final_grade, engagement_id, total_modules, enrollment_date, discussion_posts, completion_status, modules_completed, average_quiz_score, and 4 more
    Description

    This dataset provides detailed engagement metrics for students enrolled in online courses, including time spent, module completion, quiz scores, discussion activity, and assignment submissions. It enables educators and edtech platforms to analyze learning behaviors, optimize course design, and deliver personalized recommendations to improve student outcomes.

  16. G

    Learning Assessment Score Trends

    • gomask.ai
    csv, json
    Updated Oct 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GoMask.ai (2025). Learning Assessment Score Trends [Dataset]. https://gomask.ai/marketplace/datasets/learning-assessment-score-trends
    Explore at:
    csv(10 MB), jsonAvailable download formats
    Dataset updated
    Oct 22, 2025
    Dataset provided by
    GoMask.ai
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    2024 - 2025
    Area covered
    Global
    Variables measured
    score, subject, comments, max_score, school_id, student_id, teacher_id, grade_level, school_name, student_name, and 6 more
    Description

    This dataset provides a comprehensive record of student assessment scores across subjects, grade levels, and curriculum standards, enabling educators to analyze performance trends and identify areas for curriculum improvement. It includes detailed information on students, teachers, schools, and assessment types, supporting longitudinal tracking and actionable insights for educational stakeholders.

  17. f

    Data Sheet 1_The application of machine learning in predicting student...

    • figshare.com
    docx
    Updated Sep 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Asset Turkmenbayev; Elmira Abdykerimova; Shynggys Nurgozhayev; Guldana Karabassova; Dametken Baigozhanova (2025). Data Sheet 1_The application of machine learning in predicting student performance in university engineering programs: a rapid review.docx [Dataset]. http://doi.org/10.3389/feduc.2025.1562586.s002
    Explore at:
    docxAvailable download formats
    Dataset updated
    Sep 23, 2025
    Dataset provided by
    Frontiers
    Authors
    Asset Turkmenbayev; Elmira Abdykerimova; Shynggys Nurgozhayev; Guldana Karabassova; Dametken Baigozhanova
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundIn recent years, the application of machine learning (ML) to predict student performance in engineering education has expanded significantly, yet questions remain about the consistency, reliability, and generalisability of these predictive models.ObjectiveThis rapid review aimed to systematically examine peer-reviewed studies published between January 1, 2019, and December 31, 2024, that applied machine learning (ML), artificial intelligence (AI), or deep learning (DL) methods to predict or improve academic outcomes in university engineering programs.MethodsWe searched IEEE Xplore, SpringerLink, and PubMed, identifying an initial pool of 2,933 records. After screening for eligibility based on pre-defined inclusion criteria, we selected 27 peer-reviewed studies for narrative synthesis and assessed their methodological quality using the PROBAST framework.ResultsAll 27 studies involved undergraduate engineering students and demonstrated the capability of diverse ML techniques to enhance various academic outcomes. Notably, one study found that a reinforcement learning-based intelligent tutoring system significantly improved learning efficiency in digital logic courses. Another study using AI-based real-time behavior analysis increased students’ exam scores by approximately 8.44 percentage points. An optimised support vector machine (SVM) model accurately predicted engineering students’ employability with 87.8% accuracy, outperforming traditional predictive approaches. Additionally, a longitudinally validated SVM model effectively identified at-risk students, achieving 83.9% accuracy on hold-out cohorts. Bayesian regression methods also improved early-term course grade prediction by 27% over baseline predictors. However, most studies relied on single-institution samples and lacked rigorous external validation, limiting the generalisability of their findings.ConclusionThe evidence confirms that ML methods—particularly reinforcement learning, deep learning, and optimised predictive algorithms—can substantially improve student performance and academic outcomes in engineering education. However, methodological shortcomings related to participant selection bias, sample sizes, validation practices, and transparency in reporting require further attention. Future research should prioritise multi-institutional studies, robust validation techniques, and enhanced methodological transparency to fully leverage ML’s potential in engineering education.

  18. S

    Psychological_Educational_Achievement

    • scidb.cn
    Updated Nov 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jin Song (2025). Psychological_Educational_Achievement [Dataset]. http://doi.org/10.57760/sciencedb.31201
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 7, 2025
    Dataset provided by
    Science Data Bank
    Authors
    Jin Song
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    his dataset contains 478 records and 35 features, focusing on the relationship between students' psychological traits, learning behaviors, and academic achievements. It includes basic information such as Gender, Age, Academic Discipline, Year of Study, learning-related metrics like Study Hours, Learning Resources, and psychological-academic indicators including Conscientiousness, Openness, Intrinsic/Extrinsic Motivation, CGPA, Course Grades, Professional Skills. It supports research on educational psychology, student development, and academic performance analysis.

  19. E

    Education and Learning Analytics Software and Services Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Oct 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Education and Learning Analytics Software and Services Report [Dataset]. https://www.datainsightsmarket.com/reports/education-and-learning-analytics-software-and-services-1941201
    Explore at:
    ppt, doc, pdfAvailable download formats
    Dataset updated
    Oct 17, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    Explore the dynamic Education and Learning Analytics Software and Services market forecast (2025-2033). Discover key insights, market drivers, and growth trends shaping the future of education technology.

  20. G

    Online Learning Quiz Attempt Logs

    • gomask.ai
    csv, json
    Updated Nov 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GoMask.ai (2025). Online Learning Quiz Attempt Logs [Dataset]. https://gomask.ai/marketplace/datasets/online-learning-quiz-attempt-logs
    Explore at:
    json, csv(10 MB)Available download formats
    Dataset updated
    Nov 20, 2025
    Dataset provided by
    GoMask.ai
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    2024 - 2025
    Area covered
    Global
    Variables measured
    score, quiz_id, completed, max_score, attempt_id, student_id, device_type, feedback_text, location_city, adaptive_level, and 7 more
    Description

    This dataset contains detailed logs of student quiz attempts in online learning environments, including scores, time spent, hint usage, device type, and adaptive assessment levels. It enables comprehensive analysis of student performance, engagement patterns, and the effectiveness of adaptive learning strategies for edtech platforms.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Data-Science Sean (2020). Student Performance Data Set [Dataset]. https://www.kaggle.com/datasets/larsen0966/student-performance-data-set
Organization logo

Student Performance Data Set

Student achievement in secondary education of two Portuguese schools.

Explore at:
10 scholarly articles cite this dataset (View in Google Scholar)
zip(12353 bytes)Available download formats
Dataset updated
Mar 27, 2020
Authors
Data-Science Sean
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

If this Data Set is useful, and upvote is appreciated. This data approach student achievement in secondary education of two Portuguese schools. The data attributes include student grades, demographic, social and school related features) and it was collected by using school reports and questionnaires. Two datasets are provided regarding the performance in two distinct subjects: Mathematics (mat) and Portuguese language (por). In [Cortez and Silva, 2008], the two datasets were modeled under binary/five-level classification and regression tasks. Important note: the target attribute G3 has a strong correlation with attributes G2 and G1. This occurs because G3 is the final year grade (issued at the 3rd period), while G1 and G2 correspond to the 1st and 2nd-period grades. It is more difficult to predict G3 without G2 and G1, but such prediction is much more useful (see paper source for more details).

Search
Clear search
Close search
Google apps
Main menu