100+ datasets found
  1. INTELLECT-MATH-SFT-Data

    • huggingface.co
    Updated Jan 22, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Prime Intellect (2025). INTELLECT-MATH-SFT-Data [Dataset]. https://huggingface.co/datasets/PrimeIntellect/INTELLECT-MATH-SFT-Data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 22, 2025
    Dataset provided by
    Authors
    Prime Intellect
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    INTELLECT-MATH: Frontier Mathematical Reasoning through Better Initializations for Reinforcement Learning

    INTELLECT-MATH is a 7B parameter model optimized for mathematical reasoning. It was trained in two stages, an SFT stage, in which the model was fine-tuned on verified QwQ outputs, and an RL stage, in which the model was trained using the PRIME-RL recipe. We demonstrate that the quality of our SFT data can impact the performance and training speed of the RL stage: Due to its… See the full description on the dataset page: https://huggingface.co/datasets/PrimeIntellect/INTELLECT-MATH-SFT-Data.

  2. Math-Students Performance Data

    • kaggle.com
    Updated Apr 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Adil Shamim (2025). Math-Students Performance Data [Dataset]. https://www.kaggle.com/datasets/adilshamim8/math-students
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 2, 2025
    Dataset provided by
    Kaggle
    Authors
    Adil Shamim
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    About the Math-Students Dataset

    This dataset, originally sourced from the UCI Machine Learning Repository, offers a rich collection of data on student performance in a math program. It provides detailed insights into both the academic achievements and the socio-demographic backgrounds of the students, making it an excellent resource for educational data mining and predictive analytics.

    Key Features & Attributes

    • Demographics & Background:

      • School: Identifies the student's school (e.g., Gabriel Pereira or Mousinho da Silveira).
      • Sex & Age: Basic demographic information to help explore performance trends among different groups.
      • Address & Family Size: Details about the student’s home environment, including whether they live in an urban or rural area and their family size.
    • Parental & Household Information:

      • Parental Cohabitation & Education: Data on whether parents live together and their education levels, which can correlate with student support and academic outcomes.
      • Parental Occupation: Information on the mother’s and father’s jobs, providing further context on socioeconomic factors.
    • Educational & Behavioral Variables:

      • Study Time & Failures: Weekly study time and history of past class failures help gauge academic dedication and potential challenges.
      • Support & Extracurricular Activities: Records on whether the student has received extra educational support or participates in extracurricular activities, which can influence overall performance.
      • School-Related Factors: Travel time to school, attendance (absences), and participation in additional paid classes contribute to a holistic view of the educational environment.
    • Lifestyle & Social Factors:

      • Internet Access, Free Time & Socializing: Variables like internet availability, free time, and how often students go out with friends help capture lifestyle and behavioral patterns.
      • Health & Well-being: Self-reported health status and alcohol consumption patterns during weekdays and weekends offer insights into personal well-being, which may impact academic performance.
    • Academic Performance:

      • Grades: The dataset includes three key assessments—G1 (first period grade), G2 (second period grade), and G3 (final grade). G3, the final grade, serves as the primary target variable for predictive models.

    Potential Applications

    • Predictive Modeling:
      Researchers and data scientists can build regression models to predict final grades (G3) based on the numerous socio-demographic and educational features.
    • Exploratory Data Analysis:
      The dataset is ideal for exploring relationships between family background, lifestyle choices, and academic success. For example, one could analyze how study time or parental education levels correlate with performance.
    • Educational Interventions:
      By identifying key factors that contribute to academic outcomes, educators and policymakers can develop targeted interventions to support at-risk students.
    • Comparative Studies:
      While this dataset focuses on math scores, its structure is similar to the Portuguese language course dataset. This similarity provides opportunities for cross-domain comparisons in educational research.

    Additional Insights

    • Data Complexity & Quality:
      Despite its moderate size, the dataset is rich in both categorical and numerical variables. This diversity requires careful preprocessing and feature engineering but also offers the chance to uncover complex interactions between various factors.
    • Research Impact:
      The dataset has been widely used in the field of educational data mining. Its comprehensive nature has provided a basis for numerous studies examining the interplay between academic performance and a range of external factors.
    • Historical Context:
      Originating from a study presented at the 5th FUBUTEC 2008 conference, the dataset has contributed valuable insights into secondary school performance and continues to serve as a benchmark for educational analytics research.
  3. h

    preference-data-math-stack-exchange

    • huggingface.co
    Updated Dec 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Praveen Hegde (2023). preference-data-math-stack-exchange [Dataset]. https://huggingface.co/datasets/prhegde/preference-data-math-stack-exchange
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 13, 2023
    Authors
    Praveen Hegde
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    The preference dataset is derived from the stack exchange dataset which contains questions and answers from the Stack Overflow Data Dump. This contains questions and answers for various topics. For this work, we used only question and answers from math.stackexchange.com sub-folder. The questions are grouped with answers that are assigned a score corresponding to the Anthropic paper: score = log2 (1 + upvotes) rounded to the nearest integer, plus 1 if the answer was accepted by the questioner… See the full description on the dataset page: https://huggingface.co/datasets/prhegde/preference-data-math-stack-exchange.

  4. P

    MATH Dataset

    • paperswithcode.com
    • opendatalab.com
    • +1more
    Updated Mar 10, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dan Hendrycks; Collin Burns; Saurav Kadavath; Akul Arora; Steven Basart; Eric Tang; Dawn Song; Jacob Steinhardt (2021). MATH Dataset [Dataset]. https://paperswithcode.com/dataset/math
    Explore at:
    Dataset updated
    Mar 10, 2021
    Authors
    Dan Hendrycks; Collin Burns; Saurav Kadavath; Akul Arora; Steven Basart; Eric Tang; Dawn Song; Jacob Steinhardt
    Description

    MATH is a new dataset of 12,500 challenging competition mathematics problems. Each problem in MATH has a full step-by-step solution which can be used to teach models to generate answer derivations and explanations.

  5. w

    Data from: Math you can really use - every day

    • workwithdata.com
    Updated Jan 10, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2022). Math you can really use - every day [Dataset]. https://www.workwithdata.com/object/math-you-can-really-use-every-day-book-by-david-alan-herzog-0000
    Explore at:
    Dataset updated
    Jan 10, 2022
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Explore Math you can really use - every day through data • Key facts: author, publication date, book publisher, book series, book subjects • Real-time news, visualizations and datasets

  6. w

    Dataset of subjects of Essential math for data science : take control of...

    • workwithdata.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data, Dataset of subjects of Essential math for data science : take control of your data with fundamental linear algebra, probability, and statistics [Dataset]. https://www.workwithdata.com/datasets/book-subjects?f=1&fcol0=j0-book&fop0=%3D&fval0=Essential+math+for+data+science+%3A+take+control+of+your+data+with+fundamental+linear+algebra%2C+probability%2C+and+statistics&j=1&j0=books
    Explore at:
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about book subjects, has 4 rows. and is filtered where the books is Essential math for data science : take control of your data with fundamental linear algebra, probability, and statistics. It features 10 columns including book subject, number of authors, number of books, earliest publication date, and latest publication date. The preview is ordered by number of books (descending).

  7. English and maths

    • gov.uk
    Updated Nov 28, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department for Education (2019). English and maths [Dataset]. https://www.gov.uk/government/statistical-data-sets/fe-data-library-skills-for-life
    Explore at:
    Dataset updated
    Nov 28, 2019
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    Department for Education
    Description

    English and maths (formerly Skills for Life) qualifications are designed to give people the reading, writing, maths and communication skills they need in everyday life, to operate effectively in work and to help them succeed on other training courses.

    These data provide information on participation and achievements for English and maths qualifications and are broken down into a number of key reports.

    Can’t find what you’re looking for?

    If you need help finding data please refer to the table finder tool to search for specific breakdowns available for FE statistics.

    Current data

    https://assets.publishing.service.gov.uk/media/5f0c5c923a6f4003935c2c6f/201819-Nov_EandM_Part_and_Achieve.xlsx">English and maths data tool for participation and achievements 2018/19

     <p class="gem-c-attachment_metadata"><span class="gem-c-attachment_attribute">MS Excel Spreadsheet</span>, <span class="gem-c-attachment_attribute">10.9 MB</span></p>
    
    
    
    
     <p class="gem-c-attachment_metadata">This file may not be suitable for users of assistive technology.</p>
     <details data-module="ga4-event-tracker" data-ga4-event='{"event_name":"select_content","type":"detail","text":"Request an accessible format.","section":"Request an accessible format.","index_section":1}' class="gem-c-details govuk-details govuk-!-margin-bottom-0" title="Request an accessible format.">
    

    Request an accessible format.

      If you use assistive technology (such as a screen reader) and need a version of this document in a more accessible format, please email <a href="mailto:alternative.formats@education.gov.uk" target="_blank" class="govuk-link">alternative.formats@education.gov.uk</a>. Please tell us what format you need. It will help us if you say what assistive technology you use.
    

    Archive

  8. d

    Math Test Results 2006-2012

    • catalog.data.gov
    • data.cityofnewyork.us
    Updated Nov 29, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.cityofnewyork.us (2024). Math Test Results 2006-2012 [Dataset]. https://catalog.data.gov/dataset/math-test-results-2006-2012
    Explore at:
    Dataset updated
    Nov 29, 2024
    Dataset provided by
    data.cityofnewyork.us
    Description

    This report includes results for the New York State Math exams for the years 2006-2012. For the results for the New York State Math exams for the years 2013-2023, please follow this link.

  9. d

    Data from: Usable Math

    • search.dataone.org
    • dataverse.harvard.edu
    • +1more
    Updated Nov 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gattupalli, Sai Satish; Sharon Edwards; Robert Maloy (2023). Usable Math [Dataset]. http://doi.org/10.7910/DVN/SK4RDQ
    Explore at:
    Dataset updated
    Nov 8, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Gattupalli, Sai Satish; Sharon Edwards; Robert Maloy
    Description

    Usable Math is a free & open interactive website where you'll find learning modules designed to develop mathematical problem solving skills among young learners in grades 3 to 7. Visit UsableMath.org.

  10. h

    Skywork-OR1-RL-Data-Math

    • huggingface.co
    Updated Apr 15, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    PE-NLP (2025). Skywork-OR1-RL-Data-Math [Dataset]. https://huggingface.co/datasets/pe-nlp/Skywork-OR1-RL-Data-Math
    Explore at:
    Dataset updated
    Apr 15, 2025
    Dataset authored and provided by
    PE-NLP
    Description

    pe-nlp/Skywork-OR1-RL-Data-Math dataset hosted on Hugging Face and contributed by the HF Datasets community

  11. d

    ThirdGrade ELA Math Scores byTract 08032017

    • catalog.data.gov
    • detroitdata.org
    • +5more
    Updated Sep 21, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Driven Detroit (2024). ThirdGrade ELA Math Scores byTract 08032017 [Dataset]. https://catalog.data.gov/dataset/thirdgrade-ela-math-scores-bytract-08032017-eca07
    Explore at:
    Dataset updated
    Sep 21, 2024
    Dataset provided by
    Data Driven Detroit
    Description

    Third grade English Language Arts (ELA) and Math test results for the 2016-2017 school year by census tract for the state of Michigan. Data Driven Detroit obtained these datasets from MI School Data, for the State of the Detroit Child tool in July 2017. Test results were originally obtained on a school level and aggregated to census tract by Data Driven Detroit. Student data was suppressed when less than five students were tested per school.Click here for metadata (descriptions of the fields).

  12. w

    Data from: Math for meds : dosages and solutions

    • workwithdata.com
    Updated Jan 10, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2022). Math for meds : dosages and solutions [Dataset]. https://www.workwithdata.com/object/math-for-meds-dosages-and-solutions-book-by-margaret-witt-0000
    Explore at:
    Dataset updated
    Jan 10, 2022
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Explore Math for meds : dosages and solutions through data • Key facts: author, publication date, book publisher, book series, book subjects • Real-time news, visualizations and datasets

  13. w

    20 US Dollar to MATH Historical Data

    • weex.com
    Updated Apr 29, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    WEEX (2025). 20 US Dollar to MATH Historical Data [Dataset]. https://www.weex.com/tokens/math/from-usd/20
    Explore at:
    Dataset updated
    Apr 29, 2025
    Dataset authored and provided by
    WEEX
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Historical price and volatility data for US Dollar in MATH across different time periods.

  14. G

    Guatemala PISA math scores - data, chart | TheGlobalEconomy.com

    • theglobaleconomy.com
    csv, excel, xml
    Updated Apr 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Globalen LLC (2024). Guatemala PISA math scores - data, chart | TheGlobalEconomy.com [Dataset]. www.theglobaleconomy.com/Guatemala/pisa_math_scores/
    Explore at:
    excel, xml, csvAvailable download formats
    Dataset updated
    Apr 28, 2024
    Dataset authored and provided by
    Globalen LLC
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Dec 31, 2022
    Area covered
    Guatemala
    Description

    Guatemala: PISA math scores: The latest value from 2022 is 344.199 index points, unavailable from index points in . In comparison, the world average is 439.569 index points, based on data from 78 countries. Historically, the average for Guatemala from 2022 to 2022 is 344.199 index points. The minimum value, 344.199 index points, was reached in 2022 while the maximum of 344.199 index points was recorded in 2022.

  15. w

    Data from: A basic math approach to concepts of chemistry

    • workwithdata.com
    Updated Jan 11, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2022). A basic math approach to concepts of chemistry [Dataset]. https://www.workwithdata.com/object/a-basic-math-approach-to-concepts-of-chemistry-book-by-leo-michels-0000
    Explore at:
    Dataset updated
    Jan 11, 2022
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Explore A basic math approach to concepts of chemistry through data • Key facts: author, publication date, book publisher, book series, book subjects • Real-time news, visualizations and datasets

  16. d

    Mathematics Achievement: Year 9 Students - Dataset - data.govt.nz - discover...

    • catalogue.data.govt.nz
    Updated Aug 12, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2020). Mathematics Achievement: Year 9 Students - Dataset - data.govt.nz - discover and use data [Dataset]. https://catalogue.data.govt.nz/dataset/mathematics-achievement-year-9-students
    Explore at:
    Dataset updated
    Aug 12, 2020
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Mathematics scores for Year 9 students.

  17. w

    Math-e-MATIC to US Dollar Historical Data

    • weex.com
    Updated Apr 30, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    WEEX (2025). Math-e-MATIC to US Dollar Historical Data [Dataset]. https://www.weex.com/tokens/math-e-matic/to-usd
    Explore at:
    Dataset updated
    Apr 30, 2025
    Dataset authored and provided by
    WEEX
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    United States
    Description

    Historical price and volatility data for Math-e-MATIC in US Dollar across different time periods.

  18. w

    Data on Making maths meaningful

    • workwithdata.com
    Updated Apr 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2024). Data on Making maths meaningful [Dataset]. https://www.workwithdata.com/topic/making-maths-meaningful
    Explore at:
    Dataset updated
    Apr 15, 2024
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Explore Making maths meaningful through data from visualizations to datasets, all based on diverse sources.

  19. f

    Data from: A Systematic Review of Relevant Variables in the Production of...

    • figshare.com
    jpeg
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gabriele Gris; Livia dos Santos Palombarini; João dos Santos Carmo (2023). A Systematic Review of Relevant Variables in the Production of Errors in Mathematics [Dataset]. http://doi.org/10.6084/m9.figshare.9276161.v1
    Explore at:
    jpegAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    SciELO journals
    Authors
    Gabriele Gris; Livia dos Santos Palombarini; João dos Santos Carmo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Abstract Issues involving the identification, analysis, and interpretation of mistakes made by mathematics students are not recent, although much still can be investigated on this subject. The aim of the present study was to identify, from the existing literature, relevant variables in the production of errors in mathematics. A systematic review of the literature of the period between 2012 and 2017 was independently performed by two researchers to evaluate the concordance between them. We searched the ERIC, PsycArticles, SciELO and Math Educ Database databases with the descriptors error AND mathematics OR math, error AND procedure AND mathematics OR math, error pattern AND mathematics OR math, analysis of errors AND mathematics OR math, systemic error AND mathematics OR math and their correspondents in Portuguese and Spanish. A total of 415 articles were identified, of which 31 were analyzed, dealing with error production. The variables identified as responsible for producing the most common errors refer to the student's internal causes or unspecified difficulties and errors in the teaching procedures. Responsibility for error is usually attributed to the students and the main trend of the research is only to inform the production of errors, since only a few studies have indicated ways to avoid or deal with errors produced by students in a specific and descriptive way. We emphasize importance and necessity of investigating educational practices to prevent and deal with errors.

  20. w

    Data from: How to do maths

    • workwithdata.com
    Updated Feb 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2025). How to do maths [Dataset]. https://www.workwithdata.com/object/how-to-do-maths-book-by-elisabeth-heesom-0000
    Explore at:
    Dataset updated
    Feb 23, 2025
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Explore How to do maths through data • Key facts: author, publication date, book publisher, book series, book subjects • Real-time news, visualizations and datasets

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Prime Intellect (2025). INTELLECT-MATH-SFT-Data [Dataset]. https://huggingface.co/datasets/PrimeIntellect/INTELLECT-MATH-SFT-Data
Organization logo

INTELLECT-MATH-SFT-Data

PrimeIntellect/INTELLECT-MATH-SFT-Data

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 22, 2025
Dataset provided by
Authors
Prime Intellect
License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

INTELLECT-MATH: Frontier Mathematical Reasoning through Better Initializations for Reinforcement Learning

INTELLECT-MATH is a 7B parameter model optimized for mathematical reasoning. It was trained in two stages, an SFT stage, in which the model was fine-tuned on verified QwQ outputs, and an RL stage, in which the model was trained using the PRIME-RL recipe. We demonstrate that the quality of our SFT data can impact the performance and training speed of the RL stage: Due to its… See the full description on the dataset page: https://huggingface.co/datasets/PrimeIntellect/INTELLECT-MATH-SFT-Data.

Search
Clear search
Close search
Google apps
Main menu