100+ datasets found
  1. h

    MATH-500

    • huggingface.co
    Updated Feb 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ricardo (2025). MATH-500 [Dataset]. https://huggingface.co/datasets/ricdomolm/MATH-500
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 7, 2025
    Authors
    Ricardo
    Description

    MATH-500 test set with the remaining 12000 examples in train. import datasets

    https://github.com/volcengine/verl/blob/30911f133aa300ae9d8e341dba8e63192335705e/verl/utils/reward_score/math.py

    from math_utils import last_boxed_only_string, remove_boxed

    math = datasets.load_dataset('DigitalLearningGmbH/MATH-lighteval', 'default') math500 = datasets.load_dataset('HuggingFaceH4/MATH-500')

    convert math to math500 format

    def map_to_500(example): return { 'problem':โ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/ricdomolm/MATH-500.

  2. a

    Math 500 by Model

    • artificialanalysis.ai
    Updated Jun 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Artificial Analysis (2025). Math 500 by Model [Dataset]. https://artificialanalysis.ai/evaluations/math-500
    Explore at:
    Dataset updated
    Jun 28, 2025
    Dataset authored and provided by
    Artificial Analysis
    Description

    Comparison of Independently conducted by Artificial Analysis by Model

  3. h

    MATH-500

    • huggingface.co
    Updated Aug 31, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gleb Gerasimov (2025). MATH-500 [Dataset]. https://huggingface.co/datasets/gudleifrr/MATH-500
    Explore at:
    Dataset updated
    Aug 31, 2025
    Authors
    Gleb Gerasimov
    Description

    gudleifrr/MATH-500 dataset hosted on Hugging Face and contributed by the HF Datasets community

  4. h

    MATH-500-multilingual

    • huggingface.co
    Updated Sep 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abdullah Bezir (2025). MATH-500-multilingual [Dataset]. https://huggingface.co/datasets/bezir/MATH-500-multilingual
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 11, 2025
    Authors
    Abdullah Bezir
    Description

    MATH-500 Multilingual Problem Set ๐ŸŒโž—

    A multilingual subset from OpenAI's MATH benchmark. Perfect for testing math skills across languages, this dataset includes same problems in English, French, Italian, Turkish and Spanish.

      ๐ŸŒ Available Languages
    

    English ๐Ÿ‡ฌ๐Ÿ‡ง
    French ๐Ÿ‡ซ๐Ÿ‡ท
    Italian ๐Ÿ‡ฎ๐Ÿ‡น
    Turkish ๐Ÿ‡น๐Ÿ‡ท Spanish ๐Ÿ‡ช๐Ÿ‡ธ

      ๐Ÿ“‚ Source & Attribution
    

    Original Dataset: Sourced from HuggingFaceH4/MATH-500.

      ๐Ÿš€ Quick Start
    

    Load the datasetโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/bezir/MATH-500-multilingual.

  5. h

    ko-math-500

    • huggingface.co
    Updated Jul 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    davidkim205 (2025). ko-math-500 [Dataset]. https://huggingface.co/datasets/davidkim205/ko-math-500
    Explore at:
    Dataset updated
    Jul 22, 2025
    Authors
    davidkim205
    Description

    ko-math-500

    ko-math-500 is a Korean-translated subset of 500 representative problems from the widely used MATH (Mathematics Aptitude Test of Heuristics) dataset, designed to evaluate the mathematical reasoning abilities of large language models. The ko-math-500 subset is based on the standard evaluation set of 500 problems used in the 2023 paper Letโ€™s Verify Step by Step for model performance comparison. The original dataset is publicly available at HuggingFaceH4/MATH-500. Theโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/davidkim205/ko-math-500.

  6. h

    MATH-500-translated

    • huggingface.co
    Updated Feb 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Appier AI Research Team (2025). MATH-500-translated [Dataset]. https://huggingface.co/datasets/appier-ai-research/MATH-500-translated
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 21, 2025
    Dataset authored and provided by
    Appier AI Research Team
    Description

    appier-ai-research/MATH-500-translated dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. h

    MATH-500-uppercase

    • huggingface.co
    Updated Jun 14, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jacob Morrison (2025). MATH-500-uppercase [Dataset]. https://huggingface.co/datasets/jacobmorrison/MATH-500-uppercase
    Explore at:
    Dataset updated
    Jun 14, 2025
    Authors
    Jacob Morrison
    Description

    jacobmorrison/MATH-500-uppercase dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. a

    Math Index by Model

    • artificialanalysis.ai
    Updated May 4, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Artificial Analysis (2024). Math Index by Model [Dataset]. https://artificialanalysis.ai/models/comparisons/mistral-small-3-2-vs-gpt-4o-2024-05-13
    Explore at:
    Dataset updated
    May 4, 2024
    Dataset authored and provided by
    Artificial Analysis
    Description

    Comparison of Represents the average of math benchmarks in the Artificial Analysis Intelligence Index (AIME 2024 & Math-500) by Model

  9. Major AI models, by math and computational reasoning

    • statista.com
    Updated Mar 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Major AI models, by math and computational reasoning [Dataset]. https://www.statista.com/statistics/1600812/ai-math-benchmarking-ranking/
    Explore at:
    Dataset updated
    Mar 14, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    2025
    Area covered
    Worldwide
    Description

    In 2024, the artificial analysis math index ranked AI models based on their mathematical reasoning using benchmarks like AIME 2024 and Math-500. o1, QwQ-32B, and DeepSeek R1, led the rankings, showing the highest proficiency in mathematical problem solving.

  10. p

    Trends in Math Proficiency (2011-2023): Princeton HSD 500 School District...

    • publicschoolreview.com
    Updated Sep 9, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Public School Review (2025). Trends in Math Proficiency (2011-2023): Princeton HSD 500 School District vs. Illinois [Dataset]. https://www.publicschoolreview.com/illinois/princeton-hsd-500-school-district/1732700-school-district
    Explore at:
    Dataset updated
    Sep 9, 2025
    Dataset authored and provided by
    Public School Review
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset tracks annual math proficiency from 2011 to 2023 for Princeton HSD 500 School District vs. Illinois

  11. h

    MATH-500

    • huggingface.co
    Updated Jul 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jacob Morrison (2025). MATH-500 [Dataset]. https://huggingface.co/datasets/jacobmorrison/MATH-500
    Explore at:
    Dataset updated
    Jul 1, 2025
    Authors
    Jacob Morrison
    Description

    jacobmorrison/MATH-500 dataset hosted on Hugging Face and contributed by the HF Datasets community

  12. p

    Trends in Math Proficiency (2010-2011): The 500 Role Model Academy vs....

    • publicschoolreview.com
    Updated Sep 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Public School Review (2025). Trends in Math Proficiency (2010-2011): The 500 Role Model Academy vs. Florida vs. Miami-Dade School District [Dataset]. https://www.publicschoolreview.com/the-500-role-model-academy-profile
    Explore at:
    Dataset updated
    Sep 5, 2025
    Dataset authored and provided by
    Public School Review
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Dade County School District
    Description

    This dataset tracks annual math proficiency from 2010 to 2011 for The 500 Role Model Academy vs. Florida and Miami-Dade School District

  13. a

    Intelligence Index by Model

    • artificialanalysis.ai
    Updated May 4, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Artificial Analysis (2024). Intelligence Index by Model [Dataset]. https://artificialanalysis.ai/models/comparisons/mistral-small-3-2-vs-gpt-4o-2024-05-13
    Explore at:
    Dataset updated
    May 4, 2024
    Dataset authored and provided by
    Artificial Analysis
    Description

    Comparison of Artificial Analysis Intelligence Index incorporates 7 evaluations: MMLU-Pro, GPQA Diamond, Humanity's Last Exam, LiveCodeBench, SciCode, AIME, MATH-500 by Model

  14. f

    The table shows summary statistics of significant daily discontinuous...

    • figshare.com
    • plos.figshare.com
    xls
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yu-Min Yen (2023). The table shows summary statistics of significant daily discontinuous quadratic variation (sum of squared intradaily jumps) of SPC500 and DJIA on the common jump days by adopting separate and pool methods. [Dataset]. http://doi.org/10.1371/journal.pone.0058365.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Yu-Min Yen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The term common jump days used here only means that the two indices both have jumps on these days. The mean and standard deviation of are calculated conditional on The quantities of price variations shown are all scaled by 10000.

  15. h

    omni-MATH-500

    • huggingface.co
    Updated Sep 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matteo Santelmo (2025). omni-MATH-500 [Dataset]. https://huggingface.co/datasets/matsant01/omni-MATH-500
    Explore at:
    Dataset updated
    Sep 1, 2025
    Authors
    Matteo Santelmo
    Description

    matsant01/omni-MATH-500 dataset hosted on Hugging Face and contributed by the HF Datasets community

  16. p

    Trends in Math Proficiency (2012-2023): Cape Elizabeth Middle School vs....

    • publicschoolreview.com
    Updated Oct 15, 2011
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Public School Review (2011). Trends in Math Proficiency (2012-2023): Cape Elizabeth Middle School vs. Maine vs. Cape Elizabeth School District [Dataset]. https://www.publicschoolreview.com/cape-elizabeth-middle-school-profile
    Explore at:
    Dataset updated
    Oct 15, 2011
    Dataset authored and provided by
    Public School Review
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Cape Elizabeth, Maine
    Description

    This dataset tracks annual math proficiency from 2012 to 2023 for Cape Elizabeth Middle School vs. Maine and Cape Elizabeth School District

  17. p

    Trends in Math Proficiency (2012-2023): Yarmouth Elementary School vs. Maine...

    • publicschoolreview.com
    Updated Sep 5, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Public School Review (2025). Trends in Math Proficiency (2012-2023): Yarmouth Elementary School vs. Maine vs. Yarmouth Schools School District [Dataset]. https://www.publicschoolreview.com/yarmouth-elementary-school-profile
    Explore at:
    Dataset updated
    Sep 5, 2025
    Dataset authored and provided by
    Public School Review
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Maine, Yarmouth, Yarmouth Schools
    Description

    This dataset tracks annual math proficiency from 2012 to 2023 for Yarmouth Elementary School vs. Maine and Yarmouth Schools School District

  18. p

    Trends in Math Proficiency (2012-2023): Frank H Harrison Middle School vs....

    • publicschoolreview.com
    Updated Sep 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Public School Review (2025). Trends in Math Proficiency (2012-2023): Frank H Harrison Middle School vs. Maine vs. Yarmouth Schools School District [Dataset]. https://www.publicschoolreview.com/frank-h-harrison-middle-school-profile
    Explore at:
    Dataset updated
    Sep 5, 2025
    Dataset authored and provided by
    Public School Review
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Maine, Yarmouth, Yarmouth Schools
    Description

    This dataset tracks annual math proficiency from 2012 to 2023 for Frank H Harrison Middle School vs. Maine and Yarmouth Schools School District

  19. f

    The table shows summary statistics of significant daily discontinuous...

    • plos.figshare.com
    xls
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yu-Min Yen (2023). The table shows summary statistics of significant daily discontinuous quadratic variation (sum of squared intradaily jumps) of SPC500 and DJIA. [Dataset]. http://doi.org/10.1371/journal.pone.0058365.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Yu-Min Yen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    FDR is controlled at level and . The quantities of price variations shown are all scaled by 10000.

  20. h

    top-3-MATH-500-questions

    • huggingface.co
    Updated Aug 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Axel Zhang (2025). top-3-MATH-500-questions [Dataset]. https://huggingface.co/datasets/Youthquake123/top-3-MATH-500-questions
    Explore at:
    Dataset updated
    Aug 31, 2025
    Authors
    Axel Zhang
    Description

    Youthquake123/top-3-MATH-500-questions dataset hosted on Hugging Face and contributed by the HF Datasets community

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Ricardo (2025). MATH-500 [Dataset]. https://huggingface.co/datasets/ricdomolm/MATH-500

MATH-500

ricdomolm/MATH-500

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 7, 2025
Authors
Ricardo
Description

MATH-500 test set with the remaining 12000 examples in train. import datasets

https://github.com/volcengine/verl/blob/30911f133aa300ae9d8e341dba8e63192335705e/verl/utils/reward_score/math.py

from math_utils import last_boxed_only_string, remove_boxed

math = datasets.load_dataset('DigitalLearningGmbH/MATH-lighteval', 'default') math500 = datasets.load_dataset('HuggingFaceH4/MATH-500')

convert math to math500 format

def map_to_500(example): return { 'problem':โ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/ricdomolm/MATH-500.

Search
Clear search
Close search
Google apps
Main menu