100+ datasets found

h
MATH-500
huggingface.co
Updated Feb 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ricardo (2025). MATH-500 [Dataset]. https://huggingface.co/datasets/ricdomolm/MATH-500
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 7, 2025
Authors
Ricardo
Description
MATH-500 test set with the remaining 12000 examples in train. import datasets

https://github.com/volcengine/verl/blob/30911f133aa300ae9d8e341dba8e63192335705e/verl/utils/reward_score/math.py

from math_utils import last_boxed_only_string, remove_boxed

math = datasets.load_dataset('DigitalLearningGmbH/MATH-lighteval', 'default') math500 = datasets.load_dataset('HuggingFaceH4/MATH-500')

convert math to math500 format

def map_to_500(example): return { 'problem':… See the full description on the dataset page: https://huggingface.co/datasets/ricdomolm/MATH-500.
a
Math 500 by Model
artificialanalysis.ai
Updated Jun 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Artificial Analysis (2025). Math 500 by Model [Dataset]. https://artificialanalysis.ai/evaluations/math-500
Explore at:
Dataset updated
Jun 28, 2025
Dataset authored and provided by
Artificial Analysis
Description
Comparison of Independently conducted by Artificial Analysis by Model
h
MATH-500
huggingface.co
Updated Aug 31, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gleb Gerasimov (2025). MATH-500 [Dataset]. https://huggingface.co/datasets/gudleifrr/MATH-500
Explore at:
Dataset updated
Aug 31, 2025
Authors
Gleb Gerasimov
Description
gudleifrr/MATH-500 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
MATH-500-multilingual
huggingface.co
Updated Sep 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Abdullah Bezir (2025). MATH-500-multilingual [Dataset]. https://huggingface.co/datasets/bezir/MATH-500-multilingual
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 11, 2025
Authors
Abdullah Bezir
Description
MATH-500 Multilingual Problem Set 🌍➗

A multilingual subset from OpenAI's MATH benchmark. Perfect for testing math skills across languages, this dataset includes same problems in English, French, Italian, Turkish and Spanish.

🌐 Available Languages

English 🇬🇧
French 🇫🇷
Italian 🇮🇹
Turkish 🇹🇷 Spanish 🇪🇸

📂 Source & Attribution

Original Dataset: Sourced from HuggingFaceH4/MATH-500.

🚀 Quick Start

Load the dataset… See the full description on the dataset page: https://huggingface.co/datasets/bezir/MATH-500-multilingual.
h
ko-math-500
huggingface.co
Updated Jul 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
davidkim205 (2025). ko-math-500 [Dataset]. https://huggingface.co/datasets/davidkim205/ko-math-500
Explore at:
Dataset updated
Jul 22, 2025
Authors
davidkim205
Description
ko-math-500

ko-math-500 is a Korean-translated subset of 500 representative problems from the widely used MATH (Mathematics Aptitude Test of Heuristics) dataset, designed to evaluate the mathematical reasoning abilities of large language models. The ko-math-500 subset is based on the standard evaluation set of 500 problems used in the 2023 paper Let’s Verify Step by Step for model performance comparison. The original dataset is publicly available at HuggingFaceH4/MATH-500. The… See the full description on the dataset page: https://huggingface.co/datasets/davidkim205/ko-math-500.
h
MATH-500-translated
huggingface.co
Updated Feb 21, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Appier AI Research Team (2025). MATH-500-translated [Dataset]. https://huggingface.co/datasets/appier-ai-research/MATH-500-translated
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 21, 2025
Dataset authored and provided by
Appier AI Research Team
Description
appier-ai-research/MATH-500-translated dataset hosted on Hugging Face and contributed by the HF Datasets community
h
MATH-500-uppercase
huggingface.co
Updated Jun 14, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jacob Morrison (2025). MATH-500-uppercase [Dataset]. https://huggingface.co/datasets/jacobmorrison/MATH-500-uppercase
Explore at:
Dataset updated
Jun 14, 2025
Authors
Jacob Morrison
Description
jacobmorrison/MATH-500-uppercase dataset hosted on Hugging Face and contributed by the HF Datasets community
a
Math Index by Model
artificialanalysis.ai
Updated May 4, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Artificial Analysis (2024). Math Index by Model [Dataset]. https://artificialanalysis.ai/models/comparisons/mistral-small-3-2-vs-gpt-4o-2024-05-13
Explore at:
Dataset updated
May 4, 2024
Dataset authored and provided by
Artificial Analysis
Description
Comparison of Represents the average of math benchmarks in the Artificial Analysis Intelligence Index (AIME 2024 & Math-500) by Model
Major AI models, by math and computational reasoning
statista.com
Updated Mar 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Major AI models, by math and computational reasoning [Dataset]. https://www.statista.com/statistics/1600812/ai-math-benchmarking-ranking/
Explore at:
Dataset updated
Mar 14, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2025
Area covered
Worldwide
Description
In 2024, the artificial analysis math index ranked AI models based on their mathematical reasoning using benchmarks like AIME 2024 and Math-500. o1, QwQ-32B, and DeepSeek R1, led the rankings, showing the highest proficiency in mathematical problem solving.
p
Trends in Math Proficiency (2011-2023): Princeton HSD 500 School District...
publicschoolreview.com
Updated Sep 9, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Public School Review (2025). Trends in Math Proficiency (2011-2023): Princeton HSD 500 School District vs. Illinois [Dataset]. https://www.publicschoolreview.com/illinois/princeton-hsd-500-school-district/1732700-school-district
Explore at:
Dataset updated
Sep 9, 2025
Dataset authored and provided by
Public School Review
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset tracks annual math proficiency from 2011 to 2023 for Princeton HSD 500 School District vs. Illinois
h
MATH-500
huggingface.co
Updated Jul 1, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jacob Morrison (2025). MATH-500 [Dataset]. https://huggingface.co/datasets/jacobmorrison/MATH-500
Explore at:
Dataset updated
Jul 1, 2025
Authors
Jacob Morrison
Description
jacobmorrison/MATH-500 dataset hosted on Hugging Face and contributed by the HF Datasets community
p
Trends in Math Proficiency (2010-2011): The 500 Role Model Academy vs....
publicschoolreview.com
Updated Sep 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Public School Review (2025). Trends in Math Proficiency (2010-2011): The 500 Role Model Academy vs. Florida vs. Miami-Dade School District [Dataset]. https://www.publicschoolreview.com/the-500-role-model-academy-profile
Explore at:
Dataset updated
Sep 5, 2025
Dataset authored and provided by
Public School Review
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Dade County School District
Description
This dataset tracks annual math proficiency from 2010 to 2011 for The 500 Role Model Academy vs. Florida and Miami-Dade School District
a
Intelligence Index by Model
artificialanalysis.ai
Updated May 4, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Artificial Analysis (2024). Intelligence Index by Model [Dataset]. https://artificialanalysis.ai/models/comparisons/mistral-small-3-2-vs-gpt-4o-2024-05-13
Explore at:
Dataset updated
May 4, 2024
Dataset authored and provided by
Artificial Analysis
Description
Comparison of Artificial Analysis Intelligence Index incorporates 7 evaluations: MMLU-Pro, GPQA Diamond, Humanity's Last Exam, LiveCodeBench, SciCode, AIME, MATH-500 by Model
f
The table shows summary statistics of significant daily discontinuous...
figshare.com
plos.figshare.com
xls
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yu-Min Yen (2023). The table shows summary statistics of significant daily discontinuous quadratic variation (sum of squared intradaily jumps) of SPC500 and DJIA on the common jump days by adopting separate and pool methods. [Dataset]. http://doi.org/10.1371/journal.pone.0058365.t004
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0058365.t004
Dataset updated
May 30, 2023
Dataset provided by
PLOS ONE
Authors
Yu-Min Yen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The term common jump days used here only means that the two indices both have jumps on these days. The mean and standard deviation of are calculated conditional on The quantities of price variations shown are all scaled by 10000.
h
omni-MATH-500
huggingface.co
Updated Sep 1, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Matteo Santelmo (2025). omni-MATH-500 [Dataset]. https://huggingface.co/datasets/matsant01/omni-MATH-500
Explore at:
Dataset updated
Sep 1, 2025
Authors
Matteo Santelmo
Description
matsant01/omni-MATH-500 dataset hosted on Hugging Face and contributed by the HF Datasets community
p
Trends in Math Proficiency (2012-2023): Cape Elizabeth Middle School vs....
publicschoolreview.com
Updated Oct 15, 2011
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Public School Review (2011). Trends in Math Proficiency (2012-2023): Cape Elizabeth Middle School vs. Maine vs. Cape Elizabeth School District [Dataset]. https://www.publicschoolreview.com/cape-elizabeth-middle-school-profile
Explore at:
Dataset updated
Oct 15, 2011
Dataset authored and provided by
Public School Review
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Cape Elizabeth, Maine
Description
This dataset tracks annual math proficiency from 2012 to 2023 for Cape Elizabeth Middle School vs. Maine and Cape Elizabeth School District
p
Trends in Math Proficiency (2012-2023): Yarmouth Elementary School vs. Maine...
publicschoolreview.com
Updated Sep 5, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Public School Review (2025). Trends in Math Proficiency (2012-2023): Yarmouth Elementary School vs. Maine vs. Yarmouth Schools School District [Dataset]. https://www.publicschoolreview.com/yarmouth-elementary-school-profile
Explore at:
Dataset updated
Sep 5, 2025
Dataset authored and provided by
Public School Review
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Maine, Yarmouth, Yarmouth Schools
Description
This dataset tracks annual math proficiency from 2012 to 2023 for Yarmouth Elementary School vs. Maine and Yarmouth Schools School District
p
Trends in Math Proficiency (2012-2023): Frank H Harrison Middle School vs....
publicschoolreview.com
Updated Sep 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Public School Review (2025). Trends in Math Proficiency (2012-2023): Frank H Harrison Middle School vs. Maine vs. Yarmouth Schools School District [Dataset]. https://www.publicschoolreview.com/frank-h-harrison-middle-school-profile
Explore at:
Dataset updated
Sep 5, 2025
Dataset authored and provided by
Public School Review
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Maine, Yarmouth, Yarmouth Schools
Description
This dataset tracks annual math proficiency from 2012 to 2023 for Frank H Harrison Middle School vs. Maine and Yarmouth Schools School District
f
The table shows summary statistics of significant daily discontinuous...
plos.figshare.com
xls
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yu-Min Yen (2023). The table shows summary statistics of significant daily discontinuous quadratic variation (sum of squared intradaily jumps) of SPC500 and DJIA. [Dataset]. http://doi.org/10.1371/journal.pone.0058365.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0058365.t003
Dataset updated
May 30, 2023
Dataset provided by
PLOS ONE
Authors
Yu-Min Yen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
FDR is controlled at level and . The quantities of price variations shown are all scaled by 10000.
h
top-3-MATH-500-questions
huggingface.co
Updated Aug 31, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Axel Zhang (2025). top-3-MATH-500-questions [Dataset]. https://huggingface.co/datasets/Youthquake123/top-3-MATH-500-questions
Explore at:
Dataset updated
Aug 31, 2025
Authors
Axel Zhang
Description
Youthquake123/top-3-MATH-500-questions dataset hosted on Hugging Face and contributed by the HF Datasets community

Facebook

Twitter

Click to copy link

Link copied

Cite

Ricardo (2025). MATH-500 [Dataset]. https://huggingface.co/datasets/ricdomolm/MATH-500

MATH-500

ricdomolm/MATH-500

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Feb 7, 2025

Authors

Ricardo

Description

MATH-500 test set with the remaining 12000 examples in train. import datasets

https://github.com/volcengine/verl/blob/30911f133aa300ae9d8e341dba8e63192335705e/verl/utils/reward_score/math.py

from math_utils import last_boxed_only_string, remove_boxed

math = datasets.load_dataset('DigitalLearningGmbH/MATH-lighteval', 'default') math500 = datasets.load_dataset('HuggingFaceH4/MATH-500')

convert math to math500 format

def map_to_500(example): return { 'problem':… See the full description on the dataset page: https://huggingface.co/datasets/ricdomolm/MATH-500.

Clear search

Close search

Google apps

Main menu

MATH-500

https://github.com/volcengine/verl/blob/30911f133aa300ae9d8e341dba8e63192335705e/verl/utils/reward_score/math.py

convert math to math500 format

Math 500 by Model

MATH-500

MATH-500-multilingual

ko-math-500

MATH-500-translated

MATH-500-uppercase

Math Index by Model

Major AI models, by math and computational reasoning

Trends in Math Proficiency (2011-2023): Princeton HSD 500 School District...

MATH-500

Trends in Math Proficiency (2010-2011): The 500 Role Model Academy vs....

Intelligence Index by Model

The table shows summary statistics of significant daily discontinuous...

omni-MATH-500

Trends in Math Proficiency (2012-2023): Cape Elizabeth Middle School vs....

Trends in Math Proficiency (2012-2023): Yarmouth Elementary School vs. Maine...

Trends in Math Proficiency (2012-2023): Frank H Harrison Middle School vs....

The table shows summary statistics of significant daily discontinuous...

top-3-MATH-500-questions

MATH-500

ricdomolm/MATH-500

https://github.com/volcengine/verl/blob/30911f133aa300ae9d8e341dba8e63192335705e/verl/utils/reward_score/math.py

convert math to math500 format