100+ datasets found

h
MATH-500
huggingface.co
Updated Nov 18, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ricardo (2022). MATH-500 [Dataset]. https://huggingface.co/datasets/ricdomolm/MATH-500
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 18, 2022
Authors
Ricardo
Description
MATH-500 test set with the remaining 12000 examples in train. import datasets

https://github.com/volcengine/verl/blob/30911f133aa300ae9d8e341dba8e63192335705e/verl/utils/reward_score/math.py

from math_utils import last_boxed_only_string, remove_boxed

math = datasets.load_dataset('DigitalLearningGmbH/MATH-lighteval', 'default') math500 = datasets.load_dataset('HuggingFaceH4/MATH-500')

convert math to math500 format

def map_to_500(example): return { 'problem':… See the full description on the dataset page: https://huggingface.co/datasets/ricdomolm/MATH-500.
h
math500
huggingface.co
Updated Apr 10, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zikang Shan (2025). math500 [Dataset]. https://huggingface.co/datasets/zkshan2002/math500
Explore at:
Dataset updated
Apr 10, 2025
Authors
Zikang Shan
Description
Source: HuggingFaceH4/MATH-500 Modification:

Remove redundent columns

Keys:

problem answer

Size

test: 500
a
Math 500 by Model
artificialanalysis.ai
Updated Jun 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Artificial Analysis (2025). Math 500 by Model [Dataset]. https://artificialanalysis.ai/evaluations/math-500
Explore at:
Dataset updated
Jun 28, 2025
Dataset authored and provided by
Artificial Analysis
Description
Comparison of Independently conducted by Artificial Analysis by Model
h
math500
huggingface.co
Updated May 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
VLM-Reasoner (2025). math500 [Dataset]. https://huggingface.co/datasets/VLM-Reasoner/math500
Explore at:
Dataset updated
May 11, 2025
Dataset authored and provided by
VLM-Reasoner
Description
VLM-Reasoner/math500 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
MATH500
huggingface.co
Updated Jun 26, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yinjie Wang (2025). MATH500 [Dataset]. https://huggingface.co/datasets/yinjiewang/MATH500
Explore at:
Dataset updated
Jun 26, 2025
Authors
Yinjie Wang
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
yinjiewang/MATH500 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
math500
huggingface.co
Updated May 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jiarui Yao (2025). math500 [Dataset]. https://huggingface.co/datasets/FlippyDora/math500
Explore at:
Dataset updated
May 1, 2025
Authors
Jiarui Yao
Description
FlippyDora/math500 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
math500-cot-experiment
huggingface.co
Updated Apr 26, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Asankhaya Sharma (2025). math500-cot-experiment [Dataset]. https://huggingface.co/datasets/codelion/math500-cot-experiment
Explore at:
Dataset updated
Apr 26, 2025
Authors
Asankhaya Sharma
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
MATH-500 Chain of Thought Experiment Dataset

This dataset contains the results of an experiment testing different prompting strategies (standard, chain of thought, and gibberish chain of thought) on the MATH-500 benchmark using the Llama-3.2-1B-Instruct model.

Dataset Structure

The dataset is split into three parts:

standard: Direct prompting with no reasoning steps (500 examples) cot: Chain of thought prompting with structured reasoning (500 examples) gibberish:… See the full description on the dataset page: https://huggingface.co/datasets/codelion/math500-cot-experiment.
h
math500_best_of_n
huggingface.co
Updated Jan 12, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jakhongir Saydaliev (2025). math500_best_of_n [Dataset]. https://huggingface.co/datasets/Jakh0103/math500_best_of_n
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 12, 2025
Authors
Jakhongir Saydaliev
Description
Dataset Summary

This dataset contains mathematical problem-solving responses generated using two decoding methods: Greedy and Best-of-N for 20 problems spanning three difficulty levels (1, 2, 3) from the [MATH500 dataset](https://huggingface.co/datasets/HuggingFaceH4/MATH-500.

Languages

The dataset content is entirely in English.

Dataset Structure

Each instance contains the following fields:

problem: math problem given as an input answer: a ground truth answer… See the full description on the dataset page: https://huggingface.co/datasets/Jakh0103/math500_best_of_n.
h
MATH500
huggingface.co
Updated Mar 21, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yu Wang (2025). MATH500 [Dataset]. https://huggingface.co/datasets/Wloner0809/MATH500
Explore at:
Dataset updated
Mar 21, 2025
Authors
Yu Wang
Description
Wloner0809/MATH500 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
MATH500
huggingface.co
Updated Aug 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Princeton-AI (2025). MATH500 [Dataset]. https://huggingface.co/datasets/Gen-Verse/MATH500
Explore at:
Dataset updated
Aug 27, 2025
Dataset authored and provided by
Princeton-AI
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Gen-Verse/MATH500 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
math500-enhanced
huggingface.co
Updated Jun 17, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rachit Bansal (2025). math500-enhanced [Dataset]. https://huggingface.co/datasets/rachitbansal-harvard/math500-enhanced
Explore at:
Dataset updated
Jun 17, 2025
Authors
Rachit Bansal
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Math500 Enhanced Dataset

This dataset contains LLM-enhanced versions of mathematical problems with step-by-step reasoning solutions.

Dataset Statistics

Examples: 500 (500 enhanced with LLM) Enhancement Rate: 100.0%

Data Fields

question: The mathematical problem statement solution: LLM-enhanced step-by-step solution original_solution: Original solution text (for reference) answer: Final numerical answer level: Problem difficulty level type: Problem… See the full description on the dataset page: https://huggingface.co/datasets/rachitbansal-harvard/math500-enhanced.
h
MATH-500
huggingface.co
Updated Nov 7, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dongfu Jiang (2024). MATH-500 [Dataset]. https://huggingface.co/datasets/DongfuJiang/MATH-500
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 7, 2024
Authors
Dongfu Jiang
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
DongfuJiang/MATH-500 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
math500-deepseek-r1-distill-qwen-1.5b
huggingface.co
Updated May 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sungmin Jo (2025). math500-deepseek-r1-distill-qwen-1.5b [Dataset]. https://huggingface.co/datasets/jsm0424/math500-deepseek-r1-distill-qwen-1.5b
Explore at:
Dataset updated
May 28, 2025
Authors
Sungmin Jo
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset Card for "math500-deepseek-r1-distill-qwen-1.5b"

Dataset Summary

This dataset is a distilled version of the MATH500 dataset, augmented with reasoning-based responses generated by the deepseek-r1-distill-qwen-1.5b language model. The dataset is designed to evaluate and improve the mathematical reasoning capabilities of LLMs through step-by-step solutions and final answers. Each example consists of:

The original problem statement from MATH500 The reference solution… See the full description on the dataset page: https://huggingface.co/datasets/jsm0424/math500-deepseek-r1-distill-qwen-1.5b.
h
math500
huggingface.co
Updated Jun 21, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Samuel Moor-Smith (2025). math500 [Dataset]. https://huggingface.co/datasets/smoorsmith/math500
Explore at:
Dataset updated
Jun 21, 2025
Authors
Samuel Moor-Smith
Description
Dataset Card for Dataset Name

Forked from HuggingFaceH4/MATH-500

Dataset Details Dataset Description

Curated by: [More Information Needed] Funded by [optional]: [More Information Needed] Shared by [optional]: [More Information Needed] Language(s) (NLP): [More Information Needed] License: [More Information Needed]

Dataset Sources [optional]

Repository: [More Information Needed] Paper [optional]: [More Information Needed] Demo [optional]:… See the full description on the dataset page: https://huggingface.co/datasets/smoorsmith/math500.
h
MATH-train-MATH500-test
huggingface.co
Updated Nov 18, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shuozhe Li (2022). MATH-train-MATH500-test [Dataset]. https://huggingface.co/datasets/ShuoZheLi/MATH-train-MATH500-test
Explore at:
Dataset updated
Nov 18, 2022
Authors
Shuozhe Li
Description
ShuoZheLi/MATH-train-MATH500-test dataset hosted on Hugging Face and contributed by the HF Datasets community
h
math500
huggingface.co
Updated Jul 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
sparklereasoning (2025). math500 [Dataset]. https://huggingface.co/datasets/sparkle-reasoning/math500
Explore at:
Dataset updated
Jul 5, 2025
Dataset authored and provided by
sparklereasoning
Description
sparkle-reasoning/math500 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
math500-math-rubric
huggingface.co
Updated Aug 16, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mika Senghaas (2025). math500-math-rubric [Dataset]. https://huggingface.co/datasets/mikasenghaas/math500-math-rubric
Explore at:
Dataset updated
Aug 16, 2025
Authors
Mika Senghaas
Description
mikasenghaas/math500-math-rubric dataset hosted on Hugging Face and contributed by the HF Datasets community
h
math500
huggingface.co
Updated Mar 19, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wei Xiong (2025). math500 [Dataset]. https://huggingface.co/datasets/weqweasdas/math500
Explore at:
Dataset updated
Mar 19, 2025
Authors
Wei Xiong
Description
weqweasdas/math500 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
math500
huggingface.co
Updated Feb 5, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
dsrtrain (2025). math500 [Dataset]. https://huggingface.co/datasets/dsrtrain/math500
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 5, 2025
Dataset authored and provided by
dsrtrain
Description
dsrtrain/math500 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
math500
huggingface.co
Updated Jan 29, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Scale Frontier Data (2025). math500 [Dataset]. https://huggingface.co/datasets/ScaleFrontierData/math500
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 29, 2025
Dataset authored and provided by
Scale Frontier Data
Description
ScaleFrontierData/math500 dataset hosted on Hugging Face and contributed by the HF Datasets community

Facebook

Twitter

Click to copy link

Link copied

Cite

Ricardo (2022). MATH-500 [Dataset]. https://huggingface.co/datasets/ricdomolm/MATH-500

MATH-500

ricdomolm/MATH-500

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Nov 18, 2022

Authors

Ricardo

Description

MATH-500 test set with the remaining 12000 examples in train. import datasets

https://github.com/volcengine/verl/blob/30911f133aa300ae9d8e341dba8e63192335705e/verl/utils/reward_score/math.py

from math_utils import last_boxed_only_string, remove_boxed

math = datasets.load_dataset('DigitalLearningGmbH/MATH-lighteval', 'default') math500 = datasets.load_dataset('HuggingFaceH4/MATH-500')

convert math to math500 format

def map_to_500(example): return { 'problem':… See the full description on the dataset page: https://huggingface.co/datasets/ricdomolm/MATH-500.

Clear search

Close search

Google apps

Main menu

MATH-500

https://github.com/volcengine/verl/blob/30911f133aa300ae9d8e341dba8e63192335705e/verl/utils/reward_score/math.py

convert math to math500 format

math500

Math 500 by Model

math500

MATH500

math500

math500-cot-experiment

math500_best_of_n

MATH500

MATH500

math500-enhanced

MATH-500

math500-deepseek-r1-distill-qwen-1.5b

math500

MATH-train-MATH500-test

math500

math500-math-rubric

math500

math500

math500

MATH-500

ricdomolm/MATH-500

https://github.com/volcengine/verl/blob/30911f133aa300ae9d8e341dba8e63192335705e/verl/utils/reward_score/math.py

convert math to math500 format