8 datasets found

h
competition_math
huggingface.co
Updated Jan 29, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Michael Vechtomov (2023). competition_math [Dataset]. https://huggingface.co/datasets/qwedsacf/competition_math
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 29, 2023
Authors
Michael Vechtomov
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset Card for Mathematics Aptitude Test of Heuristics (MATH) dataset

Dataset Summary

The Mathematics Aptitude Test of Heuristics (MATH) dataset consists of problems from mathematics competitions, including the AMC 10, AMC 12, AIME, and more. Each problem in MATH has a full step-by-step solution, which can be used to teach models to generate answer derivations and explanations.

Supported Tasks and Leaderboards

[More Information Needed]

Languages… See the full description on the dataset page: https://huggingface.co/datasets/qwedsacf/competition_math.
h
competition_math
huggingface.co
Updated Mar 20, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jorin Eggers (2024). competition_math [Dataset]. https://huggingface.co/datasets/jeggers/competition_math
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 20, 2024
Authors
Jorin Eggers
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset Card for "competition_math"

Added column with final solution extracted from \boxed{} tags. Added numeric congig that only contains questions with numeric answers.

Dataset Summary

MATH contains 12,500 challenging competition mathematics problems. Each problem in MATH has a full step-by-step solution which can be used to teach models to generate answer derivations and explanation This dataset card aims to be a base template for new datasets.

Languages… See the full description on the dataset page: https://huggingface.co/datasets/jeggers/competition_math.
h
competition_math
huggingface.co
Updated May 28, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
jullajak (2025). competition_math [Dataset]. https://huggingface.co/datasets/MickMick102/competition_math
Explore at:
Dataset updated
May 28, 2025
Authors
jullajak
Description
MickMick102/competition_math dataset hosted on Hugging Face and contributed by the HF Datasets community
h
competition_math
huggingface.co
Updated May 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
JTS AI Team (2025). competition_math [Dataset]. https://huggingface.co/datasets/jts-ai-team/competition_math
Explore at:
Dataset updated
May 28, 2025
Authors
JTS AI Team
Description
jts-ai-team/competition_math dataset hosted on Hugging Face and contributed by the HF Datasets community
h
MATH
huggingface.co
Updated Dec 3, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
MATH [Dataset]. https://huggingface.co/datasets/Maxwell-Jia/MATH
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 3, 2024
Authors
Minghui Jia
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
MATH Dataset

The Mathematics Aptitude Test of Heuristics (MATH) dataset consists of problems from mathematics competitions, including the AMC 10, AMC 12, AIME, and more. Each problem in MATH has a full step-by-step solution, which can be used to teach models to generate answer derivations and explanations. This is a converted version of the hendrycks/competition_math originally created by Hendrycks et al. The dataset has been converted to parquet format for easier loading and usage.… See the full description on the dataset page: https://huggingface.co/datasets/Maxwell-Jia/MATH.
robust-finetuning
huggingface.co
Updated Aug 8, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Appier AI Research Team (2024). robust-finetuning [Dataset]. https://huggingface.co/datasets/appier-ai-research/robust-finetuning
Explore at:
Dataset updated
Aug 8, 2024
Dataset provided by
Appier Inc.https://www.appier.com/
Authors
Appier AI Research Team
Description
Please refer to the following source for the original datasets:

GSM8K: https://huggingface.co/datasets/openai/gsm8k MATH: https://huggingface.co/datasets/hendrycks/competition_math math-resample: In this section we subsample the 1,000 subsample only (yes it's balance)

HumanEval+: https://huggingface.co/datasets/evalplus/humanevalplus MBPP: https://huggingface.co/datasets/google-research-datasets/mbpp MBPP+: https://huggingface.co/datasets/evalplus/mbppplus ARC Challenge:… See the full description on the dataset page: https://huggingface.co/datasets/appier-ai-research/robust-finetuning.
h
optillmbench
huggingface.co
Updated Jun 15, 2015
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Asankhaya Sharma (2015). optillmbench [Dataset]. https://huggingface.co/datasets/codelion/optillmbench
Explore at:
Dataset updated
Jun 15, 2015
Authors
Asankhaya Sharma
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
OptiLLMBench Dataset

A benchmark dataset for evaluating test-time optimization and scaling capabilities of language models.

Dataset Description

OptiLLMBench contains 500 carefully selected challenging problems across multiple domains:

Mathematical reasoning (from competition_math) Code generation (from HumanEval) Word problems (from GSM8K) Multiple choice reasoning (from MMLU) Logical deduction (from BBH)

Each example is chosen to benefit from test-time optimization… See the full description on the dataset page: https://huggingface.co/datasets/codelion/optillmbench.
h
olympiad_task_translation
huggingface.co
Updated Aug 11, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nikita (2024). olympiad_task_translation [Dataset]. https://huggingface.co/datasets/NMashalov/olympiad_task_translation
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 11, 2024
Authors
Nikita
Description
Modified with Russiand Translation of hendrycks/competition_math via Llama3
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Michael Vechtomov (2023). competition_math [Dataset]. https://huggingface.co/datasets/qwedsacf/competition_math

competition_math

qwedsacf/competition_math

Mathematics Aptitude Test of Heuristics (MATH)

Explore at:

11 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Jan 29, 2023

Authors

Michael Vechtomov

License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

Dataset Card for Mathematics Aptitude Test of Heuristics (MATH) dataset

  Dataset Summary

The Mathematics Aptitude Test of Heuristics (MATH) dataset consists of problems from mathematics competitions, including the AMC 10, AMC 12, AIME, and more. Each problem in MATH has a full step-by-step solution, which can be used to teach models to generate answer derivations and explanations.

  Supported Tasks and Leaderboards

[More Information Needed]

  Languages… See the full description on the dataset page: https://huggingface.co/datasets/qwedsacf/competition_math.

Clear search

Close search

Google apps

Main menu

competition_math

competition_math

competition_math

competition_math

MATH

robust-finetuning

optillmbench

olympiad_task_translation

competition_mathSee More Versions

qwedsacf/competition_math

Mathematics Aptitude Test of Heuristics (MATH)

competition_math