10 datasets found

h
AIME_2024
huggingface.co
tokenburn.ru
Updated Dec 6, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Minghui Jia (2024). AIME_2024 [Dataset]. https://huggingface.co/datasets/Maxwell-Jia/AIME_2024
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 6, 2024
Authors
Minghui Jia
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
AIME 2024 Dataset

Dataset Description

This dataset contains problems from the American Invitational Mathematics Examination (AIME) 2024. AIME is a prestigious high school mathematics competition known for its challenging mathematical problems.

Dataset Details

Format: JSONL Size: 30 records Source: AIME 2024 I & II Language: English

Data Fields

Each record contains the following fields:

ID: Problem identifier (e.g., "2024-I-1" represents Problem 1… See the full description on the dataset page: https://huggingface.co/datasets/Maxwell-Jia/AIME_2024.
aime_2024
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hugging Face H4, aime_2024 [Dataset]. https://huggingface.co/datasets/HuggingFaceH4/aime_2024
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset provided by
Hugging Facehttps://huggingface.co/
Authors
Hugging Face H4
Description
Dataset card for AIME 2024

This dataset consists of 30 problems from the 2024 AIME I and AIME II tests. The original source is AI-MO/aimo-validation-aime, which contains a larger set of 90 problems from AIME 2022-2024.
h
AIME_1983_2024
huggingface.co
Updated Jun 11, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Di Zhang (2024). AIME_1983_2024 [Dataset]. http://doi.org/10.57967/hf/4687
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.57967/hf/4687
Dataset updated
Jun 11, 2024
Authors
Di Zhang
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Disclaimer: This is a Benchmark dataset! Do not using in training! This is the Benchmark of AIME from year 1983~2023, and 2024(part 2). Original: https://artofproblemsolving.com/wiki/index.php/AIME_Problems_and_Solutions 2024(part 1) can be find at https://huggingface.co/datasets/AI-MO/aimo-validation-aime.

Citation

@misc {di_zhang_2025, author = { {Di Zhang} }, title = { AIME_1983_2024 (Revision 6283828) }, year = 2025, url = {… See the full description on the dataset page: https://huggingface.co/datasets/di-zhang-fdu/AIME_1983_2024.
l
American Invitational Mathematics Examination 2024 Benchmark Results
lmmarketcap.com
aimodelsmap.com
Updated Mar 27, 2026
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
LM Market Cap (2026). American Invitational Mathematics Examination 2024 Benchmark Results [Dataset]. https://lmmarketcap.com/benchmarks/aime_2024
Explore at:
Dataset updated
Mar 27, 2026
Dataset authored and provided by
LM Market Cap
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Time period covered
2024 - Present
Variables measured
model rank, benchmark score
Measurement technique
American Invitational Mathematics Examination 2024 evaluation
Description
Olympiad-level mathematical problem solving from the real 2024 AIME competition. 30 problems testing advanced algebra, geometry, combinatorics, and number theory.
h
aime24
huggingface.co
Updated Feb 20, 2026
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
math-ai (2026). aime24 [Dataset]. https://huggingface.co/datasets/math-ai/aime24
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 20, 2026
Dataset authored and provided by
math-ai
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
AIME 24

American Invitational Mathematics Examination (AIME) 2024 Citation

If you use the AIME24 dataset in your research, please consider citing it as follows: @misc{aime24, title={American Invitational Mathematics Examination (AIME) 2024}, author={Zhang, Yifan and Math-AI, Team}, year={2024}, }
Major AI models, by math and computational reasoning
statista.com
Updated Nov 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Major AI models, by math and computational reasoning [Dataset]. https://www.statista.com/statistics/1600812/ai-math-benchmarking-ranking/
Explore at:
Dataset updated
Nov 28, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2025
Area covered
Worldwide
Description
In 2024, the artificial analysis math index ranked AI models based on their mathematical reasoning using benchmarks like AIME 2024 and Math-500. o1, QwQ-32B, and DeepSeek R1, led the rankings, showing the highest proficiency in mathematical problem solving.
AIME2024-ko
huggingface.co
Updated Jul 29, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
allganize (2024). AIME2024-ko [Dataset]. https://huggingface.co/datasets/allganize/AIME2024-ko
Explore at:
Dataset updated
Jul 29, 2024
Dataset provided by
Allganize, Inc.
Authors
allganize
Description
AIME2024-ko: Korean Translation of AIME Mathematics Benchmark

This dataset is originated from AIME2024 benchmark in the rLLM repository.

Korean Version README AIME2024-ko is a Korean adaptation of the AIME-2024 (American Invitational Mathematics Examination) benchmark utilized with rLLM framework. It enables evaluation of large language models (LLMs) for their mathematical reasoning capabilities in the Korean language.

Dataset Details

Original Source: AIME2024… See the full description on the dataset page: https://huggingface.co/datasets/allganize/AIME2024-ko.
h
aime24-tr
huggingface.co
Updated Dec 21, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rendra Saputra (2025). aime24-tr [Dataset]. https://huggingface.co/datasets/Rendra8631/aime24-tr
Explore at:
Dataset updated
Dec 21, 2025
Authors
Rendra Saputra
License
https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Description
AIME 2024 (Turkish) Dataset

This dataset contains the Turkish translations of problems from the 2024 American Invitational Mathematics Examination (AIME). It is intended to serve as a benchmark for evaluating the advanced mathematical reasoning capabilities of Large Language Models (LLMs) in the Turkish language. The questions were translated into Turkish using GPT-5 and subsequently manually verified and corrected. The AIME is an intermediate examination between the AMC 10/12 and… See the full description on the dataset page: https://huggingface.co/datasets/Rendra8631/aime24-tr.
h
math-correctness-classifier_64rollouts
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ralami, math-correctness-classifier_64rollouts [Dataset]. https://huggingface.co/datasets/RedaAlami/math-correctness-classifier_64rollouts
Explore at:
Authors
ralami
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
RedaAlami/math-correctness-classifier_64rollouts

Dataset Description

This dataset contains mathematical reasoning problems and model responses formatted for training correctness classifiers. Each record includes a problem statement, a model's solution attempt, and a binary label indicating correctness. The dataset spans three benchmarks:

AIME 2024: American Invitational Mathematics Examination 2024 AIME 2025: American Invitational Mathematics Examination 2025
AMO:… See the full description on the dataset page: https://huggingface.co/datasets/RedaAlami/math-correctness-classifier_64rollouts.
h
OpenR1-Math-220k_decontaminated
huggingface.co
Updated Mar 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Paul Martin (2025). OpenR1-Math-220k_decontaminated [Dataset]. https://huggingface.co/datasets/notpaulmartin/OpenR1-Math-220k_decontaminated
Explore at:
Dataset updated
Mar 30, 2025
Authors
Paul Martin
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
OpenR1-Math-220k_decontaminated

Decontaminated version of open-r1/OpenR1-Math-220k - default/train

Decontamination

Removed any questions that have an 8-gram overlap with common benchmarks: AIME 2024, AIME 2025, MATH500, GPQA Diamond, LiveCodeBench Code Generation Lite Used GitHub:huggingface/open-r1/scripts/decontaminate.py with all defaults following https://github.com/huggingface/open-r1#data-decontamination

python scripts/decontaminate.py
--dataset… See the full description on the dataset page: https://huggingface.co/datasets/notpaulmartin/OpenR1-Math-220k_decontaminated.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Minghui Jia (2024). AIME_2024 [Dataset]. https://huggingface.co/datasets/Maxwell-Jia/AIME_2024

AIME_2024

AIME 2024 Dataset

Maxwell-Jia/AIME_2024

Explore at:

155 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Dec 6, 2024

Authors

Minghui Jia

License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

AIME 2024 Dataset

  Dataset Description

This dataset contains problems from the American Invitational Mathematics Examination (AIME) 2024. AIME is a prestigious high school mathematics competition known for its challenging mathematical problems.

  Dataset Details

Format: JSONL Size: 30 records Source: AIME 2024 I & II Language: English

  Data Fields

Each record contains the following fields:

ID: Problem identifier (e.g., "2024-I-1" represents Problem 1… See the full description on the dataset page: https://huggingface.co/datasets/Maxwell-Jia/AIME_2024.

Clear search

Close search

Google apps

Main menu

AIME_2024

aime_2024

AIME_1983_2024

American Invitational Mathematics Examination 2024 Benchmark Results

aime24

Major AI models, by math and computational reasoning

AIME2024-ko

aime24-tr

math-correctness-classifier_64rollouts

OpenR1-Math-220k_decontaminated

AIME_2024

AIME 2024 Dataset

Maxwell-Jia/AIME_2024