Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
AIME 2024 Dataset
Dataset Description
This dataset contains problems from the American Invitational Mathematics Examination (AIME) 2024. AIME is a prestigious high school mathematics competition known for its challenging mathematical problems.
Dataset Details
Format: JSONL Size: 30 records Source: AIME 2024 I & II Language: English
Data Fields
Each record contains the following fields:
ID: Problem identifier (e.g., "2024-I-1" represents Problem 1… See the full description on the dataset page: https://huggingface.co/datasets/Maxwell-Jia/AIME_2024.
Facebook
TwitterDataset card for AIME 2024
This dataset consists of 30 problems from the 2024 AIME I and AIME II tests. The original source is AI-MO/aimo-validation-aime, which contains a larger set of 90 problems from AIME 2022-2024.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Disclaimer: This is a Benchmark dataset! Do not using in training! This is the Benchmark of AIME from year 1983~2023, and 2024(part 2). Original: https://artofproblemsolving.com/wiki/index.php/AIME_Problems_and_Solutions 2024(part 1) can be find at https://huggingface.co/datasets/AI-MO/aimo-validation-aime.
Citation
@misc {di_zhang_2025, author = { {Di Zhang} }, title = { AIME_1983_2024 (Revision 6283828) }, year = 2025, url = {… See the full description on the dataset page: https://huggingface.co/datasets/di-zhang-fdu/AIME_1983_2024.
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Olympiad-level mathematical problem solving from the real 2024 AIME competition. 30 problems testing advanced algebra, geometry, combinatorics, and number theory.
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
AIME 24
American Invitational Mathematics Examination (AIME) 2024
Citation
If you use the AIME24 dataset in your research, please consider citing it as follows: @misc{aime24, title={American Invitational Mathematics Examination (AIME) 2024}, author={Zhang, Yifan and Math-AI, Team}, year={2024}, }
Facebook
TwitterIn 2024, the artificial analysis math index ranked AI models based on their mathematical reasoning using benchmarks like AIME 2024 and Math-500. o1, QwQ-32B, and DeepSeek R1, led the rankings, showing the highest proficiency in mathematical problem solving.
Facebook
TwitterAIME2024-ko: Korean Translation of AIME Mathematics Benchmark
This dataset is originated from AIME2024 benchmark in the rLLM repository.
Korean Version README AIME2024-ko is a Korean adaptation of the AIME-2024 (American Invitational Mathematics Examination) benchmark utilized with rLLM framework. It enables evaluation of large language models (LLMs) for their mathematical reasoning capabilities in the Korean language.
Dataset Details
Original Source: AIME2024… See the full description on the dataset page: https://huggingface.co/datasets/allganize/AIME2024-ko.
Facebook
Twitterhttps://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
AIME 2024 (Turkish) Dataset
This dataset contains the Turkish translations of problems from the 2024 American Invitational Mathematics Examination (AIME). It is intended to serve as a benchmark for evaluating the advanced mathematical reasoning capabilities of Large Language Models (LLMs) in the Turkish language. The questions were translated into Turkish using GPT-5 and subsequently manually verified and corrected. The AIME is an intermediate examination between the AMC 10/12 and… See the full description on the dataset page: https://huggingface.co/datasets/Rendra8631/aime24-tr.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
RedaAlami/math-correctness-classifier_64rollouts
Dataset Description
This dataset contains mathematical reasoning problems and model responses formatted for training correctness classifiers. Each record includes a problem statement, a model's solution attempt, and a binary label indicating correctness. The dataset spans three benchmarks:
AIME 2024: American Invitational Mathematics Examination 2024
AIME 2025: American Invitational Mathematics Examination 2025
AMO:… See the full description on the dataset page: https://huggingface.co/datasets/RedaAlami/math-correctness-classifier_64rollouts.
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
OpenR1-Math-220k_decontaminated
Decontaminated version of open-r1/OpenR1-Math-220k - default/train
Decontamination
Removed any questions that have an 8-gram overlap with common benchmarks: AIME 2024, AIME 2025, MATH500, GPQA Diamond, LiveCodeBench Code Generation Lite Used GitHub:huggingface/open-r1/scripts/decontaminate.py with all defaults following https://github.com/huggingface/open-r1#data-decontamination
python scripts/decontaminate.py
--dataset… See the full description on the dataset page: https://huggingface.co/datasets/notpaulmartin/OpenR1-Math-220k_decontaminated.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
AIME 2024 Dataset
Dataset Description
This dataset contains problems from the American Invitational Mathematics Examination (AIME) 2024. AIME is a prestigious high school mathematics competition known for its challenging mathematical problems.
Dataset Details
Format: JSONL Size: 30 records Source: AIME 2024 I & II Language: English
Data Fields
Each record contains the following fields:
ID: Problem identifier (e.g., "2024-I-1" represents Problem 1… See the full description on the dataset page: https://huggingface.co/datasets/Maxwell-Jia/AIME_2024.