10 datasets found
  1. h

    AIME_2024

    • huggingface.co
    • tokenburn.ru
    Updated Dec 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Minghui Jia (2024). AIME_2024 [Dataset]. https://huggingface.co/datasets/Maxwell-Jia/AIME_2024
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 6, 2024
    Authors
    Minghui Jia
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    AIME 2024 Dataset

      Dataset Description
    

    This dataset contains problems from the American Invitational Mathematics Examination (AIME) 2024. AIME is a prestigious high school mathematics competition known for its challenging mathematical problems.

      Dataset Details
    

    Format: JSONL Size: 30 records Source: AIME 2024 I & II Language: English

      Data Fields
    

    Each record contains the following fields:

    ID: Problem identifier (e.g., "2024-I-1" represents Problem 1… See the full description on the dataset page: https://huggingface.co/datasets/Maxwell-Jia/AIME_2024.

  2. aime_2024

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hugging Face H4, aime_2024 [Dataset]. https://huggingface.co/datasets/HuggingFaceH4/aime_2024
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset provided by
    Hugging Facehttps://huggingface.co/
    Authors
    Hugging Face H4
    Description

    Dataset card for AIME 2024

    This dataset consists of 30 problems from the 2024 AIME I and AIME II tests. The original source is AI-MO/aimo-validation-aime, which contains a larger set of 90 problems from AIME 2022-2024.

  3. h

    AIME_1983_2024

    • huggingface.co
    Updated Jun 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Di Zhang (2024). AIME_1983_2024 [Dataset]. http://doi.org/10.57967/hf/4687
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 11, 2024
    Authors
    Di Zhang
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Disclaimer: This is a Benchmark dataset! Do not using in training! This is the Benchmark of AIME from year 1983~2023, and 2024(part 2). Original: https://artofproblemsolving.com/wiki/index.php/AIME_Problems_and_Solutions 2024(part 1) can be find at https://huggingface.co/datasets/AI-MO/aimo-validation-aime.

      Citation
    

    @misc {di_zhang_2025, author = { {Di Zhang} }, title = { AIME_1983_2024 (Revision 6283828) }, year = 2025, url = {… See the full description on the dataset page: https://huggingface.co/datasets/di-zhang-fdu/AIME_1983_2024.

  4. l

    American Invitational Mathematics Examination 2024 Benchmark Results

    • lmmarketcap.com
    • aimodelsmap.com
    Updated Mar 27, 2026
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    LM Market Cap (2026). American Invitational Mathematics Examination 2024 Benchmark Results [Dataset]. https://lmmarketcap.com/benchmarks/aime_2024
    Explore at:
    Dataset updated
    Mar 27, 2026
    Dataset authored and provided by
    LM Market Cap
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Time period covered
    2024 - Present
    Variables measured
    model rank, benchmark score
    Measurement technique
    American Invitational Mathematics Examination 2024 evaluation
    Description

    Olympiad-level mathematical problem solving from the real 2024 AIME competition. 30 problems testing advanced algebra, geometry, combinatorics, and number theory.

  5. h

    aime24

    • huggingface.co
    Updated Feb 20, 2026
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    math-ai (2026). aime24 [Dataset]. https://huggingface.co/datasets/math-ai/aime24
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 20, 2026
    Dataset authored and provided by
    math-ai
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    AIME 24

      American Invitational Mathematics Examination (AIME) 2024
    
    
    
    
    
      Citation
    

    If you use the AIME24 dataset in your research, please consider citing it as follows: @misc{aime24, title={American Invitational Mathematics Examination (AIME) 2024}, author={Zhang, Yifan and Math-AI, Team}, year={2024}, }

  6. Major AI models, by math and computational reasoning

    • statista.com
    Updated Nov 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Major AI models, by math and computational reasoning [Dataset]. https://www.statista.com/statistics/1600812/ai-math-benchmarking-ranking/
    Explore at:
    Dataset updated
    Nov 28, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    2025
    Area covered
    Worldwide
    Description

    In 2024, the artificial analysis math index ranked AI models based on their mathematical reasoning using benchmarks like AIME 2024 and Math-500. o1, QwQ-32B, and DeepSeek R1, led the rankings, showing the highest proficiency in mathematical problem solving.

  7. AIME2024-ko

    • huggingface.co
    Updated Jul 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    allganize (2024). AIME2024-ko [Dataset]. https://huggingface.co/datasets/allganize/AIME2024-ko
    Explore at:
    Dataset updated
    Jul 29, 2024
    Dataset provided by
    Allganize, Inc.
    Authors
    allganize
    Description

    AIME2024-ko: Korean Translation of AIME Mathematics Benchmark

    This dataset is originated from AIME2024 benchmark in the rLLM repository.

    Korean Version README AIME2024-ko is a Korean adaptation of the AIME-2024 (American Invitational Mathematics Examination) benchmark utilized with rLLM framework. It enables evaluation of large language models (LLMs) for their mathematical reasoning capabilities in the Korean language.

      Dataset Details
    

    Original Source: AIME2024… See the full description on the dataset page: https://huggingface.co/datasets/allganize/AIME2024-ko.

  8. h

    aime24-tr

    • huggingface.co
    Updated Dec 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rendra Saputra (2025). aime24-tr [Dataset]. https://huggingface.co/datasets/Rendra8631/aime24-tr
    Explore at:
    Dataset updated
    Dec 21, 2025
    Authors
    Rendra Saputra
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    AIME 2024 (Turkish) Dataset

    This dataset contains the Turkish translations of problems from the 2024 American Invitational Mathematics Examination (AIME). It is intended to serve as a benchmark for evaluating the advanced mathematical reasoning capabilities of Large Language Models (LLMs) in the Turkish language. The questions were translated into Turkish using GPT-5 and subsequently manually verified and corrected. The AIME is an intermediate examination between the AMC 10/12 and… See the full description on the dataset page: https://huggingface.co/datasets/Rendra8631/aime24-tr.

  9. h

    math-correctness-classifier_64rollouts

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ralami, math-correctness-classifier_64rollouts [Dataset]. https://huggingface.co/datasets/RedaAlami/math-correctness-classifier_64rollouts
    Explore at:
    Authors
    ralami
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    RedaAlami/math-correctness-classifier_64rollouts

      Dataset Description
    

    This dataset contains mathematical reasoning problems and model responses formatted for training correctness classifiers. Each record includes a problem statement, a model's solution attempt, and a binary label indicating correctness. The dataset spans three benchmarks:

    AIME 2024: American Invitational Mathematics Examination 2024 AIME 2025: American Invitational Mathematics Examination 2025
    AMO:… See the full description on the dataset page: https://huggingface.co/datasets/RedaAlami/math-correctness-classifier_64rollouts.

  10. h

    OpenR1-Math-220k_decontaminated

    • huggingface.co
    Updated Mar 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Paul Martin (2025). OpenR1-Math-220k_decontaminated [Dataset]. https://huggingface.co/datasets/notpaulmartin/OpenR1-Math-220k_decontaminated
    Explore at:
    Dataset updated
    Mar 30, 2025
    Authors
    Paul Martin
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    OpenR1-Math-220k_decontaminated

    Decontaminated version of open-r1/OpenR1-Math-220k - default/train

      Decontamination
    

    Removed any questions that have an 8-gram overlap with common benchmarks: AIME 2024, AIME 2025, MATH500, GPQA Diamond, LiveCodeBench Code Generation Lite Used GitHub:huggingface/open-r1/scripts/decontaminate.py with all defaults following https://github.com/huggingface/open-r1#data-decontamination

    python scripts/decontaminate.py
    --dataset… See the full description on the dataset page: https://huggingface.co/datasets/notpaulmartin/OpenR1-Math-220k_decontaminated.

  11. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Minghui Jia (2024). AIME_2024 [Dataset]. https://huggingface.co/datasets/Maxwell-Jia/AIME_2024

AIME_2024

AIME 2024 Dataset

Maxwell-Jia/AIME_2024

Explore at:
155 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 6, 2024
Authors
Minghui Jia
License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

AIME 2024 Dataset

  Dataset Description

This dataset contains problems from the American Invitational Mathematics Examination (AIME) 2024. AIME is a prestigious high school mathematics competition known for its challenging mathematical problems.

  Dataset Details

Format: JSONL Size: 30 records Source: AIME 2024 I & II Language: English

  Data Fields

Each record contains the following fields:

ID: Problem identifier (e.g., "2024-I-1" represents Problem 1… See the full description on the dataset page: https://huggingface.co/datasets/Maxwell-Jia/AIME_2024.

Search
Clear search
Close search
Google apps
Main menu