31 datasets found
  1. h

    AIME25

    • huggingface.co
    Updated Mar 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yu Wang (2025). AIME25 [Dataset]. https://huggingface.co/datasets/Wloner0809/AIME25
    Explore at:
    Dataset updated
    Mar 21, 2025
    Authors
    Yu Wang
    Description

    Wloner0809/AIME25 dataset hosted on Hugging Face and contributed by the HF Datasets community

  2. h

    aime25

    • huggingface.co
    Updated Apr 18, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Khiem Pham (2025). aime25 [Dataset]. https://huggingface.co/datasets/drproduck/aime25
    Explore at:
    Dataset updated
    Apr 18, 2025
    Authors
    Khiem Pham
    Description

    drproduck/aime25 dataset hosted on Hugging Face and contributed by the HF Datasets community

  3. h

    aime25

    • huggingface.co
    Updated Apr 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zikang Shan (2025). aime25 [Dataset]. https://huggingface.co/datasets/zkshan2002/aime25
    Explore at:
    Dataset updated
    Apr 10, 2025
    Authors
    Zikang Shan
    Description

    Source: math-ai/aime25 Modification:

    Remove redundent columns

    Keys:

    problem answer

    Size

    test: 30

  4. h

    aime25

    • huggingface.co
    Updated Jun 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Guanning Zeng (2025). aime25 [Dataset]. https://huggingface.co/datasets/guanning/aime25
    Explore at:
    Dataset updated
    Jun 1, 2025
    Authors
    Guanning Zeng
    Description

    guanning/aime25 dataset hosted on Hugging Face and contributed by the HF Datasets community

  5. h

    aime25

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Loser Cheems, aime25 [Dataset]. https://huggingface.co/datasets/JingzeShi/aime25
    Explore at:
    Authors
    Loser Cheems
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset card for aime25

    This dataset was made with Curator.

      Dataset details
    

    A sample from the dataset: { "question": "Find the sum of all integer bases $b>9$ for which $17_{b}$ is a divisor of $97_{b}$.", "reasoning": "Okay, let's see. The problem is to find the sum of all integer bases b > 9 where 17_b divides 97_b. Hmm.

    First, I need to understand what 17_b and 97_b represent in base 10. In base b, the number 17_b would be 1*b + 7, right? Similarly, 97_bโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/JingzeShi/aime25.

  6. h

    aime25-o3mini

    • huggingface.co
    Updated May 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xinzhi Zhang (2025). aime25-o3mini [Dataset]. https://huggingface.co/datasets/flatlander1024/aime25-o3mini
    Explore at:
    Dataset updated
    May 27, 2025
    Authors
    Xinzhi Zhang
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    flatlander1024/aime25-o3mini dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. h

    AIME2025

    • huggingface.co
    Updated Feb 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OpenCompass (2025). AIME2025 [Dataset]. https://huggingface.co/datasets/opencompass/AIME2025
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 8, 2025
    Dataset authored and provided by
    OpenCompass
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    AIME 2025 Dataset

      Dataset Description
    

    This dataset contains problems from the American Invitational Mathematics Examination (AIME) 2025-I & II.

  8. h

    reasoning-aime25-nous

    • huggingface.co
    Updated Apr 3, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Henry Broomfield (2025). reasoning-aime25-nous [Dataset]. https://huggingface.co/datasets/HennersBro98/reasoning-aime25-nous
    Explore at:
    Dataset updated
    Apr 3, 2025
    Authors
    Henry Broomfield
    Description

    HennersBro98/reasoning-aime25-nous dataset hosted on Hugging Face and contributed by the HF Datasets community

  9. AIME-25

    • huggingface.co
    Updated Jul 14, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Prime Intellect (2025). AIME-25 [Dataset]. https://huggingface.co/datasets/PrimeIntellect/AIME-25
    Explore at:
    Dataset updated
    Jul 14, 2025
    Dataset provided by
    Authors
    Prime Intellect
    Description

    PrimeIntellect/AIME-25 dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. h

    r1-qwen7b-aime25-n32

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Khiem Pham, r1-qwen7b-aime25-n32 [Dataset]. https://huggingface.co/datasets/drproduck/r1-qwen7b-aime25-n32
    Explore at:
    Authors
    Khiem Pham
    Description

    Dataset Card for Dataset Name

    Generate 32 solutions for aime25 using deepseek-ai/DeepSeek-R1-Distill-Qwen-7B This dataset card aims to be a base template for new datasets. It has been generated using this raw template.

      Dataset Details
    
    
    
    
    
      Dataset Description
    

    Columns:

    problem: str original problem statement answer: str ground truth answer completion: List[str] generations by the model prediction: List[str] prediction extracted from generation score: List[float] isโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/drproduck/r1-qwen7b-aime25-n32.

  11. h

    reasoning-aime25-deepscaler

    • huggingface.co
    Updated Apr 3, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Henry Broomfield (2025). reasoning-aime25-deepscaler [Dataset]. https://huggingface.co/datasets/HennersBro98/reasoning-aime25-deepscaler
    Explore at:
    Dataset updated
    Apr 3, 2025
    Authors
    Henry Broomfield
    Description

    HennersBro98/reasoning-aime25-deepscaler dataset hosted on Hugging Face and contributed by the HF Datasets community

  12. h

    reasoning-aime25-evaluation-system1

    • huggingface.co
    Updated Mar 19, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Henry Broomfield (2025). reasoning-aime25-evaluation-system1 [Dataset]. https://huggingface.co/datasets/HennersBro98/reasoning-aime25-evaluation-system1
    Explore at:
    Dataset updated
    Mar 19, 2025
    Authors
    Henry Broomfield
    Description

    HennersBro98/reasoning-aime25-evaluation-system1 dataset hosted on Hugging Face and contributed by the HF Datasets community

  13. h

    r1-qwen7b-awq-aime25-n32

    • huggingface.co
    Updated Mar 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Khiem Pham (2025). r1-qwen7b-awq-aime25-n32 [Dataset]. https://huggingface.co/datasets/drproduck/r1-qwen7b-awq-aime25-n32
    Explore at:
    Dataset updated
    Mar 22, 2025
    Authors
    Khiem Pham
    Description

    drproduck/r1-qwen7b-awq-aime25-n32 dataset hosted on Hugging Face and contributed by the HF Datasets community

  14. h

    AIME-22-25

    • huggingface.co
    Updated Apr 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ChuGyouk (2025). AIME-22-25 [Dataset]. https://huggingface.co/datasets/ChuGyouk/AIME-22-25
    Explore at:
    Dataset updated
    Apr 18, 2025
    Authors
    ChuGyouk
    Description

    AIME22-24 + AIME25-PartI + AIME25-PartII

  15. h

    aime-25-genesys

    • huggingface.co
    Updated Jun 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Justus Mattern (2025). aime-25-genesys [Dataset]. https://huggingface.co/datasets/justus27/aime-25-genesys
    Explore at:
    Dataset updated
    Jun 21, 2025
    Authors
    Justus Mattern
    Description

    justus27/aime-25-genesys dataset hosted on Hugging Face and contributed by the HF Datasets community

  16. h

    distill-r1-qwen-1.5b-aime-25-budget

    • huggingface.co
    Updated Apr 29, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jonathan Chang (2025). distill-r1-qwen-1.5b-aime-25-budget [Dataset]. https://huggingface.co/datasets/jdchang/distill-r1-qwen-1.5b-aime-25-budget
    Explore at:
    Dataset updated
    Apr 29, 2025
    Authors
    Jonathan Chang
    Description

    jdchang/distill-r1-qwen-1.5b-aime-25-budget dataset hosted on Hugging Face and contributed by the HF Datasets community

  17. h

    distill-r1-qwen-1.5b-aime-25-4096

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jonathan Chang, distill-r1-qwen-1.5b-aime-25-4096 [Dataset]. https://huggingface.co/datasets/jdchang/distill-r1-qwen-1.5b-aime-25-4096
    Explore at:
    Authors
    Jonathan Chang
    Description

    jdchang/distill-r1-qwen-1.5b-aime-25-4096 dataset hosted on Hugging Face and contributed by the HF Datasets community

  18. h

    distill-r1-qwen-1.5b-aime-25-4096-with-old-prm-indices_84480_92160

    • huggingface.co
    Updated May 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kaiwen Wang (2025). distill-r1-qwen-1.5b-aime-25-4096-with-old-prm-indices_84480_92160 [Dataset]. https://huggingface.co/datasets/kaiwenw/distill-r1-qwen-1.5b-aime-25-4096-with-old-prm-indices_84480_92160
    Explore at:
    Dataset updated
    May 11, 2025
    Authors
    Kaiwen Wang
    Description

    kaiwenw/distill-r1-qwen-1.5b-aime-25-4096-with-old-prm-indices_84480_92160 dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. h

    o3-2025-04-16_eval_5ed6

    • huggingface.co
    Updated Apr 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ML Foundations Development (2025). o3-2025-04-16_eval_5ed6 [Dataset]. https://huggingface.co/datasets/mlfoundations-dev/o3-2025-04-16_eval_5ed6
    Explore at:
    Dataset updated
    Apr 16, 2025
    Dataset authored and provided by
    ML Foundations Development
    Description

    o3-2025-04-16 Evaluation Results

    Precomputed model outputs for evaluation.

      Evaluation Results
    
    
    
    
    
      Summary
    

    Metric AIME25 LiveCodeBenchv5 AMC23 MATH500 MMLUPro JEEBench GPQADiamond LiveCodeBench CodeElo HLE

    Accuracy 70.3 66.8 97.5 86.0 38.2 86.2 80.0 79.2 35.2 22.8

      AIME25
    

    Average Accuracy: 70.3% ยฑ 1.7% Number of Runs: 10

    Run Accuracy Questions Solved Total Questions

    1 70.0% 21 30

    2 70.0% 21 30

    3 70.0% 21 30

    4 63.3% 19โ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/mlfoundations-dev/o3-2025-04-16_eval_5ed6.

  20. h

    gpt-4.1-2025-04-14_eval_5ed6

    • huggingface.co
    Updated Apr 14, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ML Foundations Development (2025). gpt-4.1-2025-04-14_eval_5ed6 [Dataset]. https://huggingface.co/datasets/mlfoundations-dev/gpt-4.1-2025-04-14_eval_5ed6
    Explore at:
    Dataset updated
    Apr 14, 2025
    Dataset authored and provided by
    ML Foundations Development
    Description

    gpt-4.1-2025-04-14 Evaluation Results

    Precomputed model outputs for evaluation.

      Evaluation Results
    
    
    
    
    
      Summary
    

    Metric AIME25 LiveCodeBenchv5 AMC23 MATH500 MMLUPro JEEBench GPQADiamond LiveCodeBench CodeElo HLE AIME24

    Accuracy 33.0 46.6 83.2 83.6 30.8 78.3 34.5 65.6 31.1 7.2 0.0

      AIME25
    

    Average Accuracy: 33.0% ยฑ 1.3% Number of Runs: 10

    Run Accuracy Questions Solved Total Questions

    1 33.3% 10 30

    2 33.3% 10 30

    3 33.3% 10โ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/mlfoundations-dev/gpt-4.1-2025-04-14_eval_5ed6.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Yu Wang (2025). AIME25 [Dataset]. https://huggingface.co/datasets/Wloner0809/AIME25

AIME25

Wloner0809/AIME25

Explore at:
136 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Mar 21, 2025
Authors
Yu Wang
Description

Wloner0809/AIME25 dataset hosted on Hugging Face and contributed by the HF Datasets community

Search
Clear search
Close search
Google apps
Main menu