42 datasets found
  1. aime_2024

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hugging Face H4, aime_2024 [Dataset]. https://huggingface.co/datasets/HuggingFaceH4/aime_2024
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset provided by
    Hugging Facehttps://huggingface.co/
    Authors
    Hugging Face H4
    Description

    Dataset card for AIME 2024

    This dataset consists of 30 problems from the 2024 AIME I and AIME II tests. The original source is AI-MO/aimo-validation-aime, which contains a larger set of 90 problems from AIME 2022-2024.

  2. h

    AIME-2024

    • huggingface.co
    Updated May 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    BytedTsinghua-SIA (2025). AIME-2024 [Dataset]. https://huggingface.co/datasets/BytedTsinghua-SIA/AIME-2024
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 30, 2025
    Dataset authored and provided by
    BytedTsinghua-SIA
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    [!IMPORTANT] Why this dataset is duplicated: This dataset actually repeats the AIME 2024 dataset for 32 times to help calculate metrics like Best-of-32. How we are trying to fix: verl is supporting specifying sampling times for validation and we will fix it asap.

  3. h

    DAPO-AIME-2024

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    PE-NLP, DAPO-AIME-2024 [Dataset]. https://huggingface.co/datasets/pe-nlp/DAPO-AIME-2024
    Explore at:
    Dataset authored and provided by
    PE-NLP
    Description

    pe-nlp/DAPO-AIME-2024 dataset hosted on Hugging Face and contributed by the HF Datasets community

  4. h

    aime-2024-long-rl

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Perflow-Shuai, aime-2024-long-rl [Dataset]. https://huggingface.co/datasets/Perflow-Shuai/aime-2024-long-rl
    Explore at:
    Dataset authored and provided by
    Perflow-Shuai
    Description

    Perflow-Shuai/aime-2024-long-rl dataset hosted on Hugging Face and contributed by the HF Datasets community

  5. h

    AIME-2024

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Han, AIME-2024 [Dataset]. https://huggingface.co/datasets/fingertap/AIME-2024
    Explore at:
    Authors
    Han
    Description

    fingertap/AIME-2024 dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. Major AI models, by math and computational reasoning

    • statista.com
    • tokrwards.com
    Updated Mar 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Major AI models, by math and computational reasoning [Dataset]. https://www.statista.com/statistics/1600812/ai-math-benchmarking-ranking/
    Explore at:
    Dataset updated
    Mar 14, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    2025
    Area covered
    Worldwide
    Description

    In 2024, the artificial analysis math index ranked AI models based on their mathematical reasoning using benchmarks like AIME 2024 and Math-500. o1, QwQ-32B, and DeepSeek R1, led the rankings, showing the highest proficiency in mathematical problem solving.

  7. h

    aime-2015-2024

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CMU Artificial Intelligence and Reinforcement Learning (AIRe) Lab, aime-2015-2024 [Dataset]. https://huggingface.co/datasets/CMU-AIRe/aime-2015-2024
    Explore at:
    Dataset authored and provided by
    CMU Artificial Intelligence and Reinforcement Learning (AIRe) Lab
    Description

    CMU-AIRe/aime-2015-2024 dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. a

    Math Index by GPT-4o Endpoint

    • artificialanalysis.ai
    Updated Dec 28, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Artificial Analysis (2024). Math Index by GPT-4o Endpoint [Dataset]. https://artificialanalysis.ai/models/gpt-4o-mini-realtime-dec-2024
    Explore at:
    Dataset updated
    Dec 28, 2024
    Dataset authored and provided by
    Artificial Analysis
    Description

    Comparison of Represents the average of math benchmarks in the Artificial Analysis Intelligence Index (AIME) by Model

  9. h

    aime-2024-modified

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    R Abhiram, aime-2024-modified [Dataset]. https://huggingface.co/datasets/Abhiram1009/aime-2024-modified
    Explore at:
    Authors
    R Abhiram
    Description

    Abhiram1009/aime-2024-modified dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. a

    Intelligence Index by GPT-4o Endpoint

    • artificialanalysis.ai
    Updated Dec 28, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Artificial Analysis (2024). Intelligence Index by GPT-4o Endpoint [Dataset]. https://artificialanalysis.ai/models/gpt-4o-mini-realtime-dec-2024
    Explore at:
    Dataset updated
    Dec 28, 2024
    Dataset authored and provided by
    Artificial Analysis
    Description

    Comparison of Artificial Analysis Intelligence Index v2.2 incorporates 8 evaluations: MMLU-Pro, GPQA Diamond, Humanity's Last Exam, LiveCodeBench, SciCode, AIME, IFBench, AA-LCR by Model

  11. h

    AIME-2024-2025

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ge Yi, AIME-2024-2025 [Dataset]. https://huggingface.co/datasets/GY2233/AIME-2024-2025
    Explore at:
    Authors
    Ge Yi
    Description

    GY2233/AIME-2024-2025 dataset hosted on Hugging Face and contributed by the HF Datasets community

  12. h

    AIME-2024-Ko-Translated

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dohyung Kim, AIME-2024-Ko-Translated [Dataset]. https://huggingface.co/datasets/werty1248/AIME-2024-Ko-Translated
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    Dohyung Kim
    Description

    Original Dataset: HuggingFaceH4/aime_2024 Translator: gemini-2.0-flash

  13. a

    Lac Nairne, QC - Aug 3, 2024 - Drone Flight Paths

    • hub.arcgis.com
    • community-esrica-apps.hub.arcgis.com
    • +1more
    Updated Aug 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Western University (2024). Lac Nairne, QC - Aug 3, 2024 - Drone Flight Paths [Dataset]. https://hub.arcgis.com/datasets/e36112969b2e40f692b028503fdf0234
    Explore at:
    Dataset updated
    Aug 12, 2024
    Dataset authored and provided by
    Western University
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Area covered
    Description

    Flight paths of drone surveys used to capture imagery and video for the August 3, 2024, Lac Nairne, QC downburst. Ground survey conducted August 7, 2024. DJI Mavic 3E performed four flights. Please note drones are also used for scouting the initial area of interest using a live view on the controller, meaning that some flight paths may not be associated with any imagery. View survey summary map here

  14. a

    Lac Nairne, QC - Aug 3, 2024 - Drone Photos

    • elsalvador-westernu.opendata.arcgis.com
    • ntpopendata-westernu.opendata.arcgis.com
    • +1more
    Updated Aug 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Western University (2024). Lac Nairne, QC - Aug 3, 2024 - Drone Photos [Dataset]. https://elsalvador-westernu.opendata.arcgis.com/items/2e4c0af18f60400885d9dd68b3aed74b
    Explore at:
    Dataset updated
    Aug 8, 2024
    Dataset authored and provided by
    Western University
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Area covered
    Description

    Additional photos collected via drone for the August 3, 2024, Lac Nairne, QC downbust. Ground survey conducted August 7, 2024. DJI Mavic 3E used to capture 34 photos. Does not include videos or drone mapping photos [where applicable].

  15. h

    DAPO-AIME-2024

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hong Yi, DAPO-AIME-2024 [Dataset]. https://huggingface.co/datasets/LuyiCui/DAPO-AIME-2024
    Explore at:
    Authors
    Hong Yi
    Description

    LuyiCui/DAPO-AIME-2024 dataset hosted on Hugging Face and contributed by the HF Datasets community

  16. h

    aime_2024_II

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MathArena, aime_2024_II [Dataset]. https://huggingface.co/datasets/MathArena/aime_2024_II
    Explore at:
    Dataset authored and provided by
    MathArena
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Homepage and repository

    Homepage: https://matharena.ai/ Repository: https://github.com/eth-sri/matharena

      Dataset Summary
    

    This dataset contains the questions from AIME II 2024 used for the MathArena Leaderboard

      Data Fields
    

    Below one can find the description of each field in the dataset.

    problem_idx (int): Index of the problem in the competition problem (str): Full problem statement answer (str): Ground-truth answer to the question

      Source Data
    

    The… See the full description on the dataset page: https://huggingface.co/datasets/MathArena/aime_2024_II.

  17. a

    Pricing by GPT-4o Endpoint

    • artificialanalysis.ai
    Updated May 13, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Artificial Analysis (2025). Pricing by GPT-4o Endpoint [Dataset]. https://artificialanalysis.ai/models/gpt-4o
    Explore at:
    Dataset updated
    May 13, 2024
    Dataset authored and provided by
    Artificial Analysis
    Description

    Comparison of Cost (USD) to run all evaluations in the Artificial Analysis Intelligence Index by Model

  18. Data from: A Blood Dataset from Camelyon 17

    • zenodo.org
    • produccioncientifica.ugr.es
    png, zip
    Updated Aug 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fernando Pérez-Bueno; Fernando Pérez-Bueno; Kjersti Engan; Kjersti Engan; Rafael Molina Soriano; Rafael Molina Soriano (2024). A Blood Dataset from Camelyon 17 [Dataset]. http://doi.org/10.5281/zenodo.11268269
    Explore at:
    png, zipAvailable download formats
    Dataset updated
    Aug 23, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Fernando Pérez-Bueno; Fernando Pérez-Bueno; Kjersti Engan; Kjersti Engan; Rafael Molina Soriano; Rafael Molina Soriano
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This dataset is a subset of the Camelyon-17 Breast Cancer Challenge. It contains 224x224 H&E histological image patches where blood has been detected. It was originally sampled to validate the blood detection capabilities of the method presented in [1]. Blood was manually identified by a trained technician.

    If you use this dataset, please cite:

    Pérez-Bueno, F., Engan, K., Molina, R. (2024). Robust blind color deconvolution and blood detection on histological images using Bayesian K-SVD. In: Journal of Artificial Intelligence in Medicine. https://doi.org/10.1016/j.artmed.2024.102969 [bibtex]

    Pérez-Bueno, F., Engan, K., Molina, R. (2023). A Robust BKSVD Method for Blind Color Deconvolution and Blood Detection on H&E Histological Images. In: Artificial Intelligence in Medicine. AIME 2023, vol 13897. https://doi.org/10.1007/978-3-031-34344-5_25 [bibtex]

    and the original publication for the Camelyon-17 Challenge (see details on the challenge website)

    Summary:

    • 25 images from center_0
      • 7786 tissue patches
      • 527 blood patches
      • 104 patches of other artifacts (such as blur, folded tissue, image borders, cauterized, etc. Not labeled)

    The folder structure is as follows:

    center/image_id/pathology_label/patch_label/

    pathology_label can take the following values:

    • annotated: the patch comes from a tumor annotated region (see details in Camelyon-17 Challenge)
    • no_annotated: the patch comes from a non-tumor slide (negative stage label)
    • unknown: the patch comes from a slide with a tumor stage label which is not annotated.

    patch_label can take the following values:

    • tissue: no blood or less than ~25% of blood
    • blood: more than ~25% blood
    • other: the patch has a significant amount (>~25%) of pixels that are nor tissue nor blood.

    Patches are sampled at the maximum resolution available 40x, and the filename includes the starting pixel in the x and y dimension. For the original .tiff images at high quality, please refer to the Camelyon-17 Challenge.

    The license for this dataset is CC0 following the Camelyon-17 license.

  19. m

    Assistenti di riunioni alimentate dall'intelligenza ad AIME Dimensioni del...

    • marketresearchintellect.com
    Updated Jul 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Research Intellect (2024). Assistenti di riunioni alimentate dall'intelligenza ad AIME Dimensioni del settore, Share & Growth Analysis 2033 [Dataset]. https://www.marketresearchintellect.com/it/product/ai-powered-meeting-assistants-market/
    Explore at:
    Dataset updated
    Jul 29, 2024
    Dataset authored and provided by
    Market Research Intellect
    License

    https://www.marketresearchintellect.com/it/privacy-policyhttps://www.marketresearchintellect.com/it/privacy-policy

    Area covered
    Global
    Description

    Ulteriori informazioni sulla relazione sul mercato degli assistenti alle riunioni alimentati dall'intelligenza artificiale da parte di un intelletto di ricerca di mercato, che si è attestato a 1,2 miliardi di dollari nel 2024 e si prevede che si espanda a 3,4 miliardi di USD entro il 2033, crescendo a un CAGR del 15,4%. Scopri come nuove strategie, in aumento degli investimenti e dei migliori giocatori stanno modellando il futuro.

  20. h

    AIME-1983-2024-Qwen3-8B

    • huggingface.co
    Updated Sep 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amir Mohseni (2025). AIME-1983-2024-Qwen3-8B [Dataset]. https://huggingface.co/datasets/AmirMohseni/AIME-1983-2024-Qwen3-8B
    Explore at:
    Dataset updated
    Sep 25, 2025
    Authors
    Amir Mohseni
    Description

    Qwen3-8B AIME Reasoning vs No-Reasoning Dataset (Router Edition)

    TL;DR – 933 American Invitational Mathematics Examination (AIME) problems (1983 – 2024) paired with answers generated by Qwen3-8B in two modes:Reasoning off (« no_think ») and Reasoning on (« think »).Each example is auto-verified and labelled with the winner policy used by our routing experiments.

      Dataset Summary
    

    This dataset was created for the Router Project, a line of research that investigates… See the full description on the dataset page: https://huggingface.co/datasets/AmirMohseni/AIME-1983-2024-Qwen3-8B.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Hugging Face H4, aime_2024 [Dataset]. https://huggingface.co/datasets/HuggingFaceH4/aime_2024
Organization logo

aime_2024

HuggingFaceH4/aime_2024

Explore at:
76 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset provided by
Hugging Facehttps://huggingface.co/
Authors
Hugging Face H4
Description

Dataset card for AIME 2024

This dataset consists of 30 problems from the 2024 AIME I and AIME II tests. The original source is AI-MO/aimo-validation-aime, which contains a larger set of 90 problems from AIME 2022-2024.

Search
Clear search
Close search
Google apps
Main menu