42 datasets found

aime_2024
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hugging Face H4, aime_2024 [Dataset]. https://huggingface.co/datasets/HuggingFaceH4/aime_2024
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset provided by
Hugging Facehttps://huggingface.co/
Authors
Hugging Face H4
Description
Dataset card for AIME 2024

This dataset consists of 30 problems from the 2024 AIME I and AIME II tests. The original source is AI-MO/aimo-validation-aime, which contains a larger set of 90 problems from AIME 2022-2024.
h
AIME-2024
huggingface.co
Updated May 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
BytedTsinghua-SIA (2025). AIME-2024 [Dataset]. https://huggingface.co/datasets/BytedTsinghua-SIA/AIME-2024
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 30, 2025
Dataset authored and provided by
BytedTsinghua-SIA
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
[!IMPORTANT] Why this dataset is duplicated: This dataset actually repeats the AIME 2024 dataset for 32 times to help calculate metrics like Best-of-32. How we are trying to fix: verl is supporting specifying sampling times for validation and we will fix it asap.
h
DAPO-AIME-2024
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
PE-NLP, DAPO-AIME-2024 [Dataset]. https://huggingface.co/datasets/pe-nlp/DAPO-AIME-2024
Explore at:
Dataset authored and provided by
PE-NLP
Description
pe-nlp/DAPO-AIME-2024 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
aime-2024-long-rl
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Perflow-Shuai, aime-2024-long-rl [Dataset]. https://huggingface.co/datasets/Perflow-Shuai/aime-2024-long-rl
Explore at:
Dataset authored and provided by
Perflow-Shuai
Description
Perflow-Shuai/aime-2024-long-rl dataset hosted on Hugging Face and contributed by the HF Datasets community
h
AIME-2024
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Han, AIME-2024 [Dataset]. https://huggingface.co/datasets/fingertap/AIME-2024
Explore at:
Authors
Han
Description
fingertap/AIME-2024 dataset hosted on Hugging Face and contributed by the HF Datasets community
Major AI models, by math and computational reasoning
statista.com
tokrwards.com
Updated Mar 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Major AI models, by math and computational reasoning [Dataset]. https://www.statista.com/statistics/1600812/ai-math-benchmarking-ranking/
Explore at:
Dataset updated
Mar 14, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2025
Area covered
Worldwide
Description
In 2024, the artificial analysis math index ranked AI models based on their mathematical reasoning using benchmarks like AIME 2024 and Math-500. o1, QwQ-32B, and DeepSeek R1, led the rankings, showing the highest proficiency in mathematical problem solving.
h
aime-2015-2024
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CMU Artificial Intelligence and Reinforcement Learning (AIRe) Lab, aime-2015-2024 [Dataset]. https://huggingface.co/datasets/CMU-AIRe/aime-2015-2024
Explore at:
Dataset authored and provided by
CMU Artificial Intelligence and Reinforcement Learning (AIRe) Lab
Description
CMU-AIRe/aime-2015-2024 dataset hosted on Hugging Face and contributed by the HF Datasets community
a
Math Index by GPT-4o Endpoint
artificialanalysis.ai
Updated Dec 28, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Artificial Analysis (2024). Math Index by GPT-4o Endpoint [Dataset]. https://artificialanalysis.ai/models/gpt-4o-mini-realtime-dec-2024
Explore at:
Dataset updated
Dec 28, 2024
Dataset authored and provided by
Artificial Analysis
Description
Comparison of Represents the average of math benchmarks in the Artificial Analysis Intelligence Index (AIME) by Model
h
aime-2024-modified
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
R Abhiram, aime-2024-modified [Dataset]. https://huggingface.co/datasets/Abhiram1009/aime-2024-modified
Explore at:
Authors
R Abhiram
Description
Abhiram1009/aime-2024-modified dataset hosted on Hugging Face and contributed by the HF Datasets community
a
Intelligence Index by GPT-4o Endpoint
artificialanalysis.ai
Updated Dec 28, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Artificial Analysis (2024). Intelligence Index by GPT-4o Endpoint [Dataset]. https://artificialanalysis.ai/models/gpt-4o-mini-realtime-dec-2024
Explore at:
Dataset updated
Dec 28, 2024
Dataset authored and provided by
Artificial Analysis
Description
Comparison of Artificial Analysis Intelligence Index v2.2 incorporates 8 evaluations: MMLU-Pro, GPQA Diamond, Humanity's Last Exam, LiveCodeBench, SciCode, AIME, IFBench, AA-LCR by Model
h
AIME-2024-2025
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ge Yi, AIME-2024-2025 [Dataset]. https://huggingface.co/datasets/GY2233/AIME-2024-2025
Explore at:
Authors
Ge Yi
Description
GY2233/AIME-2024-2025 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
AIME-2024-Ko-Translated
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dohyung Kim, AIME-2024-Ko-Translated [Dataset]. https://huggingface.co/datasets/werty1248/AIME-2024-Ko-Translated
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Authors
Dohyung Kim
Description
Original Dataset: HuggingFaceH4/aime_2024 Translator: gemini-2.0-flash
a
Lac Nairne, QC - Aug 3, 2024 - Drone Flight Paths
hub.arcgis.com
community-esrica-apps.hub.arcgis.com
+1more
Updated Aug 12, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Western University (2024). Lac Nairne, QC - Aug 3, 2024 - Drone Flight Paths [Dataset]. https://hub.arcgis.com/datasets/e36112969b2e40f692b028503fdf0234
Explore at:
Dataset updated
Aug 12, 2024
Dataset authored and provided by
Western University
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Area covered

Description
Flight paths of drone surveys used to capture imagery and video for the August 3, 2024, Lac Nairne, QC downburst. Ground survey conducted August 7, 2024. DJI Mavic 3E performed four flights. Please note drones are also used for scouting the initial area of interest using a live view on the controller, meaning that some flight paths may not be associated with any imagery. View survey summary map here
a
Lac Nairne, QC - Aug 3, 2024 - Drone Photos
elsalvador-westernu.opendata.arcgis.com
ntpopendata-westernu.opendata.arcgis.com
+1more
Updated Aug 8, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Western University (2024). Lac Nairne, QC - Aug 3, 2024 - Drone Photos [Dataset]. https://elsalvador-westernu.opendata.arcgis.com/items/2e4c0af18f60400885d9dd68b3aed74b
Explore at:
Dataset updated
Aug 8, 2024
Dataset authored and provided by
Western University
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Area covered

Description
Additional photos collected via drone for the August 3, 2024, Lac Nairne, QC downbust. Ground survey conducted August 7, 2024. DJI Mavic 3E used to capture 34 photos. Does not include videos or drone mapping photos [where applicable].
h
DAPO-AIME-2024
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hong Yi, DAPO-AIME-2024 [Dataset]. https://huggingface.co/datasets/LuyiCui/DAPO-AIME-2024
Explore at:
Authors
Hong Yi
Description
LuyiCui/DAPO-AIME-2024 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
aime_2024_II
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
MathArena, aime_2024_II [Dataset]. https://huggingface.co/datasets/MathArena/aime_2024_II
Explore at:
Dataset authored and provided by
MathArena
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
Homepage and repository

Homepage: https://matharena.ai/ Repository: https://github.com/eth-sri/matharena

Dataset Summary

This dataset contains the questions from AIME II 2024 used for the MathArena Leaderboard

Data Fields

Below one can find the description of each field in the dataset.

problem_idx (int): Index of the problem in the competition problem (str): Full problem statement answer (str): Ground-truth answer to the question

Source Data

The… See the full description on the dataset page: https://huggingface.co/datasets/MathArena/aime_2024_II.
a
Pricing by GPT-4o Endpoint
artificialanalysis.ai
Updated May 13, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Artificial Analysis (2025). Pricing by GPT-4o Endpoint [Dataset]. https://artificialanalysis.ai/models/gpt-4o
Explore at:
Dataset updated
May 13, 2024
Dataset authored and provided by
Artificial Analysis
Description
Comparison of Cost (USD) to run all evaluations in the Artificial Analysis Intelligence Index by Model
Data from: A Blood Dataset from Camelyon 17
zenodo.org
produccioncientifica.ugr.es
png, zip
Updated Aug 23, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fernando Pérez-Bueno; Fernando Pérez-Bueno; Kjersti Engan; Kjersti Engan; Rafael Molina Soriano; Rafael Molina Soriano (2024). A Blood Dataset from Camelyon 17 [Dataset]. http://doi.org/10.5281/zenodo.11268269
Explore at:
png, zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.11268269
Dataset updated
Aug 23, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Fernando Pérez-Bueno; Fernando Pérez-Bueno; Kjersti Engan; Kjersti Engan; Rafael Molina Soriano; Rafael Molina Soriano
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This dataset is a subset of the Camelyon-17 Breast Cancer Challenge. It contains 224x224 H&E histological image patches where blood has been detected. It was originally sampled to validate the blood detection capabilities of the method presented in [1]. Blood was manually identified by a trained technician.

If you use this dataset, please cite:

Pérez-Bueno, F., Engan, K., Molina, R. (2024). Robust blind color deconvolution and blood detection on histological images using Bayesian K-SVD. In: Journal of Artificial Intelligence in Medicine. https://doi.org/10.1016/j.artmed.2024.102969 [bibtex]

Pérez-Bueno, F., Engan, K., Molina, R. (2023). A Robust BKSVD Method for Blind Color Deconvolution and Blood Detection on H&E Histological Images. In: Artificial Intelligence in Medicine. AIME 2023, vol 13897. https://doi.org/10.1007/978-3-031-34344-5_25 [bibtex]

and the original publication for the Camelyon-17 Challenge (see details on the challenge website)

Summary:

25 images from center_0

7786 tissue patches

527 blood patches

104 patches of other artifacts (such as blur, folded tissue, image borders, cauterized, etc. Not labeled)

The folder structure is as follows:

center/image_id/pathology_label/patch_label/

pathology_label can take the following values:

annotated: the patch comes from a tumor annotated region (see details in Camelyon-17 Challenge)

no_annotated: the patch comes from a non-tumor slide (negative stage label)

unknown: the patch comes from a slide with a tumor stage label which is not annotated.

patch_label can take the following values:

tissue: no blood or less than ~25% of blood

blood: more than ~25% blood

other: the patch has a significant amount (>~25%) of pixels that are nor tissue nor blood.

Patches are sampled at the maximum resolution available 40x, and the filename includes the starting pixel in the x and y dimension. For the original .tiff images at high quality, please refer to the Camelyon-17 Challenge.

The license for this dataset is CC0 following the Camelyon-17 license.
m
Assistenti di riunioni alimentate dall'intelligenza ad AIME Dimensioni del...
marketresearchintellect.com
Updated Jul 29, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Market Research Intellect (2024). Assistenti di riunioni alimentate dall'intelligenza ad AIME Dimensioni del settore, Share & Growth Analysis 2033 [Dataset]. https://www.marketresearchintellect.com/it/product/ai-powered-meeting-assistants-market/
Explore at:
Dataset updated
Jul 29, 2024
Dataset authored and provided by
Market Research Intellect
License
https://www.marketresearchintellect.com/it/privacy-policyhttps://www.marketresearchintellect.com/it/privacy-policy
Area covered
Global
Description
Ulteriori informazioni sulla relazione sul mercato degli assistenti alle riunioni alimentati dall'intelligenza artificiale da parte di un intelletto di ricerca di mercato, che si è attestato a 1,2 miliardi di dollari nel 2024 e si prevede che si espanda a 3,4 miliardi di USD entro il 2033, crescendo a un CAGR del 15,4%. Scopri come nuove strategie, in aumento degli investimenti e dei migliori giocatori stanno modellando il futuro.
h
AIME-1983-2024-Qwen3-8B
huggingface.co
Updated Sep 25, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Amir Mohseni (2025). AIME-1983-2024-Qwen3-8B [Dataset]. https://huggingface.co/datasets/AmirMohseni/AIME-1983-2024-Qwen3-8B
Explore at:
Dataset updated
Sep 25, 2025
Authors
Amir Mohseni
Description
Qwen3-8B AIME Reasoning vs No-Reasoning Dataset (Router Edition)

TL;DR – 933 American Invitational Mathematics Examination (AIME) problems (1983 – 2024) paired with answers generated by Qwen3-8B in two modes:Reasoning off (« no_think ») and Reasoning on (« think »).Each example is auto-verified and labelled with the winner policy used by our routing experiments.

Dataset Summary

This dataset was created for the Router Project, a line of research that investigates… See the full description on the dataset page: https://huggingface.co/datasets/AmirMohseni/AIME-1983-2024-Qwen3-8B.

Facebook

Twitter

Click to copy link

Link copied

Cite

Hugging Face H4, aime_2024 [Dataset]. https://huggingface.co/datasets/HuggingFaceH4/aime_2024

aime_2024

HuggingFaceH4/aime_2024

Explore at:

76 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset provided by

Hugging Facehttps://huggingface.co/

Authors

Hugging Face H4

Description

Dataset card for AIME 2024

This dataset consists of 30 problems from the 2024 AIME I and AIME II tests. The original source is AI-MO/aimo-validation-aime, which contains a larger set of 90 problems from AIME 2022-2024.

Clear search

Close search

Google apps

Main menu

aime_2024

AIME-2024

DAPO-AIME-2024

aime-2024-long-rl

AIME-2024

Major AI models, by math and computational reasoning

aime-2015-2024

Math Index by GPT-4o Endpoint

aime-2024-modified

Intelligence Index by GPT-4o Endpoint

AIME-2024-2025

AIME-2024-Ko-Translated

Lac Nairne, QC - Aug 3, 2024 - Drone Flight Paths

Lac Nairne, QC - Aug 3, 2024 - Drone Photos

DAPO-AIME-2024

aime_2024_II

Pricing by GPT-4o Endpoint

Data from: A Blood Dataset from Camelyon 17

Summary:

Assistenti di riunioni alimentate dall'intelligenza ad AIME Dimensioni del...

AIME-1983-2024-Qwen3-8B

aime_2024

HuggingFaceH4/aime_2024