MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Card for Mathematics Aptitude Test of Heuristics (MATH) dataset
Dataset Summary
The Mathematics Aptitude Test of Heuristics (MATH) dataset consists of problems from mathematics competitions, including the AMC 10, AMC 12, AIME, and more. Each problem in MATH has a full step-by-step solution, which can be used to teach models to generate answer derivations and explanations.
Supported Tasks and Leaderboards
[More Information Needed]
Languages… See the full description on the dataset page: https://huggingface.co/datasets/qwedsacf/competition_math.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Card for "competition_math"
Added column with final solution extracted from \boxed{} tags. Added numeric congig that only contains questions with numeric answers.
Dataset Summary
MATH contains 12,500 challenging competition mathematics problems. Each problem in MATH has a full step-by-step solution which can be used to teach models to generate answer derivations and explanation This dataset card aims to be a base template for new datasets.
Languages… See the full description on the dataset page: https://huggingface.co/datasets/jeggers/competition_math.
MickMick102/competition_math dataset hosted on Hugging Face and contributed by the HF Datasets community
jts-ai-team/competition_math dataset hosted on Hugging Face and contributed by the HF Datasets community
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
MATH Dataset
The Mathematics Aptitude Test of Heuristics (MATH) dataset consists of problems from mathematics competitions, including the AMC 10, AMC 12, AIME, and more. Each problem in MATH has a full step-by-step solution, which can be used to teach models to generate answer derivations and explanations. This is a converted version of the hendrycks/competition_math originally created by Hendrycks et al. The dataset has been converted to parquet format for easier loading and usage.… See the full description on the dataset page: https://huggingface.co/datasets/Maxwell-Jia/MATH.
Please refer to the following source for the original datasets:
GSM8K: https://huggingface.co/datasets/openai/gsm8k MATH: https://huggingface.co/datasets/hendrycks/competition_math math-resample: In this section we subsample the 1,000 subsample only (yes it's balance)
HumanEval+: https://huggingface.co/datasets/evalplus/humanevalplus MBPP: https://huggingface.co/datasets/google-research-datasets/mbpp MBPP+: https://huggingface.co/datasets/evalplus/mbppplus ARC Challenge:… See the full description on the dataset page: https://huggingface.co/datasets/appier-ai-research/robust-finetuning.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
OptiLLMBench Dataset
A benchmark dataset for evaluating test-time optimization and scaling capabilities of language models.
Dataset Description
OptiLLMBench contains 500 carefully selected challenging problems across multiple domains:
Mathematical reasoning (from competition_math) Code generation (from HumanEval) Word problems (from GSM8K) Multiple choice reasoning (from MMLU) Logical deduction (from BBH)
Each example is chosen to benefit from test-time optimization… See the full description on the dataset page: https://huggingface.co/datasets/codelion/optillmbench.
Modified with Russiand Translation of hendrycks/competition_math via Llama3
Not seeing a result you expected?
Learn how you can add new datasets to our index.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Card for Mathematics Aptitude Test of Heuristics (MATH) dataset
Dataset Summary
The Mathematics Aptitude Test of Heuristics (MATH) dataset consists of problems from mathematics competitions, including the AMC 10, AMC 12, AIME, and more. Each problem in MATH has a full step-by-step solution, which can be used to teach models to generate answer derivations and explanations.
Supported Tasks and Leaderboards
[More Information Needed]
Languages… See the full description on the dataset page: https://huggingface.co/datasets/qwedsacf/competition_math.