MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Card for GSM8K
Dataset Summary
GSM8K (Grade School Math 8K) is a dataset of 8.5K high quality linguistically diverse grade school math word problems. The dataset was created to support the task of question answering on basic mathematical problems that require multi-step reasoning.
These problems take between 2 and 8 steps to solve. Solutions primarily involve performing a sequence of elementary calculations using basic arithmetic operations (+ − ×÷) to reach the… See the full description on the dataset page: https://huggingface.co/datasets/openai/gsm8k.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Card for Calc-gsm8k
Summary
This dataset is an instance of gsm8k dataset, converted to a simple html-like language that can be easily parsed (e.g. by BeautifulSoup). The data contains 3 types of tags:
gadget: A tag whose content is intended to be evaluated by calling an external tool (sympy-based calculator in this case) output: An output of the external tool result: The final answer to the mathematical problem (a number)
Supported Tasks
The… See the full description on the dataset page: https://huggingface.co/datasets/MU-NLPC/Calc-gsm8k.
GSM8K is a dataset of 8.5K high quality linguistically diverse grade school math word problems created by human problem writers. The dataset is segmented into 7.5K training problems and 1K test problems. These problems take between 2 and 8 steps to solve, and solutions primarily involve performing a sequence of elementary calculations using basic arithmetic operations (+ − ×÷) to reach the final answer. A bright middle school student should be able to solve every problem. It can be used for multi-step mathematical reasoning.
https://choosealicense.com/licenses/llama3.1/https://choosealicense.com/licenses/llama3.1/
gretelai/gsm8k-synthetic-diverse-8b
This dataset is a synthetically generated version inspired by the GSM8K https://huggingface.co/datasets/openai/gsm8k dataset, created entirely using Gretel Navigator with meta-llama/Meta-Llama-3.1-8B as the agent LLM. It contains ~1500 Grade School-level math word problems with step-by-step solutions, focusing on age group, difficulty, and domain diversity.
Key Features:
Synthetically Generated: Math problems created using Gretel… See the full description on the dataset page: https://huggingface.co/datasets/gretelai/gsm8k-synthetic-diverse-8b.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
GSM8K (Fixed)
Some erroneous labels exist in the GSM8K dataset. This dataset is fixed from https://github.com/openai/grade-school-math/blob/master/grade_school_math/data/train.jsonl with the code appended at the end. The errors are located by delving into unreasonably low pass rates by the strong DeepSeekMath-7B-RL and hopefully should be exhaustive. This dataset is used by the 🎯DART-Math project to synthesize data.
[!WARNING] ⚠️ Only the training set has been fixed so far.
for… See the full description on the dataset page: https://huggingface.co/datasets/hkust-nlp/gsm8k-fix.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Summary
This is the harder version of gsm8k math reasoning dataset (https://huggingface.co/datasets/gsm8k). We construct this dataset by replacing the numbers in the questions of GSM8K with larger numbers that are less common.
Supported Tasks and Leaderboards
This dataset is used to evaluate math reasoning
Languages
English - Numbers
Dataset Structure
dataset = load_dataset("reasoning-machines/gsm-hard") DatasetDict({ train: Dataset({… See the full description on the dataset page: https://huggingface.co/datasets/reasoning-machines/gsm-hard.
gohsyi/gsm8k dataset hosted on Hugging Face and contributed by the HF Datasets community
https://choosealicense.com/licenses/llama3.1/https://choosealicense.com/licenses/llama3.1/
Image generated by DALL-E. See prompt for more details
gretelai/synthetic-gsm8k-reflection-405b
This dataset is a synthetically generated version inspired by the GSM8K dataset, created entirely using Gretel Navigator with meta-llama/Meta-Llama-3.1-405B as the agent LLM. It contains Grade School-level reasoning tasks with step-by-step reflections and solutions, focusing on multi-step reasoning problems.
Key Features for AI Developers:
Synthetic Data Generation:… See the full description on the dataset page: https://huggingface.co/datasets/gretelai/gretel-math-gsm8k-v1.
lilac/GSM8K-main
This dataset is a Lilac processed dataset. Original dataset: https://huggingface.co/datasets/gsm8k To download the dataset to a local directory: lilac download lilacai/lilac-GSM8K-main
or from python with: ll.download("lilacai/lilac-GSM8K-main")
DaertML/gsm8k-jsonl dataset hosted on Hugging Face and contributed by the HF Datasets community
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Card for GSM8K-Platinum
🏆 Homepage | 📣 Blog | 🖥️ Code | 📖 Paper | 🔍 Error Viewer
Dataset Summary
GSM8K-Platinum is a revised version of the full test set of GSM8K (Grade School Math 8K), a dataset of grade school math word problems, providing a more accurate assessment of mathematical reasoning capabilities To revise this dataset, we ran a variety of frontier models each individual example and manually examined any example for which at least one… See the full description on the dataset page: https://huggingface.co/datasets/madrylab/gsm8k-platinum.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
A dataset containing mathematical problem-solving traces with step-by-step solutions and improvement history. Each record includes a mathematical problem, its final solution, and the iterative improvement process.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Card for GSM8K-Prolog
Dataset Summary
This is the Prolog annotated version of the GSM8K math reasoning dataset. We used the same dataset splits and questions in GSM8K and prompted GPT-4 to generate the Prolog programs to solve the questions. We then manually corrected some malfunctioning samples.
Supported Tasks and Leaderboards
This dataset can be used to train language models to generate Prolog codes in order to solve math questions and evaluate the… See the full description on the dataset page: https://huggingface.co/datasets/Thomas-X-Yang/gsm8k-prolog.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
dtrejopizzo/gsm8k-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community
https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Instella-GSM8K-synthetic
The Instella-GSM8K-synthetic dataset was used in the second stage pre-training of Instella-3B model, which was trained on top of the Instella-3B-Stage1 model. This synthetic dataset was generated using the training set of GSM8k dataset, where we first used Qwen2.5-72B-Instruct to
Abstract numerical values as function parameters and generate a Python program to solve the math question. Identify and replace numerical values in the existing question with… See the full description on the dataset page: https://huggingface.co/datasets/amd/Instella-GSM8K-synthetic.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
GSM8K-Consistency Benchmark
GSM8K-Consistency is a benchmark database for analyzing the consistency of Arithmetic Reasoning on GSM8K.
🚀 The dataset is available on 🤗 Hugging Face!
This is a math-problem-related semantics-preserving perturbation benchmark that can be very helpful for evaluating the consistency of arithmetic reasoning capability.
💻 Dataset Usage
Run the following command to load the data: from datasets import load_dataset
dataset =… See the full description on the dataset page: https://huggingface.co/datasets/shuyuej/GSM8K-Consistency.
GSM8K Dataset with IDs
This is an enhanced version of the GSM8K dataset. Unique deterministic IDs have been added for each example using a hash of the question and answer.
Features
ID: A deterministic 8-character hash generated from the question + answer. Ensures ID uniqueness within each split.
Usage
Load the dataset directly from the Hub: from datasets import load_dataset
datasetdict = load_dataset("epfl-dlab/gsm8k")
print(datasetdict["train"][0]) #… See the full description on the dataset page: https://huggingface.co/datasets/epfl-dlab/gsm8k.
RLHFlow/Mistral-GSM8K-Test dataset hosted on Hugging Face and contributed by the HF Datasets community
rawsh/mirrorqwen2.5-0.5B-gsm8k-policy-data-ST-1 dataset hosted on Hugging Face and contributed by the HF Datasets community
https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
OpenMath GSM8K Masked
We release a masked version of the GSM8K solutions. This data can be used to aid synthetic generation of additional solutions for GSM8K dataset as it is much less likely to lead to inconsistent reasoning compared to using the original solutions directly. This dataset was used to construct OpenMathInstruct-1: a math instruction tuning dataset with 1.8M problem-solution pairs generated using permissively licensed Mixtral-8x7B model. For details of how the masked… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/OpenMath-GSM8K-masked.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Card for GSM8K
Dataset Summary
GSM8K (Grade School Math 8K) is a dataset of 8.5K high quality linguistically diverse grade school math word problems. The dataset was created to support the task of question answering on basic mathematical problems that require multi-step reasoning.
These problems take between 2 and 8 steps to solve. Solutions primarily involve performing a sequence of elementary calculations using basic arithmetic operations (+ − ×÷) to reach the… See the full description on the dataset page: https://huggingface.co/datasets/openai/gsm8k.