100+ datasets found

gsm8k
huggingface.co
Updated Aug 11, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
OpenAI (2022). gsm8k [Dataset]. https://huggingface.co/datasets/openai/gsm8k
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 11, 2022
Dataset authored and provided by
OpenAIhttps://openai.com/
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset Card for GSM8K

Dataset Summary

GSM8K (Grade School Math 8K) is a dataset of 8.5K high quality linguistically diverse grade school math word problems. The dataset was created to support the task of question answering on basic mathematical problems that require multi-step reasoning.

These problems take between 2 and 8 steps to solve. Solutions primarily involve performing a sequence of elementary calculations using basic arithmetic operations (+ − ×÷) to reach the… See the full description on the dataset page: https://huggingface.co/datasets/openai/gsm8k.
h
Calc-gsm8k
huggingface.co
Updated Oct 27, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NLP Centre, Faculty of Informatics, Masaryk University (2023). Calc-gsm8k [Dataset]. https://huggingface.co/datasets/MU-NLPC/Calc-gsm8k
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 27, 2023
Dataset authored and provided by
NLP Centre, Faculty of Informatics, Masaryk University
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset Card for Calc-gsm8k

Summary

This dataset is an instance of gsm8k dataset, converted to a simple html-like language that can be easily parsed (e.g. by BeautifulSoup). The data contains 3 types of tags:

gadget: A tag whose content is intended to be evaluated by calling an external tool (sympy-based calculator in this case) output: An output of the external tool result: The final answer to the mathematical problem (a number)

Supported Tasks

The… See the full description on the dataset page: https://huggingface.co/datasets/MU-NLPC/Calc-gsm8k.
P
GSM8K Dataset
paperswithcode.com
tensorflow.org
+2more
Updated Dec 31, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Karl Cobbe; Vineet Kosaraju; Mohammad Bavarian; Mark Chen; Heewoo Jun; Lukasz Kaiser; Matthias Plappert; Jerry Tworek; Jacob Hilton; Reiichiro Nakano; Christopher Hesse; John Schulman (2024). GSM8K Dataset [Dataset]. https://paperswithcode.com/dataset/gsm8k
Explore at:
Dataset updated
Dec 31, 2024
Authors
Karl Cobbe; Vineet Kosaraju; Mohammad Bavarian; Mark Chen; Heewoo Jun; Lukasz Kaiser; Matthias Plappert; Jerry Tworek; Jacob Hilton; Reiichiro Nakano; Christopher Hesse; John Schulman
Description
GSM8K is a dataset of 8.5K high quality linguistically diverse grade school math word problems created by human problem writers. The dataset is segmented into 7.5K training problems and 1K test problems. These problems take between 2 and 8 steps to solve, and solutions primarily involve performing a sequence of elementary calculations using basic arithmetic operations (+ − ×÷) to reach the final answer. A bright middle school student should be able to solve every problem. It can be used for multi-step mathematical reasoning.
h
gsm8k-synthetic-diverse-8b
huggingface.co
Updated Oct 29, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gretel.ai (2024). gsm8k-synthetic-diverse-8b [Dataset]. https://huggingface.co/datasets/gretelai/gsm8k-synthetic-diverse-8b
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 29, 2024
Dataset provided by
Gretel.ai
License
https://choosealicense.com/licenses/llama3.1/https://choosealicense.com/licenses/llama3.1/
Description
gretelai/gsm8k-synthetic-diverse-8b

This dataset is a synthetically generated version inspired by the GSM8K https://huggingface.co/datasets/openai/gsm8k dataset, created entirely using Gretel Navigator with meta-llama/Meta-Llama-3.1-8B as the agent LLM. It contains ~1500 Grade School-level math word problems with step-by-step solutions, focusing on age group, difficulty, and domain diversity.

Key Features:

Synthetically Generated: Math problems created using Gretel… See the full description on the dataset page: https://huggingface.co/datasets/gretelai/gsm8k-synthetic-diverse-8b.
h
gsm8k-fix
huggingface.co
Updated Feb 19, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
HKUST NLP Group (2025). gsm8k-fix [Dataset]. https://huggingface.co/datasets/hkust-nlp/gsm8k-fix
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 19, 2025
Dataset authored and provided by
HKUST NLP Group
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
GSM8K (Fixed)

Some erroneous labels exist in the GSM8K dataset. This dataset is fixed from https://github.com/openai/grade-school-math/blob/master/grade_school_math/data/train.jsonl with the code appended at the end. The errors are located by delving into unreasonably low pass rates by the strong DeepSeekMath-7B-RL and hopefully should be exhaustive. This dataset is used by the 🎯DART-Math project to synthesize data.

[!WARNING] ⚠️ Only the training set has been fixed so far.

for… See the full description on the dataset page: https://huggingface.co/datasets/hkust-nlp/gsm8k-fix.
h
gsm-hard
huggingface.co
Updated Apr 9, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Reasoning Machines (2023). gsm-hard [Dataset]. https://huggingface.co/datasets/reasoning-machines/gsm-hard
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 9, 2023
Dataset authored and provided by
Reasoning Machines
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset Summary

This is the harder version of gsm8k math reasoning dataset (https://huggingface.co/datasets/gsm8k). We construct this dataset by replacing the numbers in the questions of GSM8K with larger numbers that are less common.

Supported Tasks and Leaderboards

This dataset is used to evaluate math reasoning

Languages

English - Numbers

Dataset Structure

dataset = load_dataset("reasoning-machines/gsm-hard") DatasetDict({ train: Dataset({… See the full description on the dataset page: https://huggingface.co/datasets/reasoning-machines/gsm-hard.
h
gsm8k
huggingface.co
Updated Sep 12, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hongyi Guo (2024). gsm8k [Dataset]. https://huggingface.co/datasets/gohsyi/gsm8k
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 12, 2024
Authors
Hongyi Guo
Description
gohsyi/gsm8k dataset hosted on Hugging Face and contributed by the HF Datasets community
h
gretel-math-gsm8k-v1
huggingface.co
Updated Sep 30, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gretel.ai (2024). gretel-math-gsm8k-v1 [Dataset]. https://huggingface.co/datasets/gretelai/gretel-math-gsm8k-v1
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 30, 2024
Dataset provided by
Gretel.ai
License
https://choosealicense.com/licenses/llama3.1/https://choosealicense.com/licenses/llama3.1/
Description
Image generated by DALL-E. See prompt for more details

gretelai/synthetic-gsm8k-reflection-405b

This dataset is a synthetically generated version inspired by the GSM8K dataset, created entirely using Gretel Navigator with meta-llama/Meta-Llama-3.1-405B as the agent LLM. It contains Grade School-level reasoning tasks with step-by-step reflections and solutions, focusing on multi-step reasoning problems.

Key Features for AI Developers:

Synthetic Data Generation:… See the full description on the dataset page: https://huggingface.co/datasets/gretelai/gretel-math-gsm8k-v1.
h
lilac-GSM8K-main
huggingface.co
Updated Jan 25, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lilac AI (2024). lilac-GSM8K-main [Dataset]. https://huggingface.co/datasets/lilacai/lilac-GSM8K-main
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 25, 2024
Dataset authored and provided by
Lilac AI
Description
lilac/GSM8K-main

This dataset is a Lilac processed dataset. Original dataset: https://huggingface.co/datasets/gsm8k To download the dataset to a local directory: lilac download lilacai/lilac-GSM8K-main

or from python with: ll.download("lilacai/lilac-GSM8K-main")
h
gsm8k-jsonl
huggingface.co
Updated Aug 11, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
DaertML (2022). gsm8k-jsonl [Dataset]. https://huggingface.co/datasets/DaertML/gsm8k-jsonl
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 11, 2022
Authors
DaertML
Description
DaertML/gsm8k-jsonl dataset hosted on Hugging Face and contributed by the HF Datasets community
h
gsm8k-platinum
huggingface.co
Updated Mar 6, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Madry Lab (2025). gsm8k-platinum [Dataset]. https://huggingface.co/datasets/madrylab/gsm8k-platinum
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 6, 2025
Dataset authored and provided by
Madry Lab
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset Card for GSM8K-Platinum

🏆 Homepage | 📣 Blog | 🖥️ Code | 📖 Paper | 🔍 Error Viewer

Dataset Summary

GSM8K-Platinum is a revised version of the full test set of GSM8K (Grade School Math 8K), a dataset of grade school math word problems, providing a more accurate assessment of mathematical reasoning capabilities To revise this dataset, we ran a variety of frontier models each individual example and manually examined any example for which at least one… See the full description on the dataset page: https://huggingface.co/datasets/madrylab/gsm8k-platinum.
h
gsm8k_distilled
huggingface.co
Updated Feb 2, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CAMEL-AI.org (2025). gsm8k_distilled [Dataset]. https://huggingface.co/datasets/camel-ai/gsm8k_distilled
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 2, 2025
Dataset provided by
CAMEL-AI.org
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
A dataset containing mathematical problem-solving traces with step-by-step solutions and improvement history. Each record includes a mathematical problem, its final solution, and the iterative improvement process.
h
gsm8k-prolog
huggingface.co
Updated Sep 9, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xiaocheng Yang (2023). gsm8k-prolog [Dataset]. https://huggingface.co/datasets/Thomas-X-Yang/gsm8k-prolog
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 9, 2023
Authors
Xiaocheng Yang
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset Card for GSM8K-Prolog

Dataset Summary

This is the Prolog annotated version of the GSM8K math reasoning dataset. We used the same dataset splits and questions in GSM8K and prompted GPT-4 to generate the Prolog programs to solve the questions. We then manually corrected some malfunctioning samples.

Supported Tasks and Leaderboards

This dataset can be used to train language models to generate Prolog codes in order to solve math questions and evaluate the… See the full description on the dataset page: https://huggingface.co/datasets/Thomas-X-Yang/gsm8k-prolog.
h
gsm8k-dataset
huggingface.co
Updated Oct 19, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
David Alejandro Trejo Pizzo (2023). gsm8k-dataset [Dataset]. https://huggingface.co/datasets/dtrejopizzo/gsm8k-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 19, 2023
Authors
David Alejandro Trejo Pizzo
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
dtrejopizzo/gsm8k-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community
h
Instella-GSM8K-synthetic
huggingface.co
Updated Jun 16, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AMD (2025). Instella-GSM8K-synthetic [Dataset]. https://huggingface.co/datasets/amd/Instella-GSM8K-synthetic
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 16, 2025
Dataset authored and provided by
AMD
License
https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Description
Instella-GSM8K-synthetic

The Instella-GSM8K-synthetic dataset was used in the second stage pre-training of Instella-3B model, which was trained on top of the Instella-3B-Stage1 model. This synthetic dataset was generated using the training set of GSM8k dataset, where we first used Qwen2.5-72B-Instruct to

Abstract numerical values as function parameters and generate a Python program to solve the math question. Identify and replace numerical values in the existing question with… See the full description on the dataset page: https://huggingface.co/datasets/amd/Instella-GSM8K-synthetic.
h
GSM8K-Consistency
huggingface.co
Updated Feb 8, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shuyue Jia (Bruce) (2024). GSM8K-Consistency [Dataset]. https://huggingface.co/datasets/shuyuej/GSM8K-Consistency
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 8, 2024
Authors
Shuyue Jia (Bruce)
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
GSM8K-Consistency Benchmark

GSM8K-Consistency is a benchmark database for analyzing the consistency of Arithmetic Reasoning on GSM8K.

🚀 The dataset is available on 🤗 Hugging Face!

This is a math-problem-related semantics-preserving perturbation benchmark that can be very helpful for evaluating the consistency of arithmetic reasoning capability.

💻 Dataset Usage

Run the following command to load the data: from datasets import load_dataset

dataset =… See the full description on the dataset page: https://huggingface.co/datasets/shuyuej/GSM8K-Consistency.
h
gsm8k
huggingface.co
Updated Aug 11, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
EPFL Data Science Lab (2022). gsm8k [Dataset]. https://huggingface.co/datasets/epfl-dlab/gsm8k
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 11, 2022
Dataset authored and provided by
EPFL Data Science Lab
Description
GSM8K Dataset with IDs

This is an enhanced version of the GSM8K dataset. Unique deterministic IDs have been added for each example using a hash of the question and answer.

Features

ID: A deterministic 8-character hash generated from the question + answer. Ensures ID uniqueness within each split.

Usage

Load the dataset directly from the Hub: from datasets import load_dataset

datasetdict = load_dataset("epfl-dlab/gsm8k")

print(datasetdict["train"][0]) #… See the full description on the dataset page: https://huggingface.co/datasets/epfl-dlab/gsm8k.
h
Mistral-GSM8K-Test
huggingface.co
Updated Nov 9, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
RLHFlow (2024). Mistral-GSM8K-Test [Dataset]. https://huggingface.co/datasets/RLHFlow/Mistral-GSM8K-Test
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 9, 2024
Dataset authored and provided by
RLHFlow
Description
RLHFlow/Mistral-GSM8K-Test dataset hosted on Hugging Face and contributed by the HF Datasets community
h
mirrorqwen2.5-0.5B-gsm8k-policy-data-ST-1
huggingface.co
Updated Nov 20, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
robert washbourne (2024). mirrorqwen2.5-0.5B-gsm8k-policy-data-ST-1 [Dataset]. https://huggingface.co/datasets/rawsh/mirrorqwen2.5-0.5B-gsm8k-policy-data-ST-1
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 20, 2024
Authors
robert washbourne
Description
rawsh/mirrorqwen2.5-0.5B-gsm8k-policy-data-ST-1 dataset hosted on Hugging Face and contributed by the HF Datasets community
OpenMath-GSM8K-masked
huggingface.co
Updated Jul 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NVIDIA (2025). OpenMath-GSM8K-masked [Dataset]. https://huggingface.co/datasets/nvidia/OpenMath-GSM8K-masked
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 11, 2025
Dataset provided by
Nvidiahttp://nvidia.com/
Authors
NVIDIA
License
https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Description
OpenMath GSM8K Masked

We release a masked version of the GSM8K solutions. This data can be used to aid synthetic generation of additional solutions for GSM8K dataset as it is much less likely to lead to inconsistent reasoning compared to using the original solutions directly. This dataset was used to construct OpenMathInstruct-1: a math instruction tuning dataset with 1.8M problem-solution pairs generated using permissively licensed Mixtral-8x7B model. For details of how the masked… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/OpenMath-GSM8K-masked.

Facebook

Twitter

Click to copy link

Link copied

Cite

OpenAI (2022). gsm8k [Dataset]. https://huggingface.co/datasets/openai/gsm8k

gsm8k

openai/gsm8k

Grade School Math 8K

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Aug 11, 2022

Dataset authored and provided by

OpenAIhttps://openai.com/

License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

Dataset Card for GSM8K

  Dataset Summary

GSM8K (Grade School Math 8K) is a dataset of 8.5K high quality linguistically diverse grade school math word problems. The dataset was created to support the task of question answering on basic mathematical problems that require multi-step reasoning.

These problems take between 2 and 8 steps to solve. Solutions primarily involve performing a sequence of elementary calculations using basic arithmetic operations (+ − ×÷) to reach the… See the full description on the dataset page: https://huggingface.co/datasets/openai/gsm8k.

Clear search

Close search

Google apps

Main menu

gsm8k

Calc-gsm8k

GSM8K Dataset

gsm8k-synthetic-diverse-8b

gsm8k-fix

gsm-hard

gsm8k

gretel-math-gsm8k-v1

lilac-GSM8K-main

gsm8k-jsonl

gsm8k-platinum

gsm8k_distilled

gsm8k-prolog

gsm8k-dataset

Instella-GSM8K-synthetic

GSM8K-Consistency

gsm8k

Mistral-GSM8K-Test

mirrorqwen2.5-0.5B-gsm8k-policy-data-ST-1

OpenMath-GSM8K-masked

gsm8kSee More Versions

openai/gsm8k

Grade School Math 8K

gsm8k