100+ datasets found
  1. gsm8k

    • huggingface.co
    Updated Aug 11, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OpenAI (2022). gsm8k [Dataset]. https://huggingface.co/datasets/openai/gsm8k
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 11, 2022
    Dataset authored and provided by
    OpenAIhttps://openai.com/
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset Card for GSM8K

      Dataset Summary
    

    GSM8K (Grade School Math 8K) is a dataset of 8.5K high quality linguistically diverse grade school math word problems. The dataset was created to support the task of question answering on basic mathematical problems that require multi-step reasoning.

    These problems take between 2 and 8 steps to solve. Solutions primarily involve performing a sequence of elementary calculations using basic arithmetic operations (+ − ×÷) to reach the… See the full description on the dataset page: https://huggingface.co/datasets/openai/gsm8k.

  2. h

    Calc-gsm8k

    • huggingface.co
    Updated Oct 27, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NLP Centre, Faculty of Informatics, Masaryk University (2023). Calc-gsm8k [Dataset]. https://huggingface.co/datasets/MU-NLPC/Calc-gsm8k
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 27, 2023
    Dataset authored and provided by
    NLP Centre, Faculty of Informatics, Masaryk University
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset Card for Calc-gsm8k

      Summary
    

    This dataset is an instance of gsm8k dataset, converted to a simple html-like language that can be easily parsed (e.g. by BeautifulSoup). The data contains 3 types of tags:

    gadget: A tag whose content is intended to be evaluated by calling an external tool (sympy-based calculator in this case) output: An output of the external tool result: The final answer to the mathematical problem (a number)

      Supported Tasks
    

    The… See the full description on the dataset page: https://huggingface.co/datasets/MU-NLPC/Calc-gsm8k.

  3. P

    GSM8K Dataset

    • paperswithcode.com
    • tensorflow.org
    • +2more
    Updated Dec 31, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Karl Cobbe; Vineet Kosaraju; Mohammad Bavarian; Mark Chen; Heewoo Jun; Lukasz Kaiser; Matthias Plappert; Jerry Tworek; Jacob Hilton; Reiichiro Nakano; Christopher Hesse; John Schulman (2024). GSM8K Dataset [Dataset]. https://paperswithcode.com/dataset/gsm8k
    Explore at:
    Dataset updated
    Dec 31, 2024
    Authors
    Karl Cobbe; Vineet Kosaraju; Mohammad Bavarian; Mark Chen; Heewoo Jun; Lukasz Kaiser; Matthias Plappert; Jerry Tworek; Jacob Hilton; Reiichiro Nakano; Christopher Hesse; John Schulman
    Description

    GSM8K is a dataset of 8.5K high quality linguistically diverse grade school math word problems created by human problem writers. The dataset is segmented into 7.5K training problems and 1K test problems. These problems take between 2 and 8 steps to solve, and solutions primarily involve performing a sequence of elementary calculations using basic arithmetic operations (+ − ×÷) to reach the final answer. A bright middle school student should be able to solve every problem. It can be used for multi-step mathematical reasoning.

  4. h

    gsm8k-synthetic-diverse-8b

    • huggingface.co
    Updated Oct 29, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gretel.ai (2024). gsm8k-synthetic-diverse-8b [Dataset]. https://huggingface.co/datasets/gretelai/gsm8k-synthetic-diverse-8b
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 29, 2024
    Dataset provided by
    Gretel.ai
    License

    https://choosealicense.com/licenses/llama3.1/https://choosealicense.com/licenses/llama3.1/

    Description

    gretelai/gsm8k-synthetic-diverse-8b

    This dataset is a synthetically generated version inspired by the GSM8K https://huggingface.co/datasets/openai/gsm8k dataset, created entirely using Gretel Navigator with meta-llama/Meta-Llama-3.1-8B as the agent LLM. It contains ~1500 Grade School-level math word problems with step-by-step solutions, focusing on age group, difficulty, and domain diversity.

      Key Features:
    

    Synthetically Generated: Math problems created using Gretel… See the full description on the dataset page: https://huggingface.co/datasets/gretelai/gsm8k-synthetic-diverse-8b.

  5. h

    gsm8k-fix

    • huggingface.co
    Updated Feb 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HKUST NLP Group (2025). gsm8k-fix [Dataset]. https://huggingface.co/datasets/hkust-nlp/gsm8k-fix
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 19, 2025
    Dataset authored and provided by
    HKUST NLP Group
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    GSM8K (Fixed)

    Some erroneous labels exist in the GSM8K dataset. This dataset is fixed from https://github.com/openai/grade-school-math/blob/master/grade_school_math/data/train.jsonl with the code appended at the end. The errors are located by delving into unreasonably low pass rates by the strong DeepSeekMath-7B-RL and hopefully should be exhaustive. This dataset is used by the 🎯DART-Math project to synthesize data.

    [!WARNING] ⚠️ Only the training set has been fixed so far.

    for… See the full description on the dataset page: https://huggingface.co/datasets/hkust-nlp/gsm8k-fix.

  6. h

    gsm-hard

    • huggingface.co
    Updated Apr 9, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Reasoning Machines (2023). gsm-hard [Dataset]. https://huggingface.co/datasets/reasoning-machines/gsm-hard
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 9, 2023
    Dataset authored and provided by
    Reasoning Machines
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset Summary

    This is the harder version of gsm8k math reasoning dataset (https://huggingface.co/datasets/gsm8k). We construct this dataset by replacing the numbers in the questions of GSM8K with larger numbers that are less common.

      Supported Tasks and Leaderboards
    

    This dataset is used to evaluate math reasoning

      Languages
    

    English - Numbers

      Dataset Structure
    

    dataset = load_dataset("reasoning-machines/gsm-hard") DatasetDict({ train: Dataset({… See the full description on the dataset page: https://huggingface.co/datasets/reasoning-machines/gsm-hard.

  7. h

    gsm8k

    • huggingface.co
    Updated Sep 12, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hongyi Guo (2024). gsm8k [Dataset]. https://huggingface.co/datasets/gohsyi/gsm8k
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 12, 2024
    Authors
    Hongyi Guo
    Description

    gohsyi/gsm8k dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. h

    gretel-math-gsm8k-v1

    • huggingface.co
    Updated Sep 30, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gretel.ai (2024). gretel-math-gsm8k-v1 [Dataset]. https://huggingface.co/datasets/gretelai/gretel-math-gsm8k-v1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 30, 2024
    Dataset provided by
    Gretel.ai
    License

    https://choosealicense.com/licenses/llama3.1/https://choosealicense.com/licenses/llama3.1/

    Description

    Image generated by DALL-E. See prompt for more details

      gretelai/synthetic-gsm8k-reflection-405b
    

    This dataset is a synthetically generated version inspired by the GSM8K dataset, created entirely using Gretel Navigator with meta-llama/Meta-Llama-3.1-405B as the agent LLM. It contains Grade School-level reasoning tasks with step-by-step reflections and solutions, focusing on multi-step reasoning problems.

      Key Features for AI Developers:
    

    Synthetic Data Generation:… See the full description on the dataset page: https://huggingface.co/datasets/gretelai/gretel-math-gsm8k-v1.

  9. h

    lilac-GSM8K-main

    • huggingface.co
    Updated Jan 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lilac AI (2024). lilac-GSM8K-main [Dataset]. https://huggingface.co/datasets/lilacai/lilac-GSM8K-main
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 25, 2024
    Dataset authored and provided by
    Lilac AI
    Description

    lilac/GSM8K-main

    This dataset is a Lilac processed dataset. Original dataset: https://huggingface.co/datasets/gsm8k To download the dataset to a local directory: lilac download lilacai/lilac-GSM8K-main

    or from python with: ll.download("lilacai/lilac-GSM8K-main")

  10. h

    gsm8k-jsonl

    • huggingface.co
    Updated Aug 11, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DaertML (2022). gsm8k-jsonl [Dataset]. https://huggingface.co/datasets/DaertML/gsm8k-jsonl
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 11, 2022
    Authors
    DaertML
    Description

    DaertML/gsm8k-jsonl dataset hosted on Hugging Face and contributed by the HF Datasets community

  11. h

    gsm8k-platinum

    • huggingface.co
    Updated Mar 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Madry Lab (2025). gsm8k-platinum [Dataset]. https://huggingface.co/datasets/madrylab/gsm8k-platinum
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 6, 2025
    Dataset authored and provided by
    Madry Lab
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset Card for GSM8K-Platinum

    🏆 Homepage | 📣 Blog | 🖥️ Code | 📖 Paper | 🔍 Error Viewer

      Dataset Summary
    

    GSM8K-Platinum is a revised version of the full test set of GSM8K (Grade School Math 8K), a dataset of grade school math word problems, providing a more accurate assessment of mathematical reasoning capabilities To revise this dataset, we ran a variety of frontier models each individual example and manually examined any example for which at least one… See the full description on the dataset page: https://huggingface.co/datasets/madrylab/gsm8k-platinum.

  12. h

    gsm8k_distilled

    • huggingface.co
    Updated Feb 2, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CAMEL-AI.org (2025). gsm8k_distilled [Dataset]. https://huggingface.co/datasets/camel-ai/gsm8k_distilled
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 2, 2025
    Dataset provided by
    CAMEL-AI.org
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    A dataset containing mathematical problem-solving traces with step-by-step solutions and improvement history. Each record includes a mathematical problem, its final solution, and the iterative improvement process.

  13. h

    gsm8k-prolog

    • huggingface.co
    Updated Sep 9, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiaocheng Yang (2023). gsm8k-prolog [Dataset]. https://huggingface.co/datasets/Thomas-X-Yang/gsm8k-prolog
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 9, 2023
    Authors
    Xiaocheng Yang
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset Card for GSM8K-Prolog

      Dataset Summary
    

    This is the Prolog annotated version of the GSM8K math reasoning dataset. We used the same dataset splits and questions in GSM8K and prompted GPT-4 to generate the Prolog programs to solve the questions. We then manually corrected some malfunctioning samples.

      Supported Tasks and Leaderboards
    

    This dataset can be used to train language models to generate Prolog codes in order to solve math questions and evaluate the… See the full description on the dataset page: https://huggingface.co/datasets/Thomas-X-Yang/gsm8k-prolog.

  14. h

    gsm8k-dataset

    • huggingface.co
    Updated Oct 19, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    David Alejandro Trejo Pizzo (2023). gsm8k-dataset [Dataset]. https://huggingface.co/datasets/dtrejopizzo/gsm8k-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 19, 2023
    Authors
    David Alejandro Trejo Pizzo
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    dtrejopizzo/gsm8k-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  15. h

    Instella-GSM8K-synthetic

    • huggingface.co
    Updated Jun 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AMD (2025). Instella-GSM8K-synthetic [Dataset]. https://huggingface.co/datasets/amd/Instella-GSM8K-synthetic
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 16, 2025
    Dataset authored and provided by
    AMD
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    Instella-GSM8K-synthetic

    The Instella-GSM8K-synthetic dataset was used in the second stage pre-training of Instella-3B model, which was trained on top of the Instella-3B-Stage1 model. This synthetic dataset was generated using the training set of GSM8k dataset, where we first used Qwen2.5-72B-Instruct to

    Abstract numerical values as function parameters and generate a Python program to solve the math question. Identify and replace numerical values in the existing question with… See the full description on the dataset page: https://huggingface.co/datasets/amd/Instella-GSM8K-synthetic.

  16. h

    GSM8K-Consistency

    • huggingface.co
    Updated Feb 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shuyue Jia (Bruce) (2024). GSM8K-Consistency [Dataset]. https://huggingface.co/datasets/shuyuej/GSM8K-Consistency
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 8, 2024
    Authors
    Shuyue Jia (Bruce)
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    GSM8K-Consistency Benchmark

    GSM8K-Consistency is a benchmark database for analyzing the consistency of Arithmetic Reasoning on GSM8K.

      🚀 The dataset is available on 🤗 Hugging Face!
    

    This is a math-problem-related semantics-preserving perturbation benchmark that can be very helpful for evaluating the consistency of arithmetic reasoning capability.

      💻 Dataset Usage
    

    Run the following command to load the data: from datasets import load_dataset

    dataset =… See the full description on the dataset page: https://huggingface.co/datasets/shuyuej/GSM8K-Consistency.

  17. h

    gsm8k

    • huggingface.co
    Updated Aug 11, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    EPFL Data Science Lab (2022). gsm8k [Dataset]. https://huggingface.co/datasets/epfl-dlab/gsm8k
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 11, 2022
    Dataset authored and provided by
    EPFL Data Science Lab
    Description

    GSM8K Dataset with IDs

    This is an enhanced version of the GSM8K dataset. Unique deterministic IDs have been added for each example using a hash of the question and answer.

      Features
    

    ID: A deterministic 8-character hash generated from the question + answer. Ensures ID uniqueness within each split.

      Usage
    

    Load the dataset directly from the Hub: from datasets import load_dataset

    datasetdict = load_dataset("epfl-dlab/gsm8k")

    print(datasetdict["train"][0]) #… See the full description on the dataset page: https://huggingface.co/datasets/epfl-dlab/gsm8k.

  18. h

    Mistral-GSM8K-Test

    • huggingface.co
    Updated Nov 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    RLHFlow (2024). Mistral-GSM8K-Test [Dataset]. https://huggingface.co/datasets/RLHFlow/Mistral-GSM8K-Test
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 9, 2024
    Dataset authored and provided by
    RLHFlow
    Description

    RLHFlow/Mistral-GSM8K-Test dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. h

    mirrorqwen2.5-0.5B-gsm8k-policy-data-ST-1

    • huggingface.co
    Updated Nov 20, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    robert washbourne (2024). mirrorqwen2.5-0.5B-gsm8k-policy-data-ST-1 [Dataset]. https://huggingface.co/datasets/rawsh/mirrorqwen2.5-0.5B-gsm8k-policy-data-ST-1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 20, 2024
    Authors
    robert washbourne
    Description

    rawsh/mirrorqwen2.5-0.5B-gsm8k-policy-data-ST-1 dataset hosted on Hugging Face and contributed by the HF Datasets community

  20. OpenMath-GSM8K-masked

    • huggingface.co
    Updated Jul 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NVIDIA (2025). OpenMath-GSM8K-masked [Dataset]. https://huggingface.co/datasets/nvidia/OpenMath-GSM8K-masked
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 11, 2025
    Dataset provided by
    Nvidiahttp://nvidia.com/
    Authors
    NVIDIA
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    OpenMath GSM8K Masked

    We release a masked version of the GSM8K solutions. This data can be used to aid synthetic generation of additional solutions for GSM8K dataset as it is much less likely to lead to inconsistent reasoning compared to using the original solutions directly. This dataset was used to construct OpenMathInstruct-1: a math instruction tuning dataset with 1.8M problem-solution pairs generated using permissively licensed Mixtral-8x7B model. For details of how the masked… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/OpenMath-GSM8K-masked.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
OpenAI (2022). gsm8k [Dataset]. https://huggingface.co/datasets/openai/gsm8k
Organization logo

gsm8k

openai/gsm8k

Grade School Math 8K

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 11, 2022
Dataset authored and provided by
OpenAIhttps://openai.com/
License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

Dataset Card for GSM8K

  Dataset Summary

GSM8K (Grade School Math 8K) is a dataset of 8.5K high quality linguistically diverse grade school math word problems. The dataset was created to support the task of question answering on basic mathematical problems that require multi-step reasoning.

These problems take between 2 and 8 steps to solve. Solutions primarily involve performing a sequence of elementary calculations using basic arithmetic operations (+ − ×÷) to reach the… See the full description on the dataset page: https://huggingface.co/datasets/openai/gsm8k.

Search
Clear search
Close search
Google apps
Main menu