8 datasets found
  1. aqua_rat

    • huggingface.co
    Updated Jan 23, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Deepmind (2022). aqua_rat [Dataset]. https://huggingface.co/datasets/deepmind/aqua_rat
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 23, 2022
    Dataset provided by
    DeepMindhttp://deepmind.com/
    Authors
    Deepmind
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Card for AQUA-RAT

      Dataset Summary
    

    A large-scale dataset consisting of approximately 100,000 algebraic word problems. The solution to each question is explained step-by-step using natural language. This data is used to train a program generation model that learns to generate the explanation, while generating the program that solves the question.

      Supported Tasks and Leaderboards
    
    
    
    
    
      Languages
    

    en

      Dataset Structure
    
    
    
    
    
      Data Instances… See the full description on the dataset page: https://huggingface.co/datasets/deepmind/aqua_rat.
    
  2. h

    Calc-aqua_rat

    • huggingface.co
    Updated Apr 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NLP Centre, Faculty of Informatics, Masaryk University (2023). Calc-aqua_rat [Dataset]. https://huggingface.co/datasets/MU-NLPC/Calc-aqua_rat
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 20, 2023
    Dataset authored and provided by
    NLP Centre, Faculty of Informatics, Masaryk University
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Card for Calc-aqua_rat

      Summary
    

    This dataset is an instance of AQuA-RAT dataset extended with in-context calls of a sympy calculator.

      Supported Tasks
    

    The dataset is intended for training Chain-of-Thought reasoning models able to use external tools to enhance the factuality of their responses. This dataset presents in-context scenarios where models can outsource the computations in the reasoning chain to a calculator.

      Construction Process
    

    The… See the full description on the dataset page: https://huggingface.co/datasets/MU-NLPC/Calc-aqua_rat.

  3. h

    aqua-rat-mcqa

    • huggingface.co
    Updated Jun 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rico ibañez (2025). aqua-rat-mcqa [Dataset]. https://huggingface.co/datasets/RikoteMaster/aqua-rat-mcqa
    Explore at:
    Dataset updated
    Jun 12, 2025
    Authors
    Rico ibañez
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    AQUA-RAT MCQA Dataset

    This dataset contains the AQUA-RAT dataset converted to Multiple Choice Question Answering (MCQA) format with modifications.

      Dataset Description
    

    AQUA-RAT is a dataset of algebraic word problems with rationales. This version has been processed to:

    Remove all questions where the correct answer was option "E" (5th choice) Remove the "E" option from all remaining questions (4 choices: A, B, C, D) Merge validation and test splits into a single test split… See the full description on the dataset page: https://huggingface.co/datasets/RikoteMaster/aqua-rat-mcqa.

  4. vietnamese-aqua-rat

    • kaggle.com
    Updated Nov 26, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Việt Hưng Nguyễn (2023). vietnamese-aqua-rat [Dataset]. https://www.kaggle.com/datasets/hungsvdut/vietnamese-aqua-rat/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 26, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Việt Hưng Nguyễn
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Việt Hưng Nguyễn

    Released under Apache 2.0

    Contents

  5. h

    aqua-rat

    • huggingface.co
    Updated Jun 23, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Laurentiu Petrea (2024). aqua-rat [Dataset]. https://huggingface.co/datasets/laurentiubp/aqua-rat
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 23, 2024
    Authors
    Laurentiu Petrea
    Description

    laurentiubp/aqua-rat dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. math_qa

    • huggingface.co
    • opendatalab.com
    Updated May 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ai2 (2024). math_qa [Dataset]. https://huggingface.co/datasets/allenai/math_qa
    Explore at:
    Dataset updated
    May 29, 2024
    Dataset provided by
    Allen Institute for AIhttp://allenai.org/
    Authors
    Ai2
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Our dataset is gathered by using a new representation language to annotate over the AQuA-RAT dataset. AQuA-RAT has provided the questions, options, rationale, and the correct options.

  7. h

    MNLP_M2_mcqa_dataset

    • huggingface.co
    Updated May 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nicolas Gonzalez (2025). MNLP_M2_mcqa_dataset [Dataset]. https://huggingface.co/datasets/NicoHelemon/MNLP_M2_mcqa_dataset
    Explore at:
    Dataset updated
    May 31, 2025
    Authors
    Nicolas Gonzalez
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    MNLP M2 MCQA Dataset

    A unified multiple-choice question answering (MCQA) benchmark on STEM subjects combining samples from OpenBookQA, SciQ, MMLU-auxiliary, AQUA-Rat, and MedMCQA.

      Dataset Summary
    

    This dataset merges five existing science and knowledge-based MCQA datasets into one standardized format:

    Source Train samples

    OpenBookQA 4 900

    SciQ 10 000

    MMLU-aux 85 100

    AQUA-Rat 50 000

    MedMCQA 50 000

    Total 200 000

      Supported Tasks and… See the full description on the dataset page: https://huggingface.co/datasets/NicoHelemon/MNLP_M2_mcqa_dataset.
    
  8. h

    Judgement-baseline

    • huggingface.co
    Updated May 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sleeping AI (2025). Judgement-baseline [Dataset]. https://huggingface.co/datasets/sleeping-ai/Judgement-baseline
    Explore at:
    Dataset updated
    May 9, 2025
    Dataset authored and provided by
    Sleeping AI
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Model Name

    Params

    MMLU-Pro-Plus Baseline Drop MMLU-Pro Baseline Drop Added Exp MMLU Pro Plus Added MMLU-redux 2.0 Baseline Drop AQUA-RAT Baseline Drop

    CohereLabs/c4ai-command-a-03-2025 111B ✅ (single inference) ✅ done ✅ (HF naive batch) ✅ done ✅ done

    -

    -

    -

    google/gemma-3-12b-it 12B ✅ (HF naive batch) ✅ done ✅ (HF naive batch) ✅ done ✅ done

    -

    -

    -

    meta-llama/Llama-4-Scout-17B-16E 17B ✅ (HF naive batch) ✅ done ✅ (HF naive batch) ✅ done ✅ done

    -

    -

    -

    Qwen/Qwen3-4B 4B… See the full description on the dataset page: https://huggingface.co/datasets/sleeping-ai/Judgement-baseline.

  9. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Deepmind (2022). aqua_rat [Dataset]. https://huggingface.co/datasets/deepmind/aqua_rat
Organization logo

aqua_rat

deepmind/aqua_rat

Algebra Question Answering with Rationales

Explore at:
39 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 23, 2022
Dataset provided by
DeepMindhttp://deepmind.com/
Authors
Deepmind
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Dataset Card for AQUA-RAT

  Dataset Summary

A large-scale dataset consisting of approximately 100,000 algebraic word problems. The solution to each question is explained step-by-step using natural language. This data is used to train a program generation model that learns to generate the explanation, while generating the program that solves the question.

  Supported Tasks and Leaderboards





  Languages

en

  Dataset Structure





  Data Instances… See the full description on the dataset page: https://huggingface.co/datasets/deepmind/aqua_rat.
Search
Clear search
Close search
Google apps
Main menu