8 datasets found

aqua_rat
huggingface.co
Updated Jan 23, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Deepmind (2022). aqua_rat [Dataset]. https://huggingface.co/datasets/deepmind/aqua_rat
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 23, 2022
Dataset provided by
DeepMindhttp://deepmind.com/
Authors
Deepmind
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset Card for AQUA-RAT

Dataset Summary

A large-scale dataset consisting of approximately 100,000 algebraic word problems. The solution to each question is explained step-by-step using natural language. This data is used to train a program generation model that learns to generate the explanation, while generating the program that solves the question.

Supported Tasks and Leaderboards Languages

en

Dataset Structure Data Instances… See the full description on the dataset page: https://huggingface.co/datasets/deepmind/aqua_rat.
h
Calc-aqua_rat
huggingface.co
Updated Apr 20, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NLP Centre, Faculty of Informatics, Masaryk University (2023). Calc-aqua_rat [Dataset]. https://huggingface.co/datasets/MU-NLPC/Calc-aqua_rat
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 20, 2023
Dataset authored and provided by
NLP Centre, Faculty of Informatics, Masaryk University
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset Card for Calc-aqua_rat

Summary

This dataset is an instance of AQuA-RAT dataset extended with in-context calls of a sympy calculator.

Supported Tasks

The dataset is intended for training Chain-of-Thought reasoning models able to use external tools to enhance the factuality of their responses. This dataset presents in-context scenarios where models can outsource the computations in the reasoning chain to a calculator.

Construction Process

The… See the full description on the dataset page: https://huggingface.co/datasets/MU-NLPC/Calc-aqua_rat.
h
aqua-rat-mcqa
huggingface.co
Updated Jun 12, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rico ibañez (2025). aqua-rat-mcqa [Dataset]. https://huggingface.co/datasets/RikoteMaster/aqua-rat-mcqa
Explore at:
Dataset updated
Jun 12, 2025
Authors
Rico ibañez
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
AQUA-RAT MCQA Dataset

This dataset contains the AQUA-RAT dataset converted to Multiple Choice Question Answering (MCQA) format with modifications.

Dataset Description

AQUA-RAT is a dataset of algebraic word problems with rationales. This version has been processed to:

Remove all questions where the correct answer was option "E" (5th choice) Remove the "E" option from all remaining questions (4 choices: A, B, C, D) Merge validation and test splits into a single test split… See the full description on the dataset page: https://huggingface.co/datasets/RikoteMaster/aqua-rat-mcqa.
vietnamese-aqua-rat
kaggle.com
Updated Nov 26, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Việt Hưng Nguyễn (2023). vietnamese-aqua-rat [Dataset]. https://www.kaggle.com/datasets/hungsvdut/vietnamese-aqua-rat/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 26, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Việt Hưng Nguyễn
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Việt Hưng Nguyễn

Released under Apache 2.0

Contents
h
aqua-rat
huggingface.co
Updated Jun 23, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Laurentiu Petrea (2024). aqua-rat [Dataset]. https://huggingface.co/datasets/laurentiubp/aqua-rat
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 23, 2024
Authors
Laurentiu Petrea
Description
laurentiubp/aqua-rat dataset hosted on Hugging Face and contributed by the HF Datasets community
math_qa
huggingface.co
opendatalab.com
Updated May 29, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ai2 (2024). math_qa [Dataset]. https://huggingface.co/datasets/allenai/math_qa
Explore at:
Dataset updated
May 29, 2024
Dataset provided by
Allen Institute for AIhttp://allenai.org/
Authors
Ai2
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Our dataset is gathered by using a new representation language to annotate over the AQuA-RAT dataset. AQuA-RAT has provided the questions, options, rationale, and the correct options.
h
MNLP_M2_mcqa_dataset
huggingface.co
Updated May 31, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nicolas Gonzalez (2025). MNLP_M2_mcqa_dataset [Dataset]. https://huggingface.co/datasets/NicoHelemon/MNLP_M2_mcqa_dataset
Explore at:
Dataset updated
May 31, 2025
Authors
Nicolas Gonzalez
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
MNLP M2 MCQA Dataset

A unified multiple-choice question answering (MCQA) benchmark on STEM subjects combining samples from OpenBookQA, SciQ, MMLU-auxiliary, AQUA-Rat, and MedMCQA.

Dataset Summary

This dataset merges five existing science and knowledge-based MCQA datasets into one standardized format:

Source Train samples

OpenBookQA 4 900

SciQ 10 000

MMLU-aux 85 100

AQUA-Rat 50 000

MedMCQA 50 000

Total 200 000

Supported Tasks and… See the full description on the dataset page: https://huggingface.co/datasets/NicoHelemon/MNLP_M2_mcqa_dataset.
h
Judgement-baseline
huggingface.co
Updated May 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sleeping AI (2025). Judgement-baseline [Dataset]. https://huggingface.co/datasets/sleeping-ai/Judgement-baseline
Explore at:
Dataset updated
May 9, 2025
Dataset authored and provided by
Sleeping AI
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Model Name

Params

MMLU-Pro-Plus Baseline Drop MMLU-Pro Baseline Drop Added Exp MMLU Pro Plus Added MMLU-redux 2.0 Baseline Drop AQUA-RAT Baseline Drop

CohereLabs/c4ai-command-a-03-2025 111B ✅ (single inference) ✅ done ✅ (HF naive batch) ✅ done ✅ done

✅

-

-

-

google/gemma-3-12b-it 12B ✅ (HF naive batch) ✅ done ✅ (HF naive batch) ✅ done ✅ done

✅

-

-

-

meta-llama/Llama-4-Scout-17B-16E 17B ✅ (HF naive batch) ✅ done ✅ (HF naive batch) ✅ done ✅ done

✅

-

-

-

Qwen/Qwen3-4B 4B… See the full description on the dataset page: https://huggingface.co/datasets/sleeping-ai/Judgement-baseline.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Deepmind (2022). aqua_rat [Dataset]. https://huggingface.co/datasets/deepmind/aqua_rat

aqua_rat

deepmind/aqua_rat

Algebra Question Answering with Rationales

Explore at:

39 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Jan 23, 2022

Dataset provided by

DeepMindhttp://deepmind.com/

Authors

Deepmind

License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Dataset Card for AQUA-RAT

  Dataset Summary

A large-scale dataset consisting of approximately 100,000 algebraic word problems. The solution to each question is explained step-by-step using natural language. This data is used to train a program generation model that learns to generate the explanation, while generating the program that solves the question.

  Supported Tasks and Leaderboards





  Languages

  Dataset Structure





  Data Instances… See the full description on the dataset page: https://huggingface.co/datasets/deepmind/aqua_rat.

Clear search

Close search

Google apps

Main menu

aqua_rat

Calc-aqua_rat

aqua-rat-mcqa

vietnamese-aqua-rat

Dataset

Contents

aqua-rat

math_qa

MNLP_M2_mcqa_dataset

Judgement-baseline

Params

✅

✅

✅

aqua_rat

deepmind/aqua_rat

Algebra Question Answering with Rationales