36 datasets found

winogrande
huggingface.co
Updated Oct 28, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ai2 (2022). winogrande [Dataset]. https://huggingface.co/datasets/allenai/winogrande
Explore at:
Dataset updated
Oct 28, 2022
Dataset provided by
Allen Institute for AIhttp://allenai.org/
Authors
Ai2
Description
Dataset Card for "winogrande"

Dataset Summary

WinoGrande is a new collection of 44k problems, inspired by Winograd Schema Challenge (Levesque, Davis, and Morgenstern 2011), but adjusted to improve the scale and robustness against the dataset-specific bias. Formulated as a fill-in-a-blank task with binary options, the goal is to choose the right option for a given sentence which requires commonsense reasoning.

Supported Tasks and Leaderboards

More Information… See the full description on the dataset page: https://huggingface.co/datasets/allenai/winogrande.
T
winogrande
tensorflow.org
opendatalab.com
+1more
Updated Dec 11, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). winogrande [Dataset]. https://www.tensorflow.org/datasets/catalog/winogrande
Explore at:
Dataset updated
Dec 11, 2024
Description
The WinoGrande, a large-scale dataset of 44k problems, inspired by the original Winograd Schema Challenge design, but adjusted to improve both the scale and the hardness of the dataset.

To use this dataset:

import tensorflow_datasets as tfds ds = tfds.load('winogrande', split='train') for ex in ds.take(4): print(ex)

See the guide for more informations on tensorflow_datasets.
h
winogrande
huggingface.co
Updated Jul 30, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
yifei ming (2025). winogrande [Dataset]. https://huggingface.co/datasets/alvinming/winogrande
Explore at:
Dataset updated
Jul 30, 2025
Authors
yifei ming
Description
alvinming/winogrande dataset hosted on Hugging Face and contributed by the HF Datasets community
WinoGrande Dataset
kaggle.com
Updated Apr 27, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bojan Tunguz (2021). WinoGrande Dataset [Dataset]. https://www.kaggle.com/tunguz/winogrande-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 27, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Bojan Tunguz
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
WinoGrande

Version 1.1 (Sep 16th, 2020)

Data

./data/ ├── train_[xs,s,m,l,xl].jsonl # training set with differnt sizes ├── train_[xs,s,m,l,xl]-labels.lst # answer labels for training sets ├── train_debiased.jsonl # debiased training set ├── train_debiased-labels.lst # answer labels for debiased training set ├── dev.jsonl # development set ├── dev-labels.lst # answer labels for development set ├── test.jsonl # test set ├── sample-submissions-labels.lst # example submission file for leaderboard └── eval.py # evaluation script

You can use train_*.jsonl for training models and dev for validation. Please note that labels are not included in test.jsonl. To evaluate your models on test set, make a submission to our leaderboard.

Evaluation

You can use eval.py for evaluation on the dev split, which yields metrics.json.

e.g., python eval.py --preds_file ./YOUR_PREDICTIONS.lst --labels_file ./dev-labels.lst

In the prediction file, each line consists of the predictions (1 or 2) by 5 training sets (ordered by xs, s, m, l, xl, separated by comma) for each evauation set question.

2,1,1,1,1 1,1,2,2,2 1,1,1,1,1 ......... .........

Namely, the first column is the predictions by a model trained/finetuned on train_xs.jsonl, followed by a model prediction by train_s.jsonl, ... , and the last (fifth) column is the predictions by a model from train_xl.jsonl. Please checkout a sample submission file (sample-submission-labels.lst) for reference.

Submission to Leaderboard

You can submit your predictions on test set to the leaderboard. The submission file must be named as predictions.lst. The format is the same as above.

Reference

If you use this dataset, please cite the following paper:

@article{sakaguchi2019winogrande, title={WinoGrande: An Adversarial Winograd Schema Challenge at Scale}, author={Sakaguchi, Keisuke and Bras, Ronan Le and Bhagavatula, Chandra and Choi, Yejin}, journal={arXiv preprint arXiv:1907.10641}, year={2019} }

License

Winogrande dataset is licensed under CC BY 2.0.

Questions?

You may ask us questions at our google group.

Contact

Email: keisukes[at]allenai.org
h
Winogrande
huggingface.co
Updated Dec 24, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jaeyong Park (2023). Winogrande [Dataset]. https://huggingface.co/datasets/jaypyon/Winogrande
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 24, 2023
Authors
Jaeyong Park
Description
jaypyon/Winogrande dataset hosted on Hugging Face and contributed by the HF Datasets community
h
winogrande
huggingface.co
Updated Oct 28, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vik Korrapati (2022). winogrande [Dataset]. https://huggingface.co/datasets/vikhyatk/winogrande
Explore at:
Dataset updated
Oct 28, 2022
Authors
Vik Korrapati
Description
vikhyatk/winogrande dataset hosted on Hugging Face and contributed by the HF Datasets community
a
Pre-computed-embedding-Mistral-7B-Instruct-v0.2-winogrande
aifasthub.com
huggingface.co
Updated Jul 16, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Unlimited Research Group of AI (2024). Pre-computed-embedding-Mistral-7B-Instruct-v0.2-winogrande [Dataset]. https://www.aifasthub.com/datasets/ura-hcmut/Pre-computed-embedding-Mistral-7B-Instruct-v0.2-winogrande
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 16, 2024
Dataset authored and provided by
Unlimited Research Group of AI
Description
ura-hcmut/Pre-computed-embedding-Mistral-7B-Instruct-v0.2-winogrande dataset hosted on Hugging Face and contributed by the HF Datasets community
SNU_Ko-WinoGrande
huggingface.co
Updated Aug 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
THUNDER Research Group (2025). SNU_Ko-WinoGrande [Dataset]. https://huggingface.co/datasets/thunder-research-group/SNU_Ko-WinoGrande
Explore at:
Dataset updated
Aug 20, 2025
Dataset provided by
Research group
Authors
THUNDER Research Group
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Note: Evaluation code for each benchmark dataset is under preparation and will be released soon to support standardized model assessment.

Dataset Card for Ko-WinoGrande Dataset Summary

Ko-WinoGrande is a Korean adaptation of the WinoGrande dataset, which tests language models' commonsense reasoning through pronoun resolution tasks. Each item is a fill-in-the-blank sentence with two possible antecedents. Models must determine which choice best fits the blank given the… See the full description on the dataset page: https://huggingface.co/datasets/thunder-research-group/SNU_Ko-WinoGrande.
a
Pre-computed-embedding-Llama-2-7b-hf-winogrande
aifasthub.com
huggingface.co
Updated Jul 16, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Unlimited Research Group of AI (2024). Pre-computed-embedding-Llama-2-7b-hf-winogrande [Dataset]. https://www.aifasthub.com/datasets/ura-hcmut/Pre-computed-embedding-Llama-2-7b-hf-winogrande
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 16, 2024
Dataset authored and provided by
Unlimited Research Group of AI
Description
ura-hcmut/Pre-computed-embedding-Llama-2-7b-hf-winogrande dataset hosted on Hugging Face and contributed by the HF Datasets community
h
winogrande_raw
huggingface.co
Updated Jan 28, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
coref-data (2024). winogrande_raw [Dataset]. https://huggingface.co/datasets/coref-data/winogrande_raw
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 28, 2024
Dataset authored and provided by
coref-data
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Wingrande v1.1

Dataset Summary

WinoGrande is a new collection of 44k problems, inspired by Winograd Schema Challenge (Levesque, Davis, and Morgenstern 2011), but adjusted to improve the scale and robustness against the dataset-specific bias. Formulated as a fill-in-a-blank task with binary options, the goal is to choose the right option for a given sentence which requires commonsense reasoning.

Data Fields

The data fields are the same among all splits.… See the full description on the dataset page: https://huggingface.co/datasets/coref-data/winogrande_raw.
h
winogrande
huggingface.co
Updated Jul 23, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Róbert Belanec (2025). winogrande [Dataset]. https://huggingface.co/datasets/rbelanec/winogrande
Explore at:
Dataset updated
Jul 23, 2025
Authors
Róbert Belanec
Description
rbelanec/winogrande dataset hosted on Hugging Face and contributed by the HF Datasets community
h
Pre-computed-embedding-Llama-2-13b-hf-winogrande
huggingface.co
Updated Jul 16, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Unlimited Research Group of AI (2024). Pre-computed-embedding-Llama-2-13b-hf-winogrande [Dataset]. https://huggingface.co/datasets/ura-hcmut/Pre-computed-embedding-Llama-2-13b-hf-winogrande
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 16, 2024
Dataset authored and provided by
Unlimited Research Group of AI
Description
ura-hcmut/Pre-computed-embedding-Llama-2-13b-hf-winogrande dataset hosted on Hugging Face and contributed by the HF Datasets community
h
Pre-computed-embedding-Meta-Llama-3-8B-Instruct-winogrande
huggingface.co
Updated Jul 16, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Unlimited Research Group of AI (2024). Pre-computed-embedding-Meta-Llama-3-8B-Instruct-winogrande [Dataset]. https://huggingface.co/datasets/ura-hcmut/Pre-computed-embedding-Meta-Llama-3-8B-Instruct-winogrande
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 16, 2024
Dataset authored and provided by
Unlimited Research Group of AI
Description
ura-hcmut/Pre-computed-embedding-Meta-Llama-3-8B-Instruct-winogrande dataset hosted on Hugging Face and contributed by the HF Datasets community
h
Pre-computed-embedding-gemma-7b-it-winogrande
huggingface.co
Updated Jul 16, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Unlimited Research Group of AI (2024). Pre-computed-embedding-gemma-7b-it-winogrande [Dataset]. https://huggingface.co/datasets/ura-hcmut/Pre-computed-embedding-gemma-7b-it-winogrande
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 16, 2024
Dataset authored and provided by
Unlimited Research Group of AI
Description
ura-hcmut/Pre-computed-embedding-gemma-7b-it-winogrande dataset hosted on Hugging Face and contributed by the HF Datasets community
Rainbow
opendatalab.com
zip
Updated Mar 12, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Allen Institute for Artificial Intelligence (2020). Rainbow [Dataset]. https://opendatalab.com/OpenDataLab/Rainbow
Explore at:
zip(162849661 bytes)Available download formats
Dataset updated
Mar 12, 2020
Dataset provided by
艾伦人工智能研究院http://allenai.org/
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Rainbow is multi-task benchmark for common-sense reasoning that uses different existing QA datasets: aNLI, Cosmos QA, HellaSWAG. Physical IQa, Social IQa, WinoGrande.
h
forgetting-contamination-winogrande
huggingface.co
Updated Sep 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sebastian Bordt (2025). forgetting-contamination-winogrande [Dataset]. https://huggingface.co/datasets/sbordt/forgetting-contamination-winogrande
Explore at:
Dataset updated
Sep 5, 2025
Authors
Sebastian Bordt
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
This dataset is a deduplicated subset of the XL train split of WinoGrande, as used in the paper How Much Can We Forget about Data Contamination?. The deduplication was performed using this script. The data fields are the same as in https://huggingface.co/datasets/allenai/winogrande, with the additional "split-id" column that can be used to partition the benchmark questions into different subsets. The dataset can be used as a plug-in replacement if you want to work with the deduplicated… See the full description on the dataset page: https://huggingface.co/datasets/sbordt/forgetting-contamination-winogrande.
h
winogrande-text
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sergey Pankov, winogrande-text [Dataset]. https://huggingface.co/datasets/underactuated/winogrande-text
Explore at:
Authors
Sergey Pankov
Description
underactuated/winogrande-text dataset hosted on Hugging Face and contributed by the HF Datasets community
h
winogrande-routerbench-0shot-correct-choices-contrast-full-dense-2-shots-sd1...
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jiayu (Mila) Wang, winogrande-routerbench-0shot-correct-choices-contrast-full-dense-2-shots-sd1 [Dataset]. https://huggingface.co/datasets/MilaWang/winogrande-routerbench-0shot-correct-choices-contrast-full-dense-2-shots-sd1
Explore at:
Authors
Jiayu (Mila) Wang
Description
MilaWang/winogrande-routerbench-0shot-correct-choices-contrast-full-dense-2-shots-sd1 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
WinoGrande_HT_eu_sample
huggingface.co
Updated Jul 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Orai NLP technologies (2025). WinoGrande_HT_eu_sample [Dataset]. https://huggingface.co/datasets/orai-nlp/WinoGrande_HT_eu_sample
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 22, 2025
Dataset authored and provided by
Orai NLP technologies
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
WinoGrande Human Translated Sample for Basque

A subset of 250 samples manually translated to Basque from the WinoGrande dataset (Sakaguchi et al., 2019).

Dataset Creation Source Data

A subset of 250 samples manually translated to Basque from the WinoGrande dataset (Sakaguchi et al., 2019).

Annotations Annotation process

A subset of 250 samples manually translated to Basque from the WinoGrande dataset (Sakaguchi et al., 2019). A cultural… See the full description on the dataset page: https://huggingface.co/datasets/orai-nlp/WinoGrande_HT_eu_sample.
h
E2H-Winogrande-Middle
huggingface.co
Updated Apr 9, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yang ZHANG (2025). E2H-Winogrande-Middle [Dataset]. https://huggingface.co/datasets/yangzhang33/E2H-Winogrande-Middle
Explore at:
Dataset updated
Apr 9, 2025
Authors
Yang ZHANG
Description
yangzhang33/E2H-Winogrande-Middle dataset hosted on Hugging Face and contributed by the HF Datasets community

Facebook

Twitter

Click to copy link

Link copied

Cite

Ai2 (2022). winogrande [Dataset]. https://huggingface.co/datasets/allenai/winogrande

winogrande

WinoGrande

allenai/winogrande

Explore at:

Dataset updated

Oct 28, 2022

Dataset provided by

Allen Institute for AIhttp://allenai.org/

Authors

Ai2

Description

Dataset Card for "winogrande"

  Dataset Summary

WinoGrande is a new collection of 44k problems, inspired by Winograd Schema Challenge (Levesque, Davis, and Morgenstern 2011), but adjusted to improve the scale and robustness against the dataset-specific bias. Formulated as a fill-in-a-blank task with binary options, the goal is to choose the right option for a given sentence which requires commonsense reasoning.

  Supported Tasks and Leaderboards

More Information… See the full description on the dataset page: https://huggingface.co/datasets/allenai/winogrande.

Clear search

Close search

Google apps

Main menu

winogrande

winogrande

winogrande

WinoGrande Dataset

WinoGrande

Data

Evaluation

Submission to Leaderboard

Reference

License

Questions?

Contact

Winogrande

winogrande

Pre-computed-embedding-Mistral-7B-Instruct-v0.2-winogrande

SNU_Ko-WinoGrande

Pre-computed-embedding-Llama-2-7b-hf-winogrande

winogrande_raw

winogrande

Pre-computed-embedding-Llama-2-13b-hf-winogrande

Pre-computed-embedding-Meta-Llama-3-8B-Instruct-winogrande

Pre-computed-embedding-gemma-7b-it-winogrande

Rainbow

forgetting-contamination-winogrande

winogrande-text

winogrande-routerbench-0shot-correct-choices-contrast-full-dense-2-shots-sd1...

WinoGrande_HT_eu_sample

E2H-Winogrande-Middle

winogrande

WinoGrande

allenai/winogrande