36 datasets found
  1. winogrande

    • huggingface.co
    Updated Oct 28, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ai2 (2022). winogrande [Dataset]. https://huggingface.co/datasets/allenai/winogrande
    Explore at:
    Dataset updated
    Oct 28, 2022
    Dataset provided by
    Allen Institute for AIhttp://allenai.org/
    Authors
    Ai2
    Description

    Dataset Card for "winogrande"

      Dataset Summary
    

    WinoGrande is a new collection of 44k problems, inspired by Winograd Schema Challenge (Levesque, Davis, and Morgenstern 2011), but adjusted to improve the scale and robustness against the dataset-specific bias. Formulated as a fill-in-a-blank task with binary options, the goal is to choose the right option for a given sentence which requires commonsense reasoning.

      Supported Tasks and Leaderboards
    

    More Information… See the full description on the dataset page: https://huggingface.co/datasets/allenai/winogrande.

  2. T

    winogrande

    • tensorflow.org
    • opendatalab.com
    • +1more
    Updated Dec 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). winogrande [Dataset]. https://www.tensorflow.org/datasets/catalog/winogrande
    Explore at:
    Dataset updated
    Dec 11, 2024
    Description

    The WinoGrande, a large-scale dataset of 44k problems, inspired by the original Winograd Schema Challenge design, but adjusted to improve both the scale and the hardness of the dataset.

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('winogrande', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

  3. h

    winogrande

    • huggingface.co
    Updated Jul 30, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    yifei ming (2025). winogrande [Dataset]. https://huggingface.co/datasets/alvinming/winogrande
    Explore at:
    Dataset updated
    Jul 30, 2025
    Authors
    yifei ming
    Description

    alvinming/winogrande dataset hosted on Hugging Face and contributed by the HF Datasets community

  4. WinoGrande Dataset

    • kaggle.com
    Updated Apr 27, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bojan Tunguz (2021). WinoGrande Dataset [Dataset]. https://www.kaggle.com/tunguz/winogrande-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 27, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Bojan Tunguz
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    WinoGrande

    Version 1.1 (Sep 16th, 2020)

    Data

    ./data/
    ├── train_[xs,s,m,l,xl].jsonl     # training set with differnt sizes
    ├── train_[xs,s,m,l,xl]-labels.lst   # answer labels for training sets
    ├── train_debiased.jsonl        # debiased training set
    ├── train_debiased-labels.lst     # answer labels for debiased training set
    ├── dev.jsonl             # development set
    ├── dev-labels.lst           # answer labels for development set
    ├── test.jsonl             # test set
    ├── sample-submissions-labels.lst   # example submission file for leaderboard  
    └── eval.py              # evaluation script
    

    You can use train_*.jsonl for training models and dev for validation. Please note that labels are not included in test.jsonl. To evaluate your models on test set, make a submission to our leaderboard.

    Evaluation

    You can use eval.py for evaluation on the dev split, which yields metrics.json.

    e.g., python eval.py --preds_file ./YOUR_PREDICTIONS.lst --labels_file ./dev-labels.lst
    

    In the prediction file, each line consists of the predictions (1 or 2) by 5 training sets (ordered by xs, s, m, l, xl, separated by comma) for each evauation set question.

     2,1,1,1,1
     1,1,2,2,2
     1,1,1,1,1
     .........
     .........
    

    Namely, the first column is the predictions by a model trained/finetuned on train_xs.jsonl, followed by a model prediction by train_s.jsonl, ... , and the last (fifth) column is the predictions by a model from train_xl.jsonl. Please checkout a sample submission file (sample-submission-labels.lst) for reference.

    Submission to Leaderboard

    You can submit your predictions on test set to the leaderboard. The submission file must be named as predictions.lst. The format is the same as above.

    Reference

    If you use this dataset, please cite the following paper:

    @article{sakaguchi2019winogrande,
      title={WinoGrande: An Adversarial Winograd Schema Challenge at Scale},
      author={Sakaguchi, Keisuke and Bras, Ronan Le and Bhagavatula, Chandra and Choi, Yejin},
      journal={arXiv preprint arXiv:1907.10641},
      year={2019}
    }
    

    License

    Winogrande dataset is licensed under CC BY 2.0.

    Questions?

    You may ask us questions at our google group.

    Contact

    Email: keisukes[at]allenai.org

  5. h

    Winogrande

    • huggingface.co
    Updated Dec 24, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jaeyong Park (2023). Winogrande [Dataset]. https://huggingface.co/datasets/jaypyon/Winogrande
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 24, 2023
    Authors
    Jaeyong Park
    Description

    jaypyon/Winogrande dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. h

    winogrande

    • huggingface.co
    Updated Oct 28, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vik Korrapati (2022). winogrande [Dataset]. https://huggingface.co/datasets/vikhyatk/winogrande
    Explore at:
    Dataset updated
    Oct 28, 2022
    Authors
    Vik Korrapati
    Description

    vikhyatk/winogrande dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. a

    Pre-computed-embedding-Mistral-7B-Instruct-v0.2-winogrande

    • aifasthub.com
    • huggingface.co
    Updated Jul 16, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Unlimited Research Group of AI (2024). Pre-computed-embedding-Mistral-7B-Instruct-v0.2-winogrande [Dataset]. https://www.aifasthub.com/datasets/ura-hcmut/Pre-computed-embedding-Mistral-7B-Instruct-v0.2-winogrande
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 16, 2024
    Dataset authored and provided by
    Unlimited Research Group of AI
    Description

    ura-hcmut/Pre-computed-embedding-Mistral-7B-Instruct-v0.2-winogrande dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. SNU_Ko-WinoGrande

    • huggingface.co
    Updated Aug 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    THUNDER Research Group (2025). SNU_Ko-WinoGrande [Dataset]. https://huggingface.co/datasets/thunder-research-group/SNU_Ko-WinoGrande
    Explore at:
    Dataset updated
    Aug 20, 2025
    Dataset provided by
    Research group
    Authors
    THUNDER Research Group
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Note: Evaluation code for each benchmark dataset is under preparation and will be released soon to support standardized model assessment.

      Dataset Card for Ko-WinoGrande
    
    
    
    
    
      Dataset Summary
    

    Ko-WinoGrande is a Korean adaptation of the WinoGrande dataset, which tests language models' commonsense reasoning through pronoun resolution tasks. Each item is a fill-in-the-blank sentence with two possible antecedents. Models must determine which choice best fits the blank given the… See the full description on the dataset page: https://huggingface.co/datasets/thunder-research-group/SNU_Ko-WinoGrande.

  9. a

    Pre-computed-embedding-Llama-2-7b-hf-winogrande

    • aifasthub.com
    • huggingface.co
    Updated Jul 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Unlimited Research Group of AI (2024). Pre-computed-embedding-Llama-2-7b-hf-winogrande [Dataset]. https://www.aifasthub.com/datasets/ura-hcmut/Pre-computed-embedding-Llama-2-7b-hf-winogrande
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 16, 2024
    Dataset authored and provided by
    Unlimited Research Group of AI
    Description

    ura-hcmut/Pre-computed-embedding-Llama-2-7b-hf-winogrande dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. h

    winogrande_raw

    • huggingface.co
    Updated Jan 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    coref-data (2024). winogrande_raw [Dataset]. https://huggingface.co/datasets/coref-data/winogrande_raw
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 28, 2024
    Dataset authored and provided by
    coref-data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Wingrande v1.1

      Dataset Summary
    

    WinoGrande is a new collection of 44k problems, inspired by Winograd Schema Challenge (Levesque, Davis, and Morgenstern 2011), but adjusted to improve the scale and robustness against the dataset-specific bias. Formulated as a fill-in-a-blank task with binary options, the goal is to choose the right option for a given sentence which requires commonsense reasoning.

      Data Fields
    

    The data fields are the same among all splits.… See the full description on the dataset page: https://huggingface.co/datasets/coref-data/winogrande_raw.

  11. h

    winogrande

    • huggingface.co
    Updated Jul 23, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Róbert Belanec (2025). winogrande [Dataset]. https://huggingface.co/datasets/rbelanec/winogrande
    Explore at:
    Dataset updated
    Jul 23, 2025
    Authors
    Róbert Belanec
    Description

    rbelanec/winogrande dataset hosted on Hugging Face and contributed by the HF Datasets community

  12. h

    Pre-computed-embedding-Llama-2-13b-hf-winogrande

    • huggingface.co
    Updated Jul 16, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Unlimited Research Group of AI (2024). Pre-computed-embedding-Llama-2-13b-hf-winogrande [Dataset]. https://huggingface.co/datasets/ura-hcmut/Pre-computed-embedding-Llama-2-13b-hf-winogrande
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 16, 2024
    Dataset authored and provided by
    Unlimited Research Group of AI
    Description

    ura-hcmut/Pre-computed-embedding-Llama-2-13b-hf-winogrande dataset hosted on Hugging Face and contributed by the HF Datasets community

  13. h

    Pre-computed-embedding-Meta-Llama-3-8B-Instruct-winogrande

    • huggingface.co
    Updated Jul 16, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Unlimited Research Group of AI (2024). Pre-computed-embedding-Meta-Llama-3-8B-Instruct-winogrande [Dataset]. https://huggingface.co/datasets/ura-hcmut/Pre-computed-embedding-Meta-Llama-3-8B-Instruct-winogrande
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 16, 2024
    Dataset authored and provided by
    Unlimited Research Group of AI
    Description

    ura-hcmut/Pre-computed-embedding-Meta-Llama-3-8B-Instruct-winogrande dataset hosted on Hugging Face and contributed by the HF Datasets community

  14. h

    Pre-computed-embedding-gemma-7b-it-winogrande

    • huggingface.co
    Updated Jul 16, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Unlimited Research Group of AI (2024). Pre-computed-embedding-gemma-7b-it-winogrande [Dataset]. https://huggingface.co/datasets/ura-hcmut/Pre-computed-embedding-gemma-7b-it-winogrande
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 16, 2024
    Dataset authored and provided by
    Unlimited Research Group of AI
    Description

    ura-hcmut/Pre-computed-embedding-gemma-7b-it-winogrande dataset hosted on Hugging Face and contributed by the HF Datasets community

  15. Rainbow

    • opendatalab.com
    zip
    Updated Mar 12, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Allen Institute for Artificial Intelligence (2020). Rainbow [Dataset]. https://opendatalab.com/OpenDataLab/Rainbow
    Explore at:
    zip(162849661 bytes)Available download formats
    Dataset updated
    Mar 12, 2020
    Dataset provided by
    艾伦人工智能研究院http://allenai.org/
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Rainbow is multi-task benchmark for common-sense reasoning that uses different existing QA datasets: aNLI, Cosmos QA, HellaSWAG. Physical IQa, Social IQa, WinoGrande.

  16. h

    forgetting-contamination-winogrande

    • huggingface.co
    Updated Sep 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sebastian Bordt (2025). forgetting-contamination-winogrande [Dataset]. https://huggingface.co/datasets/sbordt/forgetting-contamination-winogrande
    Explore at:
    Dataset updated
    Sep 5, 2025
    Authors
    Sebastian Bordt
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This dataset is a deduplicated subset of the XL train split of WinoGrande, as used in the paper How Much Can We Forget about Data Contamination?. The deduplication was performed using this script. The data fields are the same as in https://huggingface.co/datasets/allenai/winogrande, with the additional "split-id" column that can be used to partition the benchmark questions into different subsets. The dataset can be used as a plug-in replacement if you want to work with the deduplicated… See the full description on the dataset page: https://huggingface.co/datasets/sbordt/forgetting-contamination-winogrande.

  17. h

    winogrande-text

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sergey Pankov, winogrande-text [Dataset]. https://huggingface.co/datasets/underactuated/winogrande-text
    Explore at:
    Authors
    Sergey Pankov
    Description

    underactuated/winogrande-text dataset hosted on Hugging Face and contributed by the HF Datasets community

  18. h

    winogrande-routerbench-0shot-correct-choices-contrast-full-dense-2-shots-sd1...

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jiayu (Mila) Wang, winogrande-routerbench-0shot-correct-choices-contrast-full-dense-2-shots-sd1 [Dataset]. https://huggingface.co/datasets/MilaWang/winogrande-routerbench-0shot-correct-choices-contrast-full-dense-2-shots-sd1
    Explore at:
    Authors
    Jiayu (Mila) Wang
    Description

    MilaWang/winogrande-routerbench-0shot-correct-choices-contrast-full-dense-2-shots-sd1 dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. h

    WinoGrande_HT_eu_sample

    • huggingface.co
    Updated Jul 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Orai NLP technologies (2025). WinoGrande_HT_eu_sample [Dataset]. https://huggingface.co/datasets/orai-nlp/WinoGrande_HT_eu_sample
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 22, 2025
    Dataset authored and provided by
    Orai NLP technologies
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    WinoGrande Human Translated Sample for Basque

    A subset of 250 samples manually translated to Basque from the WinoGrande dataset (Sakaguchi et al., 2019).

      Dataset Creation
    
    
    
    
    
      Source Data
    

    A subset of 250 samples manually translated to Basque from the WinoGrande dataset (Sakaguchi et al., 2019).

      Annotations
    
    
    
    
    
      Annotation process
    

    A subset of 250 samples manually translated to Basque from the WinoGrande dataset (Sakaguchi et al., 2019). A cultural… See the full description on the dataset page: https://huggingface.co/datasets/orai-nlp/WinoGrande_HT_eu_sample.

  20. h

    E2H-Winogrande-Middle

    • huggingface.co
    Updated Apr 9, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yang ZHANG (2025). E2H-Winogrande-Middle [Dataset]. https://huggingface.co/datasets/yangzhang33/E2H-Winogrande-Middle
    Explore at:
    Dataset updated
    Apr 9, 2025
    Authors
    Yang ZHANG
    Description

    yangzhang33/E2H-Winogrande-Middle dataset hosted on Hugging Face and contributed by the HF Datasets community

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Ai2 (2022). winogrande [Dataset]. https://huggingface.co/datasets/allenai/winogrande
Organization logo

winogrande

WinoGrande

allenai/winogrande

Explore at:
Dataset updated
Oct 28, 2022
Dataset provided by
Allen Institute for AIhttp://allenai.org/
Authors
Ai2
Description

Dataset Card for "winogrande"

  Dataset Summary

WinoGrande is a new collection of 44k problems, inspired by Winograd Schema Challenge (Levesque, Davis, and Morgenstern 2011), but adjusted to improve the scale and robustness against the dataset-specific bias. Formulated as a fill-in-a-blank task with binary options, the goal is to choose the right option for a given sentence which requires commonsense reasoning.

  Supported Tasks and Leaderboards

More Information… See the full description on the dataset page: https://huggingface.co/datasets/allenai/winogrande.

Search
Clear search
Close search
Google apps
Main menu