14 datasets found
  1. h

    glue

    • huggingface.co
    • tensorflow.google.cn
    • +1more
    Updated Mar 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NYU Machine Learning for Language (2024). glue [Dataset]. https://huggingface.co/datasets/nyu-mll/glue
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 6, 2024
    Dataset authored and provided by
    NYU Machine Learning for Language
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    Dataset Card for GLUE

      Dataset Summary
    

    GLUE, the General Language Understanding Evaluation benchmark (https://gluebenchmark.com/) is a collection of resources for training, evaluating, and analyzing natural language understanding systems.

      Supported Tasks and Leaderboards
    

    The leaderboard for the GLUE benchmark can be found at this address. It comprises the following tasks:

      ax
    

    A manually-curated evaluation dataset for fine-grained analysis of system… See the full description on the dataset page: https://huggingface.co/datasets/nyu-mll/glue.

  2. glue benchmark

    • kaggle.com
    zip
    Updated Jan 11, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Learn and Progress (2021). glue benchmark [Dataset]. https://www.kaggle.com/hanminqin/glue-benchmark
    Explore at:
    zip(150783398 bytes)Available download formats
    Dataset updated
    Jan 11, 2021
    Authors
    Learn and Progress
    Description

    Dataset

    This dataset was created by Learn and Progress

    Contents

    It contains the following files:

  3. T

    super_glue

    • tensorflow.org
    • opendatalab.com
    • +1more
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    super_glue [Dataset]. https://www.tensorflow.org/datasets/catalog/super_glue
    Explore at:
    Description

    SuperGLUE (https://super.gluebenchmark.com/) is a new benchmark styled after GLUE with a new set of more difficult language understanding tasks, improved resources, and a new public leaderboard.

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('super_glue', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

  4. h

    few_glue

    • huggingface.co
    Updated Mar 31, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Junyeob Kim (2017). few_glue [Dataset]. https://huggingface.co/datasets/juny116/few_glue
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 31, 2017
    Authors
    Junyeob Kim
    Description

    SuperGLUE (https://super.gluebenchmark.com/) is a new benchmark styled after GLUE with a new set of more difficult language understanding tasks, improved resources, and a new public leaderboard.

  5. h

    Data from: snap21

    • huggingface.co
    Updated Oct 12, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wang Song (2023). snap21 [Dataset]. https://huggingface.co/datasets/kuroneko5943/snap21
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 12, 2023
    Authors
    Wang Song
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    GLUE, the General Language Understanding Evaluation benchmark (https://gluebenchmark.com/) is a collection of resources for training, evaluating, and analyzing natural language understanding systems.

  6. GLUE

    • opendatalab.com
    zip
    Updated Nov 1, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    New York University (2018). GLUE [Dataset]. https://opendatalab.com/OpenDataLab/glue
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 1, 2018
    Dataset provided by
    Paul G. Allen School of Computer Science and Engineering
    DeepMind
    New York University
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The General Language Understanding Evaluation (GLUE) benchmark is a collection of resources for training, evaluating, and analyzing natural language understanding systems. GLUE consists of: A benchmark of nine sentence- or sentence-pair language understanding tasks built on established existing datasets and selected to cover a diverse range of dataset sizes, text genres, and degrees of difficulty, A diagnostic dataset designed to evaluate and analyze model performance with respect to a wide range of linguistic phenomena found in natural language, and A public leaderboard for tracking performance on the benchmark and a dashboard for visualizing the performance of models on the diagnostic set. The format of the GLUE benchmark is model-agnostic, so any system capable of processing sentence and sentence pairs and producing corresponding predictions is eligible to participate. The benchmark tasks are selected so as to favor models that share information across tasks using parameter sharing or other transfer learning techniques. The ultimate goal of GLUE is to drive research in the development of general and robust natural language understanding systems.

  7. e

    Slovenian Word in Context dataset SloWiC 1.0 - Dataset - B2FIND

    • b2find.eudat.eu
    Updated Oct 10, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Slovenian Word in Context dataset SloWiC 1.0 - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/da28c3e1-3416-5b1c-827d-84bda5385275
    Explore at:
    Dataset updated
    Oct 10, 2024
    Description

    The SloWIC dataset is a Slovenian dataset for the Word in Context task. Each example in the dataset contains a target word with multiple meanings and two sentences that both contain the target word. Each example is also annotated with a label that shows if both sentences use the same meaning of the target word. The dataset contains 1808 manually annotated sentence pairs and additional 13150 automatically annotated pairs to help with training larger models. The dataset is stored in the JSON format following the format used in the SuperGLUE version of the Word in Context task (https://super.gluebenchmark.com/).

  8. h

    test_data

    • huggingface.co
    Updated Oct 13, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    zzzzhhh (2023). test_data [Dataset]. https://huggingface.co/datasets/zzzzhhh/test_data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 13, 2023
    Authors
    zzzzhhh
    License

    https://choosealicense.com/licenses/unknown/https://choosealicense.com/licenses/unknown/

    Description

    Dataset Card for "super_glue"

      Dataset Summary
    

    SuperGLUE (https://super.gluebenchmark.com/) is a new benchmark styled after GLUE with a new set of more difficult language understanding tasks, improved resources, and a new public leaderboard. BoolQ (Boolean Questions, Clark et al., 2019a) is a QA task where each example consists of a short passage and a yes/no question about the passage. The questions are provided anonymously and unsolicited by users of the Google search… See the full description on the dataset page: https://huggingface.co/datasets/zzzzhhh/test_data.

  9. h

    mnli

    • huggingface.co
    Updated Jun 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    bias-amplified-splits (2025). mnli [Dataset]. https://huggingface.co/datasets/bias-amplified-splits/mnli
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 6, 2025
    Dataset authored and provided by
    bias-amplified-splits
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    GLUE, the General Language Understanding Evaluation benchmark (https://gluebenchmark.com/) is a collection of resources for training, evaluating, and analyzing natural language understanding systems.

  10. h

    CoLA_2_Class-GLUE-Benchmark

    • huggingface.co
    Updated May 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lumina AI (2025). CoLA_2_Class-GLUE-Benchmark [Dataset]. https://huggingface.co/datasets/LuminaAI/CoLA_2_Class-GLUE-Benchmark
    Explore at:
    Dataset updated
    May 14, 2025
    Dataset authored and provided by
    Lumina AI
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    CoLA Text Classification RCL Dataset

      Overview
    

    This dataset contains textual data structured explicitly for standard text classification tasks using Lumina AI's Random Contrast Learning (RCL) algorithm via the PrismRCL application. Unlike LLM datasets, standard text classification datasets contain individual .txt files organized by class.

      Dataset Structure
    

    The dataset structure for text classification training: CoLA.Classification/ train/ [class_1]/… See the full description on the dataset page: https://huggingface.co/datasets/LuminaAI/CoLA_2_Class-GLUE-Benchmark.

  11. Trained Models from "General Cross-Architecture Distillation of Pretrained...

    • zenodo.org
    • data.niaid.nih.gov
    application/gzip, zip
    Updated May 11, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lukas Galke; Lukas Galke; Isabell Cuber; Christoph Meyer; Henrik Ferdinand Noelscher; Angelina Sonderecker; Ansgar Scherp; Isabell Cuber; Christoph Meyer; Henrik Ferdinand Noelscher; Angelina Sonderecker; Ansgar Scherp (2022). Trained Models from "General Cross-Architecture Distillation of Pretrained Language Models into Matrix Embeddings" [Dataset]. http://doi.org/10.5281/zenodo.6533889
    Explore at:
    zip, application/gzipAvailable download formats
    Dataset updated
    May 11, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Lukas Galke; Lukas Galke; Isabell Cuber; Christoph Meyer; Henrik Ferdinand Noelscher; Angelina Sonderecker; Ansgar Scherp; Isabell Cuber; Christoph Meyer; Henrik Ferdinand Noelscher; Angelina Sonderecker; Ansgar Scherp
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Trained models from the paper:

    Lukas Galke, Isabell Cuber, Christoph Meyer, Henrik Ferdinand Noelscher, Angelina Sonderecker, and Ansgar Scherp: General Cross-Architecture Distillation of Pretrained Language Models into Matrix Embeddings, in: International Joint Conference on Neural Networks (IJCNN), 2022.

    • File seq2mat_hybrid_bidirectional_sbertlike-100p-bsz512 holds the model from pretraining
    • File ws2020_transformer_final_models holds the fine-tuned models for each task of the GLUE benchmark
  12. f

    Model comparison for question pairs detection using 10-fold cross...

    • plos.figshare.com
    xls
    Updated Jan 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sifei Han; Lingyun Shi; Fuchiang (Rich) Tsui (2025). Model comparison for question pairs detection using 10-fold cross validation. [Dataset]. http://doi.org/10.1371/journal.pone.0317042.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jan 10, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Sifei Han; Lingyun Shi; Fuchiang (Rich) Tsui
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Model comparison for question pairs detection using 10-fold cross validation.

  13. h

    stock11

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wang Song, stock11 [Dataset]. https://huggingface.co/datasets/kuroneko5943/stock11
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    Wang Song
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    GLUE, the General Language Understanding Evaluation benchmark (https://gluebenchmark.com/) is a collection of resources for training, evaluating, and analyzing natural language understanding systems.

  14. h

    adv_glue

    • huggingface.co
    • opendatalab.com
    Updated Sep 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Secure Learning Lab (2023). adv_glue [Dataset]. https://huggingface.co/datasets/AI-Secure/adv_glue
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 1, 2023
    Dataset authored and provided by
    Secure Learning Lab
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Dataset Card for Adversarial GLUE

      Dataset Summary
    

    Adversarial GLUE Benchmark (AdvGLUE) is a comprehensive robustness evaluation benchmark that focuses on the adversarial robustness evaluation of language models. It covers five natural language understanding tasks from the famous GLUE tasks and is an adversarial version of GLUE benchmark. AdvGLUE considers textual adversarial attacks from different perspectives and hierarchies, including word-level transformations… See the full description on the dataset page: https://huggingface.co/datasets/AI-Secure/adv_glue.

  15. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
NYU Machine Learning for Language (2024). glue [Dataset]. https://huggingface.co/datasets/nyu-mll/glue

glue

nyu-mll/glue

GLUE (General Language Understanding Evaluation benchmark)

Explore at:
27 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 6, 2024
Dataset authored and provided by
NYU Machine Learning for Language
License

https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

Description

Dataset Card for GLUE

  Dataset Summary

GLUE, the General Language Understanding Evaluation benchmark (https://gluebenchmark.com/) is a collection of resources for training, evaluating, and analyzing natural language understanding systems.

  Supported Tasks and Leaderboards

The leaderboard for the GLUE benchmark can be found at this address. It comprises the following tasks:

  ax

A manually-curated evaluation dataset for fine-grained analysis of system… See the full description on the dataset page: https://huggingface.co/datasets/nyu-mll/glue.

Search
Clear search
Close search
Google apps
Main menu