14 datasets found

h
glue
huggingface.co
tensorflow.google.cn
+1more
Updated Mar 6, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NYU Machine Learning for Language (2024). glue [Dataset]. https://huggingface.co/datasets/nyu-mll/glue
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 6, 2024
Dataset authored and provided by
NYU Machine Learning for Language
License
https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Description
Dataset Card for GLUE

Dataset Summary

GLUE, the General Language Understanding Evaluation benchmark (https://gluebenchmark.com/) is a collection of resources for training, evaluating, and analyzing natural language understanding systems.

Supported Tasks and Leaderboards

The leaderboard for the GLUE benchmark can be found at this address. It comprises the following tasks:

ax

A manually-curated evaluation dataset for fine-grained analysis of system… See the full description on the dataset page: https://huggingface.co/datasets/nyu-mll/glue.
glue benchmark
kaggle.com
zip
Updated Jan 11, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Learn and Progress (2021). glue benchmark [Dataset]. https://www.kaggle.com/hanminqin/glue-benchmark
Explore at:
zip(150783398 bytes)Available download formats
Dataset updated
Jan 11, 2021
Authors
Learn and Progress
Description
Dataset

This dataset was created by Learn and Progress

Contents

It contains the following files:
T
super_glue
tensorflow.org
opendatalab.com
+1more
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
super_glue [Dataset]. https://www.tensorflow.org/datasets/catalog/super_glue
Explore at:
Description
SuperGLUE (https://super.gluebenchmark.com/) is a new benchmark styled after GLUE with a new set of more difficult language understanding tasks, improved resources, and a new public leaderboard.

To use this dataset:

import tensorflow_datasets as tfds ds = tfds.load('super_glue', split='train') for ex in ds.take(4): print(ex)

See the guide for more informations on tensorflow_datasets.
h
few_glue
huggingface.co
Updated Mar 31, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Junyeob Kim (2017). few_glue [Dataset]. https://huggingface.co/datasets/juny116/few_glue
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 31, 2017
Authors
Junyeob Kim
Description
SuperGLUE (https://super.gluebenchmark.com/) is a new benchmark styled after GLUE with a new set of more difficult language understanding tasks, improved resources, and a new public leaderboard.
h
Data from: snap21
huggingface.co
Updated Oct 12, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wang Song (2023). snap21 [Dataset]. https://huggingface.co/datasets/kuroneko5943/snap21
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 12, 2023
Authors
Wang Song
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
GLUE, the General Language Understanding Evaluation benchmark (https://gluebenchmark.com/) is a collection of resources for training, evaluating, and analyzing natural language understanding systems.
GLUE
opendatalab.com
zip
Updated Nov 1, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
New York University (2018). GLUE [Dataset]. https://opendatalab.com/OpenDataLab/glue
Explore at:
zipAvailable download formats
Dataset updated
Nov 1, 2018
Dataset provided by
Paul G. Allen School of Computer Science and Engineering
DeepMind
New York University
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The General Language Understanding Evaluation (GLUE) benchmark is a collection of resources for training, evaluating, and analyzing natural language understanding systems. GLUE consists of: A benchmark of nine sentence- or sentence-pair language understanding tasks built on established existing datasets and selected to cover a diverse range of dataset sizes, text genres, and degrees of difficulty, A diagnostic dataset designed to evaluate and analyze model performance with respect to a wide range of linguistic phenomena found in natural language, and A public leaderboard for tracking performance on the benchmark and a dashboard for visualizing the performance of models on the diagnostic set. The format of the GLUE benchmark is model-agnostic, so any system capable of processing sentence and sentence pairs and producing corresponding predictions is eligible to participate. The benchmark tasks are selected so as to favor models that share information across tasks using parameter sharing or other transfer learning techniques. The ultimate goal of GLUE is to drive research in the development of general and robust natural language understanding systems.
e
Slovenian Word in Context dataset SloWiC 1.0 - Dataset - B2FIND
b2find.eudat.eu
Updated Oct 10, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Slovenian Word in Context dataset SloWiC 1.0 - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/da28c3e1-3416-5b1c-827d-84bda5385275
Explore at:
Dataset updated
Oct 10, 2024
Description
The SloWIC dataset is a Slovenian dataset for the Word in Context task. Each example in the dataset contains a target word with multiple meanings and two sentences that both contain the target word. Each example is also annotated with a label that shows if both sentences use the same meaning of the target word. The dataset contains 1808 manually annotated sentence pairs and additional 13150 automatically annotated pairs to help with training larger models. The dataset is stored in the JSON format following the format used in the SuperGLUE version of the Word in Context task (https://super.gluebenchmark.com/).
h
test_data
huggingface.co
Updated Oct 13, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
zzzzhhh (2023). test_data [Dataset]. https://huggingface.co/datasets/zzzzhhh/test_data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 13, 2023
Authors
zzzzhhh
License
https://choosealicense.com/licenses/unknown/https://choosealicense.com/licenses/unknown/
Description
Dataset Card for "super_glue"

Dataset Summary

SuperGLUE (https://super.gluebenchmark.com/) is a new benchmark styled after GLUE with a new set of more difficult language understanding tasks, improved resources, and a new public leaderboard. BoolQ (Boolean Questions, Clark et al., 2019a) is a QA task where each example consists of a short passage and a yes/no question about the passage. The questions are provided anonymously and unsolicited by users of the Google search… See the full description on the dataset page: https://huggingface.co/datasets/zzzzhhh/test_data.
h
mnli
huggingface.co
Updated Jun 6, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
bias-amplified-splits (2025). mnli [Dataset]. https://huggingface.co/datasets/bias-amplified-splits/mnli
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 6, 2025
Dataset authored and provided by
bias-amplified-splits
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
GLUE, the General Language Understanding Evaluation benchmark (https://gluebenchmark.com/) is a collection of resources for training, evaluating, and analyzing natural language understanding systems.
h
CoLA_2_Class-GLUE-Benchmark
huggingface.co
Updated May 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lumina AI (2025). CoLA_2_Class-GLUE-Benchmark [Dataset]. https://huggingface.co/datasets/LuminaAI/CoLA_2_Class-GLUE-Benchmark
Explore at:
Dataset updated
May 14, 2025
Dataset authored and provided by
Lumina AI
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
CoLA Text Classification RCL Dataset

Overview

This dataset contains textual data structured explicitly for standard text classification tasks using Lumina AI's Random Contrast Learning (RCL) algorithm via the PrismRCL application. Unlike LLM datasets, standard text classification datasets contain individual .txt files organized by class.

Dataset Structure

The dataset structure for text classification training: CoLA.Classification/ train/ [class_1]/… See the full description on the dataset page: https://huggingface.co/datasets/LuminaAI/CoLA_2_Class-GLUE-Benchmark.
Trained Models from "General Cross-Architecture Distillation of Pretrained...
zenodo.org
data.niaid.nih.gov
application/gzip, zip
Updated May 11, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lukas Galke; Lukas Galke; Isabell Cuber; Christoph Meyer; Henrik Ferdinand Noelscher; Angelina Sonderecker; Ansgar Scherp; Isabell Cuber; Christoph Meyer; Henrik Ferdinand Noelscher; Angelina Sonderecker; Ansgar Scherp (2022). Trained Models from "General Cross-Architecture Distillation of Pretrained Language Models into Matrix Embeddings" [Dataset]. http://doi.org/10.5281/zenodo.6533889
Explore at:
zip, application/gzipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.6533889
Dataset updated
May 11, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Lukas Galke; Lukas Galke; Isabell Cuber; Christoph Meyer; Henrik Ferdinand Noelscher; Angelina Sonderecker; Ansgar Scherp; Isabell Cuber; Christoph Meyer; Henrik Ferdinand Noelscher; Angelina Sonderecker; Ansgar Scherp
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Trained models from the paper:

Lukas Galke, Isabell Cuber, Christoph Meyer, Henrik Ferdinand Noelscher, Angelina Sonderecker, and Ansgar Scherp: General Cross-Architecture Distillation of Pretrained Language Models into Matrix Embeddings, in: International Joint Conference on Neural Networks (IJCNN), 2022.

File seq2mat_hybrid_bidirectional_sbertlike-100p-bsz512 holds the model from pretraining

File ws2020_transformer_final_models holds the fine-tuned models for each task of the GLUE benchmark
f
Model comparison for question pairs detection using 10-fold cross...
plos.figshare.com
xls
Updated Jan 10, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sifei Han; Lingyun Shi; Fuchiang (Rich) Tsui (2025). Model comparison for question pairs detection using 10-fold cross validation. [Dataset]. http://doi.org/10.1371/journal.pone.0317042.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0317042.t001
Dataset updated
Jan 10, 2025
Dataset provided by
PLOS ONE
Authors
Sifei Han; Lingyun Shi; Fuchiang (Rich) Tsui
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Model comparison for question pairs detection using 10-fold cross validation.
h
stock11
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wang Song, stock11 [Dataset]. https://huggingface.co/datasets/kuroneko5943/stock11
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Authors
Wang Song
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
GLUE, the General Language Understanding Evaluation benchmark (https://gluebenchmark.com/) is a collection of resources for training, evaluating, and analyzing natural language understanding systems.
h
adv_glue
huggingface.co
opendatalab.com
Updated Sep 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Secure Learning Lab (2023). adv_glue [Dataset]. https://huggingface.co/datasets/AI-Secure/adv_glue
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 1, 2023
Dataset authored and provided by
Secure Learning Lab
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
Dataset Card for Adversarial GLUE

Dataset Summary

Adversarial GLUE Benchmark (AdvGLUE) is a comprehensive robustness evaluation benchmark that focuses on the adversarial robustness evaluation of language models. It covers five natural language understanding tasks from the famous GLUE tasks and is an adversarial version of GLUE benchmark. AdvGLUE considers textual adversarial attacks from different perspectives and hierarchies, including word-level transformations… See the full description on the dataset page: https://huggingface.co/datasets/AI-Secure/adv_glue.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

NYU Machine Learning for Language (2024). glue [Dataset]. https://huggingface.co/datasets/nyu-mll/glue

glue

nyu-mll/glue

GLUE (General Language Understanding Evaluation benchmark)

Explore at:

27 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Mar 6, 2024

Dataset authored and provided by

NYU Machine Learning for Language

License

https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

Description

Dataset Card for GLUE

  Dataset Summary

GLUE, the General Language Understanding Evaluation benchmark (https://gluebenchmark.com/) is a collection of resources for training, evaluating, and analyzing natural language understanding systems.

  Supported Tasks and Leaderboards

The leaderboard for the GLUE benchmark can be found at this address. It comprises the following tasks:

ax

A manually-curated evaluation dataset for fine-grained analysis of system… See the full description on the dataset page: https://huggingface.co/datasets/nyu-mll/glue.

Clear search

Close search

Google apps

Main menu

glue

glue benchmark

Dataset

Contents

super_glue

few_glue

Data from: snap21

GLUE

Slovenian Word in Context dataset SloWiC 1.0 - Dataset - B2FIND

test_data

mnli

CoLA_2_Class-GLUE-Benchmark

Trained Models from "General Cross-Architecture Distillation of Pretrained...

Model comparison for question pairs detection using 10-fold cross...

stock11

adv_glue

glueSee More Versions

nyu-mll/glue

GLUE (General Language Understanding Evaluation benchmark)

glue