29 datasets found
  1. h

    goemotions

    • huggingface.co
    Updated Aug 12, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Manuel Romero (2023). goemotions [Dataset]. https://huggingface.co/datasets/mrm8488/goemotions
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 12, 2023
    Authors
    Manuel Romero
    Description

    GoEmotions

    GoEmotions is a corpus of 58k carefully curated comments extracted from Reddit, with human annotations to 27 emotion categories or Neutral.

    Number of examples: 58,009. Number of labels: 27 + Neutral. Maximum sequence length in training and evaluation datasets: 30.

    On top of the raw data, we also include a version filtered based on reter-agreement, which contains a train/test/validation split:

    Size of training dataset: 43,410. Size of test dataset: 5,427. Size of… See the full description on the dataset page: https://huggingface.co/datasets/mrm8488/goemotions.

  2. T

    goemotions

    • tensorflow.org
    • opendatalab.com
    • +3more
    Updated Dec 6, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). goemotions [Dataset]. https://www.tensorflow.org/datasets/catalog/goemotions
    Explore at:
    Dataset updated
    Dec 6, 2022
    Description

    The GoEmotions dataset contains 58k carefully curated Reddit comments labeled for 27 emotion categories or Neutral. The emotion categories are admiration, amusement, anger, annoyance, approval, caring, confusion, curiosity, desire, disappointment, disapproval, disgust, embarrassment, excitement, fear, gratitude, grief, joy, love, nervousness, optimism, pride, realization, relief, remorse, sadness, surprise.

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('goemotions', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

  3. E

    GoEmotions

    • live.european-language-grid.eu
    csv
    Updated Dec 30, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2020). GoEmotions [Dataset]. https://live.european-language-grid.eu/catalogue/corpus/5011
    Explore at:
    csvAvailable download formats
    Dataset updated
    Dec 30, 2020
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset contains 58K carefully curated Reddit comments labeled for 27 emotion categories: admiration, amusement, anger, annoyance, approval, caring, confusion, curiosity, desire, disappointment, disapproval, disgust, embarrassment, excitement, fear, gratitude, grief, joy, love, nervousness, optimism, pride, realization, relief, remorse, sadness, & surprise.

  4. h

    goemotion-ekman-emotions

    • huggingface.co
    Updated Aug 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    moodlogue (2025). goemotion-ekman-emotions [Dataset]. https://huggingface.co/datasets/Frankhihi/goemotion-ekman-emotions
    Explore at:
    Dataset updated
    Aug 2, 2025
    Authors
    moodlogue
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    GoEmotions Ekman Emotions Dataset

      Dataset Description
    

    This dataset contains 10,000 text samples from Reddit comments mapped to the 7 basic Ekman emotions. It's derived from the original GoEmotions dataset and processed specifically for emotion classification research using Paul Ekman's fundamental emotion model.

      Supported Tasks
    

    Text Classification: Multi-class emotion classification Sentiment Analysis: Fine-grained emotion detection Psychology Research:… See the full description on the dataset page: https://huggingface.co/datasets/Frankhihi/goemotion-ekman-emotions.

  5. h

    goemotions-5point-sentiment

    • huggingface.co
    Updated Mar 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jose (2025). goemotions-5point-sentiment [Dataset]. https://huggingface.co/datasets/spacesedan/goemotions-5point-sentiment
    Explore at:
    Dataset updated
    Mar 23, 2025
    Authors
    Jose
    Description

    GoEmotions 5-Point Sentiment Dataset

    This dataset is a modified version of the GoEmotions dataset created by Google. The original dataset consists of 58k carefully curated Reddit comments labeled with 27 fine-grained emotion categories plus a neutral label.

      📘 About This Version
    

    This version maps the original GoEmotions emotion labels into a 5-point sentiment scale, making it more suitable for traditional sentiment analysis tasks:

    Original Label(s) Mapped Sentiment… See the full description on the dataset page: https://huggingface.co/datasets/spacesedan/goemotions-5point-sentiment.

  6. GoEmotions Dataset2

    • kaggle.com
    Updated Jul 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Enes Ozturk (2023). GoEmotions Dataset2 [Dataset]. https://www.kaggle.com/datasets/enesztrk/goemotions-dataset2/versions/1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 8, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Enes Ozturk
    Description

    Dataset

    This dataset was created by Enes Ozturk

    Contents

  7. Sentiment Analysis Dataset

    • kaggle.com
    Updated May 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mitesh (2025). Sentiment Analysis Dataset [Dataset]. https://www.kaggle.com/datasets/mgmitesh/sentiment-analysis-dataset/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 20, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Mitesh
    License

    Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
    License information was derived automatically

    Description

    This dataset is designed for building and evaluating sentiment and emotion classification models in Natural Language Processing (NLP). It includes two well-known datasets:

    • GoEmotions: A fine-grained emotion dataset developed by Google, containing 58k English Reddit comments labeled with 27 emotion categories plus Neutral.
    • DailyDialog: A high-quality multi-turn dialog dataset with emotion and intent annotations, ideal for dialog modeling and conversational AI.

    Each dataset is provided in CSV format and includes text samples along with corresponding emotion or sentiment labels.

    This dataset is useful for:

    • Emotion classification and multi-label sentiment analysis.
    • Fine-tuning transformer models (e.g., BERT, RoBERTa).
    • Training empathetic conversational agents.
    • Research in affective computing and human-centered AI.
  8. h

    goemotions-5point-sentiment-refined

    • huggingface.co
    Updated Dec 15, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jose (2015). goemotions-5point-sentiment-refined [Dataset]. https://huggingface.co/datasets/spacesedan/goemotions-5point-sentiment-refined
    Explore at:
    Dataset updated
    Dec 15, 2015
    Authors
    Jose
    Description

    spacesedan/goemotions-5point-sentiment-refined dataset hosted on Hugging Face and contributed by the HF Datasets community

  9. h

    hinglish-goemotions

    • huggingface.co
    Updated Jul 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jagrit Chaudhry (2025). hinglish-goemotions [Dataset]. https://huggingface.co/datasets/Hostileic/hinglish-goemotions
    Explore at:
    Dataset updated
    Jul 30, 2025
    Authors
    Jagrit Chaudhry
    Description

    Hostileic/hinglish-goemotions dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. h

    twitter-roberta-goemotions-binary-fear-classification

    • huggingface.co
    Updated Jul 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Garrett Baber (2023). twitter-roberta-goemotions-binary-fear-classification [Dataset]. https://huggingface.co/datasets/garrettbaber/twitter-roberta-goemotions-binary-fear-classification
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 15, 2023
    Authors
    Garrett Baber
    Description

    AutoTrain Dataset for project: twitter-goemotions-binary-fear-classification

      Dataset Description
    

    This dataset has been automatically processed by AutoTrain for project twitter-goemotions-binary-fear-classification.

      Languages
    

    The BCP-47 code for the dataset's language is unk.

      Dataset Structure
    
    
    
    
    
      Data Instances
    

    A sample from this dataset looks as follows: [ { "text": "Downvoting comments you don't like is your right.", "feat_id":… See the full description on the dataset page: https://huggingface.co/datasets/garrettbaber/twitter-roberta-goemotions-binary-fear-classification.

  11. h

    goemotions-binary

    • huggingface.co
    Updated Dec 15, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alisha Walunj (2015). goemotions-binary [Dataset]. https://huggingface.co/datasets/alisha4walunj/goemotions-binary
    Explore at:
    Dataset updated
    Dec 15, 2015
    Authors
    Alisha Walunj
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    alisha4walunj/goemotions-binary dataset hosted on Hugging Face and contributed by the HF Datasets community

  12. e

    StudEmo - corpus of consumer reviews annotated with emotions - Dataset -...

    • b2find.eudat.eu
    Updated Oct 31, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). StudEmo - corpus of consumer reviews annotated with emotions - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/1fdeaea3-0a76-58b2-b0fe-09c2c1edb1b6
    Explore at:
    Dataset updated
    Oct 31, 2023
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Humans' emotional perception is subjective by nature, in which each individual could express different emotions regarding the same textual content. Existing datasets for emotion analysis commonly depend on a single ground truth per data sample, derived from majority voting or averaging the opinions of all annotators. We introduce a new non-aggregated dataset, namely StudEmo, that contains 5,182 customer reviews, each annotated by 25 people with intensities of eight emotions from Plutchik's model, extended with valence and arousal. We also propose three personalized models that use not only textual content but also the individual human perspective, providing the model with different approaches to learning human representations. The experiments were carried out as a multitask classification on two datasets: our StudEmo dataset and GoEmotions dataset, which contains 28 emotional categories. The proposed personalized methods significantly improve prediction results, especially for emotions that have low inter-annotator agreement.

  13. h

    go_emotions_wheel_unilabel

    • huggingface.co
    Updated Feb 16, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Javier Sevilla Salcedo (2025). go_emotions_wheel_unilabel [Dataset]. https://huggingface.co/datasets/Jsevisal/go_emotions_wheel_unilabel
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 16, 2025
    Authors
    Javier Sevilla Salcedo
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Original dataset: GoEmotions dataset Filtered using the following mapping based on the basic emotions found in Plutchik's Wheel of Emotions and filtered to use only the sentences with one label wheel_dict = { "joy": [ "joy", "amusement", "excitement", "gratitude", "pride", "relief", "admiration", "love", "optimism", ], "trust": ["approval", "caring"], "fear": ["fear", "nervousness"], "surprise":… See the full description on the dataset page: https://huggingface.co/datasets/Jsevisal/go_emotions_wheel_unilabel.

  14. h

    go_emotions_dutch

    • huggingface.co
    Updated Dec 31, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joost Verhaert (2022). go_emotions_dutch [Dataset]. https://huggingface.co/datasets/joost6196/go_emotions_dutch
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 31, 2022
    Authors
    Joost Verhaert
    Description

    The GoEmotions dataset contains 58k carefully curated Reddit comments labeled for 27 emotion categories or Neutral. The emotion categories are admiration, amusement, anger, annoyance, approval, caring, confusion, curiosity, desire, disappointment, disapproval, disgust, embarrassment, excitement, fear, gratitude, grief, joy, love, nervousness, optimism, pride, realization, relief, remorse, sadness, surprise.

  15. h

    go_emotions_ekman_unilabel

    • huggingface.co
    Updated Apr 7, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Javier Sevilla Salcedo (2025). go_emotions_ekman_unilabel [Dataset]. https://huggingface.co/datasets/Jsevisal/go_emotions_ekman_unilabel
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 7, 2025
    Authors
    Javier Sevilla Salcedo
    Description

    Original dataset: GoEmotions dataset Filtered using "ekman_mapping.json" from original dataset repo and filtered to use only the sentences with one label Dataset contains 7 emotion labels as per Dr. Ekman theory. Labels are as follows: 0: anger 1: disgust 2: fear 3: joy 4: sadness 5: surprise 6: neutral

  16. h

    GoEmotions

    • huggingface.co
    Updated Mar 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Thijs Gelton (2024). GoEmotions [Dataset]. https://huggingface.co/datasets/tgelton/GoEmotions
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 6, 2024
    Authors
    Thijs Gelton
    Description

    tgelton/GoEmotions dataset hosted on Hugging Face and contributed by the HF Datasets community

  17. h

    go_emotions-es-mt

    • huggingface.co
    Updated Dec 15, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Manuel Romero (2015). go_emotions-es-mt [Dataset]. https://huggingface.co/datasets/mrm8488/go_emotions-es-mt
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 15, 2015
    Authors
    Manuel Romero
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    GoEmotions Spanish

      A Spanish translation (using EasyNMT) of the GoEmotions dataset.
    
    
    
    
    
      For more information check the official Model Card
    
  18. h

    go-emotions-cleaned

    • huggingface.co
    Updated Dec 15, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    sangkm (2015). go-emotions-cleaned [Dataset]. https://huggingface.co/datasets/sangkm/go-emotions-cleaned
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 15, 2015
    Authors
    sangkm
    Description

    sangkm/go-emotions-cleaned dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. h

    ru-izard-emotions

    • huggingface.co
    Updated Nov 26, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Daniel (2023). ru-izard-emotions [Dataset]. https://huggingface.co/datasets/Djacon/ru-izard-emotions
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 26, 2023
    Authors
    Daniel
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset Card for RuIzardEmotions

      Dataset Summary
    

    The RuIzardEmotions dataset is a high-quality translation of the go-emotions dataset and the other emotion-detection dataset. It contains 30k Reddit comments labeled for 10 emotion categories (joy, sadness, anger, enthusiasm, surprise, disgust, fear, guilt, shame and neutral). The datasets were translated using the accurate translator DeepL and additional processing. The idea for the dataset was inspired by the Izard's… See the full description on the dataset page: https://huggingface.co/datasets/Djacon/ru-izard-emotions.

  20. h

    ru_goemotions

    • huggingface.co
    Updated Sep 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Daniel (2023). ru_goemotions [Dataset]. https://huggingface.co/datasets/Djacon/ru_goemotions
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 11, 2023
    Authors
    Daniel
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset Card for GoEmotions

      Dataset Summary
    

    The RuGoEmotions dataset contains 34k Reddit comments labeled for 9 emotion categories (joy, interest, surprice, sadness, anger, disgust, fear, guilt and neutral). The dataset already with predefined train/val/test splits

      Supported Tasks and Leaderboards
    

    This dataset is intended for multi-class, multi-label emotion classification.

      Languages
    

    The data is in Russian.

      Dataset Structure
    
    
    
    
    
      Data… See the full description on the dataset page: https://huggingface.co/datasets/Djacon/ru_goemotions.
    
Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Manuel Romero (2023). goemotions [Dataset]. https://huggingface.co/datasets/mrm8488/goemotions

goemotions

mrm8488/goemotions

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 12, 2023
Authors
Manuel Romero
Description

GoEmotions

GoEmotions is a corpus of 58k carefully curated comments extracted from Reddit, with human annotations to 27 emotion categories or Neutral.

Number of examples: 58,009. Number of labels: 27 + Neutral. Maximum sequence length in training and evaluation datasets: 30.

On top of the raw data, we also include a version filtered based on reter-agreement, which contains a train/test/validation split:

Size of training dataset: 43,410. Size of test dataset: 5,427. Size of… See the full description on the dataset page: https://huggingface.co/datasets/mrm8488/goemotions.

Search
Clear search
Close search
Google apps
Main menu