96 datasets found
  1. Manipal Image Sentiment Analysis Dataset

    • figshare.com
    • search.datacite.org
    xlsx
    Updated Jan 20, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stuti Jindal; Sanjay Singh (2016). Manipal Image Sentiment Analysis Dataset [Dataset]. http://doi.org/10.6084/m9.figshare.1496534.v2
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jan 20, 2016
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Stuti Jindal; Sanjay Singh
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Manipal
    Description

    This dataset has been created through a survey wherein 267 UG and PG students of Manipal Institute of Technology, participated and annotated 1000 images for its sentiment score on a scale of 7. Each image was presented to at least three annotators. After collecting all the annotations, we took the majority vote out of the three scores for each image; that is an image annotation is considered valid only when at least two of three annotators agree on the exact label (out of 7 labels). This dataset uses following sentiment label-map: 1-Depressed 2-Very Sad 3-Sad 4-Neutral 5-Happy 6-Very Happy 7-Excited

  2. multimodal-sentiment-data

    • kaggle.com
    zip
    Updated May 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Suraj (2023). multimodal-sentiment-data [Dataset]. https://www.kaggle.com/datasets/suraj520/multimodal-sentiment-data
    Explore at:
    zip(1021992 bytes)Available download formats
    Dataset updated
    May 8, 2023
    Authors
    Suraj
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset provides a collection of images and their corresponding texts and sentiment which makes it a multi-modal sentiment analysis dataset.

    The dataset contains images of 100 different classes of animals and objects, including sharks, birds, lizards, spiders, and more.

    This dataset can be used for various computer vision and natural language processing tasks, such as image classification, sentiment analysis, and image captioning.

  3. Image and text datasets for sentiment analysis

    • figshare.com
    zip
    Updated Jun 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chuang Dong (2025). Image and text datasets for sentiment analysis [Dataset]. http://doi.org/10.6084/m9.figshare.29234471.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 4, 2025
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Chuang Dong
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This is an image and text dataset for sentiment analysis.

  4. g

    Multimodal Sentiment Analysis Dataset

    • gts.ai
    json
    Updated Jun 28, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GTS (2024). Multimodal Sentiment Analysis Dataset [Dataset]. https://gts.ai/dataset-download/multimodal-sentiment-analysis-dataset/
    Explore at:
    jsonAvailable download formats
    Dataset updated
    Jun 28, 2024
    Dataset provided by
    GLOBOSE TECHNOLOGY SOLUTIONS PRIVATE LIMITED
    Authors
    GTS
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Explore our unique Multimodal Sentiment Analysis Dataset, featuring high-quality images and corresponding text descriptions with sentiment labels.

  5. Twitter Sentiment Analysis using Roberta and Vader

    • kaggle.com
    zip
    Updated Oct 18, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jocelyn Dumlao (2023). Twitter Sentiment Analysis using Roberta and Vader [Dataset]. https://www.kaggle.com/datasets/jocelyndumlao/twitter-sentiment-analysis-using-roberta-and-vader
    Explore at:
    zip(32382 bytes)Available download formats
    Dataset updated
    Oct 18, 2023
    Authors
    Jocelyn Dumlao
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Description

    Our dataset comprises 1000 tweets, which were taken from Twitter using the Python programming language. The dataset was stored in a CSV file and generated using various modules. The random module was used to generate random IDs and text, while the faker module was used to generate random user names and dates. Additionally, the textblob module was used to assign a random sentiment to each tweet.

    This systematic approach ensures that the dataset is well-balanced and represents different types of tweets, user behavior, and sentiment. It is essential to have a balanced dataset to ensure that the analysis and visualization of the dataset are accurate and reliable. By generating tweets with a range of sentiments, we have created a diverse dataset that can be used to analyze and visualize sentiment trends and patterns.

    In addition to generating the tweets, we have also prepared a visual representation of the data sets. This visualization provides an overview of the key features of the dataset, such as the frequency distribution of the different sentiment categories, the distribution of tweets over time, and the user names associated with the tweets. This visualization will aid in the initial exploration of the dataset and enable us to identify any patterns or trends that may be present.

    Categories

    Natural Language Processing, Machine Learning Algorithm, Deep Learning

    Acknowledgements & Source

    Jannatul Ferdoshi

    Institutions: BRAC University

    Data Source

    Image Source:Twitter Sentiment Analysis Using Python GeeksforGeeks | lacienciadelcafe.com.ar

    Please don't forget to upvote if you find this useful.

  6. Datasets for Sentiment Analysis

    • zenodo.org
    csv
    Updated Dec 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Julie R. Repository creator - Campos Arias; Julie R. Repository creator - Campos Arias (2023). Datasets for Sentiment Analysis [Dataset]. http://doi.org/10.5281/zenodo.10157504
    Explore at:
    csvAvailable download formats
    Dataset updated
    Dec 10, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Julie R. Repository creator - Campos Arias; Julie R. Repository creator - Campos Arias
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This repository was created for my Master's thesis in Computational Intelligence and Internet of Things at the University of Córdoba, Spain. The purpose of this repository is to store the datasets found that were used in some of the studies that served as research material for this Master's thesis. Also, the datasets used in the experimental part of this work are included.

    Below are the datasets specified, along with the details of their references, authors, and download sources.

    ----------- STS-Gold Dataset ----------------

    The dataset consists of 2026 tweets. The file consists of 3 columns: id, polarity, and tweet. The three columns denote the unique id, polarity index of the text and the tweet text respectively.

    Reference: Saif, H., Fernandez, M., He, Y., & Alani, H. (2013). Evaluation datasets for Twitter sentiment analysis: a survey and a new dataset, the STS-Gold.

    File name: sts_gold_tweet.csv

    ----------- Amazon Sales Dataset ----------------

    This dataset is having the data of 1K+ Amazon Product's Ratings and Reviews as per their details listed on the official website of Amazon. The data was scraped in the month of January 2023 from the Official Website of Amazon.

    Owner: Karkavelraja J., Postgraduate student at Puducherry Technological University (Puducherry, Puducherry, India)

    Features:

    • product_id - Product ID
    • product_name - Name of the Product
    • category - Category of the Product
    • discounted_price - Discounted Price of the Product
    • actual_price - Actual Price of the Product
    • discount_percentage - Percentage of Discount for the Product
    • rating - Rating of the Product
    • rating_count - Number of people who voted for the Amazon rating
    • about_product - Description about the Product
    • user_id - ID of the user who wrote review for the Product
    • user_name - Name of the user who wrote review for the Product
    • review_id - ID of the user review
    • review_title - Short review
    • review_content - Long review
    • img_link - Image Link of the Product
    • product_link - Official Website Link of the Product

    License: CC BY-NC-SA 4.0

    File name: amazon.csv

    ----------- Rotten Tomatoes Reviews Dataset ----------------

    This rating inference dataset is a sentiment classification dataset, containing 5,331 positive and 5,331 negative processed sentences from Rotten Tomatoes movie reviews. On average, these reviews consist of 21 words. The first 5331 rows contains only negative samples and the last 5331 rows contain only positive samples, thus the data should be shuffled before usage.

    This data is collected from https://www.cs.cornell.edu/people/pabo/movie-review-data/ as a txt file and converted into a csv file. The file consists of 2 columns: reviews and labels (1 for fresh (good) and 0 for rotten (bad)).

    Reference: Bo Pang and Lillian Lee. Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL'05), pages 115–124, Ann Arbor, Michigan, June 2005. Association for Computational Linguistics

    File name: data_rt.csv

    ----------- Preprocessed Dataset Sentiment Analysis ----------------

    Preprocessed amazon product review data of Gen3EcoDot (Alexa) scrapped entirely from amazon.in
    Stemmed and lemmatized using nltk.
    Sentiment labels are generated using TextBlob polarity scores.

    The file consists of 4 columns: index, review (stemmed and lemmatized review using nltk), polarity (score) and division (categorical label generated using polarity score).

    DOI: 10.34740/kaggle/dsv/3877817

    Citation: @misc{pradeesh arumadi_2022, title={Preprocessed Dataset Sentiment Analysis}, url={https://www.kaggle.com/dsv/3877817}, DOI={10.34740/KAGGLE/DSV/3877817}, publisher={Kaggle}, author={Pradeesh Arumadi}, year={2022} }

    This dataset was used in the experimental phase of my research.

    File name: EcoPreprocessed.csv

    ----------- Amazon Earphones Reviews ----------------

    This dataset consists of a 9930 Amazon reviews, star ratings, for 10 latest (as of mid-2019) bluetooth earphone devices for learning how to train Machine for sentiment analysis.

    This dataset was employed in the experimental phase of my research. To align it with the objectives of my study, certain reviews were excluded from the original dataset, and an additional column was incorporated into this dataset.

    The file consists of 5 columns: ReviewTitle, ReviewBody, ReviewStar, Product and division (manually added - categorical label generated using ReviewStar score)

    License: U.S. Government Works

    Source: www.amazon.in

    File name (original): AllProductReviews.csv (contains 14337 reviews)

    File name (edited - used for my research) : AllProductReviews2.csv (contains 9930 reviews)

    ----------- Amazon Musical Instruments Reviews ----------------

    This dataset contains 7137 comments/reviews of different musical instruments coming from Amazon.

    This dataset was employed in the experimental phase of my research. To align it with the objectives of my study, certain reviews were excluded from the original dataset, and an additional column was incorporated into this dataset.

    The file consists of 10 columns: reviewerID, asin (ID of the product), reviewerName, helpful (helpfulness rating of the review), reviewText, overall (rating of the product), summary (summary of the review), unixReviewTime (time of the review - unix time), reviewTime (time of the review (raw) and division (manually added - categorical label generated using overall score).

    Source: http://jmcauley.ucsd.edu/data/amazon/

    File name (original): Musical_instruments_reviews.csv (contains 10261 reviews)

    File name (edited - used for my research) : Musical_instruments_reviews2.csv (contains 7137 reviews)

  7. 6992 Meme Images Dataset with Labels

    • kaggle.com
    zip
    Updated Jul 9, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hammad Javaid (2023). 6992 Meme Images Dataset with Labels [Dataset]. https://www.kaggle.com/datasets/hammadjavaid/6992-labeled-meme-images-dataset
    Explore at:
    zip(726545486 bytes)Available download formats
    Dataset updated
    Jul 9, 2023
    Authors
    Hammad Javaid
    License

    http://www.gnu.org/licenses/old-licenses/gpl-2.0.en.htmlhttp://www.gnu.org/licenses/old-licenses/gpl-2.0.en.html

    Description

    Explore the sentiments behind internet memes with this diverse dataset of 6992 meme images. The set is aimed at tasks like sentiment classification and majority voting using any six classifiers of your choice (three for images, three for text) from the sklearn library.

    Outputs should include confusion matrix, accuracy, recall, precision, and F1-measure, providing a comprehensive overview of classifier performance. Ideal for those interested in multimodal data, social media analysis, NLP, image/text classification, text mining, machine learning, deep learning, and sentiment analysis.

  8. Sentiments on COVID-19

    • figshare.com
    txt
    Updated May 18, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stephen Afrifa (2022). Sentiments on COVID-19 [Dataset]. http://doi.org/10.6084/m9.figshare.19337855.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 18, 2022
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Stephen Afrifa
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The dataset consists of sentiments shared on COVID-19 between November 1, 2021 to January 31, 2022. The dataset is used to analyze sentiments shared on COVID-19 and can be applied in other machine learning algorithms.

  9. m

    ColorEmoNet

    • data.mendeley.com
    Updated Jun 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    SHANKAR MALI (2025). ColorEmoNet [Dataset]. http://doi.org/10.17632/zm46z6y597.1
    Explore at:
    Dataset updated
    Jun 26, 2025
    Authors
    SHANKAR MALI
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The ColorEmoNet dataset has been constructed using foundational concepts from colour theory to explore the relationship between colours and emotions.

  10. Movies reviews, ratings, Images Dataset

    • kaggle.com
    zip
    Updated Jan 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shivam Ardeshna (2024). Movies reviews, ratings, Images Dataset [Dataset]. https://www.kaggle.com/datasets/shivamardeshna/movies-dataset
    Explore at:
    zip(1910652105 bytes)Available download formats
    Dataset updated
    Jan 2, 2024
    Authors
    Shivam Ardeshna
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    The Movie Sentiment and Rating Images Dataset is a comprehensive collection of images representing movie posters, accompanied by sentiment labels and user ratings. The dataset is designed to facilitate research and exploration in the domain of sentiment analysis and rating prediction based on visual content, particularly movie poster images.

    Key Features:

    Images:

    The dataset includes approx 33,000 high-resolution movie poster images. Each image provides a visual representation of a movie, typically derived from its promotional material. Sentiment Labels:

    Sentiment labels are assigned to each movie poster, reflecting the emotional tone or sentiment conveyed by the image. Sentiment labels may include categories such as positive, negative, neutral, or a more granular set of emotions. User Ratings:

    User ratings are associated with each movie in the dataset, indicating the numeric evaluation given by viewers. Ratings may follow a scale (e.g., 1 to 5 stars) and provide insights into the perceived quality or popularity of the movie. Potential Use Cases:

    Sentiment Analysis:

    Researchers and practitioners can leverage the dataset for sentiment analysis tasks, and training models to predict sentiment based on movie poster images. Rating Prediction:

    The dataset enables the development and evaluation of models for predicting user ratings from visual content, offering insights into viewer preferences. Content-Based Recommender Systems:

    The combination of sentiment labels and ratings makes the dataset suitable for exploring content-based recommender systems for movies. Deep Learning and Computer Vision Research:

    Researchers in deep learning and computer vision can use the dataset to investigate image-based sentiment analysis and rating prediction challenges. Acknowledgments: Include any acknowledgments or credits for the sources of the dataset, if applicable.

    Note: Provide any additional information or specific details about the dataset that may be relevant for users, such as data format, licensing, or preprocessing steps.

    By providing a clear and informative description, users can better understand the contents and potential applications of your Movie Sentiment and Rating Images Dataset.

  11. IFEED: Interactive Facial Expression and Emotion Detection Dataset

    • zenodo.org
    • data.niaid.nih.gov
    zip
    Updated May 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tiago Dias; Tiago Dias; João Vitorino; João Vitorino; Jorge Oliveira; Jorge Oliveira; Nuno Oliveira; Nuno Oliveira; Eva Maia; Eva Maia; Isabel Praça; Isabel Praça (2023). IFEED: Interactive Facial Expression and Emotion Detection Dataset [Dataset]. http://doi.org/10.5281/zenodo.7963452
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 25, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Tiago Dias; Tiago Dias; João Vitorino; João Vitorino; Jorge Oliveira; Jorge Oliveira; Nuno Oliveira; Nuno Oliveira; Eva Maia; Eva Maia; Isabel Praça; Isabel Praça
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Interactive Facial Expression and Emotion Detection (IFEED) is an annotated dataset that can be used to train, validate, and test Deep Learning models for facial expression and emotion recognition. It contains pre-filtered and analysed images of the interactions between the six main characters of the Friends television series, obtained from the video recordings of the Multimodal EmotionLines Dataset (MELD).

    The images were obtained by decomposing the videos into multiple frames and extracting the facial expression of the correctly identified characters. A team composed of 14 researchers manually verified and annotated the processed data into several classes: Angry, Sad, Happy, Fearful, Disgusted, Surprised and Neutral.

    IFEED can be valuable for the development of intelligent facial expression recognition solutions and emotion detection software, enabling binary or multi-class classification, or even anomaly detection or clustering tasks. The images with ambiguous or very subtle facial expressions can be repurposed for adversarial learning. The dataset can be combined with additional data recordings to create more complete and extensive datasets and improve the generalization of robust deep learning models.

  12. BaSalam

    • kaggle.com
    zip
    Updated Dec 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    hana veranloo (2024). BaSalam [Dataset]. https://www.kaggle.com/datasets/hanaveranloo/basalam
    Explore at:
    zip(30177607 bytes)Available download formats
    Dataset updated
    Dec 5, 2024
    Authors
    hana veranloo
    Description

    Dataset

    This dataset was created by hana veranloo

    Released under Other (specified in description)

    Contents

  13. i

    Multimodal Sentiment Analysis for Urdu Language

    • ieee-dataport.org
    Updated Dec 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ghulam Rabbani (2024). Multimodal Sentiment Analysis for Urdu Language [Dataset]. https://ieee-dataport.org/documents/multimodal-sentiment-analysis-urdu-language
    Explore at:
    Dataset updated
    Dec 2, 2024
    Authors
    Ghulam Rabbani
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    natural language processing

  14. Facial Emotion Recognition Image Dataset

    • kaggle.com
    zip
    Updated Dec 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sujay Kapadnis (2023). Facial Emotion Recognition Image Dataset [Dataset]. https://www.kaggle.com/datasets/sujaykapadnis/emotion-recognition-dataset
    Explore at:
    zip(2126237709 bytes)Available download formats
    Dataset updated
    Dec 7, 2023
    Authors
    Sujay Kapadnis
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    The dataset consists of 6 distinct emotions : Happy, Angry, Sad, Neutral, Surprise and Ahegao. Images are RGB and presented as cropped faces with corresponding emotions. The images were collected by scrapping social nets as Facebook and Instagram, scrapping YouTube videos and already available datasets as IMDB and AffectNet. 1) dataset.zip contains folders with corresponding classes. 2) data.csv contains pathes to images and corresponding labels.

    Kovenko, Volodymyr; Shevchuk, Vitalii (2021), “OAHEGA : EMOTION RECOGNITION DATASET”, Mendeley Data, V2, doi: 10.17632/5ck5zz6f2c.2

  15. t4sa Dataset: Twitter data | Sentiment analysis

    • kaggle.com
    zip
    Updated May 24, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sandeep Kumar Kushwaha (2022). t4sa Dataset: Twitter data | Sentiment analysis [Dataset]. https://www.kaggle.com/datasets/sandeepkumarkushwaha/t4sa-dataset/code
    Explore at:
    zip(241656940 bytes)Available download formats
    Dataset updated
    May 24, 2022
    Authors
    Sandeep Kumar Kushwaha
    Description

    Source: http://www.t4sa.it/ Discretion: I do not own this dataset, if there's any privacy issue please let me know I will take it down. Description: The data collection process took place from July to December 2016, lasting around 6 months in total. During this time span, we exploited Twitter's Sample API to access a random 1% sample of the stream of all globally produced tweets, discarding:

    • tweets not containing any static image or containing other media (i.e., we also discarded tweets containing only
    • videos and/or animated GIFs)
    • tweets not written in the English language
    • tweets whose text was less than 5 words long
    • retweets

    If you have used our data or trained models in a scientific publication, we would appreciate citations to the following paper:

    @InProceedings{Vadicamo_2017_ICCVW, author = {Vadicamo, Lucia and Carrara, Fabio and Cimino, Andrea and Cresci, Stefano and Dell'Orletta, Felice and Falchi, Fabrizio and Tesconi, Maurizio}, title = {Cross-Media Learning for Image Sentiment Analysis in the Wild}, booktitle = {2017 IEEE International Conference on Computer Vision Workshops (ICCVW)}, pages={308-317}, doi={10.1109/ICCVW.2017.45}, month = {Oct}, year = {2017} }

  16. Emoji Sentiment Ranking

    • figshare.com
    txt
    Updated May 30, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Petra Kralj Novak; Jasmina Smailović; Borut Sluban; Igor Mozetic (2023). Emoji Sentiment Ranking [Dataset]. http://doi.org/10.6084/m9.figshare.1600931.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Petra Kralj Novak; Jasmina Smailović; Borut Sluban; Igor Mozetic
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A lexicon of 751 emoji characters with automatically assigned sentiment. The sentiment is computed from 70,000 tweets, labeled by 83 human annotators in 13 European languages. The Emoji Sentiment Ranking web page at http://kt.ijs.si/data/Emoji_sentiment_ranking/ is automatically generated from the data provided in this repository. The process and analysis of emoji sentiment ranking is described in the paper: P. Kralj Novak, J. Smailović, B. Sluban, I. Mozetič, Sentiment of Emojis, submitted; arXiv preprint, http://arxiv.org/abs/1509.07761, 2015.

  17. h

    facial-emotion-recognition-dataset

    • huggingface.co
    Updated Jul 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Unique Data (2023). facial-emotion-recognition-dataset [Dataset]. https://huggingface.co/datasets/UniqueData/facial-emotion-recognition-dataset
    Explore at:
    Dataset updated
    Jul 22, 2023
    Authors
    Unique Data
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    The dataset consists of images capturing people displaying 7 distinct emotions (anger, contempt, disgust, fear, happiness, sadness and surprise). Each image in the dataset represents one of these specific emotions, enabling researchers and machine learning practitioners to study and develop models for emotion recognition and analysis. The images encompass a diverse range of individuals, including different genders, ethnicities, and age groups*. The dataset aims to provide a comprehensive representation of human emotions, allowing for a wide range of use cases.

  18. E

    Czech image captioning, machine translation, and sentiment analysis (Neural...

    • live.european-language-grid.eu
    Updated Jul 12, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2018). Czech image captioning, machine translation, and sentiment analysis (Neural Monkey models) [Dataset]. https://live.european-language-grid.eu/catalogue/tool-service/18210
    Explore at:
    Dataset updated
    Jul 12, 2018
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    This submission contains trained end-to-end models for the Neural Monkey toolkit for Czech and English, solving three NLP tasks: machine translation, image captioning, and sentiment analysis. The models are trained on standard datasets and achieve state-of-the-art or near state-of-the-art performance in the tasks. The models are described in the accompanying paper. The same models can also be invoked via the online demo: https://ufal.mff.cuni.cz/grants/lsd

    There are several separate ZIP archives here, each containing one model solving one of the tasks for one language.

    To use a model, you first need to install Neural Monkey: https://github.com/ufal/neuralmonkey To ensure correct functioning of the model, please use the exact version of Neural Monkey specified by the commit hash stored in the 'git_commit' file in the model directory.

    Each model directory contains a 'run.ini' Neural Monkey configuration file, to be used to run the model. See the Neural Monkey documentation to learn how to do that (you may need to update some paths to correspond to your filesystem organization). The 'experiment.ini' file, which was used to train the model, is also included. Then there are files containing the model itself, files containing the input and output vocabularies, etc.

    For the sentiment analyzers, you should tokenize your input data using the Moses tokenizer: https://pypi.org/project/mosestokenizer/

    For the machine translation, you do not need to tokenize the data, as this is done by the model.

    For image captioning, you need to: - download a trained ResNet: http://download.tensorflow.org/models/resnet_v2_50_2017_04_14.tar.gz - clone the git repository with TensorFlow models: https://github.com/tensorflow/models - preprocess the input images with the Neural Monkey 'scripts/imagenet_features.py' script (https://github.com/ufal/neuralmonkey/blob/master/scripts/imagenet_features.py) -- you need to specify the path to ResNet and to the TensorFlow models to this script

    Feel free to contact the authors of this submission in case you run into problems!

  19. h

    img-emotion-classification

    • huggingface.co
    Updated Oct 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Keyur Jotaniya (2025). img-emotion-classification [Dataset]. https://huggingface.co/datasets/Keyurjotaniya007/img-emotion-classification
    Explore at:
    Dataset updated
    Oct 22, 2025
    Authors
    Keyur Jotaniya
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Emotion Image Classification Dataset

    This dataset is designed for emotion recognition from facial images, enabling machine learning and deep learning models to classify human emotions based on facial expressions.It can be used for applications such as affective computing, sentiment analysis, and human-computer interaction.

      Dataset Overview
    

    DatasetDict({ train: Dataset({

    features: ['image', 'label'],
    
    num_rows: 28709
    

    })

    test: Dataset({

    features:… See the full description on the dataset page: https://huggingface.co/datasets/Keyurjotaniya007/img-emotion-classification.
    
  20. F

    South Asian Facial Expression Image Dataset

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FutureBee AI (2022). South Asian Facial Expression Image Dataset [Dataset]. https://www.futurebeeai.com/dataset/image-dataset/facial-images-expression-south-asian
    Explore at:
    wavAvailable download formats
    Dataset updated
    Aug 1, 2022
    Dataset provided by
    FutureBeeAI
    Authors
    FutureBee AI
    License

    https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement

    Area covered
    South Asia
    Dataset funded by
    FutureBeeAI
    Description

    Introduction

    Welcome to the South Asian Facial Expression Image Dataset, curated to support the development of advanced facial expression recognition systems, biometric identification models, KYC verification processes, and a wide range of facial analysis applications. This dataset is ideal for training robust emotion-aware AI solutions.

    Facial Expression Data

    The dataset includes over 2000 high-quality facial expression images, grouped into participant-wise sets. Each participant contributes:

    Expression Images: 5 distinct facial images capturing common human emotions: Happy, Sad, Angry, Shocked, and Neutral

    Diversity & Representation

    Geographical Coverage: Individuals from South Asian countries including India, Pakistan, Bangladesh, Nepal, Sri Lanka, Bhutan, Maldives, and more
    Demographics: Participants aged 18 to 70 years, with a gender distribution of 60% male and 40% female
    File Formats: All images are available in JPEG and HEIC formats

    Image Quality & Capture Conditions

    To ensure generalizability and robustness in model training, images were captured under varied real-world conditions:

    Lighting Conditions: Natural and artificial lighting to represent diverse scenarios
    Background Variability: Indoor and outdoor backgrounds to enhance model adaptability
    Device Quality: Captured using modern smartphones to ensure clarity and consistency

    Metadata

    Each participant's image set is accompanied by detailed metadata, enabling precise filtering and training:

    Unique Participant ID
    File Name
    Age
    Gender
    Country
    Facial Expression Label
    Demographic Information
    File Format

    This metadata helps in building expression recognition models that are both accurate and inclusive.

    Use Cases & Applications

    This dataset is ideal for a variety of AI and computer vision applications, including:

    Facial Expression Recognition: Improve accuracy in detecting emotions like happiness, anger, or surprise
    Biometric & Identity Systems: Enhance facial biometric authentication with expression variation handling
    KYC & Identity Verification: Validate facial consistency in ID documents and selfies despite varied expressions
    Generative AI Training: Support expression generation and animation in AI-generated facial images
    Emotion-Aware Systems: Power human-computer interaction, mental health assessment, and adaptive learning apps

    Secure & Ethical Collection

    Data Security: All data is securely processed and stored on FutureBeeAI’s proprietary platform
    Ethical Standards: Collection followed strict ethical guidelines ensuring participant privacy and informed consent
    Informed Consent: All participants were made aware of the data use and provided written consent

    Dataset Updates & Customization

    To support evolving AI development needs, this dataset is regularly updated and can be tailored to project-specific requirements. Custom options include:

    <div style="margin-top:10px; margin-bottom: 10px; padding-left: 30px; display: flex; gap: 16px; align-items:

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Stuti Jindal; Sanjay Singh (2016). Manipal Image Sentiment Analysis Dataset [Dataset]. http://doi.org/10.6084/m9.figshare.1496534.v2
Organization logoOrganization logo

Manipal Image Sentiment Analysis Dataset

Explore at:
xlsxAvailable download formats
Dataset updated
Jan 20, 2016
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Stuti Jindal; Sanjay Singh
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Area covered
Manipal
Description

This dataset has been created through a survey wherein 267 UG and PG students of Manipal Institute of Technology, participated and annotated 1000 images for its sentiment score on a scale of 7. Each image was presented to at least three annotators. After collecting all the annotations, we took the majority vote out of the three scores for each image; that is an image annotation is considered valid only when at least two of three annotators agree on the exact label (out of 7 labels). This dataset uses following sentiment label-map: 1-Depressed 2-Very Sad 3-Sad 4-Neutral 5-Happy 6-Very Happy 7-Excited

Search
Clear search
Close search
Google apps
Main menu