100+ datasets found
  1. P

    Sentiment Analysis for Social Media Monitoring Dataset

    • paperswithcode.com
    Updated Mar 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Sentiment Analysis for Social Media Monitoring Dataset [Dataset]. https://paperswithcode.com/dataset/sentiment-analysis-for-social-media
    Explore at:
    Dataset updated
    Mar 6, 2025
    Description

    Problem Statement

    👉 Download the case studies here

    A global consumer goods company struggled to understand customer sentiment across various social media platforms. With millions of posts, reviews, and comments generated daily, manually tracking and analyzing public opinion was inefficient. The company needed an automated solution to monitor brand perception, address negative feedback promptly, and leverage insights for marketing strategies.

    Challenge

    Analyzing social media sentiment posed the following challenges:

    Processing vast amounts of unstructured text data from multiple platforms like Twitter, Facebook, and Instagram.

    Accurately interpreting slang, emojis, and nuanced language used by social media users.

    Identifying trends and actionable insights in real-time to respond to potential crises or opportunities effectively.

    Solution Provided

    An advanced sentiment analysis system was developed using Natural Language Processing (NLP) and sentiment analysis algorithms. The solution was designed to:

    Classify social media posts into positive, negative, and neutral sentiments.

    Extract key topics and trends related to the brand and its products.

    Provide real-time dashboards for monitoring customer sentiment and identifying areas of improvement.

    Development Steps

    Data Collection

    Aggregated data from major social media platforms using APIs, focusing on brand mentions, hashtags, and product keywords.

    Preprocessing

    Cleaned and normalized text data, including handling slang, emojis, and misspellings, to prepare it for analysis.

    Model Training

    Trained NLP models for sentiment classification using supervised learning. Implemented topic modeling algorithms to identify recurring themes and discussions.

    Validation

    Tested the sentiment analysis models on labeled datasets to ensure high accuracy and relevance in classifying social media posts.

    Deployment

    Integrated the sentiment analysis system with a real-time analytics dashboard, enabling the marketing and customer support teams to track trends and respond proactively.

    Monitoring & Improvement

    Established a continuous feedback mechanism to refine models based on evolving language patterns and new social media trends.

    Results

    Gained Actionable Insights

    The system provided detailed insights into customer opinions, helping the company identify strengths and areas for improvement.

    Improved Brand Reputation Management

    Real-time monitoring enabled swift responses to negative feedback, mitigating potential reputation risks.

    Informed Marketing Strategies

    Insights from sentiment analysis guided targeted marketing campaigns, resulting in higher engagement and ROI.

    Enhanced Customer Relationships

    Proactive engagement with customers based on sentiment analysis improved customer satisfaction and loyalty.

    Scalable Monitoring Solution

    The system scaled efficiently to analyze data across multiple languages and platforms, broadening the company’s reach and understanding.

  2. g

    Sentiment Analysis for Social Media Monitoring

    • gts.ai
    json
    Updated Nov 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GTS (2023). Sentiment Analysis for Social Media Monitoring [Dataset]. https://gts.ai/case-study/sentiment-analysis-for-social-media-monitoring/
    Explore at:
    jsonAvailable download formats
    Dataset updated
    Nov 20, 2023
    Dataset provided by
    GLOBOSE TECHNOLOGY SOLUTIONS PRIVATE LIMITED
    Authors
    GTS
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Understand the mood of social media with sentiment analysis. Monitor brand mentions, analyze feedback, and tailor strategies.

  3. Social Media Posts in Arabic Dialect

    • kaggle.com
    Updated Jul 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UM6P Open Data (2024). Social Media Posts in Arabic Dialect [Dataset]. https://www.kaggle.com/datasets/um6popendata/sentiment-analysis-for-sm-posts-in-arabic-dialect
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 11, 2024
    Dataset provided by
    Kaggle
    Authors
    UM6P Open Data
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset: Sentiment Analysis for Social Media Posts in Arabic Dialect

    Overview

    This dataset contains a labeled collection of approximately 50,000 social media posts in various Arabic dialects. Each post has been manually annotated with sentiment labels, providing a rich resource for natural language processing and sentiment analysis research.

    Dataset Owner

    UM6P College of Computing

    Content

    • Posts: The dataset includes raw text data of social media posts written in different Arabic dialects.
    • Sentiment Labels: Each post is labeled with one of the following sentiment categories:
      • Positive
      • Negative
      • Neutral

    Features

    • Post ID: A unique identifier for each social media post.
    • Text: The content of the social media post in Arabic.
    • Sentiment: The sentiment label assigned to the post (Positive, Negative, Neutral).

    Format

    The dataset is provided in a CSV format with the following columns: - Post_ID: Integer - Text: String - Sentiment: String (Positive, Negative, Neutral)

    Usage

    This dataset is ideal for tasks such as: - Training sentiment analysis models - Studying sentiment trends in Arabic social media - Exploring the linguistic characteristics of Arabic dialects - Benchmarking sentiment analysis tools

    Example Data

    Post_IDTextSentiment
    1"هذا المنتج رائع جدًا وأحببته كثيرًا"Positive
    2"لم يعجبني هذا الفيلم، كان مملًا جدًا"Negative
    3"الطقس اليوم عادي، لا يوجد شيء مميز"Neutral

    Licensing

    Please refer to the dataset license included in the dataset files for information on usage rights and restrictions.

    Citation

    An open access NLP dataset for Arabic dialects: data collection, labeling, and model construction, Elmehdi Boujou, Hamza Chataoui, Abdellah El Mekki, Saad Benjelloun, Ikram Chairi and Ismail Berrada MENACIS 2020 conference, In press.

  4. Original Twitter Data Set - Supplemental Material for "Does Negative Social...

    • figshare.com
    txt
    Updated May 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Diego Corry (2024). Original Twitter Data Set - Supplemental Material for "Does Negative Social Media Sentiment Affect Stock Prices? Evidence from the Canadian Energy Sector" [Dataset]. http://doi.org/10.6084/m9.figshare.25761849.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 6, 2024
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Diego Corry
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Canada
    Description

    “Supplemental material, Original Twitter Dataset, for "Does Negative Social Media Sentiment Affect Stock Prices? Evidence from the Canadian Energy Sector" by Diego Corry and Hamish van der Ven in Business and Society.

  5. Data from: Social Media Engagement Dataset

    • kaggle.com
    Updated May 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Subash Shanmugam (2025). Social Media Engagement Dataset [Dataset]. https://www.kaggle.com/datasets/subashmaster0411/social-media-engagement-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 6, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Subash Shanmugam
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This machine-generated dataset simulates social media engagement data across various metrics, including likes, shares, comments, impressions, sentiment scores, toxicity, and engagement growth. It is designed for analysis and visualization of trends, buzz frequency, public sentiment, and user behavior on digital platforms.

    The dataset can be used to:

    Identify spikes or drops in engagement

    Analyze changes in sentiment over time

    Build dashboards for digital trend tracking

    Test algorithms for sentiment analysis or trend prediction

  6. SENTIMENT ANALYSIS OF SOCIAL MEDIA PLATFORMS

    • kaggle.com
    Updated Sep 14, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jigyashu Singh Lodhi (2023). SENTIMENT ANALYSIS OF SOCIAL MEDIA PLATFORMS [Dataset]. http://doi.org/10.34740/kaggle/dsv/6473513
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 14, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Jigyashu Singh Lodhi
    Description

    Dataset

    This dataset was created by Jigyashu Singh Lodhi

    Released under Other (specified in description)

    Contents

  7. Towards Generalization of Machine Learning Models: An Arabic Sentiment...

    • zenodo.org
    • data.niaid.nih.gov
    csv
    Updated Jun 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Samir Abdaljalil; Shaimaa Hassanein; Hamdy Mubarak; Ahmed Abdelali; Samir Abdaljalil; Shaimaa Hassanein; Hamdy Mubarak; Ahmed Abdelali (2023). Towards Generalization of Machine Learning Models: An Arabic Sentiment Analysis Dataset [Dataset]. http://doi.org/10.5281/zenodo.7801450
    Explore at:
    csvAvailable download formats
    Dataset updated
    Jun 5, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Samir Abdaljalil; Shaimaa Hassanein; Hamdy Mubarak; Ahmed Abdelali; Samir Abdaljalil; Shaimaa Hassanein; Hamdy Mubarak; Ahmed Abdelali
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This data set consists of approximately 1.64 Million Arabic tweets (shared by their IDs) posted from 2009 to 2020, and their corresponding sentiment using a three-point classification system of Positive, Negative and Neutral/Mixed. No specific locations and/or keywords were specified throughout the data collection to obtain variation in the dialects and topics represented within the dataset. It is important to note that any biases in the proposed dataset in relation to the dialects and/or topics discussed were unintentional.

    Please use the following citation if you use this data in a paper:

    Abdaljalil, S., Hassanein, S., Mubarak, H., & Abdelali, A. (2023). Towards Generalization of Machine Learning Models: A Case Study of Arabic Sentiment Analysis. Proceedings of the International AAAI Conference on Web and Social Media, 17(1), 971-980.

  8. f

    Data from: THE APPLICATION OF THE SENTIMENT ANALYSIS TECHNIQUE IN SOCIAL...

    • scielo.figshare.com
    jpeg
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Daniel José Silva Oliveira; Paulo Henrique de Souza Bermejo; José Roberto Pereira; Daniely Aparecida Barbosa (2023). THE APPLICATION OF THE SENTIMENT ANALYSIS TECHNIQUE IN SOCIAL MEDIA AS A TOOL FOR SOCIAL MANAGEMENT PRACTICES AT THE GOVERNMENTAL LEVEL [Dataset]. http://doi.org/10.6084/m9.figshare.7865825.v1
    Explore at:
    jpegAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    SciELO journals
    Authors
    Daniel José Silva Oliveira; Paulo Henrique de Souza Bermejo; José Roberto Pereira; Daniely Aparecida Barbosa
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Abstract Sentiment analysis is a knowledge discovery technique developed from data mining; its purpose is to reveal people’s opinions on specific topics. This is an appropriate technique to apply to unstructured data sources, such as social media, that cover information on a variety of topics (such as politics and public administration). In this context, the objective of this study was to identify whether sentiment analysis can reflect public opinion and, thus, contribute to practices of social management. Therefore, the sentiment analysis technique was applied to reveal citizens’ opinions, which were expressed on Twitter and concerned some of the main social programs in force during Brazil’s Rousseff government. The study consisted of a comparison between the results of the sentiment analysis and the concepts and applications involving four strategies of social media used by governments from the point of view of social management. The results revealed that the sentiment analysis technique could contribute to social management practices in the context of the network strategy.

  9. E

    Sentiment analysis of tech media articles using VADER package and...

    • live.european-language-grid.eu
    csv
    Updated Aug 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Sentiment analysis of tech media articles using VADER package and co-occurrence analysis [Dataset]. https://live.european-language-grid.eu/catalogue/lcr/1351
    Explore at:
    csvAvailable download formats
    Dataset updated
    Aug 16, 2023
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Sentiment analysis of tech media articles using VADER package and co-occurrence analysis

    Sources: Above 140k articles (01.2016-03.2019):

    Gigaom 0.5%

    Euractiv 0.9%

    The Conversation 1.3%

    Politico Europe 1.3%

    IEEE Spectrum 1.8%

    Techforge 4.3%

    Fastcompany 4.5%

    The Guardian (Tech) 9.2%

    Arstechnica 10.0%

    Reuters 11%

    Gizmodo 17.5%

    ZDNet 18.3%

    The Register 19.5%

    Methodology

    The sentiment analysis has been prepared using VADER*, an open-source lexicon and rule-based sentiment analysis tool. VADER is specifically designed for social media analysis, but can be also applied for other text sources. The sentiment lexicon was compiled using various sources (other sentiment data sets, Twitter etc.) and was validated by human input. The advantage of VADER is that the rule-based engine includes word-order sensitive relations and degree modifiers.

    As VADER is more robust in the case of shorter social media texts, the analysed articles have been divided into paragraphs. The analysis have been carried out for the social issues presented in the co-occurrence exercise.

    The process included the following main steps:

    The 100 most frequently co-occurring terms are identified for every social issue (using the co-occurrence methodology)

    The articles containing the given social issue and co-occurring term are identified

    The identified articles are divided into paragraphs

    Social issue and co-occurring words are removed from the paragraph

    The VADER sentiment analysis is carried out for every identified and modified paragraph

    The average for the given word pair is calculated for the final result

    Therefore, the procedure has been repeated for 100 words for all identified social issues.

    The sentiment analysis resulted in a compound score for every paragraph. The score is calculated from the sum of the valence scores of each word in the paragraph, and normalised between the values -1 (most extreme negative) and +1 (most extreme positive). Finally, the average is calculated from the paragraph results. Removal of terms is meant to exclude sentiment of the co-occurring word itself, because the word may be misleading, e.g. when some technologies or companies attempt to solve a negative issue. The neighbourhood's scores would be positive, but the negative term would bring the paragraph's score down.

    The presented tables include the most extreme co-occurring terms for the analysed social issue. The examples are chosen from the list of words with 30 most positive and 30 most negative sentiment. The presented graphs show the evolution of sentiments for social issues. The analysed paragraphs are selected the following way:

    The articles containing the given social issue are identified

    The paragraphs containing the social issue are selected for sentiment analysis

    *Hutto, C.J. & Gilbert, E.E. (2014). VADER: A Parsimonious Rule-based Model for Sentiment Analysis of Social Media Text. Eighth International Conference on Weblogs and Social Media (ICWSM-14). Ann Arbor, MI, June 2014.

    Files

    sentiments_mod11.csv sentiment score based on chosen unigrams

    sentiments_mod22.csv sentiment score based on chosen bigrams

    sentiments_cooc_mod11.csv, sentiments_cooc_mod12.csv, sentiments_cooc_mod21.csv, sentiments_cooc_mod22.csv combinations of co-occurrences: unigrams-unigrams, unigrams-bigrams, bigrams-unigrams, bigrams-bigrams

  10. Social Media Datasets

    • brightdata.com
    .json, .csv, .xlsx
    Updated Sep 7, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bright Data (2022). Social Media Datasets [Dataset]. https://brightdata.com/products/datasets/social-media
    Explore at:
    .json, .csv, .xlsxAvailable download formats
    Dataset updated
    Sep 7, 2022
    Dataset authored and provided by
    Bright Datahttps://brightdata.com/
    License

    https://brightdata.com/licensehttps://brightdata.com/license

    Area covered
    Worldwide
    Description

    Gain valuable insights with our comprehensive Social Media Dataset, designed to help businesses, marketers, and analysts track trends, monitor engagement, and optimize strategies. This dataset provides structured and reliable social media data from multiple platforms.

    Dataset Features

    User Profiles: Access public social media profiles, including usernames, bios, follower counts, engagement metrics, and more. Ideal for audience analysis, influencer marketing, and competitive research. Posts & Content: Extract posts, captions, hashtags, media (images/videos), timestamps, and engagement metrics such as likes, shares, and comments. Useful for trend analysis, sentiment tracking, and content strategy optimization. Comments & Interactions: Analyze user interactions, including replies, mentions, and discussions. This data helps brands understand audience sentiment and engagement patterns. Hashtag & Trend Tracking: Monitor trending hashtags, topics, and viral content across platforms to stay ahead of industry trends and consumer interests.

    Customizable Subsets for Specific Needs Our Social Media Dataset is fully customizable, allowing you to filter data based on platform, region, keywords, engagement levels, or specific user profiles. Whether you need a broad dataset for market research or a focused subset for brand monitoring, we tailor the dataset to your needs.

    Popular Use Cases

    Brand Monitoring & Reputation Management: Track brand mentions, customer feedback, and sentiment analysis to manage online reputation effectively. Influencer Marketing & Audience Analysis: Identify key influencers, analyze engagement metrics, and optimize influencer partnerships. Competitive Intelligence: Monitor competitor activity, content performance, and audience engagement to refine marketing strategies. Market Research & Consumer Insights: Analyze social media trends, customer preferences, and emerging topics to inform business decisions. AI & Predictive Analytics: Leverage structured social media data for AI-driven trend forecasting, sentiment analysis, and automated content recommendations.

    Whether you're tracking brand sentiment, analyzing audience engagement, or monitoring industry trends, our Social Media Dataset provides the structured data you need. Get started today and customize your dataset to fit your business objectives.

  11. Twitter Sentiment Analysis Datasets

    • brightdata.com
    .json, .csv, .xlsx
    Updated Dec 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bright Data (2024). Twitter Sentiment Analysis Datasets [Dataset]. https://brightdata.com/products/datasets/twitter/sentiment-analysis
    Explore at:
    .json, .csv, .xlsxAvailable download formats
    Dataset updated
    Dec 23, 2024
    Dataset authored and provided by
    Bright Datahttps://brightdata.com/
    License

    https://brightdata.com/licensehttps://brightdata.com/license

    Area covered
    Worldwide
    Description

    Our Twitter Sentiment Analysis Dataset provides a comprehensive collection of tweets, enabling businesses, researchers, and analysts to assess public sentiment, track trends, and monitor brand perception in real time. This dataset includes detailed metadata for each tweet, allowing for in-depth analysis of user engagement, sentiment trends, and social media impact.

    Key Features:
    
      Tweet Content & Metadata: Includes tweet text, hashtags, mentions, media attachments, and engagement metrics such as likes, retweets, and replies.
      Sentiment Classification: Analyze sentiment polarity (positive, negative, neutral) to gauge public opinion on brands, events, and trending topics.
      Author & User Insights: Access user details such as username, profile information, follower count, and account verification status.
      Hashtag & Topic Tracking: Identify trending hashtags and keywords to monitor conversations and sentiment shifts over time.
      Engagement Metrics: Measure tweet performance based on likes, shares, and comments to evaluate audience interaction.
      Historical & Real-Time Data: Choose from historical datasets for trend analysis or real-time data for up-to-date sentiment tracking.
    
    
    Use Cases:
    
      Brand Monitoring & Reputation Management: Track public sentiment around brands, products, and services to manage reputation and customer perception.
      Market Research & Consumer Insights: Analyze consumer opinions on industry trends, competitor performance, and emerging market opportunities.
      Political & Social Sentiment Analysis: Evaluate public opinion on political events, social movements, and global issues.
      AI & Machine Learning Applications: Train sentiment analysis models for natural language processing (NLP) and predictive analytics.
      Advertising & Campaign Performance: Measure the effectiveness of marketing campaigns by analyzing audience engagement and sentiment.
    
    
    
      Our dataset is available in multiple formats (JSON, CSV, Excel) and can be delivered via API, cloud storage (AWS, Google Cloud, Azure), or direct download. 
      Gain valuable insights into social media sentiment and enhance your decision-making with high-quality, structured Twitter data.
    
  12. Social Media Sentiment Analysis

    • kaggle.com
    Updated Dec 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bisma Sajjad (2024). Social Media Sentiment Analysis [Dataset]. https://www.kaggle.com/datasets/bismasajjad/social-media-sentiment-analysis/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 15, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Bisma Sajjad
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    A dataset of social media posts (tweets, Facebook posts, etc.) along with sentiment scores (positive, negative, neutral). The data covers a variety of topics such as politics, entertainment, and health. Columns: Post ID, Date, Platform, Topic (e.g., Politics, Entertainment), Sentiment Score (1 = Positive, -1 = Negative, 0 = Neutral), Text Content.

  13. m

    Graph-Based Social Media Data on Mental Health Topics

    • data.mendeley.com
    Updated Nov 4, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Samuel Ady Sanjaya (2024). Graph-Based Social Media Data on Mental Health Topics [Dataset]. http://doi.org/10.17632/z45txpdp7f.2
    Explore at:
    Dataset updated
    Nov 4, 2024
    Authors
    Samuel Ady Sanjaya
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is structured as a graph, where nodes represent users and edges capture their interactions, including tweets, retweets, replies, and mentions. Each node provides detailed user attributes, such as unique ID, follower and following counts, and verification status, offering insights into each user's identity, role, and influence in the mental health discourse. The edges illustrate user interactions, highlighting engagement patterns and types of content that drive responses, such as tweet impressions. This interconnected structure enables sentiment analysis and public reaction studies, allowing researchers to explore engagement trends and identify the mental health topics that resonate most with users.

    The dataset consists of three files: 1. Edges Data: Contains graph data essential for social network analysis, including fields for UserID (Source), UserID (Destination), Post/Tweet ID, and Date of Relationship. This file enables analysis of user connections without including tweet content, maintaining compliance with Twitter/X’s data-sharing policies. 2. Nodes Data: Offers user-specific details relevant to network analysis, including UserID, Account Creation Date, Follower and Following counts, Verified Status, and Date Joined Twitter. This file allows researchers to examine user behavior (e.g., identifying influential users or spam-like accounts) without direct reference to tweet content. 3. Twitter/X Content Data: This file contains only the raw tweet text as a single-column dataset, without associated user identifiers or metadata. By isolating the text, we ensure alignment with anonymization standards observed in similar published datasets, safeguarding user privacy in compliance with Twitter/X's data guidelines. This content is crucial for addressing the research focus on mental health discourse in social media. (References to prior Data in Brief publications involving Twitter/X data informed the dataset's structure.)

  14. h

    myanmar-social-media-sentiment-analysis-dataset

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chuu Htet Naing, myanmar-social-media-sentiment-analysis-dataset [Dataset]. https://huggingface.co/datasets/chuuhtetnaing/myanmar-social-media-sentiment-analysis-dataset
    Explore at:
    Authors
    Chuu Htet Naing
    Area covered
    မြန်မာ
    Description

    Myanmar Social Media Sentiment Analysis Dataset

    A Myanmar language dataset for sentiment analysis of social media content, translated from an English source dataset.

      Dataset Description
    

    This dataset contains social media text with sentiment annotations translated into Myanmar language. It is derived from the original Social Media Sentiments Analysis Dataset on Kaggle, with texts professionally translated to Myanmar language while preserving the sentiment labels.… See the full description on the dataset page: https://huggingface.co/datasets/chuuhtetnaing/myanmar-social-media-sentiment-analysis-dataset.

  15. CCUS Sentiment Analysis - Tweets Dataset

    • zenodo.org
    • data.niaid.nih.gov
    Updated May 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alberto Sánchez; Marielisa Padilla; Alberto Sánchez; Marielisa Padilla (2024). CCUS Sentiment Analysis - Tweets Dataset [Dataset]. http://doi.org/10.5281/zenodo.11202683
    Explore at:
    Dataset updated
    May 16, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Alberto Sánchez; Marielisa Padilla; Alberto Sánchez; Marielisa Padilla
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The present dataset contains Tweets in any language supported by Twitter obtained during the months January to March 2023, with any mention to the topic CCS/CCUS. The scraping process were done in Python, using the official Twitter API. All tweets were manually annotated after being machine translated into English.

    - Structure
    Every row contains:
    1st cell (A): Language
    2nd cell (B): Tweet-text
    3rd cell (Cc: Benefit
    4th cell (D): Concern
    5th cell (E): Perception – Fight climate change
    6th cell (F): Perception – Climate-friendly technology
    7th cell (G): Perception – Extensive R&D needed
    8th cell (H): Perception – Better options than CCS
    9th cell (I): Sentiment
    10th cell (J): Relatedness
    11th cell (K): Comments

    - Annotations
    Benefit
    Preventing c. change
    Reducing c. change risks
    Safeguarding jobs
    Creating new jobs
    Fossil energy production envir. friendly
    Products envir. friendly
    Reducing envir. impact
    Other
    None
    Concern
    Accidents
    Leakages
    Environmental
    Earthquake-related
    Increased local traffic
    Investment
    Greenwashing
    Lock-in effects for fossil energy
    Increase cost
    Other
    None
    Perception (Yes / No / None)
    Fight climate change
    Climate-friendly technology
    Extensive R&D needed
    Better options than CCS
    Sentiment
    Positive
    Negative
    Neutral

  16. f

    Summary of the Twitter dataset used in this study.

    • plos.figshare.com
    xls
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hoh Kim; Jaram Park; Meeyoung Cha; Jaeseung Jeong (2023). Summary of the Twitter dataset used in this study. [Dataset]. http://doi.org/10.1371/journal.pone.0126358.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Hoh Kim; Jaram Park; Meeyoung Cha; Jaeseung Jeong
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Summary of the Twitter dataset used in this study.

  17. Z

    A Labelled Dataset for Sentiment Analysis of Videos on YouTube, TikTok, and...

    • data.niaid.nih.gov
    • figshare.com
    • +2more
    Updated Jul 20, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shao, Mingchen (2024). A Labelled Dataset for Sentiment Analysis of Videos on YouTube, TikTok, and other sources about the 2024 outbreak of Measles [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_11711229
    Explore at:
    Dataset updated
    Jul 20, 2024
    Dataset provided by
    Thakur, Nirmalya
    Patel, Kesha A.
    Su, Vanessa
    Jeong, Hongseok
    Shao, Mingchen
    Bian, Andrew
    Knieling, Victoria
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    YouTube
    Description

    Please cite the following paper when using this dataset:

    N. Thakur, V. Su, M. Shao, K. Patel, H. Jeong, V. Knieling, and A. Bian “A labelled dataset for sentiment analysis of videos on YouTube, TikTok, and other sources about the 2024 outbreak of measles,” Proceedings of the 26th International Conference on Human-Computer Interaction (HCII 2024), Washington, USA, 29 June - 4 July 2024. (Accepted as a Late Breaking Paper, Preprint Available at: https://doi.org/10.48550/arXiv.2406.07693)

    Abstract

    This dataset contains the data of 4011 videos about the ongoing outbreak of measles published on 264 websites on the internet between January 1, 2024, and May 31, 2024. These websites primarily include YouTube and TikTok, which account for 48.6% and 15.2% of the videos, respectively. The remainder of the websites include Instagram and Facebook as well as the websites of various global and local news organizations. For each of these videos, the URL of the video, title of the post, description of the post, and the date of publication of the video are presented as separate attributes in the dataset. After developing this dataset, sentiment analysis (using VADER), subjectivity analysis (using TextBlob), and fine-grain sentiment analysis (using DistilRoBERTa-base) of the video titles and video descriptions were performed. This included classifying each video title and video description into (i) one of the sentiment classes i.e. positive, negative, or neutral, (ii) one of the subjectivity classes i.e. highly opinionated, neutral opinionated, or least opinionated, and (iii) one of the fine-grain sentiment classes i.e. fear, surprise, joy, sadness, anger, disgust, or neutral. These results are presented as separate attributes in the dataset for the training and testing of machine learning algorithms for performing sentiment analysis or subjectivity analysis in this field as well as for other applications. The paper associated with this dataset (please see the above-mentioned citation) also presents a list of open research questions that may be investigated using this dataset.

  18. Dataset - Virtual Influencers and Public Perception: Social Media Sentiment...

    • zenodo.org
    Updated Nov 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stefanus Rumangkit; Stefanus Rumangkit (2024). Dataset - Virtual Influencers and Public Perception: Social Media Sentiment Analysis and A Comprehensive Bibliometric [Dataset]. http://doi.org/10.5281/zenodo.14043172
    Explore at:
    Dataset updated
    Nov 6, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Stefanus Rumangkit; Stefanus Rumangkit
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset contains a collection of articles related to Virtual Influencers and Public Perception: Social Media Sentiment Analysis and A Comprehensive Bibliometric in the Scopus database.

  19. d

    Grepsr | Sentiment Analysis of Facebook/Twitter/Instagram posts, News,...

    • datarade.ai
    Updated Mar 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Grepsr (2023). Grepsr | Sentiment Analysis of Facebook/Twitter/Instagram posts, News, Product Reviews | Custom and On-demand Sentiment Analysis [Dataset]. https://datarade.ai/data-products/sentiment-analysis-of-facebook-twitter-instagram-posts-news-grepsr
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Mar 20, 2023
    Dataset authored and provided by
    Grepsr
    Area covered
    Bahrain, Israel, Sint Eustatius and Saba, Gabon, Comoros, Kenya, Mayotte, Senegal, Saint Vincent and the Grenadines, Colombia
    Description

    Usecase/Applications possible with the data:

    Customer feedback analysis: Analyzing customer feedback can be helpful for businesses to keep customers happy, stay loyal to the brand, and identify any areas to improve.

    Social media monitoring: With sentiment analysis, companies can monitor what's being said about them on social media and use that to figure out how people feel about their products and services and track any new trends.

    Market research: Sentiment analysis can be used to analyze market trends and consumer preferences, which can help companies make informed business decisions and develop effective marketing strategies.

    Financial analysis: You can use sentiment analysis to determine what people say about the stock market through news and social media, which can help you make investing decisions.

    For e-commerce (amazon/Bestbuy/home depot and much more) following data fields can be included: Title Price Vendor Name Ratings Reviews Brand ASIN URL Sentiment analysis for each review And other fields, as per request

  20. Social Media Analytic Market Size, Share, Trends, & Insights Report, 2035

    • rootsanalysis.com
    Updated Dec 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Roots Analysis (2024). Social Media Analytic Market Size, Share, Trends, & Insights Report, 2035 [Dataset]. https://www.rootsanalysis.com/social-media-analytics-market
    Explore at:
    Dataset updated
    Dec 25, 2024
    Dataset provided by
    Authors
    Roots Analysis
    License

    https://www.rootsanalysis.com/privacy.htmlhttps://www.rootsanalysis.com/privacy.html

    Time period covered
    2021 - 2031
    Area covered
    Global
    Description

    The social media analytic market size is projected to grow from USD 11.38 billion in 2025 to USD107.3 billion by 2035, representing a CAGR of 25.16% during the forecast period till 2035.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
(2025). Sentiment Analysis for Social Media Monitoring Dataset [Dataset]. https://paperswithcode.com/dataset/sentiment-analysis-for-social-media

Sentiment Analysis for Social Media Monitoring Dataset

Explore at:
13 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Mar 6, 2025
Description

Problem Statement

👉 Download the case studies here

A global consumer goods company struggled to understand customer sentiment across various social media platforms. With millions of posts, reviews, and comments generated daily, manually tracking and analyzing public opinion was inefficient. The company needed an automated solution to monitor brand perception, address negative feedback promptly, and leverage insights for marketing strategies.

Challenge

Analyzing social media sentiment posed the following challenges:

Processing vast amounts of unstructured text data from multiple platforms like Twitter, Facebook, and Instagram.

Accurately interpreting slang, emojis, and nuanced language used by social media users.

Identifying trends and actionable insights in real-time to respond to potential crises or opportunities effectively.

Solution Provided

An advanced sentiment analysis system was developed using Natural Language Processing (NLP) and sentiment analysis algorithms. The solution was designed to:

Classify social media posts into positive, negative, and neutral sentiments.

Extract key topics and trends related to the brand and its products.

Provide real-time dashboards for monitoring customer sentiment and identifying areas of improvement.

Development Steps

Data Collection

Aggregated data from major social media platforms using APIs, focusing on brand mentions, hashtags, and product keywords.

Preprocessing

Cleaned and normalized text data, including handling slang, emojis, and misspellings, to prepare it for analysis.

Model Training

Trained NLP models for sentiment classification using supervised learning. Implemented topic modeling algorithms to identify recurring themes and discussions.

Validation

Tested the sentiment analysis models on labeled datasets to ensure high accuracy and relevance in classifying social media posts.

Deployment

Integrated the sentiment analysis system with a real-time analytics dashboard, enabling the marketing and customer support teams to track trends and respond proactively.

Monitoring & Improvement

Established a continuous feedback mechanism to refine models based on evolving language patterns and new social media trends.

Results

Gained Actionable Insights

The system provided detailed insights into customer opinions, helping the company identify strengths and areas for improvement.

Improved Brand Reputation Management

Real-time monitoring enabled swift responses to negative feedback, mitigating potential reputation risks.

Informed Marketing Strategies

Insights from sentiment analysis guided targeted marketing campaigns, resulting in higher engagement and ROI.

Enhanced Customer Relationships

Proactive engagement with customers based on sentiment analysis improved customer satisfaction and loyalty.

Scalable Monitoring Solution

The system scaled efficiently to analyze data across multiple languages and platforms, broadening the company’s reach and understanding.

Search
Clear search
Close search
Google apps
Main menu