13 datasets found
  1. Tweets Dataset

    • brightdata.com
    .json, .csv, .xlsx
    Updated Nov 13, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bright Data (2024). Tweets Dataset [Dataset]. https://brightdata.com/products/datasets/twitter/tweets
    Explore at:
    .json, .csv, .xlsxAvailable download formats
    Dataset updated
    Nov 13, 2024
    Dataset authored and provided by
    Bright Datahttps://brightdata.com/
    License

    https://brightdata.com/licensehttps://brightdata.com/license

    Area covered
    Worldwide
    Description

    Utilize our Tweets dataset for a range of applications to enhance business strategies and market insights. Analyzing this dataset offers a comprehensive view of social media dynamics, empowering organizations to optimize their communication and marketing strategies. Access the full dataset or select specific data points tailored to your needs. Popular use cases include sentiment analysis to gauge public opinion and brand perception, competitor analysis by examining engagement and sentiment around rival brands, and crisis management through real-time tracking of tweet sentiment and influential voices during critical events.

  2. Z

    IA Tweets Analysis Dataset (Spanish)

    • data.niaid.nih.gov
    • explore.openaire.eu
    • +2more
    Updated Aug 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Serrano-Fernández, Alejandro (2024). IA Tweets Analysis Dataset (Spanish) [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_10821484
    Explore at:
    Dataset updated
    Aug 3, 2024
    Dataset provided by
    Guerrero-Contreras, Gabriel
    Balderas-Díaz, Sara
    Muñoz, Andrés
    Serrano-Fernández, Alejandro
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    General Description

    This dataset comprises 4,038 tweets in Spanish, related to discussions about artificial intelligence (AI), and was created and utilized in the publication "Enhancing Sentiment Analysis on Social Media: Integrating Text and Metadata for Refined Insights," (10.1109/IE61493.2024.10599899) presented at the 20th International Conference on Intelligent Environments. It is designed to support research on public perception, sentiment, and engagement with AI topics on social media from a Spanish-speaking perspective. Each entry includes detailed annotations covering sentiment analysis, user engagement metrics, and user profile characteristics, among others.

    Data Collection Method

    Tweets were gathered through the Twitter API v1.1 by targeting keywords and hashtags associated with artificial intelligence, focusing specifically on content in Spanish. The dataset captures a wide array of discussions, offering a holistic view of the Spanish-speaking public's sentiment towards AI.

    Dataset Content

    ID: A unique identifier for each tweet.

    text: The textual content of the tweet. It is a string with a maximum allowed length of 280 characters.

    polarity: The tweet's sentiment polarity (e.g., Positive, Negative, Neutral).

    favorite_count: Indicates how many times the tweet has been liked by Twitter users. It is a non-negative integer.

    retweet_count: The number of times this tweet has been retweeted. It is a non-negative integer.

    user_verified: When true, indicates that the user has a verified account, which helps the public recognize the authenticity of accounts of public interest. It is a boolean data type with two allowed values: True or False.

    user_default_profile: When true, indicates that the user has not altered the theme or background of their user profile. It is a boolean data type with two allowed values: True or False.

    user_has_extended_profile: When true, indicates that the user has an extended profile. An extended profile on Twitter allows users to provide more detailed information about themselves, such as an extended biography, a header image, details about their location, website, and other additional data. It is a boolean data type with two allowed values: True or False.

    user_followers_count: The current number of followers the account has. It is a non-negative integer.

    user_friends_count: The number of users that the account is following. It is a non-negative integer.

    user_favourites_count: The number of tweets this user has liked since the account was created. It is a non-negative integer.

    user_statuses_count: The number of tweets (including retweets) posted by the user. It is a non-negative integer.

    user_protected: When true, indicates that this user has chosen to protect their tweets, meaning their tweets are not publicly visible without their permission. It is a boolean data type with two allowed values: True or False.

    user_is_translator: When true, indicates that the user posting the tweet is a verified translator on Twitter. This means they have been recognized and validated by the platform as translators of content in different languages. It is a boolean data type with two allowed values: True or False.

    Cite as

    Guerrero-Contreras, G., Balderas-Díaz, S., Serrano-Fernández, A., & Muñoz, A. (2024, June). Enhancing Sentiment Analysis on Social Media: Integrating Text and Metadata for Refined Insights. In 2024 International Conference on Intelligent Environments (IE) (pp. 62-69). IEEE.

    Potential Use Cases

    This dataset is aimed at academic researchers and practitioners with interests in:

    Sentiment analysis and natural language processing (NLP) with a focus on AI discussions in the Spanish language.

    Social media analysis on public engagement and perception of artificial intelligence among Spanish speakers.

    Exploring correlations between user engagement metrics and sentiment in discussions about AI.

    Data Format and File Type

    The dataset is provided in CSV format, ensuring compatibility with a wide range of data analysis tools and programming environments.

    License

    The dataset is available under the Creative Commons Attribution 4.0 International (CC BY 4.0) license, permitting sharing, copying, distribution, transmission, and adaptation of the work for any purpose, including commercial, provided proper attribution is given.

  3. b

    Twitter Sentiment Analysis Datasets

    • brightdata.com
    .json, .csv, .xlsx
    Updated Dec 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bright Data (2024). Twitter Sentiment Analysis Datasets [Dataset]. https://brightdata.com/products/datasets/twitter/sentiment-analysis
    Explore at:
    .json, .csv, .xlsxAvailable download formats
    Dataset updated
    Dec 24, 2024
    Dataset authored and provided by
    Bright Data
    License

    https://brightdata.com/licensehttps://brightdata.com/license

    Area covered
    Worldwide
    Description

    Our Twitter Sentiment Analysis Dataset provides a comprehensive collection of tweets, enabling businesses, researchers, and analysts to assess public sentiment, track trends, and monitor brand perception in real time. This dataset includes detailed metadata for each tweet, allowing for in-depth analysis of user engagement, sentiment trends, and social media impact.

    Key Features:
    
      Tweet Content & Metadata: Includes tweet text, hashtags, mentions, media attachments, and engagement metrics such as likes, retweets, and replies.
      Sentiment Classification: Analyze sentiment polarity (positive, negative, neutral) to gauge public opinion on brands, events, and trending topics.
      Author & User Insights: Access user details such as username, profile information, follower count, and account verification status.
      Hashtag & Topic Tracking: Identify trending hashtags and keywords to monitor conversations and sentiment shifts over time.
      Engagement Metrics: Measure tweet performance based on likes, shares, and comments to evaluate audience interaction.
      Historical & Real-Time Data: Choose from historical datasets for trend analysis or real-time data for up-to-date sentiment tracking.
    
    
    Use Cases:
    
      Brand Monitoring & Reputation Management: Track public sentiment around brands, products, and services to manage reputation and customer perception.
      Market Research & Consumer Insights: Analyze consumer opinions on industry trends, competitor performance, and emerging market opportunities.
      Political & Social Sentiment Analysis: Evaluate public opinion on political events, social movements, and global issues.
      AI & Machine Learning Applications: Train sentiment analysis models for natural language processing (NLP) and predictive analytics.
      Advertising & Campaign Performance: Measure the effectiveness of marketing campaigns by analyzing audience engagement and sentiment.
    
    
    
      Our dataset is available in multiple formats (JSON, CSV, Excel) and can be delivered via API, cloud storage (AWS, Google Cloud, Azure), or direct download. 
      Gain valuable insights into social media sentiment and enhance your decision-making with high-quality, structured Twitter data.
    
  4. P

    Famous Keyword Twitter Replies Dataset

    • paperswithcode.com
    Updated Jun 16, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Famous Keyword Twitter Replies Dataset [Dataset]. https://paperswithcode.com/dataset/famous-keyword-twitter-replies
    Explore at:
    Dataset updated
    Jun 16, 2023
    Description

    The "Famous Keyword Twitter Replies Dataset" is a comprehensive collection of Twitter data that focuses on popular keywords and their associated replies. This dataset contains five essential columns that provide valuable insights into the Twitter conversation dynamics:

    Keyword: This column represents the specific keyword or topic of interest that generated the original tweet. It helps identify the context or subject matter around which the conversation revolves.

    Main_tweet: The main_tweet column contains the original tweet related to the keyword. It serves as the starting point or focal point of the conversation and often provides essential information or opinions on the given topic.

    Main_likes: This column provides the number of likes received by the main_tweet. Likes serve as a measure of engagement and indicate the level of popularity or resonance of the original tweet within the Twitter community.

    Reply: The reply column consists of the replies or responses to the main_tweet. These replies may include comments, opinions, additional information, or discussions related to the keyword or the original tweet itself. The replies help capture the diverse perspectives and conversations that emerge in response to the main_tweet.

    Reply_likes: This column records the number of likes received by each reply. Similar to the main_likes column, the reply_likes column measures the level of engagement and popularity of individual replies. It enables the identification of particularly noteworthy or well-received replies within the dataset.

    By analyzing this "Famous Keyword Twitter Replies Dataset," researchers, analysts, and data scientists can gain valuable insights into how popular keywords spark discussions on Twitter and how these discussions evolve through replies.

    The dataset's information on likes allows for the evaluation of tweet and reply popularity, helping to identify influential or impactful content.

    This dataset serves as a valuable resource for various applications, including sentiment analysis, trend identification, opinion mining, and understanding social media dynamics.

    Number of tweets for each pairs of tweet and reply

    Total has 17255 pairs of tweet/reply

  5. Z

    Brussel mobility Twitter sentiment analysis CSV Dataset

    • data.niaid.nih.gov
    Updated May 31, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    van Vessem, Charlotte (2024). Brussel mobility Twitter sentiment analysis CSV Dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_11401123
    Explore at:
    Dataset updated
    May 31, 2024
    Dataset provided by
    van Vessem, Charlotte
    Tori, Floriano
    Ginis, Vincent
    Betancur Arenas, Juliana
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Brussels
    Description

    SSH CENTRE (Social Sciences and Humanities for Climate, Energy aNd Transport Research Excellence) is a Horizon Europe project, engaging directly with stakeholders across research, policy, and business (including citizens) to strengthen social innovation, SSH-STEM collaboration, transdisciplinary policy advice, inclusive engagement, and SSH communities across Europe, accelerating the EU’s transition to carbon neutrality. SSH CENTRE is based in a range of activities related to Open Science, inclusivity and diversity – especially with regards Southern and Eastern Europe and different career stages – including: development of novel SSH-STEM collaborations to facilitate the delivery of the EU Green Deal; SSH knowledge brokerage to support regions in transition; and the effective design of strategies for citizen engagement in EU R&I activities. Outputs include action-led agendas and building stakeholder synergies through regular Policy Insight events.This is captured in a high-profile virtual SSH CENTRE generating and sharing best practice for SSH policy advice, overcoming fragmentation to accelerate the EU’s journey to a sustainable future.The documents uploaded here are part of WP2 whereby novel, interdisciplinary teams were provided funding to undertake activities to develop a policy recommendation related to EU Green Deal policy. Each of these policy recommendations, and the activities that inform them, will be written-up as a chapter in an edited book collection. Three books will make up this edited collection - one on climate, one on energy and one on mobility. As part of writing a chapter for the SSH CENTRE book on ‘Mobility’, we set out to analyse the sentiment of users on Twitter regarding shared and active mobility modes in Brussels. This involved us collecting tweets between 2017-2022. A tweet was collected if it contained a previously defined mobility keyword (for example: metro) and either the name of a (local) politician, a neighbourhood or municipality, or a (shared) mobility provider. The files attached to this Zenodo webpage is a csv files containing the tweets collected.”.

  6. Greta Thunberg's Twitter data mining & Frame Analysis

    • figshare.com
    txt
    Updated Jul 22, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sílvia Díaz Pérez (2022). Greta Thunberg's Twitter data mining & Frame Analysis [Dataset]. http://doi.org/10.6084/m9.figshare.20311524.v6
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jul 22, 2022
    Dataset provided by
    figshare
    Authors
    Sílvia Díaz Pérez
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Bigrams, words frequency, frame and engagement analysis of the tweets published by Greta Thunberg between 2019 and 2022.

  7. Twitter Trends|| PPP &PTI || Pakistan Elections

    • kaggle.com
    Updated Feb 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aqeelkh (2024). Twitter Trends|| PPP &PTI || Pakistan Elections [Dataset]. https://www.kaggle.com/datasets/aqeelkh/twitter-trends-ppp-and-pti-pakistan-elections/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 16, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Aqeelkh
    Area covered
    Pakistan
    Description

    Dataset Title: PPP and PTI Twitter Trend Analysis

    Overview This dataset encompasses a collection of 1184 tweets from the Twitter trend "PPP and PTI," capturing a snapshot of public discourse and sentiment regarding Pakistan's prominent political entities: the Pakistan Peoples Party (PPP) and Pakistan Tehreek-e-Insaf (PTI). It provides a diverse range of perspectives and reactions from Twitter users, making it an invaluable resource for political analysts, data scientists, and researchers interested in political sentiment analysis, social media analytics, and digital humanities.

    Dataset Description The dataset is structured into seven columns, each offering distinct insights into the tweets collected:

    • UserTag: The Twitter handle of the user who posted the tweet.
    • TimeStamp: The date and time when the tweet was posted, providing temporal context to the data.
    • Current_Date: The date when the data was collected, ensuring traceability and relevance. -**Tweet Body**: The actual content of the tweet, encapsulating the message, sentiment, and topics discussed by the user. This column is central to text analysis, sentiment detection, and thematic studies.
    • Reply: The number of replies to the tweet, indicating engagement and conversational depth.
    • Retweet: The number of retweets, reflecting the tweet's reach and virality within the Twitter community.
    • Likes: The number of likes, serving as a proxy for the tweet's popularity and user agreement.
    • Views: An estimate of how many times the tweet was viewed, offering insights into its impact and visibility.

    Potential Uses This dataset can serve a wide range of purposes, including but not limited to: 1. Sentiment analysis to gauge public opinion regarding PPP and PTI. 2. Temporal analysis to identify trends and shifts in public sentiment over time. 3. Network analysis to explore interactions and the spread of information among users. 4. Comparative analysis between the engagement and popularity of tweets related to PPP vs. PTI.

    Methodology The tweets were collected using Selenium WebDriver, ensuring a comprehensive and unbiased selection of tweets related to the "PPP and PTI" trend. Care was taken to include tweets from various times of the day to capture a broad spectrum of user engagement and opinions.

    Ethical Considerations All data was collected and presented in accordance with Twitter's data use policies and ethical guidelines for research.

    Acknowledgments This dataset was created by Aqeel Khan, a student of BS Mathematics at Namal University Mianwali, with a keen interest in data science and machine learning. The dataset compilation was aimed at facilitating research and analysis in the domains of political science, social media analytics, and data science.

    License This dataset is shared for educational and research purposes. Users of the dataset are encouraged to cite the source and adhere to Twitter's terms of service regarding the use of shared data.

  8. m

    Graph-Based Social Media Data on Mental Health Topics

    • data.mendeley.com
    Updated Nov 4, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Samuel Ady Sanjaya (2024). Graph-Based Social Media Data on Mental Health Topics [Dataset]. http://doi.org/10.17632/z45txpdp7f.2
    Explore at:
    Dataset updated
    Nov 4, 2024
    Authors
    Samuel Ady Sanjaya
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is structured as a graph, where nodes represent users and edges capture their interactions, including tweets, retweets, replies, and mentions. Each node provides detailed user attributes, such as unique ID, follower and following counts, and verification status, offering insights into each user's identity, role, and influence in the mental health discourse. The edges illustrate user interactions, highlighting engagement patterns and types of content that drive responses, such as tweet impressions. This interconnected structure enables sentiment analysis and public reaction studies, allowing researchers to explore engagement trends and identify the mental health topics that resonate most with users.

    The dataset consists of three files: 1. Edges Data: Contains graph data essential for social network analysis, including fields for UserID (Source), UserID (Destination), Post/Tweet ID, and Date of Relationship. This file enables analysis of user connections without including tweet content, maintaining compliance with Twitter/X’s data-sharing policies. 2. Nodes Data: Offers user-specific details relevant to network analysis, including UserID, Account Creation Date, Follower and Following counts, Verified Status, and Date Joined Twitter. This file allows researchers to examine user behavior (e.g., identifying influential users or spam-like accounts) without direct reference to tweet content. 3. Twitter/X Content Data: This file contains only the raw tweet text as a single-column dataset, without associated user identifiers or metadata. By isolating the text, we ensure alignment with anonymization standards observed in similar published datasets, safeguarding user privacy in compliance with Twitter/X's data guidelines. This content is crucial for addressing the research focus on mental health discourse in social media. (References to prior Data in Brief publications involving Twitter/X data informed the dataset's structure.)

  9. o

    500k ChatGPT-related Tweets Jan-Mar 2023

    • opendatabay.com
    .csv
    Updated Jun 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Datasimple (2025). 500k ChatGPT-related Tweets Jan-Mar 2023 [Dataset]. https://www.opendatabay.com/data/ai-ml/4f84df82-9852-490b-b41a-00a4a4191f47
    Explore at:
    .csvAvailable download formats
    Dataset updated
    Jun 16, 2025
    Dataset authored and provided by
    Datasimple
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Social Media and Networking
    Description

    This dataset contains a CSV file related to ChatGPT including keywords(chatgpt, chat gpt) #hashtags and @mentions about ChatGPT. OpenAI's conversational AI model. The file includes information on 500,000 tweets. The dataset aims to help understand public opinion, trends, and potential applications of ChatGPT by analyzing tweet volume, sentiment, user engagement, and the influence of key AI events. The dataset offers valuable insights for companies, researchers, and policymakers, allowing them to make informed decisions and shape the future of AI-powered conversational technologies.

    Check out my Comprehensive Analysis on this dataset: Medium article "Cracking the ChatGPT Code: A Deep Dive into 500,000 Tweets using Advanced NLP Techniques"

    Learn about the collection process in Medium article "Effortlessly Scraping Massive Twitter Data"

    License

    CC0

    Original Data Source: 500k ChatGPT-related Tweets Jan-Mar 2023

  10. m

    Data from: ID-SMSA: Indonesian Stock Market Dataset for Sentiment Analysis

    • data.mendeley.com
    Updated Jan 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jason Hartanto (2025). ID-SMSA: Indonesian Stock Market Dataset for Sentiment Analysis [Dataset]. http://doi.org/10.17632/tn4vzs8tdw.3
    Explore at:
    Dataset updated
    Jan 20, 2025
    Authors
    Jason Hartanto
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Indonesia
    Description

    The ID-SMSA Dataset is a collection of stock market-related Indonesian tweets that were collected via X (formerly known as Twitter). The dataset contains tweets in the Indonesian language, each labeled with sentiment categories: positive, negative, or neutral. A team of annotators completes the annotations using annotation guidelines that a clinical psychology specialist has reviewed. To facilitate future studies in sentiment analysis and financial market studies, other variables are also incorporated, such as the tweet's date and user engagement metrics (Quote Count, Reply Count, Retweet Count, and Favorite Count).

  11. f

    Analyzing geospatial election prediction: The influence of COVID-19 on...

    • figshare.com
    html
    Updated Oct 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Asif Khan (2023). Analyzing geospatial election prediction: The influence of COVID-19 on social media discourse [Dataset]. http://doi.org/10.6084/m9.figshare.24289102.v1
    Explore at:
    htmlAvailable download formats
    Dataset updated
    Oct 11, 2023
    Dataset provided by
    figshare
    Authors
    Asif Khan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    CodeThis figshare repository hosts a collection of tools and scripts for Twitter data analysis, focusing on Election Prediction using sentiment analysis and tweet processing. The repository includes four key files:twitter_data_collection.py: This Python script is designed for collecting tweets from Twitter in JSON format. It provides a robust method for gathering data from the Twitter platform.EP.ipynb: EP.ipynb" is designed for sentiment analysis and tweet processing. It features three sentiment analysis methods: VADER, BERT, and BERTweet. It includes a US states dictionary for geolocating and categorizing tweets by state, providing sentiment analysis results in both volumetric and percentage formats. Furthermore, it offers time-series analysis options, particularly on a monthly basis. It also includes a feature for filtering COVID-19-related tweets. Additionally, it conducts election analysis at both state and country levels, giving insights into public sentiment and engagement regarding political elections.Datasetbiden and trump.csv Files:The "biden.csv" and "trump.csv" files together constitute an extensive dataset of tweets related to two prominent U.S. political figures, Joe Biden and Donald Trump. These files contain detailed information about each tweet, including the following key attributes:create_date: The date the tweet was created.id: A unique identifier for each tweet.tweet_text: The actual text content of the tweet.user_id: The unique identifier for the Twitter user who posted the tweet.user_name: The name of the Twitter user.user_screen_name: The Twitter handle of the user.user_location: The location provided by the user in their Twitter profile.state (location): The U.S. state associated with the user's provided location.text_clean: The tweet text after preprocessing, making it suitable for analysis.Additionally, sentiment analysis has been applied to these tweets using two different methods:VADER Sentiment Analysis: Each tweet has been assigned a sentiment score and a sentiment category (positive, negative, or neutral) using VADER sentiment analysis. The sentiment scores are provided in the "Vader_score" column, and the sentiment categories are in the "Vader_sentiment" column.BERTweet Sentiment Analysis: The files also feature sentiment labels assigned using the BERTweet sentiment analysis method, along with associated sentiment scores. The sentiment labels can be found in the "Sentiment" column, and the cleaned sentiment labels are available in the "Sentiment_clean" column.This combined dataset offers a valuable resource for exploring sentiment trends, conducting research on public sentiment, and analyzing Twitter users' opinions related to Joe Biden and Donald Trump. Researchers, data analysts, and sentiment analysis practitioners can utilize this data for a wide range of studies and projects.This repository serves as a resource for collecting, processing, and analyzing Twitter data with a focus on sentiment analysis. It offers a range of tools and datasets to support research and experimentation in this area.

  12. H

    Replication Data for: "Kingdom of Trolls? Influence Operations in the Saudi...

    • dataverse.harvard.edu
    Updated Apr 12, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Christopher Barrie; Alexandra A. Siegel (2021). Replication Data for: "Kingdom of Trolls? Influence Operations in the Saudi Twittersphere," Journal of Quantitative Description: Digital Media. [Dataset]. http://doi.org/10.7910/DVN/T97FMH
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 12, 2021
    Dataset provided by
    Harvard Dataverse
    Authors
    Christopher Barrie; Alexandra A. Siegel
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Saudi Arabia
    Description

    Saudi Arabia has one of the highest rates of Twitter penetration in the world. Despite high levels of repression, the platform is frequently used to discuss political topics. Recent disclosures from Twitter have revealed state-backed attempts at distorting the online information environment through influence operations (IOs). A growing body of research has investigated online disinformation and foreign-sponsored IOs in the English-speaking world; but comparatively little is known about online disinformation outside these contexts or about the domestic use of IOs. Using public releases of IO tweets, we investigate the extent of such activity in Saudi Arabia. Benchmarking these tweets to four samples of Saudi Twitter users, we find that engagement with IO accounts was lower than engagement with the average user, but equal to engagement with news accounts. Network analysis reveals that engagement with IO accounts was largely driven by a small number of influential accounts.

  13. f

    Data from: Let’s (re)tweet about racism and sexism: responses to cyber...

    • tandf.figshare.com
    docx
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Paulina d. C. Inara Rodis (2023). Let’s (re)tweet about racism and sexism: responses to cyber aggression toward Black and Asian women [Dataset]. http://doi.org/10.6084/m9.figshare.15156554.v1
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Taylor & Francis
    Authors
    Paulina d. C. Inara Rodis
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Online, anyone’s words can easily be amplified – and on Twitter, the platform’s algorithm highlights tweets that gain attention from other users, which can exponentially reinforce a tweet’s popularity. Moreover, retweets can help spread a message well beyond the reach of its original poster. Thus, users’ interactions with posts containing or making reference to racism or sexism both illuminate the ways individuals accept, challenge, or engage with racism and sexism online, and shape how those messages spread. Using an original dataset of 59.5 million tweets, I test how particular features of messages referencing Black and Asian women predict user engagement (retweets, likes, and replies). This analysis further focuses on messages including terms that express racist or sexist content. Generally, messages including covert racist or sexist insults have a modest positive effect on all measures of user engagement (retweets, likes, and replies), which may suggest that social media environments allow individuals the time and opportunity to contend with topics that can be more difficult in-person. Additionally, variations in engagement with tweets that include references to women, Black or Asian individuals implies that users respond differently to messages involving references to and normative images of different racial, ethnic, and gendered identities. This research illuminates how specific manifestations of racialized and gendered language referencing women, Black and Asian people can not only encourage more engagement, but also share, accept, or challenge messages about marginalized identities.

  14. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Bright Data (2024). Tweets Dataset [Dataset]. https://brightdata.com/products/datasets/twitter/tweets
Organization logo

Tweets Dataset

Explore at:
.json, .csv, .xlsxAvailable download formats
Dataset updated
Nov 13, 2024
Dataset authored and provided by
Bright Datahttps://brightdata.com/
License

https://brightdata.com/licensehttps://brightdata.com/license

Area covered
Worldwide
Description

Utilize our Tweets dataset for a range of applications to enhance business strategies and market insights. Analyzing this dataset offers a comprehensive view of social media dynamics, empowering organizations to optimize their communication and marketing strategies. Access the full dataset or select specific data points tailored to your needs. Popular use cases include sentiment analysis to gauge public opinion and brand perception, competitor analysis by examining engagement and sentiment around rival brands, and crisis management through real-time tracking of tweet sentiment and influential voices during critical events.

Search
Clear search
Close search
Google apps
Main menu