23 datasets found

Indian Political Tweet Engagement Dataset
kaggle.com
zip
Updated Jan 23, 2026
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mannat Trivedi (2026). Indian Political Tweet Engagement Dataset [Dataset]. https://www.kaggle.com/datasets/mannattrivedi/indian-political-tweet-engagement-dataset
Explore at:
zip(10064890 bytes)Available download formats
Dataset updated
Jan 23, 2026
Authors
Mannat Trivedi
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Area covered
India
Description
Dataset Description and Authenticity Audit

This dataset contains 85,154 Twitter posts related to Indian political discourse, collected between October 2022 and March 2023. It includes tweet text, user identifiers, temporal metadata, and engagement metrics such as likes and retweets, enabling analysis of interaction patterns and engagement behavior in high-activity public discussions.

Dataset Integrity

The dataset consists of 9 variables and is fully cleaned, with no missing values, duplicate records, or invalid timestamps. Derived temporal features (Year, Month, Day) are perfectly consistent with the original timestamp, ensuring reliability for time-based analysis.

Authenticity Validation

Multiple forensic checks were performed to evaluate whether engagement metrics reflect real-world social media behavior:

Temporal Consistency Test confirmed exact alignment between timestamps and derived date components.

Benford’s Law Analysis showed close correspondence between observed and expected digit distributions, indicating naturally occurring numerical patterns.

Social Power Law (Pareto Principle) validation revealed that the top 1% of users contribute approximately 12.25% of all tweets, consistent with organic human-driven participation rather than automated activity.

Statistical Characteristics

Engagement metrics display realistic long-tail distributions, with a small fraction of highly engaged tweets and minimal zero inflation. The dataset contains over 58,000 distinct users and more than 98% unique tweet content, further supporting data authenticity.

Interaction Network Component

In addition to tweet-level data, the dataset includes a user interaction network represented as directed edges. Each edge denotes an interaction between two users, derived from observable Twitter actions such as replies, mentions, or retweets. This network structure enables graph-based analysis of information flow, influence patterns, and community behavior within political discussions. https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F22766094%2Fea12c83af697260cda6a5dfcdd6c544b%2Fgraph.jpg?generation=1769145568146421&alt=media" alt="">

Intended Use

The dataset is suitable for machine learning and analytical tasks such as engagement prediction, content analysis, user behavior modeling, and temporal interaction studies. Political content is treated solely as a high-engagement discussion domain and does not imply ideological inference or endorsement.
donald-trump-truths-dataset
kaggle.com
zip
Updated May 12, 2026
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Purevsuren Erdene (2026). donald-trump-truths-dataset [Dataset]. https://www.kaggle.com/datasets/epurevsuren/donald-trump-truths-dataset
Explore at:
zip(1092847 bytes)Available download formats
Dataset updated
May 12, 2026
Authors
Purevsuren Erdene
License
Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
Description
Trump Truth Social Posts Dataset (Second Presidency)

Overview

This dataset contains posts retrieved from President Donald J. Trump’s official Truth Social account during his second presidency period (2025–2026). The collection includes original Truth Social posts focused on political events, economic policy, geopolitics, trade relations, elections, national security, and public communications.

The dataset was compiled for research and analytical purposes, including:

natural language processing (NLP)

political communication analysis

sentiment analysis

financial market research

geopolitical risk studies

event-driven forecasting

Dataset Contents

The dataset includes:

post text/content

posting timestamps/dates

cleaned textual data

metadata fields (depending on extraction source)

Posts cover a wide range of topics including:

tariffs and trade negotiations

Federal Reserve commentary

sanctions and foreign policy

military conflicts and ceasefires

immigration and border policy

energy and oil policy

elections and campaign messaging

international diplomacy

Source

All posts were retrieved from Truth Social, the social media platform used by Donald J. Trump for official public communication.

Potential Applications

This dataset can be used for:

NLP model training

political sentiment analysis

financial market event studies

geopolitical risk analysis

text classification

time-series analysis

policy communication research

transformer and LLM fine-tuning

Disclaimer

This dataset is provided for research and educational purposes only. The dataset reflects publicly available social media posts and does not imply factual verification of statements contained within the posts.
Political Social Media Posts
kaggle.com
zip
Updated Nov 20, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Figure Eight (2016). Political Social Media Posts [Dataset]. https://www.kaggle.com/crowdflower/political-social-media-posts
Explore at:
zip(818736 bytes)Available download formats
Dataset updated
Nov 20, 2016
Dataset authored and provided by
Figure Eight
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
This dataset, from Crowdflower's Data For Everyone Library, provides text of 5000 messages from politicians' social media accounts, along with human judgments about the purpose, partisanship, and audience of the messages.

How was it collected?

Contributors looked at thousands of social media messages from US Senators and other American politicians to classify their content. Messages were broken down into audience (national or the tweeter’s constituency), bias (neutral/bipartisan, or biased/partisan), and finally tagged as the actual substance of the message itself (options ranged from informational, announcement of a media appearance, an attack on another candidate, etc.)

Acknowledgments

Data was provided by the Data For Everyone Library on Crowdflower.

Our Data for Everyone library is a collection of our favorite open data jobs that have come through our platform. They're available free of charge for the community, forever.

Inspiration

Here are a couple of questions you can explore with this dataset:

what words predict partisan v. neutral messages?

what words predict support messages v. attack messages?

do politicians use Twitter and Facebook for different purposes? (e.g., Twitter for attack messages, Facebook for policy messages)?

The Data

The dataset contains one file, with the following fields:

_unit_id: a unique id for the message

_golden: always FALSE; (presumably whether the message was in Crowdflower's gold standard)

_unit_state: always "finalized"

_trusted_judgments: the number of trusted human judgments that were entered for this message; an integer between 1 and 3

_last_judgment_at: when the final judgment was collected

audience: one of national or constituency

audience:confidence: a measure of confidence in the audience judgment; a float between 0.5 and 1

bias: one of neutral or partisan

bias:confidence: a measure of confidence in the bias judgment; a float between 0.5 and 1

message: the aim of the message. one of: -- attack: the message attacks another politician
-- constituency: the message discusses the politician's constituency
-- information: an informational message about news in government or the wider U.S.
-- media: a message about interaction with the media
-- mobilization: a message intended to mobilize supporters
-- other: a catch-all category for messages that don't fit into the other
-- personal: a personal message, usually expressing sympathy, support or condolences, or other personal opinions
-- policy: a message about political policy
-- support: a message of political support

message:confidence: a measure of confidence in the message judgment; a float between 0.5 and 1

orig_golden: always empty; presumably whether some portion of the message was in the gold standard

audience_gold: always empty; presumably whether the audience response was in the gold standard

bias_gold: always empty; presumably whether the bias response was in the gold standard

bioid: a unique id for the politician

embed: HTML code to embed this message

id: unique id for the message WITHIN whichever social media site it was pulled from

label: a string of the form "From: firstname lastname (position from state)"

message_gold: always blank; presumably whether the message response was in the gold standard

source: where the message was posted; one of "facebook" or "twitter"

text: the text of the message
Global Political tweets
kaggle.com
zip
Updated Aug 23, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kash (2022). Global Political tweets [Dataset]. https://www.kaggle.com/kaushiksuresh147/political-tweets
Explore at:
zip(39306532 bytes)Available download formats
Dataset updated
Aug 23, 2022
Authors
Kash
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
https://techcrunch.com/wp-content/uploads/2015/10/twitter-politics.png" alt="">

Social media is becoming a key medium through which we communicate with each other: it is at the center of the very structures of our daily interactions. Yet this infiltration is not unique to interpersonal relations. Political leaders, governments, and states operate within this social media environment, wherein they continually address crises and institute damage control through platforms such as Twitter.

With the proliferation of the internet into mass masses, social media is emerging as a potential way of communication. It provides a direct channel to politicians for communicating, connecting, and engaging with the public. The power of social media, especially Twitter and Facebook has been proved by its successful application during recent US presidential elections and Arabian countries' revolts. In India too, as the general election is about to knock at the door during early 2014, political parties and leaders are trying to harness the power of social media.

Content

The tweets have the #Politics hashtag. The collection started on 24/7/2021, and will be updated on a daily basis.

Information regarding the data

The data totally consists of 1 lakh+ records with 13 columns. The description of the features is given below | No |Columns | Descriptions | | -- | -- | -- | | 1 | user_name | The name of the user, as they’ve defined it. | | 2 | user_location | The user-defined location for this account’s profile. | | 3 | user_description | The user-defined UTF-8 string describing their account. | | 4 | user_created | Time and date, when the account was created. | | 5 | user_followers | The number of followers an account currently has. | | 6 | user_friends | The number of friends an account currently has. | | 7 | user_favourites | The number of favorites an account currently has | | 8 | user_verified | When true, indicates that the user has a verified account | | 9 | date | UTC time and date when the Tweet was created | | 10 | text | The actual UTF-8 text of the Tweet | | 11 | hashtags | All the other hashtags posted in the tweet along with #Politics | | 12 | source | Utility used to post the Tweet, Tweets from the Twitter website have a source value - web | | 13 | is_retweet | Indicates whether this Tweet has been Retweeted by the authenticating user. |

Inspiration

You can use this data to dive into the subjects that use this hashtag, look to the geographical distribution, evaluate sentiments, and look at trends.
Twitter Political Sentiment Dataset: India
kaggle.com
zip
Updated Jan 26, 2026
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kaushal Kumar (2026). Twitter Political Sentiment Dataset: India [Dataset]. https://www.kaggle.com/datasets/prokaushal05/twitter-political-sentiment-dataset-india
Explore at:
zip(140777 bytes)Available download formats
Dataset updated
Jan 26, 2026
Authors
Kaushal Kumar
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
India
Description
About Dataset Context This dataset captures the digital footprint of political influence and discourse within the Indian landscape on Twitter (X). As social media becomes the primary battleground for political narratives, understanding engagement patterns, sentiment shifts, and influential actors is crucial for political scientists, journalists, and data researchers.

Content The data consists of records scraped from Twitter focusing on key political figures, trending hashtags, and public interactions regarding Indian politics. It includes:

User Metadata: Followers, verified status, and location.

Engagement Metrics: Likes, retweets, and comment counts.

Textual Data: The content of the tweets (useful for NLP and Sentiment Analysis).

Timestamps: To track how influence peaks during specific events.

Data Profiling Report I have included a comprehensive EDA (Exploratory Data Analysis) report generated via ydata-profiling. You can find the interactive version in the Twitter_profiling.html file attached to this dataset. It provides an instant overview of correlations, missing values, and distribution plots.

Potential Use Cases Sentiment Analysis: Classify public opinion towards different political parties or policies.

Network Analysis: Map out how information spreads through retweets and mentions.

Bot Detection: Identify suspicious patterns of high-frequency posting or artificial engagement.

Time-Series Analysis: Correlate spikes in Twitter activity with real-world news events in India.

Inspiration Can we predict the popularity of a political narrative based on early engagement?

How does the "Verified" status impact the reach of a political message in the Indian context?

Which hashtags dominated the discourse during the study period?

Collection Methodology The data was collected using [Mention your tool, e.g., Snscrape, Tweepy, or a custom scraper] targeting keywords such as #IndiaPolitics, #Elections2024, and major party names.
Indian Political Sentiment on Twitter
kaggle.com
zip
Updated Mar 29, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
PyroTech (2024). Indian Political Sentiment on Twitter [Dataset]. https://www.kaggle.com/datasets/pyrotech/twitterdata
Explore at:
zip(7966522 bytes)Available download formats
Dataset updated
Mar 29, 2024
Authors
PyroTech
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Area covered
India
Description
This dataset provides a comprehensive collection of public sentiment and discourse related to Indian politics. The entries cover a wide range of opinions, news, social media posts, and other forms of public communication. Each entry is meticulously labeled with a sentiment score, capturing the polarity of the opinion from strongly negative to strongly positive.

This dataset is structured to facilitate detailed sentiment analysis and examination of political sentiments in India.

Use Cases

This dataset is ideal for:

Sentiment Analysis: Researchers can use this dataset to train and evaluate sentiment analysis models specifically tailored to the political context in India. Trend Analysis: Analysts can track the evolution of public sentiment over time, identifying key events that influenced public opinion. Political Studies: Scholars can investigate the relationship between public sentiment and political events, figures, and policies in India. Natural Language Processing (NLP): NLP practitioners can leverage this dataset for various tasks such as text classification, opinion mining, and more.
Misinformation and Metaphor Use Dataset
kaggle.com
zip
Updated May 24, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Python Developer (2025). Misinformation and Metaphor Use Dataset [Dataset]. https://www.kaggle.com/programmer3/misinformation-and-metaphor-use-dataset
Explore at:
zip(7871795 bytes)Available download formats
Dataset updated
May 24, 2025
Authors
Python Developer
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
This dataset contains 1,500 of political misinformation analyzed for the presence and effect of metaphor types on belief formation and emotional response. The data were collected across various media sources including social media posts, political advertisements, and news articles. Each record includes information on metaphor type (e.g., fear-based, nationalistic, artistic, cognitive), the political topic, emotional tone, and user engagement metrics. The dataset is complemented by responses from 547 participants, providing scores for belief acceptance and emotional intensity.
Political Inclination Classification Nepali Tweets
kaggle.com
zip
Updated Nov 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shashank Shree Neupane (2025). Political Inclination Classification Nepali Tweets [Dataset]. https://www.kaggle.com/datasets/shashankshreeneupane/political-inclination-classification-nepali-tweets
Explore at:
zip(620803 bytes)Available download formats
Dataset updated
Nov 23, 2025
Authors
Shashank Shree Neupane
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
To use this dataset on your research paper use the following reference.

@artical{s13102024ijcatr13101005, Title = "Comparing Political Inclination Classification on Twitter Posts using Naive Bayes, SVM, and XGBoost", Journal ="International Journal of Computer Applications Technology and Research(IJCATR)", Volume = "13", Issue ="10", Pages ="62 - 65", Year = "2024", Authors ="Shashank Shree Neupane, Atish Shakya, Bishan Rokka, Sagar Acharya"}

The details of the article is:

International Journal of Computer Applications Technology and Research Volume 13–Issue 10, 62 – 65, 2024, ISSN:-2319–8656 DOI:10.7753/IJCATR1310.1005

The link to article: https://ijcat.com/archieve/volume13/issue10/ijcatr13101005

The dataset contains the twitter post of nepali political leader who are on political parties. The dataset can be used to know the inclination of people towards a political party with their post on the social media such as X (formerly twitter).
Trump 2024 Campaign Truth Social Truths (Tweets)
kaggle.com
zip
Updated Dec 15, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Muhammet Akkurt (2024). Trump 2024 Campaign Truth Social Truths (Tweets) [Dataset]. https://www.kaggle.com/datasets/muhammetakkurt/trump-2024-campaign-truthsocial-truths-tweets
Explore at:
zip(1274509 bytes)Available download formats
Dataset updated
Dec 15, 2024
Authors
Muhammet Akkurt
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
Overview:

This dataset contains posts and interactions from Donald J. Trump's Truth Social account, specifically during his 2024 U.S. Presidential election campaign. Each post entry provides detailed information, including the post content, number of replies, shares, likes, and metadata such as post date, media URLs (if available), and account details. The data offers a rich source for analyzing political messaging, engagement metrics, and audience reactions during the campaign period.

Use Cases:

Sentiment Analysis: The dataset can be used to analyze public sentiment toward Trump's campaign, identifying patterns in positive or negative reactions to different posts.

Political Messaging Analysis: Researchers can study the nature of Trump's political communication strategies, including the themes and issues emphasized during the 2024 campaign.

Engagement Metrics: By analyzing the number of likes, replies, and shares, this dataset allows for a detailed understanding of public engagement with Trump's posts over time.

Media Influence Study: With data on video and image URLs, this dataset could be used to assess the impact of multimedia on audience reactions and interaction.

Source:

The posts are sourced directly from Trump's official Truth Social profile, capturing interactions that are publicly available.

Limitations:

The dataset may not include every post or interaction due to scraping limitations, and some interactions might lack context or additional details that could affect interpretability.

License:

This dataset is intended for research and analysis purposes. Please ensure that any use of the data complies with Truth Social's terms of service and applicable copyright laws.
Egypt - Arabic Political 600k Tweets
kaggle.com
zip
Updated Sep 17, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hazem Sayed (2019). Egypt - Arabic Political 600k Tweets [Dataset]. https://www.kaggle.com/hazemshokry/egypt-arabic-politics-tweets
Explore at:
zip(33171915 bytes)Available download formats
Dataset updated
Sep 17, 2019
Authors
Hazem Sayed
Area covered
Egypt
Description
Context

The hashtag كفايه_بقى_ياسيسى# (That’s enough Sisi) was trending in Egypt at number one on Monday hours after Egyptian actor Mohamed Ali posted his video online calling on Egyptians to post on every social media asking Abdel Fattah el-Sisi to resign. In the opposite way hashtag #هنكمل_مشوارنا_معاك_ياسيسي was also trending for couple of hours the same day.

This data set should help you to understand Egyptians behavior in a political trend in either supporting or opposition situation.

Content

Data collection date (from Twitter API): September 16, 2019

Dimension of the data set; 600K rows and 8 columns

Sentiment analysis library used: Stanford CoreNLP

Hashtag filters include:

كفاية_بقي_ياسيسي

كفايه_بقي_ياسيسي

هنكمل_مشوارنا_معاك_ياسيسي

ارحل_يا_سيسي

وائل غنيم

عدي المليون

Data format: Json Lines Data are splitted into small files ~30mb for each split.

Acknowledgements

Tweets links' and owners are hidden to keep everything anonymous. Please get in touch with me if you have a use case requires using them.

Inspiration

Compare sentiment analysis result extracted by Stanford CoreNLP with other NLP library.

Extract all links and collect all photos used for this trend.

Understand Egyptians behavior in a political trend in either supporting or opposition situation.

Compare people statistics for different hashtags.

See trend line chart and compare with other trends.

And much more...
Political and Off-Topic posts from TigerDroppings
kaggle.com
zip
Updated Dec 9, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
William Clarke Casey (2022). Political and Off-Topic posts from TigerDroppings [Dataset]. https://www.kaggle.com/datasets/williamclarkecasey/tigerdroppings
Explore at:
zip(1349718918 bytes)Available download formats
Dataset updated
Dec 9, 2022
Authors
William Clarke Casey
Description
TigerDroppings.com, founded in 2001 by LSU alumnus Brian Fiegel, is among the most notable and active of any college sports forums on the internet, and its popularity hasn’t declined even as major social media platforms have come to dominate online spaces for discussion. The site’s userbase consists primarily of Louisiana residents and LSU graduates, though fans of other schools in the Southeastern Conference also frequent the site. The users on these sites fit a very specific demographic and they have little diversity. In a survey of users from numerous college sports forums, including TigerDroppings, it was found that 87% of users were male and 90% were white. Additionally, 76% had at least an undergraduate degree and 42% of users had a household income of $100,000 or greater. In November 2015, TigerDroppings had 129,244 users and now, seven years later, has 256,692 registered users; the site is still growing as fast as it did in the 2000s. There is a reason for this — these very specific demographics of the userbase are able to communicate in a way they otherwise couldn’t on Facebook or Twitter. Given these demographics, the forum takes on an overwhelmingly conservative tone in the opinions and sentiments regularly expressed. To put it simply, I couldn’t imagine a dataset that better encapsulates the psyche and mindset of white conservative men in Louisiana. Comprising almost 14 million political posts from 2014 to the present, it profiles the rise of Trumpism and the cataclysmic shifts seen in American politics in recent years.

Early in the site’s history, their off-topic board the “OT Lounge” was created and is the most popular board on the site, followed closely by their “Politics” board. Unlike many other similar forums, TigerDroppings relies solely on advertising to generate revenue, and all boards are free to view and create posts on. Only an email is required to sign up and all posts are anonymous; users are only outwardly identifiable by their chosen screennames. The functionality of the site has largely gone unchanged since its founding. Users can start a thread on a particular board and replies by other users are appended to the thread; there is no visible hierarchy to replies on threads, unlike platforms like Reddit, and it is very rudimentary by current standards. On every single reply in a thread there is an upvote and a downvote button; next to each button, their respective values are displayed, publicly showing the popularity of a user’s post. Users have been informally voting on political opinions and sentiments constantly, which I believe is rich for analyzing the rise of specific attitudes and rhetoric used among this demographic.

Attached to each post in the dataset are several pieces of metadata: upvotes, downvotes, username, date of post, date of thread creation, URLs from links contained in the post, URLs to images in the post, text from blockquotes, and the position of the post in its respective thread. Additionally, I was able to gather emails and phone numbers for approx. 3,000 users of the site through the Ticket Marketplace Board, as many users had posted contact info to interact with other users externally. Data from the OT-Lounge was able to be scrapped in its entirety from 2014 to present, though among data from the Politics Board there were some gaps. All data from 2015 was not publicly accessible for unclear reasons. But more interestingly, all threads from November 2, 2020 to January 7, 2021 — the day before the presidential election until the day after the insurrection at the U.S. Capitol — is not publicly accessible at all. I hypothesize there was significant activity talking about election fraud during that period, along with potentially incriminating information about posters who may have participated in the events on January 6th.

In terms of where I want to go from here with this dataset, I am interested in exploring if a model to isolate and predict political trends among this demographic could be feasible, along with exploring what potential uses it has as a tool for electoral politics in Louisiana. If anything, I want to do some anthropological research about this demographic that so clearly to me describes the social, cultural, and political environment I was raised in.
US Election 2024 Social Media Sentiment Dataset
kaggle.com
zip
Updated Sep 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Imaad Mahmood (2025). US Election 2024 Social Media Sentiment Dataset [Dataset]. https://www.kaggle.com/datasets/imaadmahmood/us-election-2024-social-media-sentiment-dataset/data
Explore at:
zip(8063 bytes)Available download formats
Dataset updated
Sep 15, 2025
Authors
Imaad Mahmood
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Area covered
United States
Description
US Election 2024 Social Media Sentiment Dataset

The US Election 2024 Social Media Sentiment Dataset captures 100 authentic, anonymized posts from X (formerly Twitter) collected during November 5-6, 2024, coinciding with the US Presidential Election's critical period. This dataset reflects real-time public opinions, emotions, and discussions surrounding the election, focusing on candidates (Donald Trump, Kamala Harris), voting processes, and media narratives. Sourced via X's official API, the data ensures compliance with platform policies and prioritizes ethical considerations by anonymizing user identities.

Dataset Features

Size: 100 unique posts (excluding replies and quoted posts to avoid redundancy).

Attributes:

Post ID: Unique identifier for each post.

Author: Anonymized author name (e.g., Author_#ID).

Username: Anonymized handle (e.g., User_#ID).

Timestamp: Post creation time (GMT).

Text: Full post content, preserved verbatim (HTML-escaped for compatibility).

Engagement Metrics: Likes, reposts, replies, quotes, bookmarks, and views.

Hashtags: Comma-separated list of extracted hashtags (e.g., #USElection2024, #VotedForTrump).

Media: Indicator for presence of images/videos (Yes/No).

Timeframe: November 5-6, 2024, covering election night and early result announcements.

Language: Primarily English, with potential for multilingual expansion.

Potential Applications

Sentiment Analysis: Develop machine learning models to classify sentiments (e.g., pro-Trump, pro-Harris, neutral) using NLP tools like VADER or BERT.

Topic Modeling: Identify key election themes (e.g., voter turnout, media bias) via techniques like LDA.

Network Analysis: Analyze user interactions through engagement metrics to map influence networks.

Time-Series Analysis: Track sentiment or hashtag trends over the election period.

Collection Methodology

Posts were collected using X's API with targeted queries (e.g., "#USElection2024", "Trump", "Harris" -filter:replies) and a minimum engagement filter (min_faves:1) to ensure relevance. The dataset was cleaned to remove sensitive information (e.g., full URLs where non-essential) while retaining original text for analysis. The collection focused on the latest posts to capture real-time reactions.

Ethical Considerations

Adheres to X’s developer terms, ensuring ethical data use.

User identities anonymized to comply with privacy standards.

Licensed under CC0 (public domain) for open access on Kaggle.

Potential biases include a focus on high-engagement posts and English-language content; users are encouraged to expand with broader queries.

Recommendations for Kaggle

Include 2-3 sample Jupyter notebooks (e.g., exploratory data analysis with Pandas, sentiment visualization with Matplotlib) to enhance usability.

Expand the dataset using similar API queries for larger scale (e.g., 10k+ posts).

Add derived features like sentiment scores or topic labels for enriched analysis.

This dataset is a valuable resource for data scientists, political researchers, and students studying social media’s impact on the 2024 US Presidential Election. It provides a snapshot of public discourse, ideal for NLP, social network analysis, and trend detection.
Indonesian Political Discourse 2025
kaggle.com
zip
Updated May 19, 2026
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Assegaf Insani (2026). Indonesian Political Discourse 2025 [Dataset]. https://www.kaggle.com/datasets/insaniassegaf/indonesian-political-discourse-2025
Explore at:
zip(2767280 bytes)Available download formats
Dataset updated
May 19, 2026
Authors
Assegaf Insani
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset contains approximately 10,000 Indonesian-language tweets related to major political and social issues that became viral in Indonesia during 2025. The dataset includes discussions from several public hashtags and topics, such as: - #IndonesiaGelap - #RUUTNI (Indonesian National Armed Forces Bill) - #DPR (The Indonesian House of Representatives) - #KUHP (The Indonesian Criminal Code) - Other trending socio-political discussions

The dataset was collected from publicly available posts on X (Twitter) and is intended for research purposes in: - Sentiment Analysis - Natural Language Processing (NLP) - Political Communication Analysis - Social Media Mining - Machine Learning and Deep Learning Research - Large Language Model (LLM) Evaluation

Dataset Contents The dataset may include: - Raw tweet text - Cleaned tweet text - Normalized tweet text - Preprocessed tweet text - Sentiment scores - Sentiment labels

Language - Indonesian

Potential Use Cases - Indonesian sentiment analysis - Political opinion mining - Hate speech detection - Sarcasm analysis - LLM benchmarking - Comparative NLP research

Disclaimer This dataset contains publicly available social media posts collected for educational and research purposes only. The dataset is shared in accordance with fair research usage principles. Any personally identifiable information was not intentionally collected for profiling purposes.
Joe Biden's Tweets
kaggle.com
zip
Updated Dec 19, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Devastator (2022). Joe Biden's Tweets [Dataset]. https://www.kaggle.com/datasets/thedevastator/uncovering-joe-biden-s-message-through-social-me
Explore at:
zip(1175635 bytes)Available download formats
Dataset updated
Dec 19, 2022
Authors
The Devastator
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Joe Biden's Tweets

Likes, Retweets, Shares, and Conversation Dynamics

By Twitter [source]

About this dataset

At the heart of understanding Joe Biden's successful election campaign were his effective and engaged use of social media. This dataset provides unparalleled insights into how Biden harnessed the power of Twitter to create engaging conversations, share his views on policy issues, and build positive relationships with his followers. Researchers can use this data to observe the likes, retweets, shares, and replies that Biden's posts generated over time to better understand how he connected with people. Explore this dataset to track hourly, daily and weekly activity in order to gain unique insights into how Joe Biden crafted his message using social media platforms. Analyze outlinks for discussion topics relevant for elections or even pull quoted tweets from Twitter users who engage in conversations with him. You'll be able to see first -hand just how influential Joe Biden was with regards to engaging in meaningful dialogue with individuals across America while gaining valuable insight into the powerful impact that digital communication had on this particular political race

More Datasets

For more datasets, click here.

Featured Notebooks

🚨 Your notebook can be here! 🚨!

How to use the dataset

This dataset offers researchers, journalists and political analysts a comprehensive understanding of how former Vice President Joe Biden’s social media activity provides insight into his views and opinions on policy, foreign relationships and election dynamics.

Through this dataset, users can identify trends in the number of likes, retweets and replies that are generated by the posts from Joe Biden’s Twitter account. Along with this data users can also observe changes in the quoted Tweets, outlinks mentioned in posts as well as the URLs associated with them.

To make full use of this dataset follow these steps: 1. Begin by exploring the key columns such as content (tweet text), created_at (date/time posted), likeCount (number of likes on tweet), retweetCount (number of retweets on tweet) and replyCount (number of replies to tweet).
2. Using analytical tools explore correlations between variables such as between created_at column and other columns like quoteCount or outlinks to see if certain insights can be drawn depending upon when the post is made or not made by Joe Biden himself or a campaign staff member against variables like type & length of post, medium used etc..
3. Explore which tweets have more reach with higher engagement rates within lesser time frames using variables like retweetedTweet & quotedTweet along side other fields for more interesting insights about what kind messages work better than others for specific times & situations during campaigns. 4. Engage further with observed patterns to identify further links leading to interesting conclusions about outreach related activity during campaigning periods using analysis methods like data visualisations across time lines linking multiple tweets together + finding geographic regions where Joe Biden has most followers etc..
Finally never forget that proper application (& comparison) through hypothesis testing is essential when dealing with large datasets while correlating facts across multiple channels - especially dealing with topics related to politics involving a public figure being analyzed through their own tweets!

Research Ideas

Analyzing the sentiment of Joe Biden's tweet text and how it changes over time.

Tracking engagement with different topics to understand which issues are most important to him and his followers.

Comparing tweet engagement dynamics between Joe Biden and other prominent political figures for research comparison studies

Acknowledgements

If you use this dataset in your research, please credit the original authors. Data Source

License

License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: JoeBiden.csv | Column name | Description | |:-------------------|:-----------------------------------------------------------------------------------------------------------------------| ...
Sound and Audio Data in Sri Lanka
kaggle.com
zip
Updated Apr 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Techsalerator (2025). Sound and Audio Data in Sri Lanka [Dataset]. https://www.kaggle.com/datasets/techsalerator/sound-and-audio-data-in-sri-lanka
Explore at:
zip(12171329 bytes)Available download formats
Dataset updated
Apr 1, 2025
Authors
Techsalerator
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Area covered
Sri Lanka
Description
Techsalerator’s Location Sentiment Data for Sri Lanka

Techsalerator’s Location Sentiment Data for Sri Lanka provides valuable insights into public sentiment across various regions of the country. This dataset is essential for businesses, researchers, and policymakers aiming to understand regional emotions, opinions, and attitudes. By analyzing location-based sentiment trends, organizations can enhance decision-making, marketing strategies, and customer engagement.

For access to the full dataset, contact us at info@techsalerator.com or visit Techsalerator Contact Us.

Top 5 Key Data Fields

Geographic Location – Identifies the specific region, city, or district in Sri Lanka where sentiment data was collected.

Sentiment Score – Measures public sentiment as positive, neutral, or negative based on social media posts, reviews, and surveys.

Emotion Classification – Categorizes emotions such as happiness, anger, sadness, surprise, and fear for deeper sentiment analysis.

Source of Sentiment – Identifies platforms such as social media, news articles, or customer feedback contributing to sentiment trends.

Timeframe of Data Collection – Tracks sentiment changes over time to identify seasonal or event-driven shifts in public perception.

Top 5 Location Sentiment Trends in Sri Lanka

Tourism Sentiment Fluctuations – Public sentiment in tourist hotspots like Colombo, Kandy, and Galle shifts based on economic conditions, travel restrictions, and visitor experiences.

Economic and Business Perceptions – Sentiments toward local businesses and industries, including tea plantations and IT hubs, impact market confidence and investment decisions.

Political Sentiment Analysis – Tracking sentiment trends around elections, government policies, and public protests provides key insights into political engagement.

Disaster and Crisis Response Sentiments – Public reactions to events like floods, landslides, and economic crises help assess emergency response effectiveness.

Cultural and Social Trends – Sentiment analysis of festivals, religious events, and social movements reflects cultural and societal shifts across different regions.

Top 5 Applications of Location Sentiment Data in Sri Lanka

Market Research & Consumer Behavior – Businesses can tailor products and services by understanding regional sentiment patterns.

Political and Social Analysis – Governments and organizations can monitor public opinion on policies, elections, and social issues.

Crisis Management & Risk Assessment – Real-time sentiment tracking helps in disaster response and mitigation strategies.

Tourism and Hospitality Industry – Hotels, airlines, and tourism boards can adapt marketing strategies based on traveler sentiment.

AI and Machine Learning Models – Enhancing NLP models with location-based sentiment insights for improved predictive analytics.

Accessing Techsalerator’s Location Sentiment Data

To obtain Techsalerator’s Location Sentiment Data for Sri Lanka, contact info@techsalerator.com with your specific requirements. Techsalerator offers customized datasets based on requested fields, with delivery available within 24 hours. Ongoing access options can also be discussed.

Included Data Fields

Geographic Location

Sentiment Score

Emotion Classification

Source of Sentiment

Timeframe of Data Collection

Industry-Specific Sentiment Analysis

Political & Social Sentiment Trends

Crisis & Disaster Sentiment Tracking

Consumer Feedback & Brand Perception

Contact Information

For comprehensive insights into public sentiment and regional opinions across Sri Lanka, Techsalerator’s dataset is an essential tool for businesses, policymakers, and researchers.
TruthSocial - 2024 Election Integrity Initiative
kaggle.com
zip
Updated Nov 1, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kashish Shah (2024). TruthSocial - 2024 Election Integrity Initiative [Dataset]. https://www.kaggle.com/datasets/kashishashah/truthsocial-2024-election-integrity-initiative/code
Explore at:
zip(224896471 bytes)Available download formats
Dataset updated
Nov 1, 2024
Authors
Kashish Shah
Description
Dataset Overview

This dataset captures election-related discussions on TruthSocial in the lead-up to the 2024 U.S. presidential election. With 1.5 million posts spanning from February 2022 to October 2024, this dataset provides insights into political discourse, community formation, and the spread of information on a prominent alt-tech platform.

Context

TruthSocial, a platform focused on free speech, has attracted users with diverse political views, often leaning conservative. This dataset is ideal for researchers, data scientists, and political analysts interested in studying communication patterns, engagement trends, and sentiment on election-related topics in a less-moderated social media environment.

Usage Notes

This dataset can be utilized for:

Trend Analysis: Study how certain election-related keywords and hashtags gained traction over time.

Sentiment and Engagement Analysis: Measure public sentiment and engagement metrics (likes, replies, re-shares) across various posts.

Community Analysis: Explore patterns in user engagement and community formation.

License

This dataset is intended for research purposes and should be cited appropriately if used in published work.
Word Frequency In Political and Non-Pol. Subreddit
kaggle.com
zip
Updated Feb 16, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anjay23 (2021). Word Frequency In Political and Non-Pol. Subreddit [Dataset]. https://www.kaggle.com/anjay23/word-frequency-in-political-and-nonpol-subreddit
Explore at:
zip(689948 bytes)Available download formats
Dataset updated
Feb 16, 2021
Authors
Anjay23
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Dataset

This dataset was created by Anjay23

Released under CC0: Public Domain

Contents
Sound and Audio Data in Namibia
kaggle.com
zip
Updated Mar 31, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Techsalerator (2025). Sound and Audio Data in Namibia [Dataset]. https://www.kaggle.com/datasets/techsalerator/sound-and-audio-data-in-namibia
Explore at:
zip(12171329 bytes)Available download formats
Dataset updated
Mar 31, 2025
Authors
Techsalerator
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Techsalerator’s Location Sentiment Data for Namibia

Techsalerator’s Location Sentiment Data for Namibia provides in-depth insights into the emotions, opinions, and sentiment trends across different regions of the country. This dataset is essential for businesses, researchers, and policymakers looking to understand public perception, consumer behavior, and regional sentiment variations.

For access to the full dataset, contact us at info@techsalerator.com or visit Techsalerator Contact Us.

Techsalerator’s Location Sentiment Data for Namibia

Techsalerator’s Location Sentiment Data for Namibia delivers structured sentiment analysis derived from social media, news sources, and consumer reviews. This dataset is valuable for market research, social analytics, and economic development strategies.

Top 5 Key Data Fields

Geographic Sentiment Mapping – Identifies sentiment variations across Namibia’s cities, towns, and rural areas.

Sentiment Score (Positive/Neutral/Negative) – Measures public perception through AI-driven analysis of text and speech data.

Source of Sentiment Data – Categorizes sentiment sources such as social media posts, online reviews, and news articles.

Time-Based Sentiment Trends – Tracks changes in sentiment over time, providing insights into seasonal and event-driven fluctuations.

Industry-Specific Sentiment Insights – Analyzes sentiment within key sectors like tourism, agriculture, retail, and finance.

Top 5 Location Sentiment Trends in Namibia

Tourism Sentiment – Positive sentiment spikes in major tourist destinations like Etosha National Park and Swakopmund, influenced by travel reviews and social media discussions.

Urban vs. Rural Sentiment Divide – Urban areas like Windhoek exhibit more dynamic sentiment shifts due to economic and social factors, while rural areas maintain more stable sentiment trends.

Economic Sentiment Fluctuations – Public perception of Namibia’s economic conditions varies based on inflation rates, employment trends, and financial sector developments.

Social and Political Discussions – News and online discussions shape sentiment around governance, policy changes, and major national events.

Retail and Consumer Behavior – Customer sentiment in retail and e-commerce shifts based on pricing, availability, and service quality perceptions.

Top 5 Applications of Location Sentiment Data in Namibia

Market Research and Consumer Insights – Businesses use sentiment data to understand regional customer preferences and tailor marketing strategies.

Tourism and Hospitality Industry – Travel agencies and hotels analyze sentiment to improve visitor experiences and optimize promotions.

Government and Policy Decision-Making – Authorities track public sentiment on policies and social issues for better governance.

Financial and Investment Analysis – Investors and financial analysts use sentiment data to assess market confidence and economic trends.

Brand Reputation Management – Companies monitor sentiment to manage public perception and respond proactively to feedback.

Accessing Techsalerator’s Location Sentiment Data

To obtain Techsalerator’s Location Sentiment Data for Namibia, contact info@techsalerator.com with your specific requirements. Techsalerator provides customized datasets based on requested fields, with delivery available within 24 hours. Ongoing access options can also be discussed.

Included Data Fields

Geographic Sentiment Mapping

Sentiment Score (Positive/Neutral/Negative)

Source of Sentiment Data

Time-Based Sentiment Trends

Industry-Specific Sentiment Insights

Event-Driven Sentiment Analysis

Social Media and News Sentiment Breakdown

Consumer Review Sentiment

Competitor Sentiment Benchmarking

Contact Information

For comprehensive sentiment analysis and location-based insights in Namibia, Techsalerator’s dataset serves as a valuable resource for businesses, researchers, and policymakers.
Sound and Audio Data in United States of America
kaggle.com
zip
Updated Apr 3, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Techsalerator (2025). Sound and Audio Data in United States of America [Dataset]. https://www.kaggle.com/datasets/techsalerator/sound-and-audio-data-in-united-states-of-america/code
Explore at:
zip(12171329 bytes)Available download formats
Dataset updated
Apr 3, 2025
Authors
Techsalerator
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Area covered
United States
Description
Techsalerator’s Location Sentiment Data for the United States of America

Techsalerator’s Location Sentiment Data for the United States offers a comprehensive dataset crucial for businesses, researchers, and technology developers. This dataset provides deep insights into location-based sentiment patterns, helping users understand regional and local variations in public opinion across different areas in the U.S.

For access to the full dataset, contact us at info@techsalerator.com or visit Techsalerator Contact Us.

Techsalerator’s Location Sentiment Data for the United States

Techsalerator’s Location Sentiment Data for the United States provides structured sentiment analysis across urban, suburban, and rural areas. This dataset is essential for AI development, market research, political analysis, and social studies.

Top 5 Key Data Fields

Location of Sentiment – Identifies the geographic location where the sentiment was recorded, helping researchers analyze sentiment variations across regions.

Sentiment Score – Measures the positive or negative sentiment expressed in a specific location, useful for gauging public opinion on various topics.

Time of Sentiment – Records the exact time and date when the sentiment was captured, helping to track trends over time, such as during elections or major events.

Sentiment Source – Categorizes the source of sentiment data, including social media posts, customer reviews, surveys, and news articles.

Sentiment Context – Provides insights into the context surrounding the sentiment, helping to understand the cause behind positive or negative responses (e.g., political events, economic factors).

Top 5 Location Sentiment Trends in the United States

Political Sentiment Shifts – Election years show significant changes in sentiment, with variations across states, influencing campaign strategies and policy decisions.

Economic Influence on Sentiment – Economic downturns or booms can significantly affect sentiment, particularly in regions reliant on specific industries like agriculture or manufacturing.

Urban vs. Rural Sentiment Differences – Sentiment trends often differ between urban and rural areas, with urban centers focusing on issues like housing, healthcare, and public services, while rural areas tend to emphasize agricultural policies and infrastructure.

Impact of Social Movements – Events like protests, social justice movements, and activism impact sentiment, with regional differences reflecting local engagement and issues.

Disaster-Related Sentiment – Natural disasters and their aftermath, such as hurricanes, wildfires, and floods, lead to changes in public sentiment, influencing recovery and support strategies.

Top 5 Applications of Location Sentiment Data in the United States

Market Research – Businesses use sentiment data to assess regional customer perceptions, helping to tailor marketing strategies for different areas.

Political Campaigns – Political analysts and candidates use sentiment data to gauge public opinion, identify key issues, and shape campaign messages.

Crisis Management – Sentiment analysis helps organizations understand public sentiment during crises (e.g., natural disasters, pandemics) and tailor responses accordingly.

Urban Planning – Local governments use sentiment data to identify public concerns and priorities, guiding urban development and policy-making.

Consumer Behavior Analytics – Retailers and service providers leverage sentiment data to track customer feedback and adjust products or services based on regional preferences.

Accessing Techsalerator’s Location Sentiment Data

To obtain Techsalerator’s Location Sentiment Data for the United States, contact info@techsalerator.com with your specific requirements. Techsalerator provides customized datasets based on requested fields, with delivery available within 24 hours. Ongoing access options can also be discussed.

Included Data Fields

Location of Sentiment

Sentiment Score

Time of Sentiment

Sentiment Source

Sentiment Context

Sentiment Type (Positive, Negative, Neutral)

Demographic Information (Age, Gender, etc.)

Topic Categorization (Political, Economic, Social, etc.)

Sentiment Trends Over Time

Contact Information

For detailed insights into location-based sentiment patterns across the United States, Techsalerator’s dataset is an invaluable resource for researchers, marketers, political analysts, and urban planners.
Truth Social Reactions to Trump’s Iran War Posts
kaggle.com
zip
Updated Apr 22, 2026
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Muhammet Akkurt (2026). Truth Social Reactions to Trump’s Iran War Posts [Dataset]. https://www.kaggle.com/datasets/muhammetakkurt/truth-social-reactions-to-trumps-iran-war-posts-s/discussion
Explore at:
zip(65283451 bytes)Available download formats
Dataset updated
Apr 22, 2026
Authors
Muhammet Akkurt
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Area covered
Iran
Description
This dataset contains a curated collection of Donald Trump’s Truth Social posts related to Iran, military escalation, and Middle East conflict rhetoric, along with the associated comment data collected for those posts.

The dataset is structured in two linked tables:

posts.csv: source posts, engagement metrics, media flags, and link-card metadata

comments.csv: replies to those posts, including text, engagement metrics, account-level metadata, and media/link-card indicators

The main purpose of this dataset is to support research and exploratory analysis of:

public reactions to conflict-related political messaging

pro-war vs anti-war response patterns

religious, conspiratorial, celebratory, and media-amplifying rhetoric in replies

engagement dynamics across posts and comments

multimodal reactions such as text, image, video, and link-card replies

Files

posts.csv Contains post-level metadata such as post ID, timestamp, cleaned text, engagement counts, media indicators, account fields, and link-card information.

comments.csv Contains comment-level metadata such as comment ID, parent post ID, cleaned text, engagement counts, commenter account fields, media indicators, and link-card information.

Notes

This dataset is event-centered and focuses specifically on war/conflict-related posts rather than all Truth Social activity.

The current release is primarily a structured raw dataset and does not include manual stance annotations.

Some comments may be text-only, while others may contain only media, videos, or external news/link cards.

Suggested use cases

exploratory data analysis

sentiment and stance analysis

conflict discourse analysis

media/reaction type classification

political communication research

multimodal social media response analysis

Facebook

Twitter

Click to copy link

Link copied

Cite

Mannat Trivedi (2026). Indian Political Tweet Engagement Dataset [Dataset]. https://www.kaggle.com/datasets/mannattrivedi/indian-political-tweet-engagement-dataset

Indian Political Tweet Engagement Dataset

Twitter Political Engagement and User Interaction Analysis

Explore at:

zip(10064890 bytes)Available download formats

Dataset updated

Jan 23, 2026

Authors

Mannat Trivedi

License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Area covered

India

Description

Dataset Description and Authenticity Audit

This dataset contains 85,154 Twitter posts related to Indian political discourse, collected between October 2022 and March 2023. It includes tweet text, user identifiers, temporal metadata, and engagement metrics such as likes and retweets, enabling analysis of interaction patterns and engagement behavior in high-activity public discussions.

Dataset Integrity

The dataset consists of 9 variables and is fully cleaned, with no missing values, duplicate records, or invalid timestamps. Derived temporal features (Year, Month, Day) are perfectly consistent with the original timestamp, ensuring reliability for time-based analysis.

Authenticity Validation

Multiple forensic checks were performed to evaluate whether engagement metrics reflect real-world social media behavior:

Temporal Consistency Test confirmed exact alignment between timestamps and derived date components.
Benford’s Law Analysis showed close correspondence between observed and expected digit distributions, indicating naturally occurring numerical patterns.
Social Power Law (Pareto Principle) validation revealed that the top 1% of users contribute approximately 12.25% of all tweets, consistent with organic human-driven participation rather than automated activity.

Statistical Characteristics

Engagement metrics display realistic long-tail distributions, with a small fraction of highly engaged tweets and minimal zero inflation. The dataset contains over 58,000 distinct users and more than 98% unique tweet content, further supporting data authenticity.

Interaction Network Component

In addition to tweet-level data, the dataset includes a user interaction network represented as directed edges. Each edge denotes an interaction between two users, derived from observable Twitter actions such as replies, mentions, or retweets. This network structure enables graph-based analysis of information flow, influence patterns, and community behavior within political discussions. https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F22766094%2Fea12c83af697260cda6a5dfcdd6c544b%2Fgraph.jpg?generation=1769145568146421&alt=media" alt="">

Intended Use

The dataset is suitable for machine learning and analytical tasks such as engagement prediction, content analysis, user behavior modeling, and temporal interaction studies. Political content is treated solely as a high-engagement discussion domain and does not imply ideological inference or endorsement.

Clear search

Close search

Google apps

Main menu

Indian Political Tweet Engagement Dataset

Dataset Description and Authenticity Audit

Dataset Integrity

Authenticity Validation

Statistical Characteristics

Interaction Network Component

Intended Use

donald-trump-truths-dataset

Trump Truth Social Posts Dataset (Second Presidency)

Overview

Dataset Contents

Source

Potential Applications

Disclaimer

Political Social Media Posts

How was it collected?

Acknowledgments

Inspiration

The Data

Global Political tweets

Content

Information regarding the data

Inspiration

Twitter Political Sentiment Dataset: India

Indian Political Sentiment on Twitter

Misinformation and Metaphor Use Dataset

Political Inclination Classification Nepali Tweets

Trump 2024 Campaign Truth Social Truths (Tweets)

Overview:

Use Cases:

Source:

Limitations:

License:

Egypt - Arabic Political 600k Tweets

Context

Content

Acknowledgements

Inspiration

Political and Off-Topic posts from TigerDroppings

US Election 2024 Social Media Sentiment Dataset

US Election 2024 Social Media Sentiment Dataset

Dataset Features

Potential Applications

Collection Methodology

Ethical Considerations

Recommendations for Kaggle

Indonesian Political Discourse 2025

Joe Biden's Tweets

Joe Biden's Tweets

Likes, Retweets, Shares, and Conversation Dynamics

About this dataset

More Datasets

Featured Notebooks

How to use the dataset

Research Ideas

Acknowledgements

License

Columns

Sound and Audio Data in Sri Lanka

Top 5 Key Data Fields

Top 5 Location Sentiment Trends in Sri Lanka

Top 5 Applications of Location Sentiment Data in Sri Lanka

Accessing Techsalerator’s Location Sentiment Data

Included Data Fields

TruthSocial - 2024 Election Integrity Initiative

Dataset Overview

Context

Usage Notes

License

Word Frequency In Political and Non-Pol. Subreddit

Dataset

Contents

Sound and Audio Data in Namibia

Techsalerator’s Location Sentiment Data for Namibia

Top 5 Key Data Fields

Top 5 Location Sentiment Trends in Namibia

Top 5 Applications of Location Sentiment Data in Namibia

Accessing Techsalerator’s Location Sentiment Data

Included Data Fields

Sound and Audio Data in United States of America