Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset contains the county-wise vote share of the United States presidential election of 2020, and in the future 2024, the main advantage of the dataset is that it contains various important county statistics such as the counties racial composition, median and mean income, income inequality, population density, education level, population and the counties occupational distribution.
_Imp: this dataset will be updated as the 2024 results come in, I will also be adding more county demographic data, if you have any queries or suggestions please feel free to comment _
The reasons for constructing this dataset are many, however the prime reason was to aggregate all the data on counties along with the election result data for easy analysis in one place. I noticed that Kaggle contains no datasets with detailed county information, and that using the US census bureau site is pretty difficult and time consuming to extract data so it would be better to have a pre-prepared table of data
Facebook
TwitterThis data comes from the Associated Press - the AP has been tracking vote counts in US elections since 1848 and their data is widely considered to be accurate.
The variables in this dataset are:
- state: State to which the vote count corresponds
- state_abr: Two-letter abbreviation of state name
- trump_pct: Percentage of the vote won by Donald Trump
- biden_pct: Percentage of the vote won by Joe Biden
- trump_win: Binary variable denoting whether Donald Trump won the vote in a state
- biden_win: Binary variable denoting whether Joe Biden won the vote in a state
Facebook
TwitterThis dataset was created by Hue Dinh
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Please, If you enjoyed this dataset, don't forget to upvote it.
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F3244747%2F5173f21bc8eaea8335539cc942338b4d%2Fheader_win.png?generation=1605608056355359&alt=media" alt="">
For this year, was the 59th quadrennial presidential election held on Tuesday, November 3, 2020. To win the election, the candidate needs 270 out of 538 electoral votes. A good sign, that show if a candidate is doing well, is if they win states that aren't expcted to go their way.
This dataset contains county-level data from 2020 US Election.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This file shows all election related data by state and county (i.e. total votes, Republican votes, Democratic votes, Republican voting percentage, Democratic voting percentage) for both the 2020 and 2024 U.S. Presidential Elections.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
I like history, so figured I would compile a dataset on all US Presidential elections throughout history, and how they went.
Inside are datasets for the winners, candidates, and turnout for each presidential election.
Banner Image By Gilbert Stuart https://www.clarkart.edu/artpiece/detail/george-washington, Public Domain, https://commons.wikimedia.org/w/index.php?curid=591229
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Data includes tweets leading up to the U.S. 2020 election with hashtags as follows: #trump,. #biden, #2020election, #presidentialelection, #electionissues.
Over 600k tweets from Oct. 14 - Nov. 4 2020.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The County Presidential Election Returns 2000-2020 dataset describes the results of the various United States Presidential elections, broken out on a per county basis. It provides the details on the votes cast per candidate, the political party to whom the candidate belongs, and the mode by which the votes were cast. The data was collected by the MIT Election Data and Science Lab, and is housed on the Harvard Dataverse.
References: 1. https://electionlab.mit.edu/ 2. https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/VOQCHQ
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset is collected from 1824 to 2020: 1. Year: Description: The year in which the U.S. election took place. Type: Numeric (Integer) Example: 1824, 1860, 1920, 2020
Candidate: Description: The name of the candidate participating in the election. Type: String (Candidate's name) Example: John Adams, Abraham Lincoln, Franklin D. Roosevelt, Joe Biden
Party: Description: The political party affiliation of the candidate. Type: String (Party name or abbreviation) Example: Democratic, Republican, Whig, Libertarian
Popular Vote: Description: The total number of votes that the candidate received in the popular vote. Type: Numeric (Integer) Example: 500,000, 5,000,000, 70,000,000
Result: Description: The outcome of the election for the specified candidate. Type: String (e.g., "Winner," "Runner-up," "Withdrew") Example: Winner, Runner-up, Withdrew, Conceded
Percentage: Description: The percentage of the total popular vote received by the candidate. Type: Numeric (Float) Example: 25.3%, 49.8%, 60.5%
This dataset appears to capture essential information about U.S. elections over time, including details about the candidates, their political party affiliations, the number of popular votes they received, the outcome of the election, and the percentage of the total popular vote they secured. This comprehensive dataset allows for the analysis of historical U.S. election trends and outcomes.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This data file contains constituency (state-level) returns for elections to the U.S. presidency from 1976 to 2020.
For more information please refer : https://guides.lib.jjay.cuny.edu/c.php?g=992251&p=7179356
More - Find More Exciting🙀 Datasets Here - An Upvote👍 A Dayᕙ(`▿´)ᕗ , Keeps Aman Hurray Hurray..... ٩(˘◡˘)۶Haha
Facebook
Twitterhttps://www.usa.gov/government-works/https://www.usa.gov/government-works/
This dataset was created by Mohamad
Released under U.S. Government Works
Facebook
Twitterhttp://www.gnu.org/licenses/old-licenses/gpl-2.0.en.htmlhttp://www.gnu.org/licenses/old-licenses/gpl-2.0.en.html
The goal of this dataset is to provide a tidy way to access to the transcripts of speeches given by various US politicians in the context of the 2020 US Presidential Election. Transcripts have been scraped from rev.com. Some other information, such as location and type of speech, have been manually added to the dataset.
The dataset has the following columns:
speaker: Who gave the speech
title: a title or a description of speech
text: the transcript of the speech
location: the location or the platform where the speech was give
type: type of speech (e.g., campaign speech, interview or debate)
We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.
Your data will be in front of the world's largest data science community. What questions do you want to see answered?
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Comprehensive and standardized election results of winning and losing candidates for the offices of President, U.S. House, U.S. Senate, and Governor. Between 2006 and 2020 alone, there were over 4,000 races for Congress and Governor in the general election, in which 7,710 Democratic and Republican candidates ran for office. This is a candidate-level dataset that is comprehensive for Presidential, Congressional and gubernatorial contests between 2006 and 2020 with standardized names for each unique candidate, party, incumbency, vote totals, and election results.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
All tweets are available on twitter ofc.Tweets scrapped using twint.
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F2390050%2F23b273a5060da43d5bd5cc5a36a0e9b2%2Ftrump-tweets.jpeg?generation=1606222822791162&alt=media" alt="">
It'd be interesting to visualize what the public is talking about the election.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset was created by Jason G Yoon
Released under CC0: Public Domain
Facebook
TwitterThis is the dataset I used to figure out which sociodemographic factor including the current pandemic status of each state has the most significan impace on the result of the US Presidential election last year. I also included sentiment scores of tweets created from 2020-10-15 to 2020-11-02 as well, in order to figure out the effect of positive/negative emotion for each candidate - Donald Trump and Joe Biden - on the result of the election.
Details for each variable are as below: - state: name of each state in the United States, including District of Columbia - elec16, elec20: dummy variable indicating whether Trump gained the electoral votes of each state or not. If the electors casted their votes for Trump, the value is 1; otherwise the value is 0 - elecchange: dummy variable indicating whether each party flipped the result in 2020 compared to that of the 2016 - demvote16: the rate of votes that the Democrats, i.e. Hillary Clinton earned in the 2016 Presidential election - repvote16: the rate of votes that the Republicans , i.e. Donald Trump earned in the 2016 Presidential election - demvote20: the rate of votes that the Democrats, i.e. Joe Biden earned in the 2020 Presidential election - repvote20: the rate of votes that the Republicans , i.e. Donald Trump earned in the 2020 Presidential election - demvotedif: the difference between demvote20 and demvote16 - repvotedif: the difference between repvote20 and repvote16 - pop: the population of each state - cumulcases: the cumulative COVID-19 cases on the Election day - caseMar ~ caseOct: the cumulative COVID-19 cases during each month - Marper10k ~ Octper10k: the cumulative COVID-19 cases during each month per 10 thousands - unemp20: the unemployment rate of each state this year before the election - unempdif: the difference between the unemployment rate of the last year and that of this year - jan20unemp ~ oct20unemp: the unemployment rate of each month - cumulper10k: the cumulative COVID-19 cases on the Election day per 10 thousands - b_str_poscount_total: the total number of positive tweets on Biden measured by the SentiStrength - b_str_negcount_total: the total number of negative tweets on Biden measured by the SentiStrength - t_str_poscount_total: the total number of positive tweets on Trump measured by the SentiStrength - t_str_poscount_total: the total number of negative tweets on Trump measured by the SentiStrength - b_str_posprop_total: the proportion of positive tweets on Biden measured by the SentiStrength - b_str_negprop_total: the proportion of negative tweets on Biden measured by the SentiStrength - t_str_posprop_total: the proportion of positive tweets on Trump measured by the SentiStrength - t_str_negprop_total: the proportion of negative tweets on Trump measured by the SentiStrength - white: the proportion of white people - colored: the proportion of colored people - secondary: the proportion of people who has attained the secondary education - tertiary: the proportion of people who has attained the tertiary education - q3gdp20: GDP of the 3rd quarter 2020 - q3gdprate: the growth rate of the 3rd quarter 2020, compared to that of the same quarter last year - 3qsgdp20: GDP of 3 quarters 2020 - 3qsrate20: the growth rate of GDP compared to that of the 3 quarters last year - q3gdpdif: the difference in the level of GDP of the 3rd quarter compared to the last quarter - q3rate: the growth rate of the 3rd quarter compared to the last quarter - access: the proportion of households having the Internet access
Facebook
TwitterThis dataset was created by AllenEdgarPoe
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
[READ THIS FIRST! DATASETS FOR Academic/Learning/Non-commercial purpose]
US Election 2020 is very interesting to look into as it is an election in the middle of a pandemic. Me and my teammate created a twitter crawler using Twitter API and Tweepy for my Artificial Intelligence coursework. We chose Donald Trump as a subject of interest as President Trump was known for his twitter interaction.
I decided to deploy my crawler on post-voting day to conduct a sentiment analysis.
Tweet text in this datasets is suitable for Sentiment Analysis usage.
This raw datasets is crawled using Tweepy library and Twitter API. 2500 tweets were gathered per 15 minutes. There are total of 247,500 row of entries and 13 columns, with the total of 3,217,500 cells of data. Data cleaning is needed to perform before doing any analysis.
Datasets date range: 4th November 2020 - 11th November 2020 Tweets with "Trump", "DonalTrump", "realDonalTrump" were capture.
(The User = user of the particular row) username: Twitter User handle accDesc: Description of the user on profile location: Location of the tweet following: Total number of account the user is following followers: Total number of followers of the user totaltweets: Total tweets created of the user usercreated: Date of the user registered his/her Twitter account tweetcreated: Date of the tweet created favouritecount: tweet <3 count (equivalent to like on Facebook) retweetcount: Total tweet's retweet (equivalent to share on Facebook) text: Text body of the tweet tweetsource: Device used to create this tweet hashtags: hashtag of the tweet in JSON format
Banner and thumbnail courtesy of > visuals < from unsplash.com
Much thanks to my teammate Jiacheng Loh and ChenZhen Li for the efforts.
Please do not use this datasets for any malicious attempts, any damage done is not under the responsible of me.
This datasets were gathered for the purpose of learning and not for commercial purposes.
Data were public in the public domain, therefore i assume these data is open for all.
Datasets are gathered with at least 15 minutes interval, therefore datecreated distribution is not equal and may not include all tweets created within the date range.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
As you all know, America's Presidential election 2020 will going to happen in November. The result of which will going to be a big thing not just only for American citizens but other countries people too and this will ultimately effect the relations with other countries. So, I have scraped tweet replies of Twitter handle of both the final runners i.e., Donald Trump and Joe Biden. So, the idea is to predict result of this upcoming election and get some insights on how to make predictions with better accuracy.
This dataset is collected through scraping Twitter handles of Donald Trump and Joe Biden separately for a week using python script and retrieving replies on tweets done by both of them. The idea is to look for the sentiments of people towards their representative and how they are reacting to the tweets done by both the leaders. So that one can predict results in advance by analysing people sentiments using data science skills. Below I have attached two csv files, one is for Donald Trump and another is for Joe Biden. Both of the csv files are having two common columns:
1st column user contains user name who have replied to the tweets.
2nd column text contains text of replies on each tweet.
To get more insights on it and how to start with the project, you can refer to my article below. This was a small fun project to get insights thorugh this dataset where I am not training any model as such. I am simply analysing people's sentiment using NLP library TextBlob. So, you can refer to this article to get started with the dataset on how to clean it, analyze it and visualize it. Feel free to use some other algorithms to achieve greater accuracy. Predicting US Presidential Election Result Using Twitter Sentiment Analysis with Python
This dataset actually gives immense opportunities to revise your data science skills from fundamentals. - Data Cleaning as this dataset includes lots of noise which will be going to be a barrier in making good model. - Exploratory Data Analysis - Data Analysis - Data Visualization
Facebook
Twitter18th_June_2020_speech : https://factba.se/transcript/donald-trump-speech-maga-rally-reelection-orlando-june-18-2019 20th_June_2020_speech : https://factba.se/transcript/donald-trump-speech-kag-rally-tulsa-oklahoma-june-20-2020 23_June_2020_Arizona_speech : https://factba.se/transcript/donald-trump-speech-american-youth-phoenix-arizona-june-23-2020
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset contains the county-wise vote share of the United States presidential election of 2020, and in the future 2024, the main advantage of the dataset is that it contains various important county statistics such as the counties racial composition, median and mean income, income inequality, population density, education level, population and the counties occupational distribution.
_Imp: this dataset will be updated as the 2024 results come in, I will also be adding more county demographic data, if you have any queries or suggestions please feel free to comment _
The reasons for constructing this dataset are many, however the prime reason was to aggregate all the data on counties along with the election result data for easy analysis in one place. I noticed that Kaggle contains no datasets with detailed county information, and that using the US census bureau site is pretty difficult and time consuming to extract data so it would be better to have a pre-prepared table of data