Facebook
TwitterThis dataset was created by Mahendra Ch
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
We collect recent tweets about Pfizer & BioNTech vaccine.
The data is collected using tweepy Python package to access Twitter API.
Study the subjects of recent tweets about the vaccine made in collaboration by Pfizer and BioNTech, perform various NLP tasks on this data source.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
I collect recent tweets about the COVID-19 vaccines used in entire world on large scale, as following:
* Pfizer/BioNTech;
* Sinopharm;
* Sinovac;
* Moderna;
* Oxford/AstraZeneca;
* Covaxin;
* Sputnik V.
The data is collected using tweepy Python package to access Twitter API. For each of the vaccine I use relevant search term (most frequently used in Twitter to refer to the respective vaccine)
Initial data was merged from tweets about Pfizer/BioNTech vaccine. I added then tweets from Sinopharm, Sinovac (both Chinese-produced vaccines), Moderna, Oxford/Astra-Zeneca, Covaxin and Sputnik V vaccines. The collection was in the first days twice a day, until I identified approximatively the new tweets quota and then collection (for all vaccines) stabilized at once a day, during morning hours (GMT).
You can perform multiple operations on the vaccines tweets. Here are few possible suggestions:
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
this is the dataset which contain four attribute Username Description Text Hashtags for Pfizer vaccination
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Pfizer Vaccine Tweets is the motivation of gathering this datasets.
This datasets include posts that contains Pfizer hashtag on Instagram. 'id' column equal to Instagram post id. 'text' column is caption of posts. 'accessibility_caption' column is generated automatic lay by Instagram's AI. 'edge_media_preview_like' number of likes. 'edge_media_to_comment_count' number of comments. zero for 'comments_disabled' column means that user allow others for commenting. 'taken_at_timestamp' = timestamp
Facebook
TwitterAttribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
A collection of tweets related to Covid-19 vaccines with manually annotated sentiments (negative, neutral, positive). Negative sentiment is labeled as 1, neutral as 2, and positive as 3.
Tweet IDs are gathered from a dataset by Gabriel Preda and hydrated to get the full tweet text. The initial dataset included tweets about Pfizer/BioNTech, Sinopharm, Sinovac (both Chinese-produced vaccines), Moderna, Oxford/Astra-Zeneca, Covaxin, and Sputnik V vaccines.
Dataset is based on scraped Tweet IDs by Gabriel Preda.
Facebook
TwitterAttribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
**Dates: **Nov 15 to Dec 16, 2021 Total Records: estimated 3,000,000 **Search Query: **vaccine OR vaccinemandate OR "vaccine mandate" OR pfizer OR moderna OR mRNA Data source: Twitter public API **Notes: **Data downloaded in CSV format and unedited for research use. Collection method: Netlytic
Please cite data: Li, E Rosalie. Dec 2021. Vaccine tweets 1-30. Hoaxlines disinformation database from Novel Science. https://www.kaggle.com/hoaxlines/vaccine-tweets
Facebook
TwitterAttribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
This dataset contains Twitter posts containing daily updates of location-based COVID–19 vaccine-related tweets from January 2021 to August 2021.
With an existing Twitter account, we applied for Developer Access and were granted access to Twitter Academic Researcher API which allows for over 10 million tweets per month. Then, we created an application to generate the API credentials (access tokens) from Twitter. The access token was used in Python (v3.6) script to authenticate and establish a connection to the Twitter database. To get goe-tagged vaccine-related tweets, we used the python script we developed to perform a historical search (archive search) of vaccine-related keywords with place country South Africa (ZA). By goe-tagged tweets, we refer to Twitter posts with a know location. These vaccine-related keywords include but are not limited to the vaccine, anti-vaxxer, vaccination, AstraZeneca, Oxford-AstraZeneca, IChooseVaccination, VaccineToSaveSouthAfrica, JohnsonJohnson, and Pfizer. The keywords were selected from the trending topic during the period of discussion. A complete list of the keywords is shown below:
Oxford-AstraZeneca, AstraZeneca, JohnsonJohnson, Vaccine, BioNTech, anti-vaccine, jab, Vaccination, Covax, Vaccine Rollout, Sputnik, VaccineToSaveSouthAfrica, IChooseVaccination, TeachersVaccine, AstraZeneca vaccine, Pfizer, J & J, Johonson & Johnson, Moderna, VaccinesWork, VacciNation, Vaccine, Steriod, COVIDvaccine, covax, VaccineEquity, VaccineReady, Jab OR PfizerGang, Scamdemic, Plandemic, Scaredemic, COVID-19, coronavirus, SARS-CoV-2, anti-vaxxers, jab, Pfizer, BioNTech, JJ, Vaccine, JohnsonJohnson Vaccine, Vaccine Rollout, J & J, Sputnik, COVAX, CoronaVac
The preferred language of the tweet is English.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
We searched Twitter to retrieve tweets containing specific keywords related to COVID-19 from a set of Twitter accounts and news sources. We limited the retrieved tweets to those in English, excluding retweets, and posted between February, 2020, and November, 2021.
query = '(@GOVUK OR @CMO_England OR @ASTRAZENECAUK OR @UKHSA OR @DHSCgovuk OR @BBCNews OR @moderna_tx OR @NHSuk OR @BorisJohnson OR @pfizer)'
query += ' (#CovidVaccine OR #COVID19Vaccine OR vaccine OR vaccination OR vax OR moderna OR AstraZenca OR Biontech OR JNJ)'
query += ' lang:en -is:retweet -is:verified until:2021-11-30 since:2022-02-01'
tweets_list2 = []
for i,tweet in enumerate(sntwitter.TwitterSearchScraper(query).get_items()):
if i>50000:
break
tweets_list2.append([tweet.date, tweet.id, tweet.content,
tweet.user.username,tweet.user.location,
tweet.likeCount, tweet.retweetCount,tweet.replyCount])
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Winter Storm Uri in February 2021 caused havoc across the United States and specifically to Texas involving mass power outages, water and food shortages, and dangerous weather conditions.
This dataset consists of 23K+ tweets during the crisis week. Data is filtered to mostly include the tweets from influencers (users having more than 5000 followers) however there is a small subset of tweets from other users as well.
My notebook - https://www.kaggle.com/rajsengo/eda-texas-winterstrom-2021-tweets
Apply NLP techniques to undestand user sentiments about the crisis management
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
TwitterThis dataset was created by Mahendra Ch