Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
We collect recent tweets about Pfizer & BioNTech vaccine.
The data is collected using tweepy Python package to access Twitter API.
Study the subjects of recent tweets about the vaccine made in collaboration by Pfizer and BioNTech, perform various NLP tasks on this data source.
Facebook
TwitterThis dataset was created by Pro
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
I collect recent tweets about the COVID-19 vaccines used in entire world on large scale, as following:
* Pfizer/BioNTech;
* Sinopharm;
* Sinovac;
* Moderna;
* Oxford/AstraZeneca;
* Covaxin;
* Sputnik V.
The data is collected using tweepy Python package to access Twitter API. For each of the vaccine I use relevant search term (most frequently used in Twitter to refer to the respective vaccine)
Initial data was merged from tweets about Pfizer/BioNTech vaccine. I added then tweets from Sinopharm, Sinovac (both Chinese-produced vaccines), Moderna, Oxford/Astra-Zeneca, Covaxin and Sputnik V vaccines. The collection was in the first days twice a day, until I identified approximatively the new tweets quota and then collection (for all vaccines) stabilized at once a day, during morning hours (GMT).
You can perform multiple operations on the vaccines tweets. Here are few possible suggestions:
Facebook
TwitterDataset published by Kaggle user Gabriel Preda. Collection using the Python package Tweepy on COVID-19 Vaccine related tweets from 2020. The dataset was updated daily (twice a day) up until January 2022. The initial dataset only scraped tweets relating to the Pfizer/BioNTech vaccine. The dataset was later updated to include tweets relating to additional vaccines such as Sinopharm, Sinovac, Moderna, Oxford/AstraZeneca, Covaxin, and Sputnik V vaccines.
Facebook
TwitterAttribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
A collection of tweets related to Covid-19 vaccines with manually annotated sentiments (negative, neutral, positive). Negative sentiment is labeled as 1, neutral as 2, and positive as 3.
Tweet IDs are gathered from a dataset by Gabriel Preda and hydrated to get the full tweet text. The initial dataset included tweets about Pfizer/BioNTech, Sinopharm, Sinovac (both Chinese-produced vaccines), Moderna, Oxford/Astra-Zeneca, Covaxin, and Sputnik V vaccines.
Dataset is based on scraped Tweet IDs by Gabriel Preda.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Pfizer Vaccine Tweets is the motivation of gathering this datasets.
This datasets include posts that contains Pfizer hashtag on Instagram. 'id' column equal to Instagram post id. 'text' column is caption of posts. 'accessibility_caption' column is generated automatic lay by Instagram's AI. 'edge_media_preview_like' number of likes. 'edge_media_to_comment_count' number of comments. zero for 'comments_disabled' column means that user allow others for commenting. 'taken_at_timestamp' = timestamp
Facebook
TwitterAttribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
**Dates: **Nov 15 to Dec 16, 2021 Total Records: estimated 3,000,000 **Search Query: **vaccine OR vaccinemandate OR "vaccine mandate" OR pfizer OR moderna OR mRNA Data source: Twitter public API **Notes: **Data downloaded in CSV format and unedited for research use. Collection method: Netlytic
Please cite data: Li, E Rosalie. Dec 2021. Vaccine tweets 1-30. Hoaxlines disinformation database from Novel Science. https://www.kaggle.com/hoaxlines/vaccine-tweets
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Collected recent tweets about the COVID-19 vaccines used in the entire world on large scale, as follows:
Starting with the step of loading the data using pandas, some basic data frame operations allow us to see that, for each tweet, all of the following information is available:
The data is collected using tweepy Python package to access Twitter API. For each of the vaccine I use a relevant search term (most frequently used in Twitter to refer to the respective vaccine).
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
We searched Twitter to retrieve tweets containing specific keywords related to COVID-19 from a set of Twitter accounts and news sources. We limited the retrieved tweets to those in English, excluding retweets, and posted between February, 2020, and November, 2021.
query = '(@GOVUK OR @CMO_England OR @ASTRAZENECAUK OR @UKHSA OR @DHSCgovuk OR @BBCNews OR @moderna_tx OR @NHSuk OR @BorisJohnson OR @pfizer)'
query += ' (#CovidVaccine OR #COVID19Vaccine OR vaccine OR vaccination OR vax OR moderna OR AstraZenca OR Biontech OR JNJ)'
query += ' lang:en -is:retweet -is:verified until:2021-11-30 since:2022-02-01'
tweets_list2 = []
for i,tweet in enumerate(sntwitter.TwitterSearchScraper(query).get_items()):
if i>50000:
break
tweets_list2.append([tweet.date, tweet.id, tweet.content,
tweet.user.username,tweet.user.location,
tweet.likeCount, tweet.retweetCount,tweet.replyCount])
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Winter Storm Uri in February 2021 caused havoc across the United States and specifically to Texas involving mass power outages, water and food shortages, and dangerous weather conditions.
This dataset consists of 23K+ tweets during the crisis week. Data is filtered to mostly include the tweets from influencers (users having more than 5000 followers) however there is a small subset of tweets from other users as well.
My notebook - https://www.kaggle.com/rajsengo/eda-texas-winterstrom-2021-tweets
Apply NLP techniques to undestand user sentiments about the crisis management
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
We collect recent tweets about Pfizer & BioNTech vaccine.
The data is collected using tweepy Python package to access Twitter API.
Study the subjects of recent tweets about the vaccine made in collaboration by Pfizer and BioNTech, perform various NLP tasks on this data source.