10 datasets found

Pfizer Vaccine Tweets
kaggle.com
zip
Updated Nov 23, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gabriel Preda (2021). Pfizer Vaccine Tweets [Dataset]. https://www.kaggle.com/gpreda/pfizer-vaccine-tweets
Explore at:
zip(1845037 bytes)Available download formats
Dataset updated
Nov 23, 2021
Authors
Gabriel Preda
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

We collect recent tweets about Pfizer & BioNTech vaccine.

The data is collected using tweepy Python package to access Twitter API.

Inspiration

Study the subjects of recent tweets about the vaccine made in collaboration by Pfizer and BioNTech, perform various NLP tasks on this data source.
Pfizer Vaccination Tweets
kaggle.com
zip
Updated May 2, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pro (2021). Pfizer Vaccination Tweets [Dataset]. https://www.kaggle.com/vishalpatil123456/pfizer-vaccination-tweets
Explore at:
zip(380501 bytes)Available download formats
Dataset updated
May 2, 2021
Authors
Pro
Description
Dataset

This dataset was created by Pro

Contents
COVID-19 All Vaccines Tweets
kaggle.com
zip
Updated Nov 23, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gabriel Preda (2021). COVID-19 All Vaccines Tweets [Dataset]. https://www.kaggle.com/datasets/gpreda/all-COVID19-vaccines-tweets
Explore at:
zip(31300213 bytes)Available download formats
Dataset updated
Nov 23, 2021
Authors
Gabriel Preda
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

I collect recent tweets about the COVID-19 vaccines used in entire world on large scale, as following: * Pfizer/BioNTech;
* Sinopharm;
* Sinovac;
* Moderna;
* Oxford/AstraZeneca;
* Covaxin;
* Sputnik V.

Data collection

The data is collected using tweepy Python package to access Twitter API. For each of the vaccine I use relevant search term (most frequently used in Twitter to refer to the respective vaccine)

Data collection frequency

Initial data was merged from tweets about Pfizer/BioNTech vaccine. I added then tweets from Sinopharm, Sinovac (both Chinese-produced vaccines), Moderna, Oxford/Astra-Zeneca, Covaxin and Sputnik V vaccines. The collection was in the first days twice a day, until I identified approximatively the new tweets quota and then collection (for all vaccines) stabilized at once a day, during morning hours (GMT).

Inspiration

You can perform multiple operations on the vaccines tweets. Here are few possible suggestions:

Study the subjects of recent tweets about the vaccine made by various producers;

Perform various NLP tasks on this data source (topic modelling, sentiment analysis);

Using the COVID-19 World Vaccination Progress (where we can see the progress of the vaccinations and the countries where the vaccines are administered), you can study the relationship between the vaccination progress and the discussions in social media (from the tweets) about the vaccines.
u
Covid Tweets - Dataset - BSOS Data Repository
bsos-data.umd.edu
Updated Aug 20, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Covid Tweets - Dataset - BSOS Data Repository [Dataset]. https://bsos-data.umd.edu/dataset/covid-tweets-2020
Explore at:
Dataset updated
Aug 20, 2024
Description
Dataset published by Kaggle user Gabriel Preda. Collection using the Python package Tweepy on COVID-19 Vaccine related tweets from 2020. The dataset was updated daily (twice a day) up until January 2022. The initial dataset only scraped tweets relating to the Pfizer/BioNTech vaccine. The dataset was later updated to include tweets relating to additional vaccines such as Sinopharm, Sinovac, Moderna, Oxford/AstraZeneca, Covaxin, and Sputnik V vaccines.
Covid-19 Vaccine Tweets with Sentiment Annotation
kaggle.com
zip
Updated Jun 14, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FullMoonDataScience (2021). Covid-19 Vaccine Tweets with Sentiment Annotation [Dataset]. https://www.kaggle.com/datasciencetool/covid19-vaccine-tweets-with-sentiment-annotation
Explore at:
zip(581692 bytes)Available download formats
Dataset updated
Jun 14, 2021
Authors
FullMoonDataScience
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
Context

A collection of tweets related to Covid-19 vaccines with manually annotated sentiments (negative, neutral, positive). Negative sentiment is labeled as 1, neutral as 2, and positive as 3.

Data Collection

Tweet IDs are gathered from a dataset by Gabriel Preda and hydrated to get the full tweet text. The initial dataset included tweets about Pfizer/BioNTech, Sinopharm, Sinovac (both Chinese-produced vaccines), Moderna, Oxford/Astra-Zeneca, Covaxin, and Sputnik V vaccines.

Acknowledgements

Dataset is based on scraped Tweet IDs by Gabriel Preda.
Pfizer Instagram posts
kaggle.com
zip
Updated Jan 19, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
reza jafari (2021). Pfizer Instagram posts [Dataset]. https://kaggle.com/rezaunderfit/what-people-are-talking-about-pfizer-on-instagram
Explore at:
zip(250793 bytes)Available download formats
Dataset updated
Jan 19, 2021
Authors
reza jafari
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

Pfizer Vaccine Tweets is the motivation of gathering this datasets.

Content

This datasets include posts that contains Pfizer hashtag on Instagram. 'id' column equal to Instagram post id. 'text' column is caption of posts. 'accessibility_caption' column is generated automatic lay by Instagram's AI. 'edge_media_preview_like' number of likes. 'edge_media_to_comment_count' number of comments. zero for 'comments_disabled' column means that user allow others for commenting. 'taken_at_timestamp' = timestamp
Vaccine tweets
kaggle.com
zip
Updated Dec 21, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hoaxlines disinformation database (2021). Vaccine tweets [Dataset]. https://www.kaggle.com/hoaxlines/vaccine-tweets
Explore at:
zip(856641574 bytes)Available download formats
Dataset updated
Dec 21, 2021
Authors
Hoaxlines disinformation database
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
**Dates: **Nov 15 to Dec 16, 2021 Total Records: estimated 3,000,000 **Search Query: **vaccine OR vaccinemandate OR "vaccine mandate" OR pfizer OR moderna OR mRNA Data source: Twitter public API **Notes: **Data downloaded in CSV format and unedited for research use. Collection method: Netlytic

Please cite data: Li, E Rosalie. Dec 2021. Vaccine tweets 1-30. Hoaxlines disinformation database from Novel Science. https://www.kaggle.com/hoaxlines/vaccine-tweets
Covid-19 Vaccination Tweets
kaggle.com
zip
Updated Jul 3, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ishan Kotian (2021). Covid-19 Vaccination Tweets [Dataset]. https://www.kaggle.com/lykin22/vaccination-tweets
Explore at:
zip(967927 bytes)Available download formats
Dataset updated
Jul 3, 2021
Authors
Ishan Kotian
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

Collected recent tweets about the COVID-19 vaccines used in the entire world on large scale, as follows:

Pfizer/BioNTech;

Sinopharm;

Sinovac;

Moderna;

Oxford/AstraZeneca;

Covaxin;

Sputnik V.

Content

Starting with the step of loading the data using pandas, some basic data frame operations allow us to see that, for each tweet, all of the following information is available:

information about the user who tweeted

user_name: Twitter handle

user_location: where in the world the person tweets from (NOTE: there is no validation here… “your bed” is technically acceptable)

user_description: user-written biography

user_created: when they created their Twitter account

user_followers: number of followers

user_friends: number of accounts the user is following

user_favourites: number of tweets the user has liked

user_verified: indicates if the user is a well-known figure (boolean)

information about the tweet itself

id: indexing value for Twitter API

date: a DateTime object in the form of YYYY-MM-DD HH:MM:SS

text: the tweet itself (MOST IMPORTANT)

hashtags: list of hashtags used in the tweet (without ‘#’ character)

source: which device was used for the tweet

retweets: number of retweets received at the time the data was collected

favorites: number of likes received at the time the data was collected

is_retweet: indicates if the tweet is original or a retweet (boolean)

Data collection

The data is collected using tweepy Python package to access Twitter API. For each of the vaccine I use a relevant search term (most frequently used in Twitter to refer to the respective vaccine).

If you find this dataset useful, please consider upvoting ❤️
COVID-19 Twitter Data
kaggle.com
zip
Updated Jan 18, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Thushara T. (2024). COVID-19 Twitter Data [Dataset]. https://www.kaggle.com/datasets/thusharanair/sma-pipeline
Explore at:
zip(169113331 bytes)Available download formats
Dataset updated
Jan 18, 2024
Authors
Thushara T.
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
A twitter dataset curated for Social Media Analysis employing NLP techniques.

Description

We searched Twitter to retrieve tweets containing specific keywords related to COVID-19 from a set of Twitter accounts and news sources. We limited the retrieved tweets to those in English, excluding retweets, and posted between February, 2020, and November, 2021.

Sample code to extract tweets

query = '(@GOVUK OR @CMO_England OR @ASTRAZENECAUK OR @UKHSA OR @DHSCgovuk OR @BBCNews OR @moderna_tx OR @NHSuk OR @BorisJohnson OR @pfizer)' query += ' (#CovidVaccine OR #COVID19Vaccine OR vaccine OR vaccination OR vax OR moderna OR AstraZenca OR Biontech OR JNJ)' query += ' lang:en -is:retweet -is:verified until:2021-11-30 since:2022-02-01' tweets_list2 = [] for i,tweet in enumerate(sntwitter.TwitterSearchScraper(query).get_items()): if i>50000: break tweets_list2.append([tweet.date, tweet.id, tweet.content, tweet.user.username,tweet.user.location, tweet.likeCount, tweet.retweetCount,tweet.replyCount])

Notebooks for reference

Covid-19 Tweets: NLP Social Media Analysis
Texas Winter Storm 2021 Tweets
kaggle.com
zip
Updated Feb 22, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rajkumar Sengottuvel (2021). Texas Winter Storm 2021 Tweets [Dataset]. https://www.kaggle.com/rajsengo/texas-winter-strom-2021-tweets
Explore at:
zip(4105258 bytes)Available download formats
Dataset updated
Feb 22, 2021
Authors
Rajkumar Sengottuvel
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Area covered
Texas
Description
Context

Winter Storm Uri in February 2021 caused havoc across the United States and specifically to Texas involving mass power outages, water and food shortages, and dangerous weather conditions.

This dataset consists of 23K+ tweets during the crisis week. Data is filtered to mostly include the tweets from influencers (users having more than 5000 followers) however there is a small subset of tweets from other users as well.

My notebook - https://www.kaggle.com/rajsengo/eda-texas-winterstrom-2021-tweets

Acknowledgements

https://www.kaggle.com/gpreda/pfizer-vaccine-tweets - For the inspiration

https://github.com/dataquestio/twitter-scrape - Reference utility to scrape twitter

Inspiration

Apply NLP techniques to undestand user sentiments about the crisis management
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Gabriel Preda (2021). Pfizer Vaccine Tweets [Dataset]. https://www.kaggle.com/gpreda/pfizer-vaccine-tweets

Pfizer Vaccine Tweets

Pfizer and BioNTech Vaccine Tweets

Explore at:

zip(1845037 bytes)Available download formats

Dataset updated

Nov 23, 2021

Authors

Gabriel Preda

License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Context

We collect recent tweets about Pfizer & BioNTech vaccine.

The data is collected using tweepy Python package to access Twitter API.

Inspiration

Study the subjects of recent tweets about the vaccine made in collaboration by Pfizer and BioNTech, perform various NLP tasks on this data source.

Clear search

Close search

Google apps

Main menu

Pfizer Vaccine Tweets

Context

Inspiration

Pfizer Vaccination Tweets

Dataset

Contents

COVID-19 All Vaccines Tweets

Context

Data collection

Data collection frequency

Inspiration

Covid Tweets - Dataset - BSOS Data Repository

Covid-19 Vaccine Tweets with Sentiment Annotation

Context

Data Collection

Acknowledgements

Pfizer Instagram posts

Context

Content

Vaccine tweets

Covid-19 Vaccination Tweets

Context

Content

information about the user who tweeted

information about the tweet itself

Data collection

If you find this dataset useful, please consider upvoting ❤️

COVID-19 Twitter Data

A twitter dataset curated for Social Media Analysis employing NLP techniques.

Description

Sample code to extract tweets

Notebooks for reference

Texas Winter Storm 2021 Tweets

Context

Acknowledgements

Inspiration

Pfizer Vaccine Tweets

Pfizer and BioNTech Vaccine Tweets

Context

Inspiration