10 datasets found
  1. Pfizer Vaccine Tweets

    • kaggle.com
    zip
    Updated Nov 23, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gabriel Preda (2021). Pfizer Vaccine Tweets [Dataset]. https://www.kaggle.com/gpreda/pfizer-vaccine-tweets
    Explore at:
    zip(1845037 bytes)Available download formats
    Dataset updated
    Nov 23, 2021
    Authors
    Gabriel Preda
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    We collect recent tweets about Pfizer & BioNTech vaccine.

    The data is collected using tweepy Python package to access Twitter API.

    Inspiration

    Study the subjects of recent tweets about the vaccine made in collaboration by Pfizer and BioNTech, perform various NLP tasks on this data source.

  2. Pfizer Vaccination Tweets

    • kaggle.com
    zip
    Updated May 2, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pro (2021). Pfizer Vaccination Tweets [Dataset]. https://www.kaggle.com/vishalpatil123456/pfizer-vaccination-tweets
    Explore at:
    zip(380501 bytes)Available download formats
    Dataset updated
    May 2, 2021
    Authors
    Pro
    Description

    Dataset

    This dataset was created by Pro

    Contents

  3. COVID-19 All Vaccines Tweets

    • kaggle.com
    zip
    Updated Nov 23, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gabriel Preda (2021). COVID-19 All Vaccines Tweets [Dataset]. https://www.kaggle.com/datasets/gpreda/all-COVID19-vaccines-tweets
    Explore at:
    zip(31300213 bytes)Available download formats
    Dataset updated
    Nov 23, 2021
    Authors
    Gabriel Preda
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    I collect recent tweets about the COVID-19 vaccines used in entire world on large scale, as following: * Pfizer/BioNTech;
    * Sinopharm;
    * Sinovac;
    * Moderna;
    * Oxford/AstraZeneca;
    * Covaxin;
    * Sputnik V.

    Data collection

    The data is collected using tweepy Python package to access Twitter API. For each of the vaccine I use relevant search term (most frequently used in Twitter to refer to the respective vaccine)

    Data collection frequency

    Initial data was merged from tweets about Pfizer/BioNTech vaccine. I added then tweets from Sinopharm, Sinovac (both Chinese-produced vaccines), Moderna, Oxford/Astra-Zeneca, Covaxin and Sputnik V vaccines. The collection was in the first days twice a day, until I identified approximatively the new tweets quota and then collection (for all vaccines) stabilized at once a day, during morning hours (GMT).

    Inspiration

    You can perform multiple operations on the vaccines tweets. Here are few possible suggestions:

    • Study the subjects of recent tweets about the vaccine made by various producers;
    • Perform various NLP tasks on this data source (topic modelling, sentiment analysis);
    • Using the COVID-19 World Vaccination Progress (where we can see the progress of the vaccinations and the countries where the vaccines are administered), you can study the relationship between the vaccination progress and the discussions in social media (from the tweets) about the vaccines.
  4. u

    Covid Tweets - Dataset - BSOS Data Repository

    • bsos-data.umd.edu
    Updated Aug 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Covid Tweets - Dataset - BSOS Data Repository [Dataset]. https://bsos-data.umd.edu/dataset/covid-tweets-2020
    Explore at:
    Dataset updated
    Aug 20, 2024
    Description

    Dataset published by Kaggle user Gabriel Preda. Collection using the Python package Tweepy on COVID-19 Vaccine related tweets from 2020. The dataset was updated daily (twice a day) up until January 2022. The initial dataset only scraped tweets relating to the Pfizer/BioNTech vaccine. The dataset was later updated to include tweets relating to additional vaccines such as Sinopharm, Sinovac, Moderna, Oxford/AstraZeneca, Covaxin, and Sputnik V vaccines.

  5. Covid-19 Vaccine Tweets with Sentiment Annotation

    • kaggle.com
    zip
    Updated Jun 14, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FullMoonDataScience (2021). Covid-19 Vaccine Tweets with Sentiment Annotation [Dataset]. https://www.kaggle.com/datasciencetool/covid19-vaccine-tweets-with-sentiment-annotation
    Explore at:
    zip(581692 bytes)Available download formats
    Dataset updated
    Jun 14, 2021
    Authors
    FullMoonDataScience
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Context

    A collection of tweets related to Covid-19 vaccines with manually annotated sentiments (negative, neutral, positive). Negative sentiment is labeled as 1, neutral as 2, and positive as 3.

    Data Collection

    Tweet IDs are gathered from a dataset by Gabriel Preda and hydrated to get the full tweet text. The initial dataset included tweets about Pfizer/BioNTech, Sinopharm, Sinovac (both Chinese-produced vaccines), Moderna, Oxford/Astra-Zeneca, Covaxin, and Sputnik V vaccines.

    Acknowledgements

    Dataset is based on scraped Tweet IDs by Gabriel Preda.

  6. Pfizer Instagram posts

    • kaggle.com
    zip
    Updated Jan 19, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    reza jafari (2021). Pfizer Instagram posts [Dataset]. https://kaggle.com/rezaunderfit/what-people-are-talking-about-pfizer-on-instagram
    Explore at:
    zip(250793 bytes)Available download formats
    Dataset updated
    Jan 19, 2021
    Authors
    reza jafari
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    Pfizer Vaccine Tweets is the motivation of gathering this datasets.

    Content

    This datasets include posts that contains Pfizer hashtag on Instagram. 'id' column equal to Instagram post id. 'text' column is caption of posts. 'accessibility_caption' column is generated automatic lay by Instagram's AI. 'edge_media_preview_like' number of likes. 'edge_media_to_comment_count' number of comments. zero for 'comments_disabled' column means that user allow others for commenting. 'taken_at_timestamp' = timestamp

  7. Vaccine tweets

    • kaggle.com
    zip
    Updated Dec 21, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hoaxlines disinformation database (2021). Vaccine tweets [Dataset]. https://www.kaggle.com/hoaxlines/vaccine-tweets
    Explore at:
    zip(856641574 bytes)Available download formats
    Dataset updated
    Dec 21, 2021
    Authors
    Hoaxlines disinformation database
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    **Dates: **Nov 15 to Dec 16, 2021 Total Records: estimated 3,000,000 **Search Query: **vaccine OR vaccinemandate OR "vaccine mandate" OR pfizer OR moderna OR mRNA Data source: Twitter public API **Notes: **Data downloaded in CSV format and unedited for research use. Collection method: Netlytic

    Please cite data: Li, E Rosalie. Dec 2021. Vaccine tweets 1-30. Hoaxlines disinformation database from Novel Science. https://www.kaggle.com/hoaxlines/vaccine-tweets

  8. Covid-19 Vaccination Tweets

    • kaggle.com
    zip
    Updated Jul 3, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ishan Kotian (2021). Covid-19 Vaccination Tweets [Dataset]. https://www.kaggle.com/lykin22/vaccination-tweets
    Explore at:
    zip(967927 bytes)Available download formats
    Dataset updated
    Jul 3, 2021
    Authors
    Ishan Kotian
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    Collected recent tweets about the COVID-19 vaccines used in the entire world on large scale, as follows:

    • Pfizer/BioNTech;
    • Sinopharm;
    • Sinovac;
    • Moderna;
    • Oxford/AstraZeneca;
    • Covaxin;
    • Sputnik V.

    Content

    Starting with the step of loading the data using pandas, some basic data frame operations allow us to see that, for each tweet, all of the following information is available:

    information about the user who tweeted

    • user_name: Twitter handle
    • user_location: where in the world the person tweets from (NOTE: there is no validation here… “your bed” is technically acceptable)
    • user_description: user-written biography
    • user_created: when they created their Twitter account
    • user_followers: number of followers
    • user_friends: number of accounts the user is following
    • user_favourites: number of tweets the user has liked
    • user_verified: indicates if the user is a well-known figure (boolean)

    information about the tweet itself

    • id: indexing value for Twitter API
    • date: a DateTime object in the form of YYYY-MM-DD HH:MM:SS
    • text: the tweet itself (MOST IMPORTANT)
    • hashtags: list of hashtags used in the tweet (without ‘#’ character)
    • source: which device was used for the tweet
    • retweets: number of retweets received at the time the data was collected
    • favorites: number of likes received at the time the data was collected
    • is_retweet: indicates if the tweet is original or a retweet (boolean)

    Data collection

    The data is collected using tweepy Python package to access Twitter API. For each of the vaccine I use a relevant search term (most frequently used in Twitter to refer to the respective vaccine).

    If you find this dataset useful, please consider upvoting ❤️

  9. COVID-19 Twitter Data

    • kaggle.com
    zip
    Updated Jan 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Thushara T. (2024). COVID-19 Twitter Data [Dataset]. https://www.kaggle.com/datasets/thusharanair/sma-pipeline
    Explore at:
    zip(169113331 bytes)Available download formats
    Dataset updated
    Jan 18, 2024
    Authors
    Thushara T.
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    A twitter dataset curated for Social Media Analysis employing NLP techniques.

    Description

    We searched Twitter to retrieve tweets containing specific keywords related to COVID-19 from a set of Twitter accounts and news sources. We limited the retrieved tweets to those in English, excluding retweets, and posted between February, 2020, and November, 2021.

    Sample code to extract tweets

    query = '(@GOVUK OR @CMO_England OR @ASTRAZENECAUK OR @UKHSA OR @DHSCgovuk OR @BBCNews OR @moderna_tx OR @NHSuk OR @BorisJohnson OR @pfizer)'
    query += ' (#CovidVaccine OR #COVID19Vaccine OR vaccine OR vaccination OR vax OR moderna OR AstraZenca OR Biontech OR JNJ)'
    query += ' lang:en -is:retweet -is:verified until:2021-11-30 since:2022-02-01'
    tweets_list2 = []
    for i,tweet in enumerate(sntwitter.TwitterSearchScraper(query).get_items()):
       if i>50000:
         break
       tweets_list2.append([tweet.date, tweet.id, tweet.content, 
                 tweet.user.username,tweet.user.location, 
                 tweet.likeCount, tweet.retweetCount,tweet.replyCount])
    

    Notebooks for reference

    Covid-19 Tweets: NLP Social Media Analysis

  10. Texas Winter Storm 2021 Tweets

    • kaggle.com
    zip
    Updated Feb 22, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rajkumar Sengottuvel (2021). Texas Winter Storm 2021 Tweets [Dataset]. https://www.kaggle.com/rajsengo/texas-winter-strom-2021-tweets
    Explore at:
    zip(4105258 bytes)Available download formats
    Dataset updated
    Feb 22, 2021
    Authors
    Rajkumar Sengottuvel
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    Texas
    Description

    Context

    Winter Storm Uri in February 2021 caused havoc across the United States and specifically to Texas involving mass power outages, water and food shortages, and dangerous weather conditions.

    This dataset consists of 23K+ tweets during the crisis week. Data is filtered to mostly include the tweets from influencers (users having more than 5000 followers) however there is a small subset of tweets from other users as well.

    My notebook - https://www.kaggle.com/rajsengo/eda-texas-winterstrom-2021-tweets

    Acknowledgements

    Inspiration

    Apply NLP techniques to undestand user sentiments about the crisis management

  11. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Gabriel Preda (2021). Pfizer Vaccine Tweets [Dataset]. https://www.kaggle.com/gpreda/pfizer-vaccine-tweets
Organization logo

Pfizer Vaccine Tweets

Pfizer and BioNTech Vaccine Tweets

Explore at:
zip(1845037 bytes)Available download formats
Dataset updated
Nov 23, 2021
Authors
Gabriel Preda
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Context

We collect recent tweets about Pfizer & BioNTech vaccine.

The data is collected using tweepy Python package to access Twitter API.

Inspiration

Study the subjects of recent tweets about the vaccine made in collaboration by Pfizer and BioNTech, perform various NLP tasks on this data source.

Search
Clear search
Close search
Google apps
Main menu