Search
Clear search
Close search
Main menu
Google apps
100+ datasets found
  1. g

    Coronavirus (Covid-19) Data in the United States

    • github.com
    • openicpsr.org
    • +1more
    csv
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    New York Times, Coronavirus (Covid-19) Data in the United States [Dataset]. https://github.com/nytimes/covid-19-data
    Explore at:
    csvAvailable download formats
    Dataset provided by
    New York Times
    License

    https://github.com/nytimes/covid-19-data/blob/master/LICENSEhttps://github.com/nytimes/covid-19-data/blob/master/LICENSE

    Description

    The New York Times is releasing a series of data files with cumulative counts of coronavirus cases in the United States, at the state and county level, over time. We are compiling this time series data from state and local governments and health departments in an attempt to provide a complete record of the ongoing outbreak.

    Since the first reported coronavirus case in Washington State on Jan. 21, 2020, The Times has tracked cases of coronavirus in real time as they were identified after testing. Because of the widespread shortage of testing, however, the data is necessarily limited in the picture it presents of the outbreak.

    We have used this data to power our maps and reporting tracking the outbreak, and it is now being made available to the public in response to requests from researchers, scientists and government officials who would like access to the data to better understand the outbreak.

    The data begins with the first reported coronavirus case in Washington State on Jan. 21, 2020. We will publish regular updates to the data in this repository.

  2. H

    Novel Coronavirus (COVID-19) Cases Data

    • data.humdata.org
    csv
    Updated Feb 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Johns Hopkins University Center for Systems Science and Engineering (2025). Novel Coronavirus (COVID-19) Cases Data [Dataset]. https://data.humdata.org/dataset/novel-coronavirus-2019-ncov-cases
    Explore at:
    csvAvailable download formats
    Dataset updated
    Feb 4, 2025
    Dataset provided by
    Johns Hopkins University Center for Systems Science and Engineering
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description
    JHU Has Stopped Collecting Data As Of 03/10/2023
    After three years of around-the-clock tracking of COVID-19 data from around the world, Johns Hopkins has discontinued the Coronavirus Resource Center’s operations.
    The site’s two raw data repositories will remain accessible for information collected from 1/22/20 to 3/10/23 on cases, deaths, vaccines, testing and demographics.

    Novel Corona Virus (COVID-19) epidemiological data since 22 January 2020. The data is compiled by the Johns Hopkins University Center for Systems Science and Engineering (JHU CCSE) from various sources including the World Health Organization (WHO), DXY.cn, BNO News, National Health Commission of the People’s Republic of China (NHC), China CDC (CCDC), Hong Kong Department of Health, Macau Government, Taiwan CDC, US CDC, Government of Canada, Australia Government Department of Health, European Centre for Disease Prevention and Control (ECDC), Ministry of Health Singapore (MOH), and others. JHU CCSE maintains the data on the 2019 Novel Coronavirus COVID-19 (2019-nCoV) Data Repository on Github.

    Fields available in the data include Province/State, Country/Region, Last Update, Confirmed, Suspected, Recovered, Deaths.

    On 23/03/2020, a new data structure was released. The current resources for the latest time series data are:

    • time_series_covid19_confirmed_global.csv
    • time_series_covid19_deaths_global.csv
    • time_series_covid19_recovered_global.csv

    ---DEPRECATION WARNING---
    The resources below ceased being updated on 22/03/2020 and were removed on 26/03/2020:

    • time_series_19-covid-Confirmed.csv
    • time_series_19-covid-Deaths.csv
    • time_series_19-covid-Recovered.csv
  3. i

    Coronavirus (COVID-19) Tweets Dataset

    • ieee-dataport.org
    • search.datacite.org
    • +1more
    Updated Oct 26, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rabindra Lamsal (2020). Coronavirus (COVID-19) Tweets Dataset [Dataset]. http://doi.org/10.21227/781w-ef42
    Explore at:
    Dataset updated
    Oct 26, 2020
    Dataset provided by
    IEEE Dataport
    Authors
    Rabindra Lamsal
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset (COV19Tweets) includes CSV files that contain IDs and sentiment scores of the tweets related to the COVID-19 pandemic. The real-time Twitter feed is monitored for coronavirus-related tweets using 90+ different keywords and hashtags that are commonly used while referencing the pandemic. The oldest tweets in this dataset date back to October 01, 2019. This dataset has been wholly re-designed on March 20, 2020, to comply with the content redistribution policy set by Twitter. Twitter's policy restricts the sharing of Twitter data other than IDs; therefore, only the tweet IDs are released through this dataset. You need to hydrate the tweet IDs in order to get complete data. For detailed instructions on the hydration of tweet IDs, please read this article.Announcements: We release CrisisTransformers (https://huggingface.co/crisistransformers), a family of pre-trained language models and sentence encoders introduced in the paper "CrisisTransformers: Pre-trained language models and sentence encoders for crisis-related social media texts". The models were trained based on the RoBERTa pre-training procedure on a massive corpus of over 15 billion word tokens sourced from tweets associated with 30+ crisis events such as disease outbreaks, natural disasters, conflicts, etc. CrisisTransformers were evaluated on 18 public crisis-specific datasets against strong baselines such as BERT, RoBERTa, BERTweet, etc. Our pre-trained models outperform the baselines across all 18 datasets in classification tasks, and our best-performing sentence-encoder outperforms the state-of-the-art by more than 17% in sentence encoding tasks. Please refer to the associated paper for more details.MegaGeoCOV Extended — an extended version of MegaGeoCOV has been released. The dataset is introduced in the paper "A Twitter narrative of the COVID-19 pandemic in Australia".We have released BillionCOV — a billion-scale COVID-19 tweets dataset for efficient hydration. Hydration takes time due to limits placed by Twitter on its tweet lookup endpoint. We re-hydrated the tweets present in this dataset (COV19Tweets) and found that more than 500 million tweet identifiers point to either deleted or protected tweets. If we avoid hydrating those tweet identifiers alone, it saves almost two months in a single hydration task. BillionCOV will receive quarterly updates, while this dataset (COV19Tweets) will continue to receive updates every day. Learn more about BillionCOV on its page: https://dx.doi.org/10.21227/871g-yp65. Related publications:Rabindra Lamsal. (2021). Design and analysis of a large-scale COVID-19 tweets dataset. Applied Intelligence, 51(5), 2790-2804.Rabindra Lamsal, Aaron Harwood, Maria Rodriguez Read. (2022). Socially Enhanced Situation Awareness from Microblogs using Artificial Intelligence: A Survey. ACM Computing Surveys, 55(4), 1-38. (arXiv)Rabindra Lamsal, Aaron Harwood, Maria Rodriguez Read. (2022). Twitter conversations predict the daily confirmed COVID-19 cases. Applied Soft Computing, 129, 109603. (arXiv)Rabindra Lamsal, Aaron Harwood, Maria Rodriguez Read. (2022). Addressing the location A/B problem on Twitter: the next generation location inference research. In 2022 ACM SIGSPATIAL LocalRec (pp. 1-4).Rabindra Lamsal, Aaron Harwood, Maria Rodriguez Read. (2022). Where did you tweet from? Inferring the origin locations of tweets based on contextual information. In 2022 IEEE International Conference on Big Data (pp. 3935-3944). (arXiv)Rabindra Lamsal, Maria Rodriguez Read, Shanika Karunasekera. (2023). BillionCOV: An Enriched Billion-scale Collection of COVID-19 tweets for Efficient Hydration. Data in Brief, 48, 109229. (arXiv)Rabindra Lamsal, Maria Rodriguez Read, Shanika Karunasekera. (2023). A Twitter narrative of the COVID-19 pandemic in Australia. In 20th International ISCRAM Conference (pp. 353-370). (arXiv)Rabindra Lamsal, Maria Rodriguez Read, Shanika Karunasekera. (2024). CrisisTransformers: Pre-trained language models and sentence encoders for crisis-related social media texts. Knowledge-Based Systems, 296, 111916. (arXiv)Rabindra Lamsal, Maria Rodriguez Read, Shanika Karunasekera. (2024). Semantically Enriched Cross-Lingual Sentence Embeddings for Crisis-related Social Media Texts. In 21st International ISCRAM Conference (in press). (arXiv)An Open access Billion-scale COVID-19 Tweets Dataset (COV19Tweets)— Dataset name: COV19Tweets Dataset— Number of tweets : 2,263,729,117 tweets— Coverage : Global— Language : English (EN)— Dataset usage terms : By using this dataset, you agree to (i) use the content of this dataset and the data generated from the content of this dataset for non-commercial research only, (ii) remain in compliance with Twitter's Policy and (iii) cite the following paper:Lamsal, R. (2021). Design and analysis of a large-scale COVID-19 tweets dataset. Applied Intelligence, 51, 2790-2804. https://doi.org/10.1007/s10489-020-02029-zBibTeX entry:@article{lamsal2021design, title={Design and analysis of a large-scale COVID-19 tweets dataset}, author={Lamsal, Rabindra}, journal={Applied Intelligence}, volume={51}, number={5}, pages={2790--2804}, year={2021}, publisher={Springer} }— Geo-tagged Version: Coronavirus (COVID-19) Geo-tagged Tweets Dataset (GeoCOV19Tweets Dataset)— Dataset updates : Everyday— Active keywords and hashtags (archive: keywords.tsv) : corona, #corona, coronavirus, #coronavirus, covid, #covid, covid19, #covid19, covid-19, #covid-19, sarscov2, #sarscov2, sars cov2, sars cov 2, covid_19, #covid_19, #ncov, ncov, #ncov2019, ncov2019, 2019-ncov, #2019-ncov, pandemic, #pandemic #2019ncov, 2019ncov, quarantine, #quarantine, flatten the curve, flattening the curve, #flatteningthecurve, #flattenthecurve, hand sanitizer, #handsanitizer, #lockdown, lockdown, social distancing, #socialdistancing, work from home, #workfromhome, working from home, #workingfromhome, ppe, n95, #ppe, #n95, #covidiots, covidiots, herd immunity, #herdimmunity, pneumonia, #pneumonia, chinese virus, #chinesevirus, wuhan virus, #wuhanvirus, kung flu, #kungflu, wearamask, #wearamask, wear a mask, vaccine, vaccines, #vaccine, #vaccines, corona vaccine, corona vaccines, #coronavaccine, #coronavaccines, face shield, #faceshield, face shields, #faceshields, health worker, #healthworker, health workers, #healthworkers, #stayhomestaysafe, #coronaupdate, #frontlineheroes, #coronawarriors, #homeschool, #homeschooling, #hometasking, #masks4all, #wfh, wash ur hands, wash your hands, #washurhands, #washyourhands, #stayathome, #stayhome, #selfisolating, self isolating Important Notes:> Dataset files are published in chronological order.> Twitter's content redistribution policy restricts the sharing of tweet information other than tweet IDs and/or user IDs. Twitter wants researchers to always pull fresh data. It is because a user might delete a tweet or make his/her profile protected.> Retweets are excluded in the files corona_tweets_chi.csv and earlier.> Only the tweet IDs are available (sentiment scores are not available) for the tweets present in the files: corona_tweets_11b.csv, corona_tweets_223.csv, corona_tweets_297.csv, corona_tweets_395.csv and the files containing tweets from before March 20, 2020.> March 29, 2020 04:02 PM - March 30, 2020 02:00 PM -- Some technical fault has occurred. Preventive measures have been taken. Tweets for this session won't be available. [update: the tweets for this session are now available in the corona_tweets_11b.csv file; retweets are excluded though]> Please go through the Dataset Files section for specific notes.> There's a Combined_Files section (at the bottom of the dataset files list) if you want to download dataset files in bulk.> The naming convention for the later added CSVs (tweets from before March 20, 2020) will have a greek alphabet name instead of a numeric counter. I'll start with the last greek alphabet name "omega" and proceed up towards "alpha".> If you want access to tweets older than October 01, 2019, feel free to reach out to me at rlamsal [at] student.unimelb.edu.au using your academic/research institution email.Dataset Files (GMT+5:45)--------- tweets from before March 20, 2020 ---------corona_tweets_theta.csv: 418,625 tweets (October 01, 2019 12:00 AM - October 18, 2019, 07:51 AM)corona_tweets_iota.csv: 1,000,000 tweets (October 18, 2019, 07:51 AM - December 01, 2019 01:25 AM)corona_tweets_kappa.csv: 1,000,000 tweets (December 01, 2019 01:25 AM - January 09, 2020, 10:20 PM)corona_tweets_lambda.csv: 1,000,000 tweets (January 09, 2020, 10:20 PM - January 26, 2020, 05:14 PM)corona_tweets_mu.csv: 1,000,000 tweets (January 26, 2020, 05:14 PM - January 31, 2020, 07:18 AM)corona_tweets_nu.csv: 1,000,000 tweets (January 31, 2020, 07:18 AM - February 05, 2020 03:38 PM)corona_tweets_xi.csv: 4,003,032 tweets (February 05, 2020 03:38 PM - February 28, 2020 04:27 AM)corona_tweets_omicron.csv: 3,000,000 tweets (February 28, 2020 04:27 AM - March 04, 2020 03:36 PM)corona_tweets_pi.csv: 3,000,000 tweets (March 04, 2020 03:36 PM - March 09, 2020 07:58 AM)corona_tweets_rho.csv: 3,990,232 tweets (March 09, 2020 07:58 AM - March 12, 2020 12:01 PM)corona_tweets_sigma.csv: 3,000,000 tweets (March 12, 2020 12:01 PM - March 13, 2020 07:13 PM)corona_tweets_tau.csv: 3,000,000 tweets (March 13, 2020 07:13 PM - March 15, 2020 04:03 AM)corona_tweets_upsilon.csv: 3,999,408 tweets (March 15, 2020 04:03 AM - March 17, 2020 03:25 AM)corona_tweets_phi.csv: 3,000,000 tweets (March 17, 2020 03:25 AM - March 18, 2020 06:51 AM)corona_tweets_chi.csv: 3,000,000 tweets (March 18, 2020 06:51 AM - March 19, 2020 10:57 AM)corona_tweets_psi.csv: 3,878,586 tweets (March 19, 2020 10:57 AM - March 19, 2020 08:04 PM)corona_tweets_omega.csv: 4,000,000 tweets (March 19, 2020 08:04 PM - March 20, 2020 01:37 AM)----------------------------------corona_tweets_01.csv + corona_tweets_02.csv + corona_tweets_03.csv: 2,475,980 tweets (March 20, 2020 01:37 AM - March 21, 2020 09:25 AM)corona_tweets_04.csv: 1,233,340

  4. g

    Coronavirus COVID-19 Global Cases by the Center for Systems Science and...

    • github.com
    • systems.jhu.edu
    • +1more
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE), Coronavirus COVID-19 Global Cases by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University (JHU) [Dataset]. https://github.com/CSSEGISandData/COVID-19
    Explore at:
    Dataset provided by
    Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE)
    Area covered
    Global
    Description

    2019 Novel Coronavirus COVID-19 (2019-nCoV) Visual Dashboard and Map:
    https://www.arcgis.com/apps/opsdashboard/index.html#/bda7594740fd40299423467b48e9ecf6

    • Confirmed Cases by Country/Region/Sovereignty
    • Confirmed Cases by Province/State/Dependency
    • Deaths
    • Recovered

    Downloadable data:
    https://github.com/CSSEGISandData/COVID-19

    Additional Information about the Visual Dashboard:
    https://systems.jhu.edu/research/public-health/ncov

  5. S

    Coronavirus COVID-19 Global Cases

    • redivis.com
    application/jsonl +7
    Updated Jul 13, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stanford Center for Population Health Sciences (2020). Coronavirus COVID-19 Global Cases [Dataset]. http://doi.org/10.57761/pyf5-4e40
    Explore at:
    application/jsonl, parquet, csv, stata, avro, spss, sas, arrowAvailable download formats
    Dataset updated
    Jul 13, 2020
    Dataset provided by
    Redivis Inc.
    Authors
    Stanford Center for Population Health Sciences
    Time period covered
    Jan 22, 2020 - Jul 12, 2020
    Description

    Abstract

    JHU Coronavirus COVID-19 Global Cases, by country

    Documentation

    PHS is updating the Coronavirus Global Cases dataset weekly, Monday, Wednesday and Friday from Cloud Marketplace.

    This data comes from the data repository for the 2019 Novel Coronavirus Visual Dashboard operated by the Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE). This database was created in response to the Coronavirus public health emergency to track reported cases in real-time. The data include the location and number of confirmed COVID-19 cases, deaths, and recoveries for all affected countries, aggregated at the appropriate province or state. It was developed to enable researchers, public health authorities and the general public to track the outbreak as it unfolds. Additional information is available in the blog post.

    Visual Dashboard (desktop): https://www.arcgis.com/apps/opsdashboard/index.html#/bda7594740fd40299423467b48e9ecf6

    Section 2

    Included Data Sources are:

    %3C!-- --%3E

    Section 3

    **Terms of Use: **

    This GitHub repo and its contents herein, including all data, mapping, and analysis, copyright 2020 Johns Hopkins University, all rights reserved, is provided to the public strictly for educational and academic research purposes. The Website relies upon publicly available data from multiple sources, that do not always agree. The Johns Hopkins University hereby disclaims any and all representations and warranties with respect to the Website, including accuracy, fitness for use, and merchantability. Reliance on the Website for medical guidance or use of the Website in commerce is strictly prohibited.

    Section 4

    **U.S. county-level characteristics relevant to COVID-19 **

    Chin, Kahn, Krieger, Buckee, Balsari and Kiang (forthcoming) show that counties differ significantly in biological, demographic and socioeconomic factors that are associated with COVID-19 vulnerability. A range of publicly available county-specific data identifying these key factors, guided by international experiences and consideration of epidemiological parameters of importance, have been combined by the authors and are available for use:

    https://github.com/mkiang/county_preparedness/

  6. d

    Fighting Coronavirus/COVID-19 with Public Health Data

    • catalog.data.gov
    • data.tempe.gov
    • +1more
    Updated Mar 18, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Tempe (2023). Fighting Coronavirus/COVID-19 with Public Health Data [Dataset]. https://catalog.data.gov/dataset/fighting-coronavirus-covid-19-with-public-health-data-1cf8a
    Explore at:
    Dataset updated
    Mar 18, 2023
    Dataset provided by
    City of Tempe
    Description

    This story map explores the partnership between the City of Tempe and Arizona State University to study city wastewater for Coronavirus/COVID-19. Featured sections include:What is Coronavirus/COVID-19Analyzing Wastewater DataData-Driven Decision MakingWhat You Can DoFrequently Asked Questions Important ContactsPlease also see the Spanish language version.

  7. e

    Data from: Coronavirus COVID-19 Cases

    • coronavirus-resources.esri.com
    • hub.arcgis.com
    • +1more
    Updated Feb 6, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CSSE_covid19 (2020). Coronavirus COVID-19 Cases [Dataset]. https://coronavirus-resources.esri.com/maps/bbb2e4f589ba40d692fab712ae37b9ac
    Explore at:
    Dataset updated
    Feb 6, 2020
    Dataset authored and provided by
    CSSE_covid19
    Area covered
    Pacific Ocean, North Pacific Ocean
    Description

    On March 10, 2023, the Johns Hopkins Coronavirus Resource Center ceased collecting and reporting of global COVID-19 data. For updated cases, deaths, and vaccine data please visit the following sources:Global: World Health Organization (WHO)U.S.: U.S. Centers for Disease Control and Prevention (CDC)For more information, visit the Johns Hopkins Coronavirus Resource Center.This feature layer contains the most up-to-date COVID-19 cases and the latest trend plot. It covers the US (county or state level), China, Canada, Australia (province/state level), and the rest of the world (country/region level, represented by either the country centroids or their capitals). Data sources are WHO, CDC, ECDC, NHC, DXY, 1point3acres, Worldometers.info, BNO, the COVID Tracking Project (testing and hospitalizations), state and national government health departments, and local media reports. This layer is created and maintained by the Center for Systems Science and Engineering (CSSE) at the Johns Hopkins University. This feature layer is supported by Esri Living Atlas team, JHU APL and JHU Data Services. This layer is opened to the public and free to share. Contact us.

  8. d

    Data from: Novel Coronavirus 2019

    • datahub.io
    Updated Mar 23, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2020). Novel Coronavirus 2019 [Dataset]. https://datahub.io/core/covid-19
    Explore at:
    Dataset updated
    Mar 23, 2020
    License

    ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
    License information was derived automatically

    Description

    Coronavirus disease 2019 (COVID-19) time series listing confirmed cases, reported deaths and reported recoveries. Data is disaggregated by country (and sometimes subregion). Coronavirus disease (COV...

  9. o

    Novel Coronavirus (COVID-19) Cases Data - Dataset - Data Catalog Armenia

    • data.opendata.am
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Novel Coronavirus (COVID-19) Cases Data - Dataset - Data Catalog Armenia [Dataset]. https://data.opendata.am/dataset/novel-coronavirus-covid-19-cases-data
    Explore at:
    Dataset updated
    May 31, 2023
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Novel Corona Virus (COVID-19) epidemiological data since 22 January 2020. The data is compiled by the Johns Hopkins University Center for Systems Science and Engineering (JHU CCSE) from various sources including the World Health Organization (WHO), DXY.cn, BNO News, National Health Commission of the People’s Republic of China (NHC), China CDC (CCDC), Hong Kong Department of Health, Macau Government, Taiwan CDC, US CDC, Government of Canada, Australia Government Department of Health, European Centre for Disease Prevention and Control (ECDC), Ministry of Health Singapore (MOH), and others. JHU has Stopped collecting data as of 10 March 2023.

  10. i

    Coronavirus (COVID-19) Geo-tagged Tweets Dataset

    • ieee-dataport.org
    Updated Jan 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rabindra Lamsal (2023). Coronavirus (COVID-19) Geo-tagged Tweets Dataset [Dataset]. http://doi.org/10.21227/fpsb-jz61
    Explore at:
    Dataset updated
    Jan 22, 2023
    Dataset provided by
    IEEE Dataport
    Authors
    Rabindra Lamsal
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset (GeoCOV19Tweets) contains IDs and sentiment scores of geo-tagged tweets related to the COVID-19 pandemic. The real-time Twitter feed is monitored for coronavirus-related tweets using 90+ different keywords and hashtags that are commonly used while referencing the pandemic. Complying with Twitter's content redistribution policy, only the tweet IDs are shared. The tweet IDs in this dataset belong to the tweets created providing an exact location. You can reconstruct the dataset by hydrating these IDs. For detailed instructions on the hydration of tweet IDs, please read this article.Announcements: We release CrisisTransformers (https://huggingface.co/crisistransformers), a family of pre-trained language models and sentence encoders introduced in the paper "CrisisTransformers: Pre-trained language models and sentence encoders for crisis-related social media texts". The models were trained based on the RoBERTa pre-training procedure on a massive corpus of over 15 billion word tokens sourced from tweets associated with 30+ crisis events such as disease outbreaks, natural disasters, conflicts, etc. CrisisTransformers were evaluated on 18 public crisis-specific datasets against strong baselines such as BERT, RoBERTa, BERTweet, etc. Our pre-trained models outperform the baselines across all 18 datasets in classification tasks, and our best-performing sentence-encoder outperforms the state-of-the-art by more than 17% in sentence encoding tasks. Please refer to the associated paper for more details.MegaGeoCOV Extended — an extended version of MegaGeoCOV has been released. The dataset is introduced in the paper "A Twitter narrative of the COVID-19 pandemic in Australia".We have released BillionCOV — a billion-scale COVID-19 tweets dataset for efficient hydration. Hydration takes time due to limits placed by Twitter on its tweet lookup endpoint. We re-hydrated the tweets present in COV19Tweets and found that more than 500 million tweet identifiers point to either deleted or protected tweets. If we avoid hydrating those tweet identifiers alone, it saves almost two months in a single hydration task. BillionCOV will receive quarterly updates, while COV19Tweets will continue to receive updates every day. Learn more about BillionCOV on its page: https://dx.doi.org/10.21227/871g-yp65We also release a million-scale COVID-19-specific geotagged tweets dataset — MegaGeoCOV (on GitHub). The dataset is introduced in the paper "Twitter conversations predict the daily confirmed COVID-19 cases". Related publications:Rabindra Lamsal. (2021). Design and analysis of a large-scale COVID-19 tweets dataset. Applied Intelligence, 51(5), 2790-2804.Rabindra Lamsal, Aaron Harwood, Maria Rodriguez Read. (2022). Socially Enhanced Situation Awareness from Microblogs using Artificial Intelligence: A Survey. ACM Computing Surveys, 55(4), 1-38. (arXiv)Rabindra Lamsal, Aaron Harwood, Maria Rodriguez Read. (2022). Twitter conversations predict the daily confirmed COVID-19 cases. Applied Soft Computing, 129, 109603. (arXiv)Rabindra Lamsal, Aaron Harwood, Maria Rodriguez Read. (2022). Addressing the location A/B problem on Twitter: the next generation location inference research. In 2022 ACM SIGSPATIAL LocalRec (pp. 1-4).Rabindra Lamsal, Aaron Harwood, Maria Rodriguez Read. (2022). Where did you tweet from? Inferring the origin locations of tweets based on contextual information. In 2022 IEEE International Conference on Big Data (pp. 3935-3944). (arXiv)Rabindra Lamsal, Maria Rodriguez Read, Shanika Karunasekera. (2023). BillionCOV: An Enriched Billion-scale Collection of COVID-19 tweets for Efficient Hydration. Data in Brief, 48, 109229. (arXiv)Rabindra Lamsal, Maria Rodriguez Read, Shanika Karunasekera. (2023). A Twitter narrative of the COVID-19 pandemic in Australia. In 20th International ISCRAM Conference (pp. 353-370). (arXiv)Rabindra Lamsal, Maria Rodriguez Read, Shanika Karunasekera. (2024). CrisisTransformers: Pre-trained language models and sentence encoders for crisis-related social media texts. Knowledge-Based Systems, 296, 111916. (arXiv)Rabindra Lamsal, Maria Rodriguez Read, Shanika Karunasekera. (2024). Semantically Enriched Cross-Lingual Sentence Embeddings for Crisis-related Social Media Texts. In 21st International ISCRAM Conference (in press). (arXiv)Below is a quick overview of this dataset.— Dataset name: GeoCOV19Tweets Dataset— Number of tweets : 502,067 tweets— Coverage : Global— Language : English (EN)— Dataset usage terms : By using this dataset, you agree to (i) use the content of this dataset and the data generated from the content of this dataset for non-commercial research only, (ii) remain in compliance with Twitter's Policy and (iii) cite the following paper:Lamsal, R. (2021). Design and analysis of a large-scale COVID-19 tweets dataset. Applied Intelligence, 51, 2790-2804. https://doi.org/10.1007/s10489-020-02029-zBibTeX entry:@article{lamsal2021design, title={Design and analysis of a large-scale COVID-19 tweets dataset}, author={Lamsal, Rabindra}, journal={Applied Intelligence}, volume={51}, number={5}, pages={2790--2804}, year={2021}, publisher={Springer} }— Primary dataset : Coronavirus (COVID-19) Tweets Dataset (COV19Tweets Dataset)— Dataset updates : Everyday— Keywords and hashtags: keywords.tsvPlease visit this page (primary dataset) for more details.Collection date & Number of tweets(2020) March 20 - March 21: 1290 tweets(2020) March 21 - March 22: 1020 tweets(2020) March 22 - March 23: 1069 tweets(2020) March 23 - March 24: 1072 tweets(2020) March 24 - March 25: 949 tweets(2020) March 25 - March 26: 913 tweets(2020) March 26 - March 27: 810 tweets(2020) March 27 - March 28: 855 tweets(2020) March 28 - March 29: 828 tweets(2020) March 29 - March 30: 5318 tweets (this file was added on June 29, 2021; its primary file corona_tweets_11b.csv was created while excluding retweets right at the API level; compared to other days the geo-tagged tweets are significantly higher for this day; Reason: Twitter's full-search endpoint was asked to create a corpus while excluding retweets; retweets have NULL geo and place objects, and since they were excluded I was able to come up with 5318 geo-tagged tweets out of 1,677,362 tweets collected for this day; this was quite an interesting observation to note)(2020) March 30 - March 31: 538 tweets(2020) March 31 - April 1: 636 tweets(2020) April 1 - April 2: 608 tweets(2020) April 2 - April 3: 661 tweets(2020) April 3 - April 4: 592 tweets(2020) April 4 - April 5: 661 tweets(2020) April 5 - April 6: 709 tweets(2020) April 6 - April 7: 549 tweets(2020) April 7 - April 8: 593 tweets(2020) April 8 - April 9: 491 tweets(2020) April 9 - April 10: 507 tweets(2020) April 10 - April 11: 534 tweets(2020) April 11 - April 12: 539 tweets(2020) April 12- April 13: 543 tweets(2020) April 13 - April 14: 510 tweets(2020) April 14 - April 15: 387 tweets(2020) April 15 - April 16: 321 tweets(2020) April 16 - April 17: 443 tweets(2020) April 17 - April 18: 373 tweets(2020) April 18 - April 19: 1020 tweets(2020) April 19 - April 20: 884 tweets(2020) April 20 - April 21: 869 tweets(2020) April 21 - April 22: 878 tweets(2020) April 22 - April 23: 831 tweets(2020) April 23 - April 24: 818 tweets(2020) April 24 - April 25: 747 tweets(2020) April 25- April 26: 693 tweets(2020) April 26 - April 27: 939 tweets(2020) April 27 - April 28: 744 tweets(2020) April 28 - April 29: 1408 tweets(2020) April 29 - April 30: 1751 tweets(2020) April 30 - May 1: 1637 tweets(2020) May 1 - May 2: 1866 tweets(2020) May 2 - May 3: 1839 tweets(2020) May 3 - May 4: 1566 tweets(2020) May 4 - May 5: 1615 tweets(2020) May 5 - May 6: 1635 tweets(2020) May 6 - May 7: 1571 tweets(2020) May 7 - May 8: 1621 tweets(2020) May 8 - May 9: 1684 tweets(2020) May 9 - May 10: 1474 tweets(2020) May 10 - May 11: 1130 tweets(2020) May 11 - May 12: 1281 tweets(2020) May 12- May 13: 1630 tweets(2020) May 13 - May 14: 1480 tweets(2020) May 14 - May 15: 1652 tweets(2020) May 15 - May 16: 1583 tweets(2020) May 16 - May 17: 1487 tweets(2020) May 17 - May 18: 1341 tweets(2020) May 18 - May 19: 1398 tweets(2020) May 19 - May 20: 1389 tweets(2020) May 20 - May 21: 1397 tweets(2020) May 21 - May 22: 1562 tweets(2020) May 22 - May 23: 1558 tweets(2020) May 23 - May 24: 1299 tweets(2020) May 24 - May 25: 1297 tweets(2020) May 25- May 26: 1190 tweets(2020) May 26 - May 27: 1184 tweets(2020) May 27 - May 28: 1257 tweets(2020) May 28 - May 29: 1277 tweets(2020) May 29 - May 30: 1202 tweets(2020) May 30 - May 31: 1209 tweets(2020) May 31 - June 1: 1080 tweets(2020) June 1 - June 2: 1233 tweets(2020) June 2 - June 3: 917 tweets(2020) June 3 - June 4: 1055 tweets(2020) June 4 - June 5: 1117 tweets(2020) June 5 - June 6: 1184 tweets(2020) June 6 - June 7: 1093 tweets(2020) June 7 - June 8: 1054 tweets(2020) June 8 - June 9: 1180 tweets(2020) June 9 - June 10: 1155 tweets(2020) June 10 - June 11: 1131 tweets(2020) June 11 - June 12: 1148 tweets(2020) June 12- June 13: 1189 tweets(2020) June 13 - June 14: 1045 tweets(2020) June 14 - June 15: 1024 tweets(2020) June 15 - June 16: 1663 tweets(2020) June 16 - June 17: 1692 tweets(2020) June 17 - June 18: 1634 tweets(2020) June 18 - June 19: 1610 tweets(2020) June 19 - June 20: 1698 tweets(2020) June 20 - June 21: 1613 tweets(2020) June 21 - June 22: 1419 tweets(2020) June 22 - June 23: 1524 tweets(2020) June 23 - June 24: 1431 tweets(2020) June 24 - June 25: 1454 tweets(2020) June 25- June 26: 1539 tweets(2020) June 26 - June 27: 1403 tweets(2020) June 27 - June 28: 1766 tweets(2020) June 28 - June 29: 1405 tweets(2020) June 29 - June 30: 1534 tweets(2020) June 30 - June 31: 1519 tweets(2020) July 1 - July 2: 1841 tweets(2020) July 2 - July 3: 1434 tweets(2020) July 3 - July 4: 1475 tweets(2020) July 4 - July 5: 2028 tweets(2020) July 5 - July 6: 1491 tweets(2020) July 6 - July 7: 1275 tweets(2020) July 7 - July 8: 1336 tweets(2020) July 8 - July 9: 1428 tweets(2020) July 9 - July 10: 1831

  11. Data from: coronavirus-covid-19-cases

    • kaggle.com
    zip
    Updated Aug 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    vmahawar (2024). coronavirus-covid-19-cases [Dataset]. https://www.kaggle.com/datasets/vmahawar/coronavirus-covid-19-cases
    Explore at:
    zip(13268503 bytes)Available download formats
    Dataset updated
    Aug 13, 2024
    Authors
    vmahawar
    Description

    Dataset

    This dataset was created by vmahawar

    Released under Other (specified in description)

    Contents

  12. o

    COVID-19 Pandemic - USA counties

    • public.opendatasoft.com
    • public.aws-ec2-eu-1.opendatasoft.com
    • +1more
    csv, excel, geojson +1
    Updated Feb 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). COVID-19 Pandemic - USA counties [Dataset]. https://public.opendatasoft.com/explore/dataset/coronavirus-covid-19-pandemic-usa-counties/
    Explore at:
    excel, geojson, csv, jsonAvailable download formats
    Dataset updated
    Feb 17, 2025
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    United States
    Description

    This is the USA counties data extracted from the 2019 Coronavirus data hub operated by the Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE). Also, Supported by ESRI Living Atlas Team and the Johns Hopkins University Applied Physics Lab (JHU APL).Sources:1Point3Arces: https://coronavirus.1point3acres.com/enUS CDC: https://www.cdc.gov/coronavirus/2019-ncov/index.html Enrichmentthe official FIPS codes are available and should be used for joins or geojoins needs.Terms of Use:This data set is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0) by the Johns Hopkins University on behalf of its Center for Systems Science in Engineering. Copyright Johns Hopkins University 2020.Attribute the data as the "COVID-19 Data Repository by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University" or "JHU CSSE COVID-19 Data" for short, and the url: https://github.com/CSSEGISandData/COVID-19.For publications that use the data, please cite the following publication: "Dong E, Du H, Gardner L. An interactive web-based dashboard to track COVID-19 in real time. Lancet Inf Dis. 20(5):533-534. doi: 10.1016/S1473-3099(20)30120-1"

  13. COVID-19 cases worldwide as of May 2, 2023, by country or territory

    • statista.com
    • innovatkmarketing.store
    Updated Aug 29, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2023). COVID-19 cases worldwide as of May 2, 2023, by country or territory [Dataset]. https://www.statista.com/statistics/1043366/novel-coronavirus-2019ncov-cases-worldwide-by-country/
    Explore at:
    Dataset updated
    Aug 29, 2023
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    As of May 2, 2023, the outbreak of the coronavirus disease (COVID-19) had been confirmed in almost every country in the world. The virus had infected over 687 million people worldwide, and the number of deaths had reached almost 6.87 million. The most severely affected countries include the U.S., India, and Brazil.

    COVID-19: background information COVID-19 is a novel coronavirus that had not previously been identified in humans. The first case was detected in the Hubei province of China at the end of December 2019. The virus is highly transmissible and coughing and sneezing are the most common forms of transmission, which is similar to the outbreak of the SARS coronavirus that began in 2002 and was thought to have spread via cough and sneeze droplets expelled into the air by infected persons.

    Naming the coronavirus disease Coronaviruses are a group of viruses that can be transmitted between animals and people, causing illnesses that may range from the common cold to more severe respiratory syndromes. In February 2020, the International Committee on Taxonomy of Viruses and the World Health Organization announced official names for both the virus and the disease it causes: SARS-CoV-2 and COVID-19, respectively. The name of the disease is derived from the words corona, virus, and disease, while the number 19 represents the year that it emerged.

  14. Coronavirus (COVID-19) Infection Survey: technical data

    • ons.gov.uk
    • cy.ons.gov.uk
    xlsx
    Updated Mar 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office for National Statistics (2023). Coronavirus (COVID-19) Infection Survey: technical data [Dataset]. https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/conditionsanddiseases/datasets/covid19infectionsurveytechnicaldata
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Mar 10, 2023
    Dataset provided by
    Office for National Statisticshttp://www.ons.gov.uk/
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    Technical and methodological data from the Coronavirus (COVID-19) Infection Survey, England, Wales, Northern Ireland and Scotland.

  15. Total number of U.S. COVID-19 cases and deaths April 26, 2023

    • statista.com
    Updated May 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Total number of U.S. COVID-19 cases and deaths April 26, 2023 [Dataset]. https://www.statista.com/statistics/1101932/coronavirus-covid19-cases-and-deaths-number-us-americans/
    Explore at:
    Dataset updated
    May 15, 2024
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    United States
    Description

    As of April 26, 2023, the number of both confirmed and presumptive positive cases of the COVID-19 disease reported in the United States had reached over 104 million with over 1.1 million deaths reported among these cases.

    Coronavirus deaths by age in the U.S. Daily new cases of COVID-19 hit record highs in the United States at the beginning of 2022. Underlying health conditions can worsen cases of coronavirus, and case fatality rates among confirmed COVID-19 patients increase with age. The highest number of deaths from COVID-19 have been among those aged 85 years and older, with this age group accounting for over 300 thousand deaths.

    Where has this coronavirus come from? Coronaviruses are a large group of viruses transmitted between animals and people that cause illnesses ranging from the common cold to more severe diseases. The novel coronavirus that is currently infecting humans was already circulating among certain animal species. The first human case of this new coronavirus strain was reported in China at the end of December 2019. The coronavirus was named severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), and its associated disease is known as COVID-19.

  16. Coronavirus daily data

    • data.world
    csv, zip
    Updated Jan 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mark Di Marco (2025). Coronavirus daily data [Dataset]. https://data.world/markmarkoh/coronavirus-data
    Explore at:
    csv, zipAvailable download formats
    Dataset updated
    Jan 9, 2025
    Dataset provided by
    data.world, Inc.
    Authors
    Mark Di Marco
    Time period covered
    Jan 22, 2020 - Mar 9, 2023
    Description

    Originally sourced from https://ourworldindata.org/coronavirus-source-data

    Synced daily

    Update 12/04/2020

    The data sources have been updated to use JHU data:

    From OWID:

    ​> On 30 November 2020, we changed our source for confirmed cases and deaths to the COVID-19 Data Repository by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University. Our previous source for confirmed cases and deaths, the European Centre for Disease Prevention and Control (ECDC), had announced in November 2020 that it would switch from a daily to a weekly reporting schedule from December. Our World in Data therefore had to transition away from the ECDC as a source to continue to provide daily updates of confirmed cases and deaths. The data last sourced from the ECDC remains available as an archive in the ecdc folder. The format (variable names and types) of our complete COVID-19 dataset remains the same.

  17. COVID-19 dataset by Our World in Data

    • kaggle.com
    Updated Sep 20, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    YuryBolkonsky (2020). COVID-19 dataset by Our World in Data [Dataset]. https://www.kaggle.com/datasets/bolkonsky/covid19
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 20, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    YuryBolkonsky
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Data on COVID-19 (coronavirus) by Our World in Data

    Our complete COVID-19 dataset is a collection of the COVID-19 data maintained by Our World in Data. It is updated daily and includes data on confirmed cases, deaths, and testing, as well as other variables of potential interest.

    🗂️ Download our complete COVID-19 dataset : CSV | XLSX | JSON

    We will continue to publish up-to-date data on confirmed cases, deaths, and testing, throughout the duration of the COVID-19 pandemic.

    Our data sources

    • Confirmed cases and deaths: our data comes from the European Centre for Disease Prevention and Control (ECDC). We discuss how and when the ECDC collects and publishes this data here. The cases & deaths dataset is updated daily. *Note: the number of cases or deaths reported by any institution—including the ECDC, the WHO, Johns Hopkins and others—on a given day does not necessarily represent the actual number on that date. This is because of the long reporting chain that exists between a new case/death and its inclusion in statistics. This also means that negative values in cases and deaths can sometimes appear when a country sends a correction to the ECDC, because it had previously overestimated the number of cases/deaths. Alternatively, large changes can sometimes (although rarely) be made to a country's entire time series if the ECDC decides (and has access to the necessary data) to correct values retrospectively.*
    • Testing for COVID-19: this data is collected by the Our World in Data team from official reports; you can find further details in our post on COVID-19 testing, including our checklist of questions to understand testing data, information on geographical and temporal coverage, and detailed country-by-country source information. The testing dataset is updated around twice a week.
    • Other variables: this data is collected from a variety of sources (United Nations, World Bank, Global Burden of Disease, Blavatnik School of Government, etc.). More information is available in our codebook.

    The complete Our World in Data COVID-19 dataset

    Our complete COVID-19 dataset is available in CSV, XLSX, and JSON formats, and includes all of our historical data on the pandemic up to the date of publication.

    The CSV and XLSX files follow a format of 1 row per location and date. The JSON version is split by country ISO code, with static variables and an array of daily records.

    The variables represent all of our main data related to confirmed cases, deaths, and testing, as well as other variables of potential interest.

    As of 10 September 2020, the columns are: iso_code, continent, location, date, total_cases, new_cases, new_cases_smoothed, total_deaths, new_deaths, new_deaths_smoothed, total_cases_per_million, new_cases_per_million, new_cases_smoothed_per_million, total_deaths_per_million, new_deaths_per_million, new_deaths_smoothed_per_million, total_tests, new_tests, new_tests_smoothed, total_tests_per_thousand, new_tests_per_thousand, new_tests_smoothed_per_thousand, tests_per_case, positive_rate, tests_units, stringency_index, population, population_density, median_age, aged_65_older, aged_70_older, gdp_per_capita, extreme_poverty, cardiovasc_death_rate, diabetes_prevalence, female_smokers, male_smokers, handwashing_facilities, hospital_beds_per_thousand, life_expectancy, human_development_index

    A full codebook is made available, with a description and source for each variable in the dataset.

    Additional files and information

    If you are interested in the individual files that make up the complete dataset, or more detailed information, other files can be found in the subfolders:

    • ecdc: data from the European Centre for Disease Prevention and Control, related to confirmed cases and deaths;
    • testing: data from various official sources, related to COVID-19 tests performed in each country. This folder contains two files with more detailed information:
    • who: data from the World Health Organization, related to confirmed cases and deaths—_we have stopped using and updating this data since 18 March 2020_.

    Changelog

    • Up until 17 March 2020, we were using WHO data manually extracted from their daily situation report PDFs.
    • From 19 March 2020, we started relying on data published by the European CDC. We wrote about why we decided to switch sources.
    • On 3 April 2020, we added country-level time series on COVID-19 tests.
    • On 16 April 2020, we made available a complete dataset of all of our main variables related to confirmed cases, deaths, and tests.
    • On 25 April 2020, we added rows for "World" and "International" to our complete dataset. The iso_code column for "International" is blank, and for "World" we use OWID_WRL.
    • On 9 May 2020, we added new variables related to demographic, economic, and public health data to our complete dataset.
    • On 19 May 2020, we added 2 variables related to testing: new_tests_smoothed and new_tests_smoothed_per_thousand. To generate them we assume that testing changed equally on a daily basis over any periods in which no data was reported (as not all countries report testing data on a daily basis). This produces a complete series of daily figures, which is then averaged over a rolling 7-day window.
    • On 23 May 2020, we added a JSON version of our complete dataset.
    • On 4 June 2020, we added a continent column to our complete dataset.
    • On 1 July 2020, we changed the format of the JSON version of our complete dataset to normalize the data and reduce file size.
    • On 4 August 2020, we added the positive_rate and tests_per_case columns to our complete dataset.
    • On 7 August 2020, we transformed our markdown codebook to a CSV file to allow easier merging with the complete dataset.
    • On 17 August 2020, we added 4 variables related to cases and deaths: new_cases_smoothed, new_deaths_smoothed, new_cases_smoothed_per_million, and new_deaths_smoothed_per_million. These metrics are averaged versions (over a rolling 7-day window) of the daily variables.
    • On 10 September 2020, we added the human_development_index column to our complete dataset.

    Data alterations

    • We standardize names of countries and regions. Since the names of countries and regions are different in different data sources, we standardize all names to the Our World in Data standard entity names.
    • We may correct or discard inconsistencies that we detect in the original data.
    • Testing data is collected from many different sources. A detailed documentation for each country is available in our post on COVID-19 testing.
    • Where we collect multiple time series for a given country in our testing data (for example: for the United States, we collect data from both the CDC, and the COVID Tracking Project), our complete COVID-19 dataset only includes the most complete, or, if equally complete, data on the number of people tested rather than the number of tests/samples/swabs processed. The list of 'secondary' test series (those removed) is located in scripts/input/owid/secondary_testing_series.csv.

    Stable URLs

    The /public path of this repository is hosted at https://covid.ourworldindata.org/. For example, you can access the CSV for the complete dataset at https://covid.ourworldindata.org/data/owid-covid-data.csv.

    We have the goal to keep all stable URLs working, even when we have to restructure this repository. If you need regular updates, please consider using

  18. Cumulative cases of COVID-19 in the U.S. from Jan. 20, 2020 - Nov. 11, 2022,...

    • statista.com
    Updated Nov 17, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2022). Cumulative cases of COVID-19 in the U.S. from Jan. 20, 2020 - Nov. 11, 2022, by week [Dataset]. https://www.statista.com/statistics/1103185/cumulative-coronavirus-covid19-cases-number-us-by-day/
    Explore at:
    Dataset updated
    Nov 17, 2022
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Jan 20, 2020 - Nov 11, 2022
    Area covered
    United States
    Description

    As of November 11, 2022, almost 96.8 million confirmed cases of COVID-19 had been reported by the World Health Organization (WHO) for the United States. The pandemic has impacted all 50 states, with vast numbers of cases recorded in California, Texas, and Florida.

    The coronavirus in the U.S. The coronavirus hit the United States in mid-March 2020, and cases started to soar at an alarming rate. The country has performed a high number of COVID-19 tests, which is a necessary step to manage the outbreak, but new coronavirus cases in the U.S. have spiked several times since the pandemic began, most notably at the end of 2022. However, restrictions in many states have been eased as new cases have declined.

    The origin of the coronavirus In December 2019, officials in Wuhan, China, were the first to report cases of pneumonia with an unknown cause. A new human coronavirus – SARS-CoV-2 – has since been discovered, and COVID-19 is the infectious disease it causes. All available evidence to date suggests that COVID-19 is a zoonotic disease, which means it can spread from animals to humans. The WHO says transmission is likely to have happened through an animal that is handled by humans. Researchers do not support the theory that the virus was developed in a laboratory.

  19. Weekly United States COVID-19 Cases and Deaths by State - ARCHIVED

    • data.cdc.gov
    • data.virginia.gov
    • +1more
    application/rdfxml +5
    Updated Jun 1, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CDC COVID-19 Response (2023). Weekly United States COVID-19 Cases and Deaths by State - ARCHIVED [Dataset]. https://data.cdc.gov/Case-Surveillance/Weekly-United-States-COVID-19-Cases-and-Deaths-by-/pwn4-m3yp
    Explore at:
    csv, application/rdfxml, xml, tsv, json, application/rssxmlAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Centers for Disease Control and Preventionhttp://www.cdc.gov/
    Authors
    CDC COVID-19 Response
    License

    https://www.usa.gov/government-workshttps://www.usa.gov/government-works

    Area covered
    United States
    Description

    Reporting of new Aggregate Case and Death Count data was discontinued May 11, 2023, with the expiration of the COVID-19 public health emergency declaration. This dataset will receive a final update on June 1, 2023, to reconcile historical data through May 10, 2023, and will remain publicly available.

    Aggregate Data Collection Process Since the start of the COVID-19 pandemic, data have been gathered through a robust process with the following steps:

    • A CDC data team reviews and validates the information obtained from jurisdictions’ state and local websites via an overnight data review process.
    • If more than one official county data source exists, CDC uses a comprehensive data selection process comparing each official county data source, and takes the highest case and death counts respectively, unless otherwise specified by the state.
    • CDC compiles these data and posts the finalized information on COVID Data Tracker.
    • County level data is aggregated to obtain state and territory specific totals.
    This process is collaborative, with CDC and jurisdictions working together to ensure the accuracy of COVID-19 case and death numbers. County counts provide the most up-to-date numbers on cases and deaths by report date. CDC may retrospectively update counts to correct data quality issues.

    Methodology Changes Several differences exist between the current, weekly-updated dataset and the archived version:

    • Source: The current Weekly-Updated Version is based on county-level aggregate count data, while the Archived Version is based on State-level aggregate count data.
    • Confirmed/Probable Cases/Death breakdown:  While the probable cases and deaths are included in the total case and total death counts in both versions (if applicable), they were reported separately from the confirmed cases and deaths by jurisdiction in the Archived Version.  In the current Weekly-Updated Version, the counts by jurisdiction are not reported by confirmed or probable status (See Confirmed and Probable Counts section for more detail).
    • Time Series Frequency: The current Weekly-Updated Version contains weekly time series data (i.e., one record per week per jurisdiction), while the Archived Version contains daily time series data (i.e., one record per day per jurisdiction).
    • Update Frequency: The current Weekly-Updated Version is updated weekly, while the Archived Version was updated twice daily up to October 20, 2022.
    Important note: The counts reflected during a given time period in this dataset may not match the counts reflected for the same time period in the archived dataset noted above. Discrepancies may exist due to differences between county and state COVID-19 case surveillance and reconciliation efforts.

    Confirmed and Probable Counts In this dataset, counts by jurisdiction are not displayed by confirmed or probable status. Instead, confirmed and probable cases and deaths are included in the Total Cases and Total Deaths columns, when available. Not all jurisdictions report probable cases and deaths to CDC.* Confirmed and probable case definition criteria are described here:

    Council of State and Territorial Epidemiologists (ymaws.com).

    Deaths CDC reports death data on other sections of the website: CDC COVID Data Tracker: Home, CDC COVID Data Tracker: Cases, Deaths, and Testing, and NCHS Provisional Death Counts. Information presented on the COVID Data Tracker pages is based on the same source (total case counts) as the present dataset; however, NCHS Death Counts are based on death certificates that use information reported by physicians, medical examiners, or coroners in the cause-of-death section of each certificate. Data from each of these pages are considered provisional (not complete and pending verification) and are therefore subject to change. Counts from previous weeks are continually revised as more records are received and processed.

    Number of Jurisdictions Reporting There are currently 60 public health jurisdictions reporting cases of COVID-19. This includes the 50 states, the District of Columbia, New York City, the U.S. territories of American Samoa, Guam, the Commonwealth of the Northern Mariana Islands, Puerto Rico, and the U.S Virgin Islands as well as three independent countries in compacts of free association with the United States, Federated States of Micronesia, Republic of the Marshall Islands, and Republic of Palau. New York State’s reported case and death counts do not include New York City’s counts as they separately report nationally notifiable conditions to CDC.

    CDC COVID-19 data are available to the public as summary or aggregate count files, including total counts of cases and deaths, available by state and by county. These and other data on COVID-19 are available from multiple public locations, such as:

    https://www.cdc.gov/coronavirus/2019-ncov/cases-updates/cases-in-us.html

    https://www.cdc.gov/covid-data-tracker/index.html

    https://www.cdc.gov/coronavirus/2019-ncov/covid-data/covidview/index.html

    https://www.cdc.gov/coronavirus/2019-ncov/php/open-america/surveillance-data-analytics.html

    Additional COVID-19 public use datasets, include line-level (patient-level) data, are available at: https://data.cdc.gov/browse?tags=covid-19.

    Archived Data Notes:

    November 3, 2022: Due to a reporting cadence issue, case rates for Missouri counties are calculated based on 11 days’ worth of case count data in the Weekly United States COVID-19 Cases and Deaths by State data released on November 3, 2022, instead of the customary 7 days’ worth of data.

    November 10, 2022: Due to a reporting cadence change, case rates for Alabama counties are calculated based on 13 days’ worth of case count data in the Weekly United States COVID-19 Cases and Deaths by State data released on November 10, 2022, instead of the customary 7 days’ worth of data.

    November 10, 2022: Per the request of the jurisdiction, cases and deaths among non-residents have been removed from all Hawaii county totals throughout the entire time series. Cumulative case and death counts reported by CDC will no longer match Hawaii’s COVID-19 Dashboard, which still includes non-resident cases and deaths. 

    November 17, 2022: Two new columns, weekly historic cases and weekly historic deaths, were added to this dataset on November 17, 2022. These columns reflect case and death counts that were reported that week but were historical in nature and not reflective of the current burden within the jurisdiction. These historical cases and deaths are not included in the new weekly case and new weekly death columns; however, they are reflected in the cumulative totals provided for each jurisdiction. These data are used to account for artificial increases in case and death totals due to batched reporting of historical data.

    December 1, 2022: Due to cadence changes over the Thanksgiving holiday, case rates for all Ohio counties are reported as 0 in the data released on December 1, 2022.

    January 5, 2023: Due to North Carolina’s holiday reporting cadence, aggregate case and death data will contain 14 days’ worth of data instead of the customary 7 days. As a result, case and death metrics will appear higher than expected in the January 5, 2023, weekly release.

    January 12, 2023: Due to data processing delays, Mississippi’s aggregate case and death data will be reported as 0. As a result, case and death metrics will appear lower than expected in the January 12, 2023, weekly release.

    January 19, 2023: Due to a reporting cadence issue, Mississippi’s aggregate case and death data will be calculated based on 14 days’ worth of data instead of the customary 7 days in the January 19, 2023, weekly release.

    January 26, 2023: Due to a reporting backlog of historic COVID-19 cases, case rates for two Michigan counties (Livingston and Washtenaw) were higher than expected in the January 19, 2023 weekly release.

    January 26, 2023: Due to a backlog of historic COVID-19 cases being reported this week, aggregate case and death counts in Charlotte County and Sarasota County, Florida, will appear higher than expected in the January 26, 2023 weekly release.

    January 26, 2023: Due to data processing delays, Mississippi’s aggregate case and death data will be reported as 0 in the weekly release posted on January 26, 2023.

    February 2, 2023: As of the data collection deadline, CDC observed an abnormally large increase in aggregate COVID-19 cases and deaths reported for Washington State. In response, totals for new cases and new deaths released on February 2, 2023, have been displayed as zero at the state level until the issue is addressed with state officials. CDC is working with state officials to address the issue.

    February 2, 2023: Due to a decrease reported in cumulative case counts by Wyoming, case rates will be reported as 0 in the February 2, 2023, weekly release. CDC is working with state officials to verify the data submitted.

    February 16, 2023: Due to data processing delays, Utah’s aggregate case and death data will be reported as 0 in the weekly release posted on February 16, 2023. As a result, case and death metrics will appear lower than expected and should be interpreted with caution.

    February 16, 2023: Due to a reporting cadence change, Maine’s

  20. T

    World Coronavirus COVID-19 Cases

    • tradingeconomics.com
    csv, excel, json, xml
    Updated Mar 9, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2020). World Coronavirus COVID-19 Cases [Dataset]. https://tradingeconomics.com/world/coronavirus-cases
    Explore at:
    csv, excel, xml, jsonAvailable download formats
    Dataset updated
    Mar 9, 2020
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 4, 2020 - May 17, 2023
    Area covered
    World
    Description

    The World Health Organization reported 766440796 Coronavirus Cases since the epidemic began. In addition, countries reported 6932591 Coronavirus Deaths. This dataset provides - World Coronavirus Cases- actual values, historical data, forecast, chart, statistics, economic calendar and news.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
New York Times, Coronavirus (Covid-19) Data in the United States [Dataset]. https://github.com/nytimes/covid-19-data

Coronavirus (Covid-19) Data in the United States

Explore at:
csvAvailable download formats
Dataset provided by
New York Times
License

https://github.com/nytimes/covid-19-data/blob/master/LICENSEhttps://github.com/nytimes/covid-19-data/blob/master/LICENSE

Description

The New York Times is releasing a series of data files with cumulative counts of coronavirus cases in the United States, at the state and county level, over time. We are compiling this time series data from state and local governments and health departments in an attempt to provide a complete record of the ongoing outbreak.

Since the first reported coronavirus case in Washington State on Jan. 21, 2020, The Times has tracked cases of coronavirus in real time as they were identified after testing. Because of the widespread shortage of testing, however, the data is necessarily limited in the picture it presents of the outbreak.

We have used this data to power our maps and reporting tracking the outbreak, and it is now being made available to the public in response to requests from researchers, scientists and government officials who would like access to the data to better understand the outbreak.

The data begins with the first reported coronavirus case in Washington State on Jan. 21, 2020. We will publish regular updates to the data in this repository.