Facebook
TwitterThe New York Times is releasing a series of data files with cumulative counts of coronavirus cases in the United States, at the state and county level, over time. We are compiling this time series data from state and local governments and health departments in an attempt to provide a complete record of the ongoing outbreak.
Since late January, The Times has tracked cases of coronavirus in real time as they were identified after testing. Because of the widespread shortage of testing, however, the data is necessarily limited in the picture it presents of the outbreak.
We have used this data to power our maps and reporting tracking the outbreak, and it is now being made available to the public in response to requests from researchers, scientists and government officials who would like access to the data to better understand the outbreak.
The data begins with the first reported coronavirus case in Washington State on Jan. 21, 2020. We will publish regular updates to the data in this repository.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
From World Health Organization - On 31 December 2019, WHO was alerted to several cases of pneumonia in Wuhan City, Hubei Province of China. The virus did not match any other known virus. This raised concern because when a virus is new, we do not know how it affects people.
So daily level information on the affected people can give some interesting insights when it is made available to the broader data science community.
Johns Hopkins University has made an excellent dashboard using the affected cases data. Data is extracted from the google sheets associated and made available here.
Now data is available as csv files in the Johns Hopkins Github repository. Please refer to the github repository for the Terms of Use details. Uploading it here for using it in Kaggle kernels and getting insights from the broader DS community.
2019 Novel Coronavirus (2019-nCoV) is a virus (more specifically, a coronavirus) identified as the cause of an outbreak of respiratory illness first detected in Wuhan, China. Early on, many of the patients in the outbreak in Wuhan, China reportedly had some link to a large seafood and animal market, suggesting animal-to-person spread. However, a growing number of patients reportedly have not had exposure to animal markets, indicating person-to-person spread is occurring. At this time, it’s unclear how easily or sustainably this virus is spreading between people - CDC
This dataset has daily level information on the number of affected cases, deaths and recovery from 2019 novel coronavirus. Please note that this is a time series data and so the number of cases on any given day is the cumulative number.
The data is available from 22 Jan, 2020.
Here’s a polished version suitable for a professional Kaggle dataset description:
This dataset contains time-series and case-level records of the COVID-19 pandemic. The primary file is covid_19_data.csv, with supporting files for earlier records and individual-level line list data.
This is the primary dataset and contains aggregated COVID-19 statistics by location and date.
This file contains earlier COVID-19 records. It is no longer updated and is provided only for historical reference. For current analysis, please use covid_19_data.csv.
This file provides individual-level case information, obtained from an open data source. It includes patient demographics, travel history, and case outcomes.
Another individual-level case dataset, also obtained from public sources, with detailed patient-level information useful for micro-level epidemiological analysis.
✅ Use covid_19_data.csv for up-to-date aggregated global trends.
✅ Use the line list datasets for detailed, individual-level case analysis.
If you are interested in knowing country level data, please refer to the following Kaggle datasets:
India - https://www.kaggle.com/sudalairajkumar/covid19-in-india
South Korea - https://www.kaggle.com/kimjihoo/coronavirusdataset
Italy - https://www.kaggle.com/sudalairajkumar/covid19-in-italy
Brazil - https://www.kaggle.com/unanimad/corona-virus-brazil
USA - https://www.kaggle.com/sudalairajkumar/covid19-in-usa
Switzerland - https://www.kaggle.com/daenuprobst/covid19-cases-switzerland
Indonesia - https://www.kaggle.com/ardisragen/indonesia-coronavirus-cases
Johns Hopkins University for making the data available for educational and academic research purposes
MoBS lab - https://www.mobs-lab.org/2019ncov.html
World Health Organization (WHO): https://www.who.int/
DXY.cn. Pneumonia. 2020. http://3g.dxy.cn/newh5/view/pneumonia.
BNO News: https://bnonews.com/index.php/2020/02/the-latest-coronavirus-cases/
National Health Commission of the People’s Republic of China (NHC): http://www.nhc.gov.cn/xcs/yqtb/list_gzbd.shtml
China CDC (CCDC): http://weekly.chinacdc.cn/news/TrackingtheEpidemic.htm
Hong Kong Department of Health: https://www.chp.gov.hk/en/features/102465.html
Macau Government: https://www.ssm.gov.mo/portal/
Taiwan CDC: https://sites.google....
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This is the data for the 2019 Novel Coronavirus Visual Dashboard operated by the Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE). Also, Supported by ESRI Living Atlas Team and the Johns Hopkins University Applied Physics Lab (JHU APL).Data SourcesWorld Health Organization (WHO): https://www.who.int/ DXY.cn. Pneumonia. 2020. http://3g.dxy.cn/newh5/view/pneumonia. BNO News: https://bnonews.com/index.php/2020/02/the-latest-coronavirus-cases/ National Health Commission of the People’s Republic of China (NHC): http://www.nhc.gov.cn/xcs/yqtb/list_gzbd.shtml China CDC (CCDC): http://weekly.chinacdc.cn/news/TrackingtheEpidemic.htm Hong Kong Department of Health: https://www.chp.gov.hk/en/features/102465.html Macau Government: https://www.ssm.gov.mo/portal/ Taiwan CDC: https://sites.google.com/cdc.gov.tw/2019ncov/taiwan?authuser=0 US CDC: https://www.cdc.gov/coronavirus/2019-ncov/index.html Government of Canada: https://www.canada.ca/en/public-health/services/diseases/coronavirus.html Australia Government Department of Health: https://www.health.gov.au/news/coronavirus-update-at-a-glance European Centre for Disease Prevention and Control (ECDC): https://www.ecdc.europa.eu/en/geographical-distribution-2019-ncov-casesMinistry of Health Singapore (MOH): https://www.moh.gov.sg/covid-19Italy Ministry of Health: http://www.salute.gov.it/nuovocoronavirus
Facebook
TwitterDataset aims to facilitate a state by state comparison of potential risk factors that may heighten Covid 19 transmission rates or deaths. It includes state by state estimates of: covid 19 positives/deaths, flu/pneumonia deaths, major city population densities, available hospital resources, high risk health condition prevalance, population over 60, means of work transportation rates, housing characteristics (ie number of large apartment complexes/seniors living alone), and industry information.
The Data Includes:
1) Covid 19 Outcome Stats:
Covid_Death : Covid Deaths by State
Covid_Positive : Covid Positive Tests by State
2) US Major City Population Density by State: CBSA_Major_City_max_weighted_density
3) KFF Estimates of Total Hospital Beds by State:
Kaiser_Total_Hospital_Beds
4) 2018 Season Flu and Pneumonia Death Stats:
FLUVIEW_TOTAL_PNEUMONIA_DEATHS_Season_2018
FLUVIEW_TOTAL_INFLUENZA_DEATHS_Season_2018
5)US Total Rates of Flu Hospitalization by Underlying Condition:
Fluview_US_FLU_Hospitalization_Rate_....
6) State by State BRFSS Prevalance Rates of Conditions Associated with Higher Flu Hospitalization Rates
BRFSS_Diabetes_Prevalance
BRFSS_Asthma_Prevalance
BRFSS_COPD_Prevalance
BRFSS_Obesity BMI Prevalance
BRFSS_Other_Cancer_Prevalance
BRFSS_Kidney_Disease_Prevalance
BRFSS_Obesity BMI Prevalance
BRFSS_2017_High_Cholestoral_Prevalance
BRFSS_2017_High_Blood_Pressure_Prevalance
Census_Population_Over_60
7)State by state breakdown of Means of Work Transpotation:
COMMUTE_Census_Worker_Public_Transportation_Rate
8) State by state breakdown of Housing Characteristics
9) State by State breakdown of Industry Information
Links to data sources:
https://worldpopulationreview.com/states/
https://covidtracking.com/data/
https://gis.cdc.gov/GRASP/Fluview/FluHospRates.html https://www.kff.org/health-costs/issue-brief/state-data-and-policy-actions-to-address-coronavirus/#stateleveldata
Census Tables: ACSST1Y2018.S1811 ACSST1Y2018.S0102 ACSST1Y2018.S2403 ACSST1Y2018.S2501 ACSST1Y2018.S2504
https://www.census.gov/library/visualizations/2012/dec/c2010sr-01-density.html
https://gis.cdc.gov/grasp/fluview/mortality.html
I hope to show the existence of correlations that warrant a deeper county by county analysis to identify areas of increased risk requiring increased resource allocation or increased attention to preventative measures.
Facebook
TwitterAs global communities responded to COVID-19, we heard from public health officials that the same type of aggregated, anonymized insights we use in products such as Google Maps would be helpful as they made critical decisions to combat COVID-19. These Community Mobility Reports aimed to provide insights into what changed in response to policies aimed at combating COVID-19. The reports charted movement trends over time by geography, across different categories of places such as retail and recreation, groceries and pharmacies, parks, transit stations, workplaces, and residential.
Facebook
TwitterIn collaboration with the Public Health Agency of Canada (PHAC), this table provides Canadians and researchers with data to monitor only the confirmed cases of coronavirus (COVID-19) in Canada. This table will provide an aggregate summary of the data available in the publication 13-26-0003.
Facebook
TwitterNotice: For data on COVID-19 in the United States, please see https://www.cdc.gov/coronavirus/2019-ncov/cases-in-us.html. Notice: Data from California published in week 29 for years 2019 and 2020 were incomplete when originally published on July 24, 2020. On August 4, 2020, incomplete case counts were replaced with a "U" indicating case counts are not available for specified time period. NNDSS - TABLE 1FF. Severe acute respiratory syndrome-associated coronavirus disease to Shigellosis – 2020. In this Table, provisional cases* of notifiable diseases are displayed for United States, U.S. territories, and Non-U.S. residents. Note: This table contains provisional cases of national notifiable diseases from the National Notifiable Diseases Surveillance System (NNDSS). NNDSS data from the 50 states, New York City, the District of Columbia and the U.S. territories are collated and published weekly on the NNDSS Data and Statistics web page (https://wwwn.cdc.gov/nndss/data-and-statistics.html). Cases reported by state health departments to CDC for weekly publication are provisional because of the time needed to complete case follow-up. Therefore, numbers presented in later weeks may reflect changes made to these counts as additional information becomes available. The national surveillance case definitions used to define a case are available on the NNDSS web site at https://wwwn.cdc.gov/nndss/. Information about the weekly provisional data and guides to interpreting data are available at: https://wwwn.cdc.gov/nndss/infectious-tables.html. Footnotes: U: Unavailable — The reporting jurisdiction was unable to send the data to CDC or CDC was unable to process the data. -: No reported cases — The reporting jurisdiction did not submit any cases to CDC. N: Not reportable — The disease or condition was not reportable by law, statute, or regulation in the reporting jurisdiction. NN: Not nationally notifiable — This condition was not designated as being nationally notifiable. NP: Nationally notifiable but not published. NC: Not calculated — There is insufficient data available to support the calculation of this statistic. Cum: Cumulative year-to-date counts. Max: Maximum — Maximum case count during the previous 52 weeks. * Case counts for reporting years 2019 and 2020 are provisional and subject to change. Cases are assigned to the reporting jurisdiction submitting the case to NNDSS, if the case's country of usual residence is the U.S., a U.S. territory, unknown, or null (i.e. country not reported); otherwise, the case is assigned to the 'Non-U.S. Residents' category. Country of usual residence is currently not reported by all jurisdictions or for all conditions. For further information on interpretation of these data, see https://wwwn.cdc.gov/nndss/document/Users_guide_WONDER_tables_cleared_final.pdf. †Previous 52 week maximum and cumulative YTD are determined from periods of time when the condition was reportable in the jurisdiction (i.e., may be less than 52 weeks of data or incomplete YTD data).
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The dashboard is updated each Friday.
Laboratory surveillance data: California laboratories report SARS-CoV-2 test results to CDPH through electronic laboratory reporting. Los Angeles County SARS-CoV-2 lab data has a 7-day reporting lag. Test positivity is calculated using SARS-CoV-2 lab tests that has a specimen collection date reported during a given week. Specimens for testing are collected from patients in healthcare settings and do not reflect all testing for COVID-19 in California. Test positivity for a given week is calculated by dividing the number of positive COVID-19 results by the total number of specimens tested for that virus. Weekly laboratory surveillance data are defined as Sunday through Saturday.
Hospitalization data: Data on COVID-19 and influenza hospital admissions are from Centers for Disease Control and Prevention’s (CDC) National Healthcare Safety Network (NHSN) Hospitalization dataset. The requirement to report COVID-19-associated hospitalizations was effective November 1, 2024. CDPH pulls NHSN data from the CDC on the Wednesday prior to the publication of the report. Results may differ depending on which day data are pulled. Admission rates are calculated using population estimates from the P-3: Complete State and County Projections Dataset (https://dof.ca.gov/forecasting/demographics/projections/) provided by the State of California Department of Finance. Reported weekly admission rates for the entire season use the population estimates for the year the season started. For more information on NHSN data including the protocol and data collection information, see the CDC NHSN webpage (https://www.cdc.gov/nhsn/index.html). Weekly hospitalization data are defined as Sunday through Saturday.
Death certificate data: CDPH receives weekly year-to-date dynamic data on deaths occurring in California from the CDPH Center for Health Statistics and Informatics. These data are limited to deaths occurring among California residents and are analyzed to identify COVID-19-coded deaths. These deaths are not necessarily laboratory-confirmed and are an underestimate of all COVID-19-associated deaths in California. Weekly death data are defined as Sunday through Saturday.
Facebook
TwitterEffective June 28, 2023, this dataset will no longer be updated. Similar data are accessible from CDC WONDER (https://wonder.cdc.gov/mcd-icd10-provisional.html) Provisional count of deaths involving COVID-19 by county of occurrence, in the United States, 2020-2023.
Facebook
TwitterOfficial statistics are produced impartially and free from political influence.
Facebook
TwitterOfficial statistics are produced impartially and free from political influence.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
On March 2, 2022 DC Health announced the District’s new COVID-19 Community Level key metrics and reporting. COVID-19 cases are now reported on a weekly basis. The data in this table includes overall COVID-19 statistics for the District of Columbia hospitals. The number of hospital beds and ventilators available. Due to rapidly changing nature of COVID-19, data for March 2020 is limited.General Guidelines for Interpreting Disease Surveillance Data during a disease outbreak, the health department will collect, process, and analyze large amounts of information to understand and respond to the health impacts of the disease and its transmission in the community. The sources of disease surveillance information include contact tracing, medical record review, and laboratory information, and are considered protected health information. When interpreting the results of these analyses, it is important to keep in mind that the disease surveillance system may not capture the full picture of the outbreak, and that previously reported data may change over time as it undergoes data quality review or as additional information is added. These analyses, especially within populations with small samples, may be subject to large amounts of variation from day to day. Despite these limitations, data from disease surveillance is a valuable source of information to understand how to stop the spread of COVID19.
Facebook
TwitterOfficial statistics are produced impartially and free from political influence.
Facebook
TwitterOfficial statistics are produced impartially and free from political influence.
Facebook
TwitterOfficial statistics are produced impartially and free from political influence.
Facebook
TwitterNNDSS - TABLE 1FF. Severe acute respiratory syndrome-associated coronavirus disease to Shigellosis – 2022. In this Table, provisional cases* of notifiable diseases are displayed for United States, U.S. territories, and Non-U.S. residents. Notes: • These are weekly cases of selected infectious national notifiable diseases, from the National Notifiable Diseases Surveillance System (NNDSS). NNDSS data reported by the 50 states, New York City, the District of Columbia, and the U.S. territories are collated and published weekly as numbered tables available at https://www.cdc.gov/nndss/data-statistics/index.html. Cases reported by state health departments to CDC for weekly publication are subject to ongoing revision of information and delayed reporting. Therefore, numbers listed in later weeks may reflect changes made to these counts as additional information becomes available. Case counts in the tables are presented as published each week. See also Guide to Interpreting Provisional and Finalized NNDSS Data at https://www.cdc.gov/nndss/docs/Readers-Guide-WONDER-Tables-20210421-508.pdf. • Notices, errata, and other notes are available in the Notice To Data Users page at https://wonder.cdc.gov/nndss/NTR.html. • The list of national notifiable infectious diseases and conditions and their national surveillance case definitions are available at https://ndc.services.cdc.gov/. This list incorporates the Council of State and Territorial Epidemiologists (CSTE) position statements approved by CSTE for national surveillance. Footnotes: *Case counts for reporting years 2021 and 2022 are provisional and subject to change. Cases are assigned to the reporting jurisdiction submitting the case to NNDSS, if the case's country of usual residence is the U.S., a U.S. territory, unknown, or null (i.e. country not reported); otherwise, the case is assigned to the 'Non-U.S. Residents' category. Country of usual residence is currently not reported by all jurisdictions or for all conditions. For further information on interpretation of these data, see https://www.cdc.gov/nndss/docs/Readers-Guide-WONDER-Tables-20210421-508.pdf. †Previous 52 week maximum and cumulative YTD are determined from periods of time when the condition was reportable in the jurisdiction (i.e., may be less than 52 weeks of data or incomplete YTD data). U: Unavailable — The reporting jurisdiction was unable to send the data to CDC or CDC was unable to process the data. -: No reported cases — The reporting jurisdiction did not submit any cases to CDC. N: Not reportable — The disease or condition was not reportable by law, statute, or regulation in the reporting jurisdiction. NN: Not nationally notifiable — This condition was not designated as being nationally notifiable. NP: Nationally notifiable but not published. NC: Not calculated — There is insufficient data available to support the calculation of this statistic. Cum: Cumulative year-to-date counts. Max: Maximum — Maximum case count during the previous 52 weeks.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘NNDSS - TABLE 1FF. Severe acute respiratory syndrome-associated coronavirus disease to Shigellosis’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://catalog.data.gov/dataset/029d9907-e060-4893-8b07-3bdef9bfee29 on 29 August 2021.
--- Dataset description provided by original source is as follows ---
NNDSS - TABLE 1FF. Severe acute respiratory syndrome-associated coronavirus disease to Shigellosis – 2021. In this Table, provisional cases* of notifiable diseases are displayed for United States, U.S. territories, and Non-U.S. residents.
Notice: Due to data processing issues at CDC, data for the following jurisdictions may be incomplete for week 7: Alaska, Arizona, California, Connecticut, Delaware, Florida, Hawaii, Louisiana, Maryland, Michigan, Missouri, North Dakota, New Hampshire, New York City, Oregon, Pennsylvania, and Rhode Island.
Note: This table contains provisional cases of national notifiable diseases from the National Notifiable Diseases Surveillance System (NNDSS). NNDSS data from the 50 states, New York City, the District of Columbia and the U.S. territories are collated and published weekly on the NNDSS Data and Statistics web page (https://wwwn.cdc.gov/nndss/data-and-statistics.html). Cases reported by state health departments to CDC for weekly publication are provisional because of the time needed to complete case follow-up. Therefore, numbers presented in later weeks may reflect changes made to these counts as additional information becomes available. The national surveillance case definitions used to define a case are available on the NNDSS web site at https://wwwn.cdc.gov/nndss/. Information about the weekly provisional data and guides to interpreting data are available at: https://wwwn.cdc.gov/nndss/infectious-tables.html.
Footnotes: U: Unavailable — The reporting jurisdiction was unable to send the data to CDC or CDC was unable to process the data. -: No reported cases — The reporting jurisdiction did not submit any cases to CDC. N: Not reportable — The disease or condition was not reportable by law, statute, or regulation in the reporting jurisdiction. NN: Not nationally notifiable — This condition was not designated as being nationally notifiable. NP: Nationally notifiable but not published. NC: Not calculated — There is insufficient data available to support the calculation of this statistic. Cum: Cumulative year-to-date counts. Max: Maximum — Maximum case count during the previous 52 weeks. * Case counts for reporting years 2020 and 2021 are provisional and subject to change. Cases are assigned to the reporting jurisdiction submitting the case to NNDSS, if the case's country of usual residence is the U.S., a U.S. territory, unknown, or null (i.e. country not reported); otherwise, the case is assigned to the 'Non-U.S. Residents' category. Country of usual residence is currently not reported by all jurisdictions or for all conditions. For further information on interpretation of these data, see https://wwwn.cdc.gov/nndss/document/Users_guide_WONDER_tables_cleared_final.pdf. †Previous 52 week maximum and cumulative YTD are determined from periods of time when the condition was reportable in the jurisdiction (i.e., may be less than 52 weeks of data or incomplete YTD data).
--- Original source retains full ownership of the source dataset ---
Facebook
TwitterThis is an Experimental Official Statistics publication produced by HM Revenue and Customs (HMRC) using HMRC’s Coronavirus Job Retention Scheme claims data.
This publication covers all Coronavirus Job Retention Scheme claims submitted by employers from the start of the scheme up to 30 September 2021. It includes statistics on the claims themselves and the jobs supported.
Data from HMRC’s Real Time Information (RTI) system has been matched with Coronavirus Job Retention Scheme data to produce analysis of claims by:
For more information on Experimental Statistics and governance of statistics produced by public bodies please see the https://uksa.statisticsauthority.gov.uk/about-the-authority/uk-statistical-system/types-of-official-statistics" class="govuk-link">UK Statistics Authority website.
Facebook
TwitterOfficial statistics are produced impartially and free from political influence.
Facebook
TwitterApproved funding or credit sources for businesses or organizations due to COVID-19, by North American Industry Classification System (NAICS), business employment size, type of business, business activity and majority ownership.
Facebook
TwitterThe New York Times is releasing a series of data files with cumulative counts of coronavirus cases in the United States, at the state and county level, over time. We are compiling this time series data from state and local governments and health departments in an attempt to provide a complete record of the ongoing outbreak.
Since late January, The Times has tracked cases of coronavirus in real time as they were identified after testing. Because of the widespread shortage of testing, however, the data is necessarily limited in the picture it presents of the outbreak.
We have used this data to power our maps and reporting tracking the outbreak, and it is now being made available to the public in response to requests from researchers, scientists and government officials who would like access to the data to better understand the outbreak.
The data begins with the first reported coronavirus case in Washington State on Jan. 21, 2020. We will publish regular updates to the data in this repository.