Facebook
TwitterNew York has presented the most cases compared to all states across the U.S..There have also been critiques regarding how much more unnoticed impact the flu has caused. My dataset allows us to compare whether or not this is true according to the most recent data.
This COVID-19 data is from Kaggle whereas the New York influenza data comes from the U.S. government health data website. I merged the two datasets by county and FIPS code and listed the most recent reports of 2020 COVID-19 cases and deaths alongside the 2019 known influenza cases for comparison.
I am thankful to Kaggle and the U.S. government for making the data that made this possible openly available.
This data can be extended to answer the common misconceptions of the scale of the COVID-19 and common flu. My inspiration stems from supporting conclusions with data rather than simply intuition.
I would like my data to help answer how we can make U.S. citizens realize what diseases are most impactful.
Facebook
TwitterDeaths counts for influenza, pneumonia, and COVID-19 reported to NCHS by week ending date, by state and HHS region, and age group.
Facebook
Twitterhttps://www.usa.gov/government-workshttps://www.usa.gov/government-works
This dataset represents preliminary estimates of cumulative U.S. COVID-19 disease burden for the 2024-2025 period, including illnesses, outpatient visits, hospitalizations, and deaths. The weekly COVID-19-associated burden estimates are preliminary and based on continuously collected surveillance data from patients hospitalized with laboratory-confirmed severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infections. The data come from the Coronavirus Disease 2019 (COVID-19)-Associated Hospitalization Surveillance Network (COVID-NET), a surveillance platform that captures data from hospitals that serve about 10% of the U.S. population. Each week CDC estimates a range (i.e., lower estimate and an upper estimate) of COVID-19 -associated burden that have occurred since October 1, 2024.
Note: Data are preliminary and subject to change as more data become available. Rates for recent COVID-19-associated hospital admissions are subject to reporting delays; as new data are received each week, previous rates are updated accordingly.
References
Facebook
TwitterThis dataset is a per-state amalgamation of demographic, public health and other relevant predictors for COVID-19.
Used positive, death and totalTestResults from the API for, respectively, Infected, Deaths and Tested in this dataset.
Please read the documentation of the API for more context on those columns
Density is people per meter squared https://worldpopulationreview.com/states/
https://worldpopulationreview.com/states/gdp-by-state/
https://worldpopulationreview.com/states/per-capita-income-by-state/
https://en.wikipedia.org/wiki/List_of_U.S._states_by_Gini_coefficient
Rates from Feb 2020 and are percentage of labor force
https://www.bls.gov/web/laus/laumstrk.htm
Ratio is Male / Female
https://www.kff.org/other/state-indicator/distribution-by-gender/
https://worldpopulationreview.com/states/smoking-rates-by-state/
Death rate per 100,000 people
https://www.cdc.gov/nchs/pressroom/sosmap/flu_pneumonia_mortality/flu_pneumonia.htm
Death rate per 100,000 people
https://www.cdc.gov/nchs/pressroom/sosmap/lung_disease_mortality/lung_disease.htm
https://www.kff.org/other/state-indicator/total-active-physicians/
https://www.kff.org/other/state-indicator/total-hospitals
Includes spending for all health care services and products by state of residence. Hospital spending is included and reflects the total net revenue. Costs such as insurance, administration, research, and construction expenses are not included.
https://www.kff.org/other/state-indicator/avg-annual-growth-per-capita/
Pollution: Average exposure of the general public to particulate matter of 2.5 microns or less (PM2.5) measured in micrograms per cubic meter (3-year estimate)
https://www.americashealthrankings.org/explore/annual/measure/air/state/ALL
For each state, number of medium and large airports https://en.wikipedia.org/wiki/List_of_the_busiest_airports_in_the_United_States
Note that FL was incorrect in the table, but is corrected in the Hottest States paragraph
https://worldpopulationreview.com/states/average-temperatures-by-state/
District of Columbia temperature computed as the average of Maryland and Virginia
Urbanization as a percentage of the population https://www.icip.iastate.edu/tables/population/urban-pct-states
https://www.kff.org/other/state-indicator/distribution-by-age/
Schools that haven't closed are marked NaN https://www.edweek.org/ew/section/multimedia/map-coronavirus-and-school-closures.html
Note that some datasets above did not contain data for District of Columbia, this missing data was found via Google searches manually entered.
Facebook
TwitterData is from the California Department of Public Health (CDPH) Respiratory Virus Weekly Report.
The report is updated each Friday.
Laboratory surveillance data: California laboratories report SARS-CoV-2 test results to CDPH through electronic laboratory reporting. Los Angeles County SARS-CoV-2 lab data has a 7-day reporting lag. Test positivity is calculated using SARS-CoV-2 lab tests that has a specimen collection date reported during a given week.
Laboratory surveillance for influenza, respiratory syncytial virus (RSV), and other respiratory viruses (parainfluenza types 1-4, human metapneumovirus, non-SARS-CoV-2 coronaviruses, adenovirus, enterovirus/rhinovirus) involves the use of data from clinical sentinel laboratories (hospital, academic or private) located throughout California. Specimens for testing are collected from patients in healthcare settings and do not reflect all testing for influenza, respiratory syncytial virus, and other respiratory viruses in California. These laboratories report the number of laboratory-confirmed influenza, respiratory syncytial virus, and other respiratory virus detections and isolations, and the total number of specimens tested by virus type on a weekly basis.
Test positivity for a given week is calculated by dividing the number of positive COVID-19, influenza, RSV, or other respiratory virus results by the total number of specimens tested for that virus. Weekly laboratory surveillance data are defined as Sunday through Saturday.
Hospitalization data: Data on COVID-19 and influenza hospital admissions are from Centers for Disease Control and Prevention’s (CDC) National Healthcare Safety Network (NHSN) Hospitalization dataset. The requirement to report COVID-19 and influenza-associated hospitalizations was effective November 1, 2024. CDPH pulls NHSN data from the CDC on the Wednesday prior to the publication of the report. Results may differ depending on which day data are pulled. Admission rates are calculated using population estimates from the P-3: Complete State and County Projections Dataset provided by the State of California Department of Finance (https://dof.ca.gov/forecasting/demographics/projections/). Reported weekly admission rates for the entire season use the population estimates for the year the season started. For more information on NHSN data including the protocol and data collection information, see the CDC NHSN webpage (https://www.cdc.gov/nhsn/index.html).
CDPH collaborates with Northern California Kaiser Permanente (NCKP) to monitor trends in RSV admissions. The percentage of RSV admissions is calculated by dividing the number of RSV-related admissions by the total number of admissions during the same period. Admissions for pregnancy, labor and delivery, birth, and outpatient procedures are not included in total number of admissions. These admissions serve as a proxy for RSV activity and do not necessarily represent laboratory confirmed hospitalizations for RSV infections; NCKP members are not representative of all Californians.
Weekly hospitalization data are defined as Sunday through Saturday.
Death certificate data: CDPH receives weekly year-to-date dynamic data on deaths occurring in California from the CDPH Center for Health Statistics and Informatics. These data are limited to deaths occurring among California residents and are analyzed to identify influenza, respiratory syncytial virus, and COVID-19-coded deaths. These deaths are not necessarily laboratory-confirmed and are an underestimate of all influenza, respiratory syncytial virus, and COVID-19-associated deaths in California. Weekly death data are defined as Sunday through Saturday.
Wastewater data: This dataset represents statewide weekly SARS-CoV-2 wastewater summary values. SARS-CoV-2 wastewater concentrations from all sites in California are combined into a single, statewide, unit-less summary value for each week, using a method for data transformation and aggregation developed by the CDC National Wastewater Surveillance System (NWSS). Please see the CDC NWSS data methods page for a description of how these summary values are calculated. Weekly wastewater data are defined as Sunday through Saturday.
Facebook
TwitterEffective September 27, 2023, this dataset will no longer be updated. Similar data are accessible from wonder.cdc.gov. Deaths involving COVID-19, pneumonia, and influenza reported to NCHS by sex, age group, and jurisdiction of occurrence.
Facebook
Twitterhttp://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Why did I create this dataset? This is my first time creating a notebook in Kaggle and I am interested in learning more about COVID-19 and how different countries are affected by it and why. It might be useful to compare different metrics between different countries. And I also wanted to participate in a challenge, and I've decided to join the COVID-19 datasets challenge. While looking through the projects, I noticed https://www.kaggle.com/koryto/countryinfo and it inspired me to start this project.
My approach is to scour the Internet and Kaggle looking for country data that can potentially have an impact on how the COVID-19 pandemic spreads. In the end, I ended up with the following for each country:
See covid19_data - data_sources.csv for data source details.
Notebook: https://www.kaggle.com/bitsnpieces/covid19-data
Since I did not personally collect each datapoint, and because each datasource is different with different objectives, collected at different times, measured in different ways, any inferences from this dataset will need further investigation.
I want to acknowledge the authors of the datasets that made their data publicly available which has made this project possible. Banner image is by Brian.
I hope that the community finds this dataset useful. Feel free to recommend other datasets that you think will be useful / relevant! Thanks for looking.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset represents preliminary weekly hospital respiratory data and metrics aggregated to national and state/territory levels reported to CDC’s National Health Safety Network (NHSN) beginning August 2020. This dataset updates weekly on Wednesdays with preliminary data reported to NHSN for the previous reporting week (Sunday – Saturday).Data for reporting dates through April 30, 2024 represent data reported during a previous mandated reporting period as specified by the HHS Secretary. Data for reporting dates May 1, 2024 – October 31, 2024 represent voluntarily reported data in the absence of a mandate. Data for reporting dates beginning November 1, 2024 represent data reported during a current mandated reporting period. All data and metrics capturing information on respiratory syncytial virus (RSV) were voluntarily reported until November 1, 2024. All data included in this dataset represent aggregated counts, and include metrics capturing information specific to hospital capacity, occupancy, hospitalizations, and new hospital admissions with corresponding metrics indicating reporting coverage for a given reporting week. NHSN monitors national and local trends in healthcare system stress and capacity for all acute care and critical access hospitals in the United States.For more information on the reporting mandate per the Centers for Medicare and Medicaid Services (CMS) requirements, visit: Updates to the Condition of Participation (CoP) Requirements for Hospitals and Critical Access Hospitals (CAHs) To Report Acute Respiratory Illnesses.For more information regarding NHSN’s collection of these data, including full reporting guidance, visit: NHSN Hospital Respiratory Data.For data that is considered final for a given reporting week (Sunday – Saturday), and reflects that which is used in NHSN HRD dashboards for publication each Friday, visit: https://data.cdc.gov/Public-Health-Surveillance/Weekly-Hospital-Respiratory-Data-HRD-Metrics-by-Ju/ua7e-t2fy/about_data.CDC coordinates weekly forecasts of hospitalization admissions based on this data set. More information about flu forecasting can be found at About Flu Forecasting | FluSight | CDC, and information about COVID-19 forecasting and other modeling analyses for the Respiratory Virus Season are available at CFA's Insights for Respiratory Virus Season | CFA | CDC.Source: CDC National Healthcare Safety Network (NHSN).Data source description (updated November 15, 2024): As of October 9, 2024, Hospital Respiratory Data (HRD; formerly Respiratory Pathogen, Hospital Capacity, and Supply data or 'COVID-19 hospital data') are reported to HHS through CDC's National Healthcare Safety Network (NHSN) based on updated requirements from the Centers for Medicare and Medicaid Services (CMS). These data were voluntarily reported to NHSN May 1, 2024 until November 1, 2024, at which time CMS began requiring acute care and critical access hospitals to electronically report information via NHSN about COVID-19, influenza, and RSV, hospital bed census and capacity. Hospital bed capacity and occupancy data for all patients and for patients with COVID-19 or influenza for collection dates prior to May 1, 2024, represent data reported during a previously mandated reporting period as specified by the HHS Secretary, and data for collection dates May 1, 2024 – October 31, 2024 represent data reported voluntarily to NHSN. All RSV data through October 31, 2024 represent voluntarily reported data; as such, all voluntarily reported data included in this dataset represent reporting hospitals only for a given week and might not be complete or representative of all hospitals during the specified reporting periods.NHSN monitors national and local trends in healthcare system stress and capacity for all acute care and critical access hospitals in the United States. Data reported by hospitals to NHSN represent aggregated counts and include metrics capturing information specific to hospital capacity, occupancy, hospitalizations, and admissions. Find more information about reporting to NHSN: https://www.cdc.gov/nhsn/psc/hospital-respiratory-reporting.html.Data quality: This dataset represents preliminary weekly hospital respiratory data and metrics aggregated to national and state/territory levels reported to CDC’s National Health Safety Network (NHSN) beginning A
Facebook
TwitterASPR Treatments Locator Outpatient COVID-19 and influenza (flu) medications may be available at additional locations not listed on this dataset. The locations displayed have either self-attested they have inventory of COVID-19 or influenza antiviral medications within at least the last two months and/or reported participation in the Paxlovid Patient Assistance Program. All therapeutics identified in the locator not approved by the FDA must be used in alignment with the terms of the respective product’s Emergency Use Authorization. Visit the ASPR COVID-19 Treatments page for more information on all COVID-19 treatment options and CDC’s Flu Treatment page for more information on influenza treatment options. This dataset identifies sites that have commercially purchased inventory of COVID-19 and influenza treatments and, in some cases, may identify sites that have remaining, no-cost U.S. government distributed supply. Some sites may charge for services which may or may not be covered by insurance. Some sites may offer prescribing services for patients, including telehealth services. This dataset is intended for informational purposes only and does not serve as an endorsement or recommendation for use of any of the locations listed on the sites.
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Age-standardised mortality rates for deaths involving coronavirus (COVID-19), non-COVID-19 deaths and all deaths by vaccination status, broken down by age group.
Facebook
TwitterA. SUMMARY This dataset includes weekly respiratory disease hospital admissions for Influenza, RSV, and COVID-19 into San Francisco hospitals. Columns in the dataset include a count and rate of hospital admissions per 100,000 people. The data are reported by week. B. HOW THE DATASET IS CREATED Hospital admission data is reported to the San Francisco Department of Public Health (SFDPH) from the United States Center for Disease Control’s (CDC) National Healthcare Safety Network (NHSN) program. San Francisco population estimates are pulled from a view based on the San Francisco Population and Demographic Census dataset. These population estimates are from the 2019-2023 5-year American Community Survey (ACS). C. UPDATE PROCESS The dataset is updated every Friday and includes data from the previous Sunday through Saturday. For example, the update on Friday, October 17th will include data through Saturday, October 11th. Data may change as more current information becomes available. D. HOW TO USE THIS DATASET Weekly data represent a count of confirmed admissions of Influenza, RSV, and COVID-19 patients to San Francisco hospitals by week. The admission rate per 100,000 is calculated by multiplying the count of admissions each week by 100,000 and dividing by the population estimate.
Facebook
TwitterThis dataset includes aggregated weekly respiratory virus laboratory data that the Chicago Department of Public Health (CDPH) uses to monitor influenza, COVID-19, respiratory syncytial virus (RSV), and other respiratory virus activity in Chicago. The data represents respiratory viral PCR tests performed by several hospital laboratories in Chicago as well as two commercial laboratories serving Chicago facilities. Data are voluntarily reported on a weekly basis and do not contain patient demographic or geographic information. The data reported represent both Chicago and non-Chicago residents tested by the reporting facility.
The respiratory viruses included are influenza, RSV, SARS-CoV-2, parainfluenza, rhinovirus/enterovirus, adenovirus, human metapneumovirus, and seasonal coronaviruses. Influenza laboratory data are available from 2010-2011 to present; for all other respiratory viruses data are available from 2019-2020 to present.
All data are provisional and subject to change. Information is updated as additional details are received. At any given time, this dataset reflects data currently known to CDPH. Numbers in this dataset may differ from other public sources.
Splitgraph serves as an HTTP API that lets you run SQL queries directly on this data to power Web applications. For example:
See the Splitgraph documentation for more information.
Facebook
TwitterThe Respiratory Virus Hospitalization Surveillance Network (RESP-NET) is a network that conducts, active, population-based surveillance for laboratory confirmed hospitalizations associated with Influenza, COVID-19, and RSV. The RESP-NET platforms have overlapping surveillance areas and use similar methods to collect data. Hospitalization rates show how many people in the surveillance area are hospitalized with influenza, COVID-19, and RSV compared to the total number of people residing in that area.
Data will be updated weekly. Data are preliminary and subject to change as more data become available.
Facebook
TwitterNSSP Emergency Department (ED) Visit Trajectories by State and Sub-State Regions- COVID-19, Flu, RSV, Combined. This dataset provides the percentage of emergency department patient visits for the specified pathogen of all ED patient visits for the specified geographic part of the country that were observed for the given week from data submitted to the National Syndromic Surveillance Program (NSSP). In addition, the trend over time is characterized as increasing, decreasing or no change, with exceptions for when there are no data available, the data are too sparse, or there are not enough data to compute a trend. These data are to provide awareness of how the weekly trend is changing for the given geographic region.
Note that the reported sub-state trends are from Health Service Areas (HSA) and the data reported from the health care facilities located within the given HSA. Health Service Areas are regions of one or more counties that align to patterns of care seeking. The HSA level data are reported for each county in the HSA.
More information on HSAs is available here.
For the emergency department time series, trajectory classifications reported on for sub-state (HSA) emergency department time series, trajectory classifications are based on approximations of the first derivative (slope) of trends that are smoothed using generalized additive models (GAMs). To determine time intervals in which the slope is sufficiently changing (i.e., rate of change distinguishable from 0), 95% confidence intervals for the slope approximations are calculated and assessed. Weeks with a 95% confidence interval not containing 0 are classified as increasing if the slope estimate is positive and decreasing if the slope estimate is negative. Weeks with a 95% confidence interval containing 0 are classified as stable. In the scenario that an HSA's time series is determined to be too sparse (i.e., many weeks with percentages of 0%), a model is not fit, and the HSA is classified as “sparse”.
For additional information, please see: Companion Guide: NSSP Emergency Department Data on Respiratory Illness
Updated once per week on Fridays.
Facebook
TwitterThe 2009 swine flu pandemic was an influenza pandemic that lasted for about 19 months, from January 2009 to August 2010, and the second of two pandemics involving H1N1 influenza virus.
- data.csv - contains day by day country wise no. of cases & deaths from 4th April to 6th July 2009
- Although the pandemic went on for more than 2 years the data is only from 24th April 2009 to 6th July 2009.
- Because the countries were no longer required to test and report individual cases from 6th July 2009.
- So that day by day data from 6th July 2009 is not available.
Photo from CDC Blog https://blogs.cdc.gov/publichealthmatters/2019/04/h1n1/
- COVID-19 - https://www.kaggle.com/imdevskp/corona-virus-report
- MERS - https://www.kaggle.com/imdevskp/mers-outbreak-dataset-20122019
- Ebola Western Africa 2014 Outbreak - https://www.kaggle.com/imdevskp/ebola-outbreak-20142016-complete-dataset
- H1N1 | Swine Flu 2009 Pandemic Dataset - https://www.kaggle.com/imdevskp/h1n1-swine-flu-2009-pandemic-dataset
- SARS 2003 Pandemic - https://www.kaggle.com/imdevskp/sars-outbreak-2003-complete-dataset
- HIV AIDS - https://www.kaggle.com/imdevskp/hiv-aids-dataset
Facebook
TwitterNOTE: This dataset is no longer being updated but is being kept for historical reference. For current data on respiratory illness visits and respiratory laboratory testing data please see Influenza, COVID-19, RSV, and Other Respiratory Virus Laboratory Surveillance and Inpatient, Emergency Department, and Outpatient Visits for Respiratory Illnesses.
This dataset includes aggregated weekly metrics of the surveillance indicators that the Department of Public Health uses to monitor influenza activity in Chicago. These indicators include:
Influenza-associated ICU hospitalizations for Chicago residents, which is a reportable condition in Illinois (HOSP_ columns)
Influenza laboratory data provided by participating sentinel laboratories in Chicago (LAB_ columns)
Influenza-like illness data for outpatient clinic visits and emergency department visits. (ILI_ columns)
For more information on ILINET, see https://www.cdc.gov/flu/weekly/overview.htm#anchor_1539281266932.
For more information on ESSENCE, see https://www.dph.illinois.gov/data-statistics/syndromic-surveillance
All data are provisional and subject to change. Information is updated as additional details are received. At any given time, this dataset reflects data currently known to CDPH. Numbers in this dataset may differ from other public sources.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Most cost-effective option depending on country income level, influenza, and COVID prevalence among patients with severe COVID-like illness.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
A SARS-like virus outbreak originating in Wuhan, China, is spreading into neighboring Asian countries, and as far afield as Australia, the US a and Europe.
On 31 December 2019, the Chinese authorities reported a case of pneumonia with an unknown cause in Wuhan, Hubei province, to the World Health Organisation (WHO)’s China Office. As more and more cases emerged, totaling 44 by 3 January, the country’s National Health Commission isolated the virus causing fever and flu-like symptoms and identified it as a novel coronavirus, now known to the WHO as 2019-nCoV.
The following dataset shows the numbers of spreading coronavirus across the globe.
Sno - Serial number Date - Date of the observation Province / State - Province or state of the observation Country - Country of observation Last Update - Recent update (not accurate in terms of time) Confirmed - Number of confirmed cases Deaths - Number of death cases Recovered - Number of recovered cases
Thanks to John Hopkins CSSE for the live updates on Coronavirus and data streaming. Source: https://github.com/CSSEGISandData/COVID-19 Dashboard: https://public.tableau.com/profile/vignesh.coumarane#!/vizhome/DashboardToupload/Dashboard12
Inspired by the following work: https://gisanddata.maps.arcgis.com/apps/opsdashboard/index.html#/bda7594740fd40299423467b48e9ecf6
Facebook
TwitterData is from the California Department of Public Health (CDPH) Respiratory Virus Dashboard.
The respiratory virus dashboard shows statewide and regional, weekly data for the following conditions:
COVID-19
Influenza
Respiratory Syncytial Virus (RSV)
This dashboard provides an overview of respiratory virus activity (test positivity, flu typing, emergency department visits) and severity (hospital admissions, deaths, and pediatric deaths). The data update most Fridays but may change in future updates as more information becomes available.
Facebook
TwitterLatin America became an epicenter of the coronavirus pandemic in May, driven by Brazil’s ballooning caseload. Ten months after its first known case, Brazil has had more than 7.9 million cases and over 200,000 deaths.
In early June, Brazil began averaging about 1,000 deaths per day from Covid-19, joining the United States — and later India — as the countries with the world’s largest death tolls.
This dataset contains information about COVID-19 in Brazil extracted on the date 16/06/2021. It is the most updated dataset available about Covid in Brazil
🔍 date: date that the data was collected. format YYYY-MM-DD.
🔍 state: Abbreviation for States. Example: SP
🔍 city: Name of the city (if the value is NaN, they are referring to the State, not the city)
🔍 place_type: Can be City or State
🔍 order_for_place: Number that identifies the registering order for this location. The line that refers to the first log is going to be shown as 1, and the following information will start the count as an index.
🔍 is_last: Show if the line was the last update from that place, can be True or False
🔍 city_ibge_code: IBGE Code from the location
🔍confirmed: Number of confirmed cases.
🔍deaths: Number of deaths.
🔍estimated_population: Estimated population for this city/state in 2020. Data from IBGE
🔍estimated_population_2019: Estimated population for this city/state in 2019. Data from IBGE.
🔍confirmed_per_100k_inhabitants: Number of confirmed cases per 100.000 habitants (based on estimated_population).
🔍death_rate: Death rate (deaths / confirmed cases).
This dataset was downloaded from the URL bello. Thanks, Brasil.IO! Their main goal is to make all Brazilian data available to the public DATASET URL: https://brasil.io/dataset/covid19/files/ Cities map file https://geoftp.ibge.gov.br/organizacao_do_territorio/malhas_territoriais/malhas_municipais/municipio_2020/Brasil/BR/
COVID-19 - https://www.kaggle.com/rafaelherrero/covid19-brazil-full-cases-17062021 COVID-19 - https://www.kaggle.com/imdevskp/corona-virus-report MERS - https://www.kaggle.com/imdevskp/mers-outbreak-dataset-20122019 Ebola Western Africa 2014 Outbreak - https://www.kaggle.com/imdevskp/ebola-outbreak-20142016-complete-dataset H1N1 | Swine Flu 2009 Pandemic Dataset - https://www.kaggle.com/imdevskp/h1n1-swine-flu-2009-pandemic-dataset SARS 2003 Pandemic - https://www.kaggle.com/imdevskp/sars-outbreak-2003-complete-dataset HIV AIDS - https://www.kaggle.com/imdevskp/hiv-aids-dataset
Facebook
TwitterNew York has presented the most cases compared to all states across the U.S..There have also been critiques regarding how much more unnoticed impact the flu has caused. My dataset allows us to compare whether or not this is true according to the most recent data.
This COVID-19 data is from Kaggle whereas the New York influenza data comes from the U.S. government health data website. I merged the two datasets by county and FIPS code and listed the most recent reports of 2020 COVID-19 cases and deaths alongside the 2019 known influenza cases for comparison.
I am thankful to Kaggle and the U.S. government for making the data that made this possible openly available.
This data can be extended to answer the common misconceptions of the scale of the COVID-19 and common flu. My inspiration stems from supporting conclusions with data rather than simply intuition.
I would like my data to help answer how we can make U.S. citizens realize what diseases are most impactful.