Facebook
TwitterNew York has presented the most cases compared to all states across the U.S..There have also been critiques regarding how much more unnoticed impact the flu has caused. My dataset allows us to compare whether or not this is true according to the most recent data.
This COVID-19 data is from Kaggle whereas the New York influenza data comes from the U.S. government health data website. I merged the two datasets by county and FIPS code and listed the most recent reports of 2020 COVID-19 cases and deaths alongside the 2019 known influenza cases for comparison.
I am thankful to Kaggle and the U.S. government for making the data that made this possible openly available.
This data can be extended to answer the common misconceptions of the scale of the COVID-19 and common flu. My inspiration stems from supporting conclusions with data rather than simply intuition.
I would like my data to help answer how we can make U.S. citizens realize what diseases are most impactful.
Facebook
TwitterInfluenza death rates by county, all races (includes Hispanic/Latino), all sexes, all ages, 2019-2023. Death data were provided by the National Vital Statistics System. Death rates (deaths per 100,000 population per year) are age-adjusted to the 2000 US standard population (20 age groups: <1, 1-4, 5-9, ... , 80-84, 85-89, 90+). Rates calculated using SEER*Stat. Population counts for denominators are based on Census populations as modified by the National Cancer Institute. The US Population Data File is used for mortality data. The Average Annual Percent Change is based onthe APCs calculated by the Joinpoint Regression Program (Version 4.9.0.0). Due to data availability issues, the time period used in the calculation of the joinpoint regression model may differ for selected counties. Counties with a (3) after their name may have their joinpoint regresssion model calculated using a different time period due to data availability issues.
Facebook
TwitterThis dataset is a per-state amalgamation of demographic, public health and other relevant predictors for COVID-19.
Used positive, death and totalTestResults from the API for, respectively, Infected, Deaths and Tested in this dataset.
Please read the documentation of the API for more context on those columns
Density is people per meter squared https://worldpopulationreview.com/states/
https://worldpopulationreview.com/states/gdp-by-state/
https://worldpopulationreview.com/states/per-capita-income-by-state/
https://en.wikipedia.org/wiki/List_of_U.S._states_by_Gini_coefficient
Rates from Feb 2020 and are percentage of labor force
https://www.bls.gov/web/laus/laumstrk.htm
Ratio is Male / Female
https://www.kff.org/other/state-indicator/distribution-by-gender/
https://worldpopulationreview.com/states/smoking-rates-by-state/
Death rate per 100,000 people
https://www.cdc.gov/nchs/pressroom/sosmap/flu_pneumonia_mortality/flu_pneumonia.htm
Death rate per 100,000 people
https://www.cdc.gov/nchs/pressroom/sosmap/lung_disease_mortality/lung_disease.htm
https://www.kff.org/other/state-indicator/total-active-physicians/
https://www.kff.org/other/state-indicator/total-hospitals
Includes spending for all health care services and products by state of residence. Hospital spending is included and reflects the total net revenue. Costs such as insurance, administration, research, and construction expenses are not included.
https://www.kff.org/other/state-indicator/avg-annual-growth-per-capita/
Pollution: Average exposure of the general public to particulate matter of 2.5 microns or less (PM2.5) measured in micrograms per cubic meter (3-year estimate)
https://www.americashealthrankings.org/explore/annual/measure/air/state/ALL
For each state, number of medium and large airports https://en.wikipedia.org/wiki/List_of_the_busiest_airports_in_the_United_States
Note that FL was incorrect in the table, but is corrected in the Hottest States paragraph
https://worldpopulationreview.com/states/average-temperatures-by-state/
District of Columbia temperature computed as the average of Maryland and Virginia
Urbanization as a percentage of the population https://www.icip.iastate.edu/tables/population/urban-pct-states
https://www.kff.org/other/state-indicator/distribution-by-age/
Schools that haven't closed are marked NaN https://www.edweek.org/ew/section/multimedia/map-coronavirus-and-school-closures.html
Note that some datasets above did not contain data for District of Columbia, this missing data was found via Google searches manually entered.
Facebook
TwitterThe 2009 swine flu pandemic was an influenza pandemic that lasted for about 19 months, from January 2009 to August 2010, and the second of two pandemics involving H1N1 influenza virus.
- data.csv - contains day by day country wise no. of cases & deaths from 4th April to 6th July 2009
- Although the pandemic went on for more than 2 years the data is only from 24th April 2009 to 6th July 2009.
- Because the countries were no longer required to test and report individual cases from 6th July 2009.
- So that day by day data from 6th July 2009 is not available.
Photo from CDC Blog https://blogs.cdc.gov/publichealthmatters/2019/04/h1n1/
- COVID-19 - https://www.kaggle.com/imdevskp/corona-virus-report
- MERS - https://www.kaggle.com/imdevskp/mers-outbreak-dataset-20122019
- Ebola Western Africa 2014 Outbreak - https://www.kaggle.com/imdevskp/ebola-outbreak-20142016-complete-dataset
- H1N1 | Swine Flu 2009 Pandemic Dataset - https://www.kaggle.com/imdevskp/h1n1-swine-flu-2009-pandemic-dataset
- SARS 2003 Pandemic - https://www.kaggle.com/imdevskp/sars-outbreak-2003-complete-dataset
- HIV AIDS - https://www.kaggle.com/imdevskp/hiv-aids-dataset
Facebook
Twitterhttps://www.usa.gov/government-workshttps://www.usa.gov/government-works
This dataset represents preliminary estimates of cumulative U.S. COVID-19 disease burden for the 2024-2025 period, including illnesses, outpatient visits, hospitalizations, and deaths. The weekly COVID-19-associated burden estimates are preliminary and based on continuously collected surveillance data from patients hospitalized with laboratory-confirmed severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infections. The data come from the Coronavirus Disease 2019 (COVID-19)-Associated Hospitalization Surveillance Network (COVID-NET), a surveillance platform that captures data from hospitals that serve about 10% of the U.S. population. Each week CDC estimates a range (i.e., lower estimate and an upper estimate) of COVID-19 -associated burden that have occurred since October 1, 2024.
Note: Data are preliminary and subject to change as more data become available. Rates for recent COVID-19-associated hospital admissions are subject to reporting delays; as new data are received each week, previous rates are updated accordingly.
References
Facebook
TwitterRank, number of deaths, percentage of deaths, and age-specific mortality rates for the leading causes of death, by age group and sex, 2000 to most recent year.
Facebook
Twitterhttp://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Why did I create this dataset? This is my first time creating a notebook in Kaggle and I am interested in learning more about COVID-19 and how different countries are affected by it and why. It might be useful to compare different metrics between different countries. And I also wanted to participate in a challenge, and I've decided to join the COVID-19 datasets challenge. While looking through the projects, I noticed https://www.kaggle.com/koryto/countryinfo and it inspired me to start this project.
My approach is to scour the Internet and Kaggle looking for country data that can potentially have an impact on how the COVID-19 pandemic spreads. In the end, I ended up with the following for each country:
See covid19_data - data_sources.csv for data source details.
Notebook: https://www.kaggle.com/bitsnpieces/covid19-data
Since I did not personally collect each datapoint, and because each datasource is different with different objectives, collected at different times, measured in different ways, any inferences from this dataset will need further investigation.
I want to acknowledge the authors of the datasets that made their data publicly available which has made this project possible. Banner image is by Brian.
I hope that the community finds this dataset useful. Feel free to recommend other datasets that you think will be useful / relevant! Thanks for looking.
Facebook
TwitterIntroductionWe compared hospitalization outcomes of young children hospitalized with COVID-19 to those hospitalized with influenza in the United States.MethodsPatients aged 0-<5 years hospitalized with an admission diagnosis of acute COVID-19 (April 2021-March 2022) or influenza (April 2019-March 2020) were selected from the PINC AI Healthcare Database Special Release. Hospitalization outcomes included length of stay (LOS), intensive care unit (ICU) admission, oxygen supplementation, and mechanical ventilation (MV). Inverse probability of treatment weighting was used to adjust for confounders in logistic regression analyses.ResultsAmong children hospitalized with COVID-19 (n = 4,839; median age: 0 years), 21.3% had an ICU admission, 19.6% received oxygen supplementation, 7.9% received MV support, and 0.5% died. Among children hospitalized with influenza (n = 4,349; median age: 1 year), 17.4% were admitted to the ICU, 26.7% received oxygen supplementation, 7.6% received MV support, and 0.3% died. Compared to children hospitalized with influenza, those with COVID-19 were more likely to have an ICU admission (adjusted odds ratio [aOR]: 1.34; 95% confidence interval [CI]: 1.21–1.48). However, children with COVID-19 were less likely to receive oxygen supplementation (aOR: 0.71; 95% CI: 0.64–0.78), have a prolonged LOS (aOR: 0.81; 95% CI: 0.75–0.88), or a prolonged ICU stay (aOR: 0.56; 95% CI: 0.46–0.68). The likelihood of receiving MV was similar (aOR: 0.94; 95% CI: 0.81, 1.1).ConclusionsHospitalized children with either SARS-CoV-2 or influenza had severe complications including ICU admission and oxygen supplementation. Nearly 10% received MV support. Both SARS-CoV-2 and influenza have the potential to cause severe illness in young children.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Characteristics of patients hospitalised for COVID-19 and controls.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Post hoc analysis of specific hospitalisation/mortality outcomes within the mental health and cognitive category.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
BackgroundThe impact of seasonal influenza vaccination (SIV) on mortality is still controversial; some studies have claimed that increasing vaccination coverage rates is beneficial, while others have found no significant association. This study aimed to construct a granular longitudinal dataset of local VCRs and assess their effect on pneumonia- and influenza-related (P&I) mortality among Italian adults aged ≥ 65 years.MethodsNUTS-3 (nomenclature of territorial units for statistics) level data on SIV coverage were collected via a survey of local data holders. Fixed- and random-effects panel regression modeling, when adjusted for potential confounders, was performed to assess the association between local SIV coverage rates and P&I mortality in older adults.ResultsA total of 1,144 local VCRs from 2003 to 2019 were ascertained. In the fully adjusted fixed-effects model, each 1% increase in vaccination coverage was associated (P < 0.001) with a 0.6% (95% CI: 0.3–0.9%) average over-time decrease in P&I mortality. With an annual average of 9,293 P&I deaths in Italy, this model suggested that 56 deaths could have been avoided each year by increasing SIV coverage by 1%. The random-effects model produced similar results. The base-case results were robust in a sensitivity analysis.ConclusionOver the last two decades, Italian jurisdictions with higher SIV uptake had, on average, fewer P&I deaths among older adults. Local policy-makers should implement effective strategies to increase SIV coverage in the Italian senior population.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
TwitterNew York has presented the most cases compared to all states across the U.S..There have also been critiques regarding how much more unnoticed impact the flu has caused. My dataset allows us to compare whether or not this is true according to the most recent data.
This COVID-19 data is from Kaggle whereas the New York influenza data comes from the U.S. government health data website. I merged the two datasets by county and FIPS code and listed the most recent reports of 2020 COVID-19 cases and deaths alongside the 2019 known influenza cases for comparison.
I am thankful to Kaggle and the U.S. government for making the data that made this possible openly available.
This data can be extended to answer the common misconceptions of the scale of the COVID-19 and common flu. My inspiration stems from supporting conclusions with data rather than simply intuition.
I would like my data to help answer how we can make U.S. citizens realize what diseases are most impactful.