8 datasets found
  1. COVID-19 and Influenza | New York Datasets

    • kaggle.com
    zip
    Updated May 9, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Angel Henriquez (2020). COVID-19 and Influenza | New York Datasets [Dataset]. https://www.kaggle.com/datasets/angelhenriquez1/covid19-influenza-newyorkdatasets/discussion
    Explore at:
    zip(648794 bytes)Available download formats
    Dataset updated
    May 9, 2020
    Authors
    Angel Henriquez
    Description

    Context

    New York has presented the most cases compared to all states across the U.S..There have also been critiques regarding how much more unnoticed impact the flu has caused. My dataset allows us to compare whether or not this is true according to the most recent data.

    Content

    This COVID-19 data is from Kaggle whereas the New York influenza data comes from the U.S. government health data website. I merged the two datasets by county and FIPS code and listed the most recent reports of 2020 COVID-19 cases and deaths alongside the 2019 known influenza cases for comparison.

    Acknowledgements

    I am thankful to Kaggle and the U.S. government for making the data that made this possible openly available.

    Inspiration

    This data can be extended to answer the common misconceptions of the scale of the COVID-19 and common flu. My inspiration stems from supporting conclusions with data rather than simply intuition.

    I would like my data to help answer how we can make U.S. citizens realize what diseases are most impactful.

  2. COVID-19 State Data

    • kaggle.com
    zip
    Updated Nov 3, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Night Ranger (2020). COVID-19 State Data [Dataset]. https://www.kaggle.com/nightranger77/covid19-state-data
    Explore at:
    zip(4501 bytes)Available download formats
    Dataset updated
    Nov 3, 2020
    Authors
    Night Ranger
    Description

    This dataset is a per-state amalgamation of demographic, public health and other relevant predictors for COVID-19.

    Deaths, Infections and Tests by State

    The COVID Tracking Project: https://covidtracking.com/data/api

    Used positive, death and totalTestResults from the API for, respectively, Infected, Deaths and Tested in this dataset. Please read the documentation of the API for more context on those columns

    Predictor Data and Sources

    Population (2020)

    Density is people per meter squared https://worldpopulationreview.com/states/

    ICU Beds and Age 60+

    https://khn.org/news/as-coronavirus-spreads-widely-millions-of-older-americans-live-in-counties-with-no-icu-beds/

    GDP

    https://worldpopulationreview.com/states/gdp-by-state/

    Income per capita (2018)

    https://worldpopulationreview.com/states/per-capita-income-by-state/

    Gini

    https://en.wikipedia.org/wiki/List_of_U.S._states_by_Gini_coefficient

    Unemployment (2020)

    Rates from Feb 2020 and are percentage of labor force
    https://www.bls.gov/web/laus/laumstrk.htm

    Sex (2017)

    Ratio is Male / Female
    https://www.kff.org/other/state-indicator/distribution-by-gender/

    Smoking Percentage (2020)

    https://worldpopulationreview.com/states/smoking-rates-by-state/

    Influenza and Pneumonia Death Rate (2018)

    Death rate per 100,000 people
    https://www.cdc.gov/nchs/pressroom/sosmap/flu_pneumonia_mortality/flu_pneumonia.htm

    Chronic Lower Respiratory Disease Death Rate (2018)

    Death rate per 100,000 people
    https://www.cdc.gov/nchs/pressroom/sosmap/lung_disease_mortality/lung_disease.htm

    Active Physicians (2019)

    https://www.kff.org/other/state-indicator/total-active-physicians/

    Hospitals (2018)

    https://www.kff.org/other/state-indicator/total-hospitals

    Health spending per capita

    Includes spending for all health care services and products by state of residence. Hospital spending is included and reflects the total net revenue. Costs such as insurance, administration, research, and construction expenses are not included.
    https://www.kff.org/other/state-indicator/avg-annual-growth-per-capita/

    Pollution (2019)

    Pollution: Average exposure of the general public to particulate matter of 2.5 microns or less (PM2.5) measured in micrograms per cubic meter (3-year estimate)
    https://www.americashealthrankings.org/explore/annual/measure/air/state/ALL

    Medium and Large Airports

    For each state, number of medium and large airports https://en.wikipedia.org/wiki/List_of_the_busiest_airports_in_the_United_States

    Temperature (2019)

    Note that FL was incorrect in the table, but is corrected in the Hottest States paragraph
    https://worldpopulationreview.com/states/average-temperatures-by-state/
    District of Columbia temperature computed as the average of Maryland and Virginia

    Urbanization (2010)

    Urbanization as a percentage of the population https://www.icip.iastate.edu/tables/population/urban-pct-states

    Age Groups (2018)

    https://www.kff.org/other/state-indicator/distribution-by-age/

    School Closure Dates

    Schools that haven't closed are marked NaN https://www.edweek.org/ew/section/multimedia/map-coronavirus-and-school-closures.html

    Note that some datasets above did not contain data for District of Columbia, this missing data was found via Google searches manually entered.

  3. NNDSS - TABLE 1R. Hepatitis C, perinatal infection to Influenza-associated...

    • catalog.data.gov
    • healthdata.gov
    • +4more
    Updated Jul 9, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Centers for Disease Control and Prevention (2025). NNDSS - TABLE 1R. Hepatitis C, perinatal infection to Influenza-associated pediatric mortality [Dataset]. https://catalog.data.gov/dataset/nndss-table-1r-hepatitis-c-perinatal-infection-to-influenza-associated-pediatric-mortality
    Explore at:
    Dataset updated
    Jul 9, 2025
    Dataset provided by
    Centers for Disease Control and Preventionhttp://www.cdc.gov/
    Description

    NNDSS - TABLE 1R. Hepatitis C, perinatal infection to Influenza-associated pediatric mortality - 2020. In this Table, provisional cases* of notifiable diseases are displayed for United States, U.S. territories, and Non-U.S. residents. Notice: Data from California published in week 29 for years 2019 and 2020 were incomplete when originally published on July 24, 2020. On August 4, 2020, incomplete case counts were replaced with a "U" indicating case counts are not available for specified time period. Note: This table contains provisional cases of national notifiable diseases from the National Notifiable Diseases Surveillance System (NNDSS). NNDSS data from the 50 states, New York City, the District of Columbia and the U.S. territories are collated and published weekly on the NNDSS Data and Statistics web page (https://wwwn.cdc.gov/nndss/data-and-statistics.html). Cases reported by state health departments to CDC for weekly publication are provisional because of the time needed to complete case follow-up. Therefore, numbers presented in later weeks may reflect changes made to these counts as additional information becomes available. The national surveillance case definitions used to define a case are available on the NNDSS web site at https://wwwn.cdc.gov/nndss/. Information about the weekly provisional data and guides to interpreting data are available at: https://wwwn.cdc.gov/nndss/infectious-tables.html. Footnotes: U: Unavailable — The reporting jurisdiction was unable to send the data to CDC or CDC was unable to process the data. -: No reported cases — The reporting jurisdiction did not submit any cases to CDC. N: Not reportable — The disease or condition was not reportable by law, statute, or regulation in the reporting jurisdiction. NN: Not nationally notifiable — This condition was not designated as being nationally notifiable. NP: Nationally notifiable but not published. NC: Not calculated — There is insufficient data available to support the calculation of this statistic. Cum: Cumulative year-to-date counts. Max: Maximum — Maximum case count during the previous 52 weeks. * Case counts for reporting years 2019 and 2020 are provisional and subject to change. Cases are assigned to the reporting jurisdiction submitting the case to NNDSS, if the case's country of usual residence is the U.S., a U.S. territory, unknown, or null (i.e. country not reported); otherwise, the case is assigned to the 'Non-U.S. Residents' category. Country of usual residence is currently not reported by all jurisdictions or for all conditions. For further information on interpretation of these data, see https://wwwn.cdc.gov/nndss/document/Users_guide_WONDER_tables_cleared_final.pdf. †Previous 52 week maximum and cumulative YTD are determined from periods of time when the condition was reportable in the jurisdiction (i.e., may be less than 52 weeks of data or incomplete YTD data). § Please refer to the CDC WONDER publication for weekly updates to the footnote for this condition.

  4. COVID-19 Country Data

    • kaggle.com
    zip
    Updated May 3, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Patrick (2020). COVID-19 Country Data [Dataset]. https://www.kaggle.com/datasets/bitsnpieces/covid19-country-data/code
    Explore at:
    zip(190821 bytes)Available download formats
    Dataset updated
    May 3, 2020
    Authors
    Patrick
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Description

    Motivation

    Why did I create this dataset? This is my first time creating a notebook in Kaggle and I am interested in learning more about COVID-19 and how different countries are affected by it and why. It might be useful to compare different metrics between different countries. And I also wanted to participate in a challenge, and I've decided to join the COVID-19 datasets challenge. While looking through the projects, I noticed https://www.kaggle.com/koryto/countryinfo and it inspired me to start this project.

    Method

    My approach is to scour the Internet and Kaggle looking for country data that can potentially have an impact on how the COVID-19 pandemic spreads. In the end, I ended up with the following for each country:

    • Monthly temperature and precipitation from Worldbank
    • Latitude and longitude
    • Population, density, gender and age
    • Airport traffic from Worldbank
    • COVID-19 date of first case and number of cases and deaths as of March 26, 2020
    • 2009 H1N1 flu pandemic cases and deaths obtained from Wikipedia
    • Property affordability index and Health care index from Numbeo
    • Number of hospital beds and ICU beds from Wikipedia
    • Flu and pneumonia death rate from Worldlifeexpectancy.com (Age Adjusted Death Rate Estimates: 2017)
    • School closures due to COVID-19
    • Number of COVID-19 tests done
    • Number of COVID-19 genetic strains
    • US Social Distancing Policies from COVID19StatePolicy’s SocialDistancing repository on GitHub
    • DHL Global Connectedness Index 2018 (People Breadth scores)
    • Datasets have been merged by country name whenever possible. I needed to rename some countries by hand, e.g. US to United Sates, etc. but it's possible that I might have missed some. See the output file covid19_merged.csv for the merged result.

    See covid19_data - data_sources.csv for data source details.

    Notebook: https://www.kaggle.com/bitsnpieces/covid19-data

    Caveats

    Since I did not personally collect each datapoint, and because each datasource is different with different objectives, collected at different times, measured in different ways, any inferences from this dataset will need further investigation.

    Other interesting sources of information

    Acknowledgements

    I want to acknowledge the authors of the datasets that made their data publicly available which has made this project possible. Banner image is by Brian.

    I hope that the community finds this dataset useful. Feel free to recommend other datasets that you think will be useful / relevant! Thanks for looking.

  5. Leading causes of death, total population, by age group

    • www150.statcan.gc.ca
    • ouvert.canada.ca
    • +1more
    Updated Feb 19, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Government of Canada, Statistics Canada (2025). Leading causes of death, total population, by age group [Dataset]. http://doi.org/10.25318/1310039401-eng
    Explore at:
    Dataset updated
    Feb 19, 2025
    Dataset provided by
    Statistics Canadahttps://statcan.gc.ca/en
    Area covered
    Canada
    Description

    Rank, number of deaths, percentage of deaths, and age-specific mortality rates for the leading causes of death, by age group and sex, 2000 to most recent year.

  6. All-cause, COVID-19, and non-COVID-19 ASDR for ages 25+ by state and time...

    • figshare.com
    • plos.figshare.com
    xls
    Updated Jun 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anneliese N. Luck; Andrew C. Stokes; Katherine Hempstead; Eugenio Paglino; Samuel H. Preston (2023). All-cause, COVID-19, and non-COVID-19 ASDR for ages 25+ by state and time period. [Dataset]. http://doi.org/10.1371/journal.pone.0281683.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 21, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Anneliese N. Luck; Andrew C. Stokes; Katherine Hempstead; Eugenio Paglino; Samuel H. Preston
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    All-cause, COVID-19, and non-COVID-19 ASDR for ages 25+ by state and time period.

  7. Characteristics of patients hospitalised for COVID-19 and controls.

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 5, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Krishnan Bhaskaran; Christopher T. Rentsch; George Hickman; William J. Hulme; Anna Schultze; Helen J. Curtis; Kevin Wing; Charlotte Warren-Gash; Laurie Tomlinson; Chris J. Bates; Rohini Mathur; Brian MacKenna; Viyaasan Mahalingasivam; Angel Wong; Alex J. Walker; Caroline E. Morton; Daniel Grint; Amir Mehrkar; Rosalind M. Eggo; Peter Inglesby; Ian J. Douglas; Helen I. McDonald; Jonathan Cockburn; Elizabeth J. Williamson; David Evans; John Parry; Frank Hester; Sam Harper; Stephen JW Evans; Sebastian Bacon; Liam Smeeth; Ben Goldacre (2023). Characteristics of patients hospitalised for COVID-19 and controls. [Dataset]. http://doi.org/10.1371/journal.pmed.1003871.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 5, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Krishnan Bhaskaran; Christopher T. Rentsch; George Hickman; William J. Hulme; Anna Schultze; Helen J. Curtis; Kevin Wing; Charlotte Warren-Gash; Laurie Tomlinson; Chris J. Bates; Rohini Mathur; Brian MacKenna; Viyaasan Mahalingasivam; Angel Wong; Alex J. Walker; Caroline E. Morton; Daniel Grint; Amir Mehrkar; Rosalind M. Eggo; Peter Inglesby; Ian J. Douglas; Helen I. McDonald; Jonathan Cockburn; Elizabeth J. Williamson; David Evans; John Parry; Frank Hester; Sam Harper; Stephen JW Evans; Sebastian Bacon; Liam Smeeth; Ben Goldacre
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Characteristics of patients hospitalised for COVID-19 and controls.

  8. Post hoc analysis of specific hospitalisation/mortality outcomes within the...

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Krishnan Bhaskaran; Christopher T. Rentsch; George Hickman; William J. Hulme; Anna Schultze; Helen J. Curtis; Kevin Wing; Charlotte Warren-Gash; Laurie Tomlinson; Chris J. Bates; Rohini Mathur; Brian MacKenna; Viyaasan Mahalingasivam; Angel Wong; Alex J. Walker; Caroline E. Morton; Daniel Grint; Amir Mehrkar; Rosalind M. Eggo; Peter Inglesby; Ian J. Douglas; Helen I. McDonald; Jonathan Cockburn; Elizabeth J. Williamson; David Evans; John Parry; Frank Hester; Sam Harper; Stephen JW Evans; Sebastian Bacon; Liam Smeeth; Ben Goldacre (2023). Post hoc analysis of specific hospitalisation/mortality outcomes within the mental health and cognitive category. [Dataset]. http://doi.org/10.1371/journal.pmed.1003871.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 7, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Krishnan Bhaskaran; Christopher T. Rentsch; George Hickman; William J. Hulme; Anna Schultze; Helen J. Curtis; Kevin Wing; Charlotte Warren-Gash; Laurie Tomlinson; Chris J. Bates; Rohini Mathur; Brian MacKenna; Viyaasan Mahalingasivam; Angel Wong; Alex J. Walker; Caroline E. Morton; Daniel Grint; Amir Mehrkar; Rosalind M. Eggo; Peter Inglesby; Ian J. Douglas; Helen I. McDonald; Jonathan Cockburn; Elizabeth J. Williamson; David Evans; John Parry; Frank Hester; Sam Harper; Stephen JW Evans; Sebastian Bacon; Liam Smeeth; Ben Goldacre
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Post hoc analysis of specific hospitalisation/mortality outcomes within the mental health and cognitive category.

  9. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Angel Henriquez (2020). COVID-19 and Influenza | New York Datasets [Dataset]. https://www.kaggle.com/datasets/angelhenriquez1/covid19-influenza-newyorkdatasets/discussion
Organization logo

COVID-19 and Influenza | New York Datasets

Useful Dataset to Compare COVID-19 and Influenza Death Tolls in New York

Explore at:
zip(648794 bytes)Available download formats
Dataset updated
May 9, 2020
Authors
Angel Henriquez
Description

Context

New York has presented the most cases compared to all states across the U.S..There have also been critiques regarding how much more unnoticed impact the flu has caused. My dataset allows us to compare whether or not this is true according to the most recent data.

Content

This COVID-19 data is from Kaggle whereas the New York influenza data comes from the U.S. government health data website. I merged the two datasets by county and FIPS code and listed the most recent reports of 2020 COVID-19 cases and deaths alongside the 2019 known influenza cases for comparison.

Acknowledgements

I am thankful to Kaggle and the U.S. government for making the data that made this possible openly available.

Inspiration

This data can be extended to answer the common misconceptions of the scale of the COVID-19 and common flu. My inspiration stems from supporting conclusions with data rather than simply intuition.

I would like my data to help answer how we can make U.S. citizens realize what diseases are most impactful.

Search
Clear search
Close search
Google apps
Main menu