100+ datasets found
  1. T

    CORONAVIRUS DEATHS by Country Dataset

    • tradingeconomics.com
    csv, excel, json, xml
    Updated Mar 4, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2020). CORONAVIRUS DEATHS by Country Dataset [Dataset]. https://tradingeconomics.com/country-list/coronavirus-deaths
    Explore at:
    csv, excel, xml, jsonAvailable download formats
    Dataset updated
    Mar 4, 2020
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    2025
    Area covered
    World
    Description

    This dataset provides values for CORONAVIRUS DEATHS reported in several countries. The data includes current values, previous releases, historical highs and record lows, release frequency, reported unit and currency.

  2. Country data on COVID-19

    • kaggle.com
    zip
    Updated Aug 6, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Carla Oliveira (2023). Country data on COVID-19 [Dataset]. https://www.kaggle.com/datasets/carlaoliveira/country-data-on-covid19
    Explore at:
    zip(8634707 bytes)Available download formats
    Dataset updated
    Aug 6, 2023
    Authors
    Carla Oliveira
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    The data is in CSV format and includes all historical data on the pandemic up to 03/01/2023, following a 1-line format per country and date.

    In the pre-processing of these data, missing data were checked. It was observed, for example, that the missing data referring to new_cases was where the total number of cases had not been changed and that most of the missing data related to vaccination, which actually at the beginning of the pandemic there was no data. Therefore, to solve these cases of missing data it was decided to replace the data containing “NaN” by zero. Some of these features were combined to generate new features. This process that creates new features (data) from existing data, aiming to improve the data before applying machine learning algorithms, is called feature engineering. The new features created were: - Vaccination rate (vaccination_ratio'): total number of people who received at least one dose of vaccine divided by the population at risk. This dose number was chosen because it has a higher correlation with new deaths. - Prevalence: existing cases of the disease at a given time divided by the population at risk of having the disease. Formula: COVID-19 cases ÷ Population at risk * 100. Example: 168,331 ÷ 210,000,000 * 100 = 0.08. - Incidence: new cases of the disease in a defined population during a specific period (one day, for example) divided by the population at risk. Formula: New COVID-19 cases in one day ÷ Population - Total cases * 100. Example: 5,632 ÷ 209,837,301 * 100 = 0.0026.

  3. T

    World Coronavirus COVID-19 Deaths

    • tradingeconomics.com
    csv, excel, json, xml
    Updated Mar 9, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2020). World Coronavirus COVID-19 Deaths [Dataset]. https://tradingeconomics.com/world/coronavirus-deaths
    Explore at:
    excel, csv, xml, jsonAvailable download formats
    Dataset updated
    Mar 9, 2020
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 4, 2020 - May 17, 2023
    Area covered
    World
    Description

    The World Health Organization reported 6932591 Coronavirus Deaths since the epidemic began. In addition, countries reported 766440796 Coronavirus Cases. This dataset provides - World Coronavirus Deaths- actual values, historical data, forecast, chart, statistics, economic calendar and news.

  4. Causes of death around all over the world .

    • kaggle.com
    zip
    Updated Nov 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tanzeela Shahzadi (2025). Causes of death around all over the world . [Dataset]. https://www.kaggle.com/datasets/tan5577/causes-of-death-around-all-over-the-world
    Explore at:
    zip(331562 bytes)Available download formats
    Dataset updated
    Nov 23, 2025
    Authors
    Tanzeela Shahzadi
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    World
    Description

    About Dataset

    Context:

    A straightforward way to assess the health status of a population is to focus on mortality – or concepts like child mortality or life expectancy, which are based on mortality estimates. A focus on mortality, however, does not take into account that the burden of diseases is not only that they kill people, but that they cause suffering to people who live with them. Assessing health outcomes by both mortality and morbidity (the prevalent diseases) provides a more encompassing view on health outcomes. This is the topic of this entry. The sum of mortality and morbidity is referred to as the ‘burden of disease’ and can be measured by a metric called ‘Disability Adjusted Life Years‘ (DALYs).

    DALYs are measuring lost health and are a standardized metric that allow for direct comparisons of disease burdens of different diseases across countries, between different populations, and over time. Conceptually, one DALY is the equivalent of losing one year in good health because of either premature death or disease or disability. One DALY represents one lost year of healthy life. The first ‘Global Burden of Disease’ (GBD) was GBD 1990 and the DALY metric was prominently featured in the World Bank’s 1993 World Development Report. Today it is published by both the researchers at the Institute of Health Metrics and Evaluation (IHME) and the ‘Disease Burden Unit’ at the World Health Organization (WHO), which was created in 1998. The IHME continues the work that was started in the early 1990s and publishes the Global Burden of Disease study.

    Content:

    In this Dataset, we have Historical Data of different cause of deaths for all ages around the World. The key features of this Dataset are: Meningitis, Alzheimer's Disease and Other Dementias, Parkinson's Disease, Nutritional Deficiencies, Malaria, Drowning, Interpersonal Violence, Maternal Disorders, HIV/AIDS, Drug Use Disorders, Tuberculosis, Cardiovascular Diseases, Lower Respiratory Infections, Neonatal Disorders, Alcohol Use Disorders, Self-harm, Exposure to Forces of Nature, Diarrheal Diseases, Environmental Heat and Cold Exposure, Neoplasms, Conflict and Terrorism, Diabetes Mellitus, Chronic Kidney Disease, Poisonings, Protein-Energy Malnutrition, Road Injuries, Chronic Respiratory Diseases, Cirrhosis and Other Chronic Liver Diseases, Digestive Diseases, Fire, Heat, and Hot Substances, Acute Hepatitis.

    Dataset Glossary (Column-wise):

    1. Country/Territory - Name of the Country/Territory
    2. Code - Country/Territory Code
    3. Year - Year of the Incident
    4. Meningitis - No. of People died from Meningitis
    5. Alzheimer's Disease and Other Dementias - No. of People died from Alzheimer's Disease and Other Dementias
    6. Parkinson's Disease - No. of People died from Parkinson's Disease
    7. Nutritional Deficiencies - No. of People died from Nutritional Deficiencies
    8. Malaria - No. of People died from Malaria
    9. Drowning - No. of People died from Drowning
    10. Interpersonal Violence - No. of People died from Interpersonal Violence
    11. Maternal Disorders - No. of People died from Maternal Disorders
    12. Drug Use Disorders - No. of People died from Drug Use Disorders
    13. Tuberculosis - No. of People died from Tuberculosis
    14. Cardiovascular Diseases - No. of People died from Cardiovascular Diseases
    15. Lower Respiratory Infections - No. of People died from Lower Respiratory Infections
    16. Neonatal Disorders - No. of People died from Neonatal Disorders
    17. Alcohol Use Disorders - No. of People died from Alcohol Use Disorders
    18. Self-harm - No. of People died from Self-harm
    19. Exposure to Forces of Nature - No. of People died from Exposure to Forces of Nature
    20. Diarrheal Diseases - No. of People died from Diarrheal Diseases
    21. Environmental Heat and Cold Exposure - No. of People died from Environmental Heat and Cold Exposure
    22. Neoplasms - No. of People died from Neoplasms
    23. Conflict and Terrorism - No. of People died from Conflict and Terrorism
    24. Diabetes Mellitus - No. of People died from Diabetes Mellitus
    25. Chronic Kidney Disease - No. of People died from Chronic Kidney Disease
    26. Poisonings - No. of People died from Poisoning
    27. Protein-Energy Malnutrition - No. of People died from Protein-Energy Malnutrition
    28. Chronic Respiratory Diseases - No. of People died from Chronic Respiratory Diseases
    29. Cirrhosis and Other Chronic Liver Diseases - No. of People died from Cirrhosis and Other Chronic Liver Diseases
    30. Digestive Diseases - No. of People died from Digestive Diseases
    31. Fire, Heat, and Hot Substances - No. of People died from Fire or Heat or any Hot Substances
    32. Acute Hepatitis - No. of People died from Acute Hepatitis Structure of the Dataset

    Acknowledgement:

    This Dataset is created from Our World in Data. This Dataset falls under open access under the Creative Commons BY license. You can check the FAQ for more informa...

  5. _Global Health Outcomes Data_

    • kaggle.com
    zip
    Updated Jan 23, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2023). _Global Health Outcomes Data_ [Dataset]. https://www.kaggle.com/datasets/thedevastator/global-health-outcomes-data
    Explore at:
    zip(7031 bytes)Available download formats
    Dataset updated
    Jan 23, 2023
    Authors
    The Devastator
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Global Health Outcomes Data

    Impact on Mortality Rates and Malnutrition in Countries Around the World

    By Humanitarian Data Exchange [source]

    About this dataset

    This dataset provides comprehensive insights into critical health conditions around the world, such as mortality rate, malnutrition levels, and frequency of preventable diseases. It documents the prevalence of life-threatening diseases like malaria and tuberculosis, and are tracked alongside key health indicators like adult mortality rates, HIV prevalence, physicians per 10,000 people ratio and public health expenditures. Such metrics provide us with an accurate picture of how developed healthcare systems are in certain countries which ultimately leads to improvements in public policy formation and awareness amongst decision-makers. With this data it is possible to observe disparities between different regions of the world which can help inform global strategies for providing equitable care globally. This dataset is a valuable source for researchers interested in understanding global health trends over time or seeking to evaluate regional differences within countries

    More Datasets

    For more datasets, click here.

    Featured Notebooks

    • 🚨 Your notebook can be here! 🚨!

    How to use the dataset

    This dataset provides comprehensive global health outcome data for countries around the world. It includes vital information such as infant mortality rates, child malnutrition rates, adult mortality rates, deaths due to malaria and tuberculosis, HIV prevalence rates, life expectancy at age 60 and public health expenditure. This dataset can be used to gain valuable insight into the challenges faced by different countries in providing a good quality of life for their citizens.

    To use this dataset, first identify what questions you need answered and what outcomes you are looking to measure. You may want to look at specific disease-based indicators (e.g. malaria or tuberculosis), health-related indicators (e.g., nutrition), or overall population markers (e.g., life expectancy).

    Then decide which data points from the provided fields will help answer your questions and provide the results needed - e.g,. infant mortality rate or HIV prevalence rate - extracting these values from relevant columns like “Infants lacking immunization (% of one-year-olds) Measles 2013” or “HIV prevalence, adult (% ages 15Ð49) 2013” respectively

    Next extract other columnwise relevant information - e.g., country name — that could also aid your analysis using tools like Excel or Python's Pandas library; sorting through them based on any metric desired — e..g,, physicians per 10k people — while being mindful that some data points are missing in some cases (denoted by NA).

    Finally perform basic analyses with either your own scripting language, like R/Python libraries' numerical functions with accompanying visuals/graphs etc if elucidating trends is desired; drawing meaningful conclusions about overall state of global health outcomes accordingly before making informed decisions thereafter if needed too!

    Research Ideas

    • Create a world health map to visualize the differences in health outcomes across different countries and regions.
    • Develop an AI-based decision support tool that identifies optimal public health policies or interventions based on these metrics for different countries.
    • Design a dashboard or web app that displays and updates this data in real-time, to allow users to compare the current state of global health indicators and benchmark them against historical figures

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. Data Source

    License

    License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

    Columns

    File: health-outcomes-csv-1.csv | Column name | Description | |:-------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------| | Country | The name of the country. (String) ...

  6. d

    COVID-19 Cases and Deaths by Race/Ethnicity - ARCHIVE

    • catalog.data.gov
    • data.ct.gov
    • +2more
    Updated Aug 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.ct.gov (2023). COVID-19 Cases and Deaths by Race/Ethnicity - ARCHIVE [Dataset]. https://catalog.data.gov/dataset/covid-19-cases-and-deaths-by-race-ethnicity
    Explore at:
    Dataset updated
    Aug 12, 2023
    Dataset provided by
    data.ct.gov
    Description

    Note: DPH is updating and streamlining the COVID-19 cases, deaths, and testing data. As of 6/27/2022, the data will be published in four tables instead of twelve. The COVID-19 Cases, Deaths, and Tests by Day dataset contains cases and test data by date of sample submission. The death data are by date of death. This dataset is updated daily and contains information back to the beginning of the pandemic. The data can be found at https://data.ct.gov/Health-and-Human-Services/COVID-19-Cases-Deaths-and-Tests-by-Day/g9vi-2ahj. The COVID-19 State Metrics dataset contains over 93 columns of data. This dataset is updated daily and currently contains information starting June 21, 2022 to the present. The data can be found at https://data.ct.gov/Health-and-Human-Services/COVID-19-State-Level-Data/qmgw-5kp6 . The COVID-19 County Metrics dataset contains 25 columns of data. This dataset is updated daily and currently contains information starting June 16, 2022 to the present. The data can be found at https://data.ct.gov/Health-and-Human-Services/COVID-19-County-Level-Data/ujiq-dy22 . The COVID-19 Town Metrics dataset contains 16 columns of data. This dataset is updated daily and currently contains information starting June 16, 2022 to the present. The data can be found at https://data.ct.gov/Health-and-Human-Services/COVID-19-Town-Level-Data/icxw-cada . To protect confidentiality, if a town has fewer than 5 cases or positive NAAT tests over the past 7 days, those data will be suppressed. COVID-19 cases and associated deaths that have been reported among Connecticut residents, broken down by race and ethnicity. All data in this report are preliminary; data for previous dates will be updated as new reports are received and data errors are corrected. Deaths reported to the either the Office of the Chief Medical Examiner (OCME) or Department of Public Health (DPH) are included in the COVID-19 update. The following data show the number of COVID-19 cases and associated deaths per 100,000 population by race and ethnicity. Crude rates represent the total cases or deaths per 100,000 people. Age-adjusted rates consider the age of the person at diagnosis or death when estimating the rate and use a standardized population to provide a fair comparison between population groups with different age distributions. Age-adjustment is important in Connecticut as the median age of among the non-Hispanic white population is 47 years, whereas it is 34 years among non-Hispanic blacks, and 29 years among Hispanics. Because most non-Hispanic white residents who died were over 75 years of age, the age-adjusted rates are lower than the unadjusted rates. In contrast, Hispanic residents who died tend to be younger than 75 years of age which results in higher age-adjusted rates. The population data used to calculate rates is based on the CT DPH population statistics for 2019, which is available online here: https://portal.ct.gov/DPH/Health-Information-Systems--Reporting/Population/Population-Statistics. Prior to 5/10/2021, the population estimates from 2018 were used. Rates are standardized to the 2000 US Millions Standard population (data available here: https://seer.cancer.gov/stdpopulations/). Standardization was done using 19 age groups (0, 1-4, 5-9, 10-14, ..., 80-84, 85 years and older). More information about direct standardization for age adjustment is available here: https://www.cdc.gov/nchs/data/statnt/statnt06rv.pdf Categories are mutually exclusive. The category “multiracial” includes people who answered ‘yes’ to more than one race category. Counts may not add up to total case counts as data on race and ethnicity may be missing. Age adjusted rates calculated only for groups with more than 20 deaths. Abbreviation: NH=Non-Hispanic. Data on Connecticut deaths were obtained from the Connecticut Deaths Registry maintained by the DPH Office of Vital Records. Cause of death was determined by a death certifier (e.g., physician, APRN, medical

  7. o

    Deaths Involving COVID-19 by Fatality Type

    • data.ontario.ca
    • datasets.ai
    • +3more
    csv, xlsx
    Updated Dec 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Health (2024). Deaths Involving COVID-19 by Fatality Type [Dataset]. https://data.ontario.ca/dataset/deaths-involving-covid-19-by-fatality-type
    Explore at:
    xlsx(10965), xlsx(11076), csv(34979)Available download formats
    Dataset updated
    Dec 13, 2024
    Dataset authored and provided by
    Health
    License

    https://www.ontario.ca/page/open-government-licence-ontariohttps://www.ontario.ca/page/open-government-licence-ontario

    Time period covered
    Nov 14, 2024
    Area covered
    Ontario
    Description

    This dataset reports the daily reported number of deaths involving COVID-19 by fatality type.

    Learn how the Government of Ontario is helping to keep Ontarians safe during the 2019 Novel Coronavirus outbreak.

    Effective November 14, 2024 this page will no longer be updated. Information about COVID-19 and other respiratory viruses is available on Public Health Ontario’s interactive respiratory virus tool: https://www.publichealthontario.ca/en/Data-and-Analysis/Infectious-Disease/Respiratory-Virus-Tool

    Data includes:

    • Date on which the death occurred
    • Total number of deaths involving COVID-19
    • Number of deaths with “COVID-19 as the underlying cause of death”
    • Number of deaths with “COVID-19 contributed but not underlying cause”
    • Number of deaths where the “Cause of death unknown” or “Cause of death missing”

    Additional Notes

    The method used to count COVID-19 deaths has changed, effective December 1, 2022. Prior to December 1 2022, deaths were counted based on the date the death was updated in the public health unit’s system. Going forward, deaths are counted on the date they occurred.

    On November 30, 2023 the count of COVID-19 deaths was updated to include missing historical deaths from January 15, 2020 to March 31, 2023.

    CCM is a dynamic disease reporting system which allows ongoing update to data previously entered. As a result, data extracted from CCM represents a snapshot at the time of extraction and may differ from previous or subsequent results. Public Health Units continually clean up COVID-19 data, correcting for missing or overcounted cases and deaths. These corrections can result in data spikes and current totals being different from previously reported cases and deaths. Observed trends over time should be interpreted with caution for the most recent period due to reporting and/or data entry lags.

    As of December 1, 2022, data are based on the date on which the death occurred. This reporting method differs from the prior method which is based on net change in COVID-19 deaths reported day over day.

    Data are based on net change in COVID-19 deaths for which COVID-19 caused the death reported day over day. Deaths are not reported by the date on which death happened as reporting may include deaths that happened on previous dates.

    Spikes, negative numbers and other data anomalies: Due to ongoing data entry and data quality assurance activities in Case and Contact Management system (CCM) file, Public Health Units continually clean up COVID-19, correcting for missing or overcounted cases and deaths. These corrections can result in data spikes, negative numbers and current totals being different from previously reported case and death counts.

    Public Health Units report cause of death in the CCM based on information available to them at the time of reporting and in accordance with definitions provided by Public Health Ontario. The medical certificate of death is the official record and the cause of death could be different.

    Deaths are defined per the outcome field in CCM marked as “Fatal”. Deaths in COVID-19 cases identified as unrelated to COVID-19 are not included in the number of deaths involving COVID-19 reported.

    "_Cause of death unknown_" is the category of death for COVID-19 positive individuals with cause of death still under investigation, or for which the public health unit was unable to determine cause of death. The category may change later when the cause of death is confirmed either as “COVID-19 as the underlying cause of death”, “COVID-19 contributed but not underlying cause,” or “COVID-19 unrelated”.

    "_Cause of death missing_" is the category of death for COVID-19 positive individuals with the cause of death missing in CCM.

    Rates for the most recent days are subject to reporting lags

    All data reflects totals from 8 p.m. the previous day.

    This dataset is subject to change.

  8. n

    Coronavirus (Covid-19) Data in the United States

    • nytimes.com
    • openicpsr.org
    • +4more
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    New York Times, Coronavirus (Covid-19) Data in the United States [Dataset]. https://www.nytimes.com/interactive/2020/us/coronavirus-us-cases.html
    Explore at:
    Dataset provided by
    New York Times
    Description

    The New York Times is releasing a series of data files with cumulative counts of coronavirus cases in the United States, at the state and county level, over time. We are compiling this time series data from state and local governments and health departments in an attempt to provide a complete record of the ongoing outbreak.

    Since late January, The Times has tracked cases of coronavirus in real time as they were identified after testing. Because of the widespread shortage of testing, however, the data is necessarily limited in the picture it presents of the outbreak.

    We have used this data to power our maps and reporting tracking the outbreak, and it is now being made available to the public in response to requests from researchers, scientists and government officials who would like access to the data to better understand the outbreak.

    The data begins with the first reported coronavirus case in Washington State on Jan. 21, 2020. We will publish regular updates to the data in this repository.

  9. WHO Data: Leading cause of DALYs & Death

    • kaggle.com
    zip
    Updated Apr 23, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    sdawar (2021). WHO Data: Leading cause of DALYs & Death [Dataset]. https://www.kaggle.com/datasets/sdawar/top-25-economies-leading-cause-of-dalys-death/discussion
    Explore at:
    zip(10921346 bytes)Available download formats
    Dataset updated
    Apr 23, 2021
    Authors
    sdawar
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    Top 25 Economies 2021 latest country-level estimates of cause-specific disability-adjusted life year (DALYs), and Deaths for the year 2000, 2010, 2015 and 2019 from World Health Organization.

    Content

    This dataset from World Health Organization contains The latest country-level estimates of cause-specific disability-adjusted life year (DALYs), and Deaths for the year 2000, 2010, 2015 and 2019 from World Health Organization.

    The burden of disease is calculated using the disability-adjusted life year (DALY). One DALY represents the loss of the equivalent of one year of full health. DALYs for a disease or health condition are the sum of years of life lost due to premature mortality (YLLs) and years of healthy life lost due to disability (YLDs) due to prevalent cases of the disease or health condition in a population.

    I have used a web scraper to scrape the data from WHO site. The original dataset had data for more than 180 countries, but due to memory restrictions, I have downloaded the dataset for the worlds Top 25 Economies 2021.

    Following are the columns in the dataset:

    • COUNTRY_CODE : The ISO 3166-1 alpha-3 country code
    • COUNTRY : The country name
    • GHE_CAUSE_CODE : Code for the condition/disease
    • GHE_CAUSE_TYPE : Type of the disease/condition
    • GHE_CAUSE_TITLE : Title for the condition/disease
    • YEAR : Reporting year
    • SEX_CODE : Sex of the population for the corresponding age group for the reporting year
    • AGEGROUP_CODE : Age group of the population for the corresponding sex for the reporting year
    • POPULATION : Total number of people in corresponding age group and sex for the reporting year
    • DEATHS : Total deaths occurred within the corresponding age group and sex for the reporting year due to the corresponding condition/disease
    • DEATHS_RATE : Death rate corresponding to the POPULATION due to the corresponding condition/disease
    • DEATHS_100K : Death rate per 100K POPULATION due to the corresponding condition/disease
    • DALY : DALY value within the corresponding age group and sex for the reporting year due to the corresponding condition/disease
    • DALY_RATE : DALY rate corresponding to the POPULATION due to the corresponding condition/disease
    • DALY_100K : DALY rate per 100K POPULATION due to the corresponding condition/disease

    Acknowledgements

    This dataset and further information on the topic can be found at WHO website.

    Inspiration

    The goal is to analyze and interpret the results across multiple variables and find interesting observations, co-relations or patterns.

  10. Deaths Involving COVID-19 by Vaccination Status

    • open.canada.ca
    • gimi9.com
    • +1more
    csv, docx, html, xlsx
    Updated Nov 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Government of Ontario (2025). Deaths Involving COVID-19 by Vaccination Status [Dataset]. https://open.canada.ca/data/dataset/1375bb00-6454-4d3e-a723-4ae9e849d655
    Explore at:
    docx, csv, html, xlsxAvailable download formats
    Dataset updated
    Nov 12, 2025
    Dataset provided by
    Government of Ontariohttps://www.ontario.ca/
    License

    Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
    License information was derived automatically

    Time period covered
    Mar 1, 2021 - Nov 12, 2024
    Description

    This dataset reports the daily reported number of the 7-day moving average rates of Deaths involving COVID-19 by vaccination status and by age group. Learn how the Government of Ontario is helping to keep Ontarians safe during the 2019 Novel Coronavirus outbreak. Effective November 14, 2024 this page will no longer be updated. Information about COVID-19 and other respiratory viruses is available on Public Health Ontario’s interactive respiratory virus tool: https://www.publichealthontario.ca/en/Data-and-Analysis/Infectious-Disease/Respiratory-Virus-Tool Data includes: * Date on which the death occurred * Age group * 7-day moving average of the last seven days of the death rate per 100,000 for those not fully vaccinated * 7-day moving average of the last seven days of the death rate per 100,000 for those fully vaccinated * 7-day moving average of the last seven days of the death rate per 100,000 for those vaccinated with at least one booster ##Additional notes As of June 16, all COVID-19 datasets will be updated weekly on Thursdays by 2pm. As of January 12, 2024, data from the date of January 1, 2024 onwards reflect updated population estimates. This update specifically impacts data for the 'not fully vaccinated' category. On November 30, 2023 the count of COVID-19 deaths was updated to include missing historical deaths from January 15, 2020 to March 31, 2023. CCM is a dynamic disease reporting system which allows ongoing update to data previously entered. As a result, data extracted from CCM represents a snapshot at the time of extraction and may differ from previous or subsequent results. Public Health Units continually clean up COVID-19 data, correcting for missing or overcounted cases and deaths. These corrections can result in data spikes and current totals being different from previously reported cases and deaths. Observed trends over time should be interpreted with caution for the most recent period due to reporting and/or data entry lags. The data does not include vaccination data for people who did not provide consent for vaccination records to be entered into the provincial COVaxON system. This includes individual records as well as records from some Indigenous communities where those communities have not consented to including vaccination information in COVaxON. “Not fully vaccinated” category includes people with no vaccine and one dose of double-dose vaccine. “People with one dose of double-dose vaccine” category has a small and constantly changing number. The combination will stabilize the results. Spikes, negative numbers and other data anomalies: Due to ongoing data entry and data quality assurance activities in Case and Contact Management system (CCM) file, Public Health Units continually clean up COVID-19, correcting for missing or overcounted cases and deaths. These corrections can result in data spikes, negative numbers and current totals being different from previously reported case and death counts. Public Health Units report cause of death in the CCM based on information available to them at the time of reporting and in accordance with definitions provided by Public Health Ontario. The medical certificate of death is the official record and the cause of death could be different. Deaths are defined per the outcome field in CCM marked as “Fatal”. Deaths in COVID-19 cases identified as unrelated to COVID-19 are not included in the Deaths involving COVID-19 reported. Rates for the most recent days are subject to reporting lags All data reflects totals from 8 p.m. the previous day. This dataset is subject to change.

  11. Effect of suicide rates on life expectancy dataset

    • zenodo.org
    • data.niaid.nih.gov
    csv
    Updated Apr 16, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Filip Zoubek; Filip Zoubek (2021). Effect of suicide rates on life expectancy dataset [Dataset]. http://doi.org/10.5281/zenodo.4694270
    Explore at:
    csvAvailable download formats
    Dataset updated
    Apr 16, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Filip Zoubek; Filip Zoubek
    License

    Attribution-NonCommercial-ShareAlike 3.0 (CC BY-NC-SA 3.0)https://creativecommons.org/licenses/by-nc-sa/3.0/
    License information was derived automatically

    Description

    Effect of suicide rates on life expectancy dataset

    Abstract
    In 2015, approximately 55 million people died worldwide, of which 8 million committed suicide. In the USA, one of the main causes of death is the aforementioned suicide, therefore, this experiment is dealing with the question of how much suicide rates affects the statistics of average life expectancy.
    The experiment takes two datasets, one with the number of suicides and life expectancy in the second one and combine data into one dataset. Subsequently, I try to find any patterns and correlations among the variables and perform statistical test using simple regression to confirm my assumptions.

    Data

    The experiment uses two datasets - WHO Suicide Statistics[1] and WHO Life Expectancy[2], which were firstly appropriately preprocessed. The final merged dataset to the experiment has 13 variables, where country and year are used as index: Country, Year, Suicides number, Life expectancy, Adult Mortality, which is probability of dying between 15 and 60 years per 1000 population, Infant deaths, which is number of Infant Deaths per 1000 population, Alcohol, which is alcohol, recorded per capita (15+) consumption, Under-five deaths, which is number of under-five deaths per 1000 population, HIV/AIDS, which is deaths per 1 000 live births HIV/AIDS, GDP, which is Gross Domestic Product per capita, Population, Income composition of resources, which is Human Development Index in terms of income composition of resources, and Schooling, which is number of years of schooling.

    LICENSE

    THE EXPERIMENT USES TWO DATASET - WHO SUICIDE STATISTICS AND WHO LIFE EXPECTANCY, WHICH WERE COLLEECTED FROM WHO AND UNITED NATIONS WEBSITE. THEREFORE, ALL DATASETS ARE UNDER THE LICENSE ATTRIBUTION-NONCOMMERCIAL-SHAREALIKE 3.0 IGO (https://creativecommons.org/licenses/by-nc-sa/3.0/igo/).

    [1] https://www.kaggle.com/szamil/who-suicide-statistics

    [2] https://www.kaggle.com/kumarajarshi/life-expectancy-who

  12. Statewide Death Profiles

    • data.chhs.ca.gov
    • data.ca.gov
    • +3more
    csv, zip
    Updated Dec 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    California Department of Public Health (2025). Statewide Death Profiles [Dataset]. https://data.chhs.ca.gov/dataset/statewide-death-profiles
    Explore at:
    csv(4689434), csv(164006), csv(5034), csv(476576), csv(2026589), csv(5401561), csv(463460), csv(419332), csv(200270), csv(16301), zipAvailable download formats
    Dataset updated
    Dec 2, 2025
    Dataset authored and provided by
    California Department of Public Healthhttps://www.cdph.ca.gov/
    Description

    This dataset contains counts of deaths for California as a whole based on information entered on death certificates. Final counts are derived from static data and include out-of-state deaths to California residents, whereas provisional counts are derived from incomplete and dynamic data. Provisional counts are based on the records available when the data was retrieved and may not represent all deaths that occurred during the time period. Deaths involving injuries from external or environmental forces, such as accidents, homicide and suicide, often require additional investigation that tends to delay certification of the cause and manner of death. This can result in significant under-reporting of these deaths in provisional data.

    The final data tables include both deaths that occurred in California regardless of the place of residence (by occurrence) and deaths to California residents (by residence), whereas the provisional data table only includes deaths that occurred in California regardless of the place of residence (by occurrence). The data are reported as totals, as well as stratified by age, gender, race-ethnicity, and death place type. Deaths due to all causes (ALL) and selected underlying cause of death categories are provided. See temporal coverage for more information on which combinations are available for which years.

    The cause of death categories are based solely on the underlying cause of death as coded by the International Classification of Diseases. The underlying cause of death is defined by the World Health Organization (WHO) as "the disease or injury which initiated the train of events leading directly to death, or the circumstances of the accident or violence which produced the fatal injury." It is a single value assigned to each death based on the details as entered on the death certificate. When more than one cause is listed, the order in which they are listed can affect which cause is coded as the underlying cause. This means that similar events could be coded with different underlying causes of death depending on variations in how they were entered. Consequently, while underlying cause of death provides a convenient comparison between cause of death categories, it may not capture the full impact of each cause of death as it does not always take into account all conditions contributing to the death.

  13. Novel Covid-19 Dataset

    • kaggle.com
    Updated Sep 18, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GHOST5612 (2025). Novel Covid-19 Dataset [Dataset]. https://www.kaggle.com/datasets/ghost5612/novel-covid-19-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 18, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    GHOST5612
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Context:

    From World Health Organization - On 31 December 2019, WHO was alerted to several cases of pneumonia in Wuhan City, Hubei Province of China. The virus did not match any other known virus. This raised concern because when a virus is new, we do not know how it affects people.

    So daily level information on the affected people can give some interesting insights when it is made available to the broader data science community.

    Johns Hopkins University has made an excellent dashboard using the affected cases data. Data is extracted from the google sheets associated and made available here.

    Edited:

    Now data is available as csv files in the Johns Hopkins Github repository. Please refer to the github repository for the Terms of Use details. Uploading it here for using it in Kaggle kernels and getting insights from the broader DS community.

    Content

    2019 Novel Coronavirus (2019-nCoV) is a virus (more specifically, a coronavirus) identified as the cause of an outbreak of respiratory illness first detected in Wuhan, China. Early on, many of the patients in the outbreak in Wuhan, China reportedly had some link to a large seafood and animal market, suggesting animal-to-person spread. However, a growing number of patients reportedly have not had exposure to animal markets, indicating person-to-person spread is occurring. At this time, it’s unclear how easily or sustainably this virus is spreading between people - CDC

    This dataset has daily level information on the number of affected cases, deaths and recovery from 2019 novel coronavirus. Please note that this is a time series data and so the number of cases on any given day is the cumulative number.

    The data is available from 22 Jan, 2020.

    Here’s a polished version suitable for a professional Kaggle dataset description:

    Dataset Description

    This dataset contains time-series and case-level records of the COVID-19 pandemic. The primary file is covid_19_data.csv, with supporting files for earlier records and individual-level line list data.

    Files and Columns

    1. covid_19_data.csv (Main File)

    This is the primary dataset and contains aggregated COVID-19 statistics by location and date.

    • Sno – Serial number of the record
    • ObservationDate – Date of the observation (MM/DD/YYYY)
    • Province/State – Province or state of the observation (may be missing for some entries)
    • Country/Region – Country of the observation
    • Last Update – Timestamp (UTC) when the record was last updated (not standardized, requires cleaning before use)
    • Confirmed – Cumulative number of confirmed cases on that date
    • Deaths – Cumulative number of deaths on that date
    • Recovered – Cumulative number of recoveries on that date

    2. 2019_ncov_data.csv (Legacy File)

    This file contains earlier COVID-19 records. It is no longer updated and is provided only for historical reference. For current analysis, please use covid_19_data.csv.

    3. COVID_open_line_list_data.csv

    This file provides individual-level case information, obtained from an open data source. It includes patient demographics, travel history, and case outcomes.

    4. COVID19_line_list_data.csv

    Another individual-level case dataset, also obtained from public sources, with detailed patient-level information useful for micro-level epidemiological analysis.

    ✅ Use covid_19_data.csv for up-to-date aggregated global trends.

    ✅ Use the line list datasets for detailed, individual-level case analysis.

    Country level datasets:

    If you are interested in knowing country level data, please refer to the following Kaggle datasets:

    India - https://www.kaggle.com/sudalairajkumar/covid19-in-india

    South Korea - https://www.kaggle.com/kimjihoo/coronavirusdataset

    Italy - https://www.kaggle.com/sudalairajkumar/covid19-in-italy

    Brazil - https://www.kaggle.com/unanimad/corona-virus-brazil

    USA - https://www.kaggle.com/sudalairajkumar/covid19-in-usa

    Switzerland - https://www.kaggle.com/daenuprobst/covid19-cases-switzerland

    Indonesia - https://www.kaggle.com/ardisragen/indonesia-coronavirus-cases

    Acknowledgements :

    Johns Hopkins University for making the data available for educational and academic research purposes

    MoBS lab - https://www.mobs-lab.org/2019ncov.html

    World Health Organization (WHO): https://www.who.int/

    DXY.cn. Pneumonia. 2020. http://3g.dxy.cn/newh5/view/pneumonia.

    BNO News: https://bnonews.com/index.php/2020/02/the-latest-coronavirus-cases/

    National Health Commission of the People’s Republic of China (NHC): http://www.nhc.gov.cn/xcs/yqtb/list_gzbd.shtml

    China CDC (CCDC): http://weekly.chinacdc.cn/news/TrackingtheEpidemic.htm

    Hong Kong Department of Health: https://www.chp.gov.hk/en/features/102465.html

    Macau Government: https://www.ssm.gov.mo/portal/

    Taiwan CDC: https://sites.google....

  14. 💀Deaths And Obesity - 🎀Health

    • kaggle.com
    zip
    Updated May 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    waticson (2024). 💀Deaths And Obesity - 🎀Health [Dataset]. https://www.kaggle.com/datasets/yutodennou/death-and-obesity
    Explore at:
    zip(224551 bytes)Available download formats
    Dataset updated
    May 24, 2024
    Authors
    waticson
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    This data set summarizes obesity and the number of deaths caused by it in each country

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F2993575%2Fb55c8c53db1eb6809cc0fb6b5a081195%2F2024-05-25%20093352.png?generation=1716597253375211&alt=media" alt="">

    💡I have already divided these into TRAIN data, TEST data, and ANSWER data so you guys can start working on the regression problem right away.

    • train.csv: Obesity and deaths data from 1990 to 2013
    • test.csv: The explanatory variable in 2014
    • answer.csv: The objective variable in 2014

    These data were created with the assumption that the number of deaths due to obesity in 2014 will be estimated from data from 1990 to 2013.

    There is also something called HINT data(hint.csv). This is data for 2015 and beyond. I have left it out of the train or test data because it has many missing values, but it may be useful for forecasting and for those who are interested in more recent data.

    VariablesDiscription
    Country205 country names
    CodeCountry code like AFG for Afghanistan
    YearYear of collecting data
    PopulationPopulation in a country
    Percentage-OverweightPercentage of defined as overweight, BMI >= 25(age-standardized estimate)(%),Sex: both sexes, Age group:18+
    Mean-Daily-Caloric-SupplyMean of daily supply of calories among overweight or obesity, BMI >= 25(age-standardized). Only about men
    Mean-BMIBMI, Age group:18+ years. 2 columns for both male and female
    Percentage-Overweighted-MalePercentage of adults who are overweight (age-standardized) - Age group: 18+ years. 2 columns for both male and female
    Prevalence-Hypertension-MalePrevalence of hypertension among adults aged 30-79 years(age-standardized). 2 columns for both male and female
    Prevalence-ObesityPrevalence of obesity among adults, BMI >= 30(age-standardized estimate)(%),Sex: both sexes, Age group:18+
    Death-By-High-BMIDeaths that are from all causes attributed to high body-mass index per 100,000 people, in both sexes aged age-standarized
  15. o

    WPS 9673 - Death and Destitution : The Global Distribution of Welfare Losses...

    • data.opendata.am
    Updated Jul 7, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). WPS 9673 - Death and Destitution : The Global Distribution of Welfare Losses from the COVID-19 Pandemic - Dataset - Data Catalog Armenia [Dataset]. https://data.opendata.am/dataset/dcwb0037527
    Explore at:
    Dataset updated
    Jul 7, 2023
    Description

    The COVID-19 pandemic has brought about massive declines in well-being around the world. This paper seeks to quantify and compare two important components of those losses—increased mortality and higher poverty—using years of human life as a common metric. The paper estimates that almost 20 million life-years were lost to COVID-19 by December 2020. Over the same period and by the most conservative definition, more than 120 million additional years were spent in poverty because of the pandemic. The mortality burden, whether estimated in lives or years of life lost, increases sharply with gross domestic product per capita. By contrast, the poverty burden declines with per capita national income when a constant absolute poverty line is used, or is uncorrelated with national income when a more relative approach is taken to poverty lines. In both cases, the poverty burden of the pandemic, relative to the mortality burden, is much higher for poor countries. The distribution of aggregate welfare losses—combining mortality and poverty and expressed in terms of life-years —depends on the choice of poverty line(s) and the relative weights placed on mortality and poverty. With a constant absolute poverty line and a relatively low welfare weight on mortality, poorer countries are found to bear a greater welfare loss from the pandemic. When poverty lines are set differently for poor, middle-income, and high-income countries and/or a greater welfare weight is placed on mortality, upper-middle-income and rich countries suffer the most.

  16. w

    Fire statistics data tables

    • gov.uk
    • s3.amazonaws.com
    Updated Oct 23, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ministry of Housing, Communities and Local Government (2025). Fire statistics data tables [Dataset]. https://www.gov.uk/government/statistical-data-sets/fire-statistics-data-tables
    Explore at:
    Dataset updated
    Oct 23, 2025
    Dataset provided by
    GOV.UK
    Authors
    Ministry of Housing, Communities and Local Government
    Description

    On 1 April 2025 responsibility for fire and rescue transferred from the Home Office to the Ministry of Housing, Communities and Local Government.

    This information covers fires, false alarms and other incidents attended by fire crews, and the statistics include the numbers of incidents, fires, fatalities and casualties as well as information on response times to fires. The Ministry of Housing, Communities and Local Government (MHCLG) also collect information on the workforce, fire prevention work, health and safety and firefighter pensions. All data tables on fire statistics are below.

    MHCLG has responsibility for fire services in England. The vast majority of data tables produced by the Ministry of Housing, Communities and Local Government are for England but some (0101, 0103, 0201, 0501, 1401) tables are for Great Britain split by nation. In the past the Department for Communities and Local Government (who previously had responsibility for fire services in England) produced data tables for Great Britain and at times the UK. Similar information for devolved administrations are available at https://www.firescotland.gov.uk/about/statistics/">Scotland: Fire and Rescue Statistics, https://statswales.gov.wales/Catalogue/Community-Safety-and-Social-Inclusion/Community-Safety">Wales: Community safety and https://www.nifrs.org/home/about-us/publications/">Northern Ireland: Fire and Rescue Statistics.

    If you use assistive technology (for example, a screen reader) and need a version of any of these documents in a more accessible format, please email alternativeformats@communities.gov.uk. Please tell us what format you need. It will help us if you say what assistive technology you use.

    Related content

    Fire statistics guidance
    Fire statistics incident level datasets

    Incidents attended

    https://assets.publishing.service.gov.uk/media/68f0f810e8e4040c38a3cf96/FIRE0101.xlsx">FIRE0101: Incidents attended by fire and rescue services by nation and population (MS Excel Spreadsheet, 143 KB) Previous FIRE0101 tables

    https://assets.publishing.service.gov.uk/media/68f0ffd528f6872f1663ef77/FIRE0102.xlsx">FIRE0102: Incidents attended by fire and rescue services in England, by incident type and fire and rescue authority (MS Excel Spreadsheet, 2.12 MB) Previous FIRE0102 tables

    https://assets.publishing.service.gov.uk/media/68f20a3e06e6515f7914c71c/FIRE0103.xlsx">FIRE0103: Fires attended by fire and rescue services by nation and population (MS Excel Spreadsheet, 197 KB) Previous FIRE0103 tables

    https://assets.publishing.service.gov.uk/media/68f20a552f0fc56403a3cfef/FIRE0104.xlsx">FIRE0104: Fire false alarms by reason for false alarm, England (MS Excel Spreadsheet, 443 KB) Previous FIRE0104 tables

    Dwelling fires attended

    https://assets.publishing.service.gov.uk/media/68f100492f0fc56403a3cf94/FIRE0201.xlsx">FIRE0201: Dwelling fires attended by fire and rescue services by motive, population and nation (MS Excel Spreadsheet, 192 KB) Previous FIRE0201 tables

    <span class="gem

  17. d

    Mass Killings in America, 2006 - present

    • data.world
    csv, zip
    Updated Dec 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Associated Press (2025). Mass Killings in America, 2006 - present [Dataset]. https://data.world/associatedpress/mass-killings-public
    Explore at:
    zip, csvAvailable download formats
    Dataset updated
    Dec 1, 2025
    Authors
    The Associated Press
    Time period covered
    Jan 1, 2006 - Nov 29, 2025
    Area covered
    Description

    THIS DATASET WAS LAST UPDATED AT 7:11 AM EASTERN ON DEC. 1

    OVERVIEW

    2019 had the most mass killings since at least the 1970s, according to the Associated Press/USA TODAY/Northeastern University Mass Killings Database.

    In all, there were 45 mass killings, defined as when four or more people are killed excluding the perpetrator. Of those, 33 were mass shootings . This summer was especially violent, with three high-profile public mass shootings occurring in the span of just four weeks, leaving 38 killed and 66 injured.

    A total of 229 people died in mass killings in 2019.

    The AP's analysis found that more than 50% of the incidents were family annihilations, which is similar to prior years. Although they are far less common, the 9 public mass shootings during the year were the most deadly type of mass murder, resulting in 73 people's deaths, not including the assailants.

    One-third of the offenders died at the scene of the killing or soon after, half from suicides.

    About this Dataset

    The Associated Press/USA TODAY/Northeastern University Mass Killings database tracks all U.S. homicides since 2006 involving four or more people killed (not including the offender) over a short period of time (24 hours) regardless of weapon, location, victim-offender relationship or motive. The database includes information on these and other characteristics concerning the incidents, offenders, and victims.

    The AP/USA TODAY/Northeastern database represents the most complete tracking of mass murders by the above definition currently available. Other efforts, such as the Gun Violence Archive or Everytown for Gun Safety may include events that do not meet our criteria, but a review of these sites and others indicates that this database contains every event that matches the definition, including some not tracked by other organizations.

    This data will be updated periodically and can be used as an ongoing resource to help cover these events.

    Using this Dataset

    To get basic counts of incidents of mass killings and mass shootings by year nationwide, use these queries:

    Mass killings by year

    Mass shootings by year

    To get these counts just for your state:

    Filter killings by state

    Definition of "mass murder"

    Mass murder is defined as the intentional killing of four or more victims by any means within a 24-hour period, excluding the deaths of unborn children and the offender(s). The standard of four or more dead was initially set by the FBI.

    This definition does not exclude cases based on method (e.g., shootings only), type or motivation (e.g., public only), victim-offender relationship (e.g., strangers only), or number of locations (e.g., one). The time frame of 24 hours was chosen to eliminate conflation with spree killers, who kill multiple victims in quick succession in different locations or incidents, and to satisfy the traditional requirement of occurring in a “single incident.”

    Offenders who commit mass murder during a spree (before or after committing additional homicides) are included in the database, and all victims within seven days of the mass murder are included in the victim count. Negligent homicides related to driving under the influence or accidental fires are excluded due to the lack of offender intent. Only incidents occurring within the 50 states and Washington D.C. are considered.

    Methodology

    Project researchers first identified potential incidents using the Federal Bureau of Investigation’s Supplementary Homicide Reports (SHR). Homicide incidents in the SHR were flagged as potential mass murder cases if four or more victims were reported on the same record, and the type of death was murder or non-negligent manslaughter.

    Cases were subsequently verified utilizing media accounts, court documents, academic journal articles, books, and local law enforcement records obtained through Freedom of Information Act (FOIA) requests. Each data point was corroborated by multiple sources, which were compiled into a single document to assess the quality of information.

    In case(s) of contradiction among sources, official law enforcement or court records were used, when available, followed by the most recent media or academic source.

    Case information was subsequently compared with every other known mass murder database to ensure reliability and validity. Incidents listed in the SHR that could not be independently verified were excluded from the database.

    Project researchers also conducted extensive searches for incidents not reported in the SHR during the time period, utilizing internet search engines, Lexis-Nexis, and Newspapers.com. Search terms include: [number] dead, [number] killed, [number] slain, [number] murdered, [number] homicide, mass murder, mass shooting, massacre, rampage, family killing, familicide, and arson murder. Offender, victim, and location names were also directly searched when available.

    This project started at USA TODAY in 2012.

    Contacts

    Contact AP Data Editor Justin Myers with questions, suggestions or comments about this dataset at jmyers@ap.org. The Northeastern University researcher working with AP and USA TODAY is Professor James Alan Fox, who can be reached at j.fox@northeastern.edu or 617-416-4400.

  18. Cleaned Countries Life Expectancy Dataset

    • kaggle.com
    zip
    Updated Aug 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    paperxd (2024). Cleaned Countries Life Expectancy Dataset [Dataset]. https://www.kaggle.com/datasets/paperxd/cleaned-life-expectancy-dataset/code
    Explore at:
    zip(375854 bytes)Available download formats
    Dataset updated
    Aug 28, 2024
    Authors
    paperxd
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Credits

    This dataset was from tekkum and the original file was in xlsx format.

    Context

    While numerous studies have explored the factors influencing life expectancy, most have focused on demographic variables, economic indicators, and mortality rates. However, there has been limited examination of the impact of immunization coverage, health expenditures, and educational attainment on life expectancy. This study seeks to address these gaps by developing a comprehensive dataset with no missing values analyses, utilizing data from many years across 193 different countries. Key immunizations such as Hepatitis B, Polio, and Diphtheria, along with factors like GDP, schooling, and health expenditure, are included in this dataset. This approach aims to identify the most significant predictors of life expectancy, allowing countries to prioritize interventions that could most effectively improve the health and longevity of their populations.

    Dataset

    The success of this analysis relies heavily on the accuracy and completeness of the data. The dataset used in this project has been sourced from the Global Health Observatory (GHO) data repository of the World Health Organization (WHO), which tracks health metrics and related factors for countries worldwide. The corresponding economic data was obtained from the United Nations. From the broad range of health-related variables available, this study focuses on those that are most representative and critical to understanding life expectancy. The dataset includes data for 193 countries and has been meticulously merged into a single file containing 22 columns and 2,938 rows, representing 20 predictive variables. The variables were categorized into four main groups: Immunization-related factors, Mortality factors, Economic factors, and Social factors. Countries with a lot of missing values were excluded, and some values were generated by Bayesian Ridge.

    Question

    This dataset aims to answer the following key questions:

    1. Do the selected predictive factors significantly impact life expectancy, and which variables are the most influential?

    2. Should countries with a lower life expectancy (below 65 years) increase healthcare expenditure to improve their population's lifespan?

    3. How do infant and adult mortality rates influence life expectancy across different regions?

    4. What is the relationship between life expectancy and lifestyle factors such as alcohol consumption?

    5. How does educational attainment, as measured by years of schooling, affect human lifespan?

    6. Is there a positive or negative correlation between alcohol consumption and life expectancy?

    7. What is the impact of immunization coverage on life expectancy, particularly regarding diseases like Hepatitis B, Polio, and Diphtheria?

  19. Deaths by vaccination status, England

    • ons.gov.uk
    • cy.ons.gov.uk
    xlsx
    Updated Aug 25, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office for National Statistics (2023). Deaths by vaccination status, England [Dataset]. https://www.ons.gov.uk/peoplepopulationandcommunity/birthsdeathsandmarriages/deaths/datasets/deathsbyvaccinationstatusengland
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Aug 25, 2023
    Dataset provided by
    Office for National Statisticshttp://www.ons.gov.uk/
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    Age-standardised mortality rates for deaths involving coronavirus (COVID-19), non-COVID-19 deaths and all deaths by vaccination status, broken down by age group.

  20. Indicator 11.5.1: Number of missing persons due to disaster (number)

    • unstats-undesa.opendata.arcgis.com
    • sdgs.amerigeoss.org
    • +1more
    Updated Sep 9, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UN DESA Statistics Division (2021). Indicator 11.5.1: Number of missing persons due to disaster (number) [Dataset]. https://unstats-undesa.opendata.arcgis.com/datasets/indicator-11-5-1-number-of-missing-persons-due-to-disaster-number
    Explore at:
    Dataset updated
    Sep 9, 2021
    Dataset provided by
    United Nations Department of Economic and Social Affairshttps://www.un.org/en/desa
    Authors
    UN DESA Statistics Division
    Area covered
    Description

    Series Name: Number of missing persons due to disaster (number)Series Code: VC_DSR_MISSRelease Version: 2021.Q2.G.03 This dataset is part of the Global SDG Indicator Database compiled through the UN System in preparation for the Secretary-General's annual report on Progress towards the Sustainable Development Goals.Indicator 11.5.1: Number of deaths, missing persons and directly affected persons attributed to disasters per 100,000 populationTarget 11.5: By 2030, significantly reduce the number of deaths and the number of people affected and substantially decrease the direct economic losses relative to global gross domestic product caused by disasters, including water-related disasters, with a focus on protecting the poor and people in vulnerable situationsGoal 11: Make cities and human settlements inclusive, safe, resilient and sustainableFor more information on the compilation methodology of this dataset, see https://unstats.un.org/sdgs/metadata/

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
TRADING ECONOMICS (2020). CORONAVIRUS DEATHS by Country Dataset [Dataset]. https://tradingeconomics.com/country-list/coronavirus-deaths

CORONAVIRUS DEATHS by Country Dataset

CORONAVIRUS DEATHS by Country Dataset (2025)

Explore at:
16 scholarly articles cite this dataset (View in Google Scholar)
csv, excel, xml, jsonAvailable download formats
Dataset updated
Mar 4, 2020
Dataset authored and provided by
TRADING ECONOMICS
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Time period covered
2025
Area covered
World
Description

This dataset provides values for CORONAVIRUS DEATHS reported in several countries. The data includes current values, previous releases, historical highs and record lows, release frequency, reported unit and currency.

Search
Clear search
Close search
Google apps
Main menu