80 datasets found
  1. Effect of suicide rates on life expectancy dataset

    • zenodo.org
    • data.niaid.nih.gov
    csv
    Updated Apr 16, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Filip Zoubek; Filip Zoubek (2021). Effect of suicide rates on life expectancy dataset [Dataset]. http://doi.org/10.5281/zenodo.4694270
    Explore at:
    csvAvailable download formats
    Dataset updated
    Apr 16, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Filip Zoubek; Filip Zoubek
    License

    Attribution-NonCommercial-ShareAlike 3.0 (CC BY-NC-SA 3.0)https://creativecommons.org/licenses/by-nc-sa/3.0/
    License information was derived automatically

    Description

    Effect of suicide rates on life expectancy dataset

    Abstract
    In 2015, approximately 55 million people died worldwide, of which 8 million committed suicide. In the USA, one of the main causes of death is the aforementioned suicide, therefore, this experiment is dealing with the question of how much suicide rates affects the statistics of average life expectancy.
    The experiment takes two datasets, one with the number of suicides and life expectancy in the second one and combine data into one dataset. Subsequently, I try to find any patterns and correlations among the variables and perform statistical test using simple regression to confirm my assumptions.

    Data

    The experiment uses two datasets - WHO Suicide Statistics[1] and WHO Life Expectancy[2], which were firstly appropriately preprocessed. The final merged dataset to the experiment has 13 variables, where country and year are used as index: Country, Year, Suicides number, Life expectancy, Adult Mortality, which is probability of dying between 15 and 60 years per 1000 population, Infant deaths, which is number of Infant Deaths per 1000 population, Alcohol, which is alcohol, recorded per capita (15+) consumption, Under-five deaths, which is number of under-five deaths per 1000 population, HIV/AIDS, which is deaths per 1 000 live births HIV/AIDS, GDP, which is Gross Domestic Product per capita, Population, Income composition of resources, which is Human Development Index in terms of income composition of resources, and Schooling, which is number of years of schooling.

    LICENSE

    THE EXPERIMENT USES TWO DATASET - WHO SUICIDE STATISTICS AND WHO LIFE EXPECTANCY, WHICH WERE COLLEECTED FROM WHO AND UNITED NATIONS WEBSITE. THEREFORE, ALL DATASETS ARE UNDER THE LICENSE ATTRIBUTION-NONCOMMERCIAL-SHAREALIKE 3.0 IGO (https://creativecommons.org/licenses/by-nc-sa/3.0/igo/).

    [1] https://www.kaggle.com/szamil/who-suicide-statistics

    [2] https://www.kaggle.com/kumarajarshi/life-expectancy-who

  2. Suicides in England and Wales

    • ons.gov.uk
    • cy.ons.gov.uk
    xlsx
    Updated Oct 3, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office for National Statistics (2025). Suicides in England and Wales [Dataset]. https://www.ons.gov.uk/peoplepopulationandcommunity/birthsdeathsandmarriages/deaths/datasets/suicidesintheunitedkingdomreferencetables
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Oct 3, 2025
    Dataset provided by
    Office for National Statisticshttp://www.ons.gov.uk/
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Area covered
    England
    Description

    Number of suicides and suicide rates by sex and age in England and Wales. Includes information on conclusion type, the proportion of suicides by method, and the median registration delay.

  3. T

    Suicides And Attempts

    • data.cincinnati-oh.gov
    csv, xlsx, xml
    Updated Oct 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Cincinnati (2025). Suicides And Attempts [Dataset]. https://data.cincinnati-oh.gov/Safety/Suicides-And-Attempts/w92t-np3h
    Explore at:
    xlsx, xml, csvAvailable download formats
    Dataset updated
    Oct 12, 2025
    Authors
    City of Cincinnati
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Description

    Fire Incident data includes all fire incident responses. This includes emergency medical services (EMS) calls, fires, rescue incidents, and all other services handled by the Fire Department.

    The source of this data is the City of Cincinnati's computer aided dispatch (CAD) database.

    This data is updated daily.

    DISCLAIMER: In compliance with privacy laws, all Public Safety datasets are anonymized and appropriately redacted prior to publication on the City of Cincinnati’s Open Data Portal. This means that for all public safety datasets: (1) the last two digits of all addresses have been replaced with “XX,” and in cases where there is a single digit street address, the entire address number is replaced with "X"; and (2) Latitude and Longitude have been randomly skewed to represent values within the same block area (but not the exact location) of the incident.

  4. Deaths; suicide (residents), various themes

    • cbs.nl
    • ckan.mobidatalab.eu
    • +2more
    xml
    Updated Aug 28, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Centraal Bureau voor de Statistiek (2025). Deaths; suicide (residents), various themes [Dataset]. https://www.cbs.nl/en-gb/figures/detail/7022eng
    Explore at:
    xmlAvailable download formats
    Dataset updated
    Aug 28, 2025
    Dataset provided by
    Statistics Netherlands
    Authors
    Centraal Bureau voor de Statistiek
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    1950 - 2024
    Area covered
    The Netherlands
    Description

    This table contains the number of victims of suicide arranged by marital status, method, motives, age and sex. They represent the number deaths by suicide in the resident population of the Netherlands.

    The figures in this table are equal to the suicide figures in the causes of death statistics, because they are based on the same files. The causes of death statistics do not contain information on the motive of suicide. For the years 1950-1995, this information is obtained from a historical data file on suicides. For the years 1996-now the motive is taken from the external causes of death (Niet-Natuurlijke dood) file. Before the 9th revision of the International Statistical Classification of Diseases and Related Health Problems (ICD), i.e. for the years 1950-1978, it was not possible to code "jumping in front of train/metro". For these years 1950-1978 "jumping in front of train/metro" has been left empty, and it has been counted in the group "other method".

    Relative figures have been calculated per 100 000 of the corresponding population group. The figures are calculated based on the average population of the corresponding year.

    Data available from: 1950

    Status of the figures: The figures up to and including 2023 are final, the figures for 2024 are provisional.

    Changes as of August 28th 2025: The provisional figures for 2024 are added.

    When will new figures be published: In the first quarter of 2026 the final figures for 2024 will be published.

  5. Number of suicides India 1971-2022

    • statista.com
    • tokrwards.com
    • +1more
    Updated May 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Number of suicides India 1971-2022 [Dataset]. https://www.statista.com/statistics/665354/number-of-suicides-india/
    Explore at:
    Dataset updated
    May 27, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    India
    Description

    Over *** thousand deaths due to suicides were recorded in India in 2022. Furthermore, majority of suicides were reported in the state of Tamil Nadu, followed by Rajasthan. The number of suicides that year had increased from the previous year. Some of the causes for suicides in the country were due to professional problems, abuse, violence, family problems, financial loss, sense of isolation and mental disorders. Depressive disorders and suicide As of 2015, over ****** million people worldwide suffered from some kind of depressive disorder. Furthermore, over ** percent of the total population in India suffer from different forms of mental disorders as of 2017. There exists a positive correlation between the number of suicide mortality rates and people with select mental disorders as opposed to those without. Risk factors for mental disorders Every ******* person in India suffers from some form of mental disorder. Today, depressive disorders are regarded as the leading contributor not only to disease burden and morbidity worldwide, but even suicide if not addressed. In 2022, the leading cause for suicide deaths in India was due to family problems. The second leading cause was due to illness. Some of the risk factors, relative to developing mental disorders including depressive and anxiety disorders, include bullying victimization, poverty, unemployment, childhood sexual abuse and intimate partner violence.

  6. Death rates for suicide, by sex, race, Hispanic origin, and age: United...

    • catalog.data.gov
    • healthdata.gov
    • +3more
    Updated Apr 23, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Centers for Disease Control and Prevention (2025). Death rates for suicide, by sex, race, Hispanic origin, and age: United States [Dataset]. https://catalog.data.gov/dataset/death-rates-for-suicide-by-sex-race-hispanic-origin-and-age-united-states-020c1
    Explore at:
    Dataset updated
    Apr 23, 2025
    Dataset provided by
    Centers for Disease Control and Preventionhttp://www.cdc.gov/
    Area covered
    United States
    Description

    Data on death rates for suicide, by selected population characteristics. Please refer to the PDF or Excel version of this table in the HUS 2019 Data Finder (https://www.cdc.gov/nchs/hus/contents2019.htm) for critical information about measures, definitions, and changes over time. SOURCE: NCHS, National Vital Statistics System (NVSS); Grove RD, Hetzel AM. Vital statistics rates in the United States, 1940–1960. National Center for Health Statistics. 1968; numerator data from NVSS annual public-use Mortality Files; denominator data from U.S. Census Bureau national population estimates; and Murphy SL, Xu JQ, Kochanek KD, Arias E, Tejada-Vera B. Deaths: Final data for 2018. National Vital Statistics Reports; vol 69 no 13. Hyattsville, MD: National Center for Health Statistics. 2021. Available from: https://www.cdc.gov/nchs/products/nvsr.htm. For more information on the National Vital Statistics System, see the corresponding Appendix entry at https://www.cdc.gov/nchs/data/hus/hus19-appendix-508.pdf.

  7. Deaths caused by suicide by quarter in England

    • ons.gov.uk
    • cy.ons.gov.uk
    xlsx
    Updated Apr 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office for National Statistics (2025). Deaths caused by suicide by quarter in England [Dataset]. https://www.ons.gov.uk/peoplepopulationandcommunity/birthsdeathsandmarriages/deaths/datasets/deathscausedbysuicidebyquarterinengland
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Apr 9, 2025
    Dataset provided by
    Office for National Statisticshttp://www.ons.gov.uk/
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    Provisional rate and number of suicide deaths registered in England per quarter. Includes 2001 to 2023 registrations and provisional data for Quarter 1 (Jan to Mar) to Quarter 4 (Oct to Dec) 2024. These are official statistics in development.

  8. Deaths from Suicide - Dataset - data.gov.uk

    • ckan.publishing.service.gov.uk
    Updated Jul 11, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ckan.publishing.service.gov.uk (2017). Deaths from Suicide - Dataset - data.gov.uk [Dataset]. https://ckan.publishing.service.gov.uk/dataset/deaths-from-suicide
    Explore at:
    Dataset updated
    Jul 11, 2017
    Dataset provided by
    CKANhttps://ckan.org/
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    This data shows deaths (of people age 10 and over) from Suicide and Undetermined Injury, numbers and rates by gender, as 3-year moving-averages. Suicide is a significant cause of premature deaths occurring generally at younger ages than other common causes of premature mortality. It may also be seen as an indicator of underlying rates of mental ill-health. Directly Age-Standardised Rates (DASR) are shown in the data, where numbers are sufficient, so that death rates can be directly compared between areas. The DASR calculation applies Age-specific rates to a Standard (European) population to cancel out possible effects on crude rates due to different age structures among populations, thus enabling direct comparisons of rates. The figures in this dataset include deaths recorded as suicide (people age 10 and over) and undetermined injury (age 15 and over) as those are mostly likely also to have been caused by self-harm rather than unverifiable accident, neglect or abuse. The population denominators for rates are age 10 and over. Low numbers may result in zero values or missing data. Data source: Office for Health Improvement and Disparities (OHID), Public Health Outcomes Framework (PHOF) indicator 41001 (E10). This data is updated annually.

  9. Statewide Death Profiles

    • data.chhs.ca.gov
    • data.ca.gov
    • +3more
    csv, zip
    Updated Oct 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    California Department of Public Health (2025). Statewide Death Profiles [Dataset]. https://data.chhs.ca.gov/dataset/statewide-death-profiles
    Explore at:
    csv(419332), csv(5034), csv(5401561), csv(463460), csv(2026589), csv(16301), csv(200270), csv(4689434), zip, csv(164006), csv(429224)Available download formats
    Dataset updated
    Oct 2, 2025
    Dataset authored and provided by
    California Department of Public Healthhttps://www.cdph.ca.gov/
    Description

    This dataset contains counts of deaths for California as a whole based on information entered on death certificates. Final counts are derived from static data and include out-of-state deaths to California residents, whereas provisional counts are derived from incomplete and dynamic data. Provisional counts are based on the records available when the data was retrieved and may not represent all deaths that occurred during the time period. Deaths involving injuries from external or environmental forces, such as accidents, homicide and suicide, often require additional investigation that tends to delay certification of the cause and manner of death. This can result in significant under-reporting of these deaths in provisional data.

    The final data tables include both deaths that occurred in California regardless of the place of residence (by occurrence) and deaths to California residents (by residence), whereas the provisional data table only includes deaths that occurred in California regardless of the place of residence (by occurrence). The data are reported as totals, as well as stratified by age, gender, race-ethnicity, and death place type. Deaths due to all causes (ALL) and selected underlying cause of death categories are provided. See temporal coverage for more information on which combinations are available for which years.

    The cause of death categories are based solely on the underlying cause of death as coded by the International Classification of Diseases. The underlying cause of death is defined by the World Health Organization (WHO) as "the disease or injury which initiated the train of events leading directly to death, or the circumstances of the accident or violence which produced the fatal injury." It is a single value assigned to each death based on the details as entered on the death certificate. When more than one cause is listed, the order in which they are listed can affect which cause is coded as the underlying cause. This means that similar events could be coded with different underlying causes of death depending on variations in how they were entered. Consequently, while underlying cause of death provides a convenient comparison between cause of death categories, it may not capture the full impact of each cause of death as it does not always take into account all conditions contributing to the death.

  10. d

    Deaths from Suicide - Dataset - Datopian CKAN instance

    • demo.dev.datopian.com
    Updated Oct 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Deaths from Suicide - Dataset - Datopian CKAN instance [Dataset]. https://demo.dev.datopian.com/dataset/lcc--deaths-from-suicide
    Explore at:
    Dataset updated
    Oct 7, 2025
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    This data shows deaths (of people age 10 and over) from Suicide and Undetermined Injury, numbers and rates by gender, as 3-year moving-averages. Suicide is a significant cause of premature deaths occurring generally at younger ages than other common causes of premature mortality. It may also be seen as an indicator of underlying rates of mental ill-health. Directly Age-Standardised Rates (DASR) are shown in the data, where numbers are sufficient, so that death rates can be directly compared between areas. The DASR calculation applies Age-specific rates to a Standard (European) population to cancel out possible effects on crude rates due to different age structures among populations, thus enabling direct comparisons of rates. The figures in this dataset include deaths recorded as suicide (people age 10 and over) and undetermined injury (age 15 and over) as those are mostly likely also to have been caused by self-harm rather than unverifiable accident, neglect or abuse. The population denominators for rates are age 10 and over. Low numbers may result in zero values or missing data. Data source: Office for Health Improvement and Disparities (OHID), Public Health Outcomes Framework (PHOF) indicator 41001 (E10). This data is updated annually.

  11. Mass Killings in America, 2006 - present

    • data.world
    csv, zip
    Updated Oct 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Associated Press (2025). Mass Killings in America, 2006 - present [Dataset]. https://data.world/associatedpress/mass-killings-public
    Explore at:
    zip, csvAvailable download formats
    Dataset updated
    Oct 10, 2025
    Dataset provided by
    data.world, Inc.
    Authors
    The Associated Press
    Time period covered
    Jan 1, 2006 - Sep 28, 2025
    Area covered
    Description

    THIS DATASET WAS LAST UPDATED AT 2:13 PM EASTERN ON OCT. 10

    OVERVIEW

    2019 had the most mass killings since at least the 1970s, according to the Associated Press/USA TODAY/Northeastern University Mass Killings Database.

    In all, there were 45 mass killings, defined as when four or more people are killed excluding the perpetrator. Of those, 33 were mass shootings . This summer was especially violent, with three high-profile public mass shootings occurring in the span of just four weeks, leaving 38 killed and 66 injured.

    A total of 229 people died in mass killings in 2019.

    The AP's analysis found that more than 50% of the incidents were family annihilations, which is similar to prior years. Although they are far less common, the 9 public mass shootings during the year were the most deadly type of mass murder, resulting in 73 people's deaths, not including the assailants.

    One-third of the offenders died at the scene of the killing or soon after, half from suicides.

    About this Dataset

    The Associated Press/USA TODAY/Northeastern University Mass Killings database tracks all U.S. homicides since 2006 involving four or more people killed (not including the offender) over a short period of time (24 hours) regardless of weapon, location, victim-offender relationship or motive. The database includes information on these and other characteristics concerning the incidents, offenders, and victims.

    The AP/USA TODAY/Northeastern database represents the most complete tracking of mass murders by the above definition currently available. Other efforts, such as the Gun Violence Archive or Everytown for Gun Safety may include events that do not meet our criteria, but a review of these sites and others indicates that this database contains every event that matches the definition, including some not tracked by other organizations.

    This data will be updated periodically and can be used as an ongoing resource to help cover these events.

    Using this Dataset

    To get basic counts of incidents of mass killings and mass shootings by year nationwide, use these queries:

    Mass killings by year

    Mass shootings by year

    To get these counts just for your state:

    Filter killings by state

    Definition of "mass murder"

    Mass murder is defined as the intentional killing of four or more victims by any means within a 24-hour period, excluding the deaths of unborn children and the offender(s). The standard of four or more dead was initially set by the FBI.

    This definition does not exclude cases based on method (e.g., shootings only), type or motivation (e.g., public only), victim-offender relationship (e.g., strangers only), or number of locations (e.g., one). The time frame of 24 hours was chosen to eliminate conflation with spree killers, who kill multiple victims in quick succession in different locations or incidents, and to satisfy the traditional requirement of occurring in a “single incident.”

    Offenders who commit mass murder during a spree (before or after committing additional homicides) are included in the database, and all victims within seven days of the mass murder are included in the victim count. Negligent homicides related to driving under the influence or accidental fires are excluded due to the lack of offender intent. Only incidents occurring within the 50 states and Washington D.C. are considered.

    Methodology

    Project researchers first identified potential incidents using the Federal Bureau of Investigation’s Supplementary Homicide Reports (SHR). Homicide incidents in the SHR were flagged as potential mass murder cases if four or more victims were reported on the same record, and the type of death was murder or non-negligent manslaughter.

    Cases were subsequently verified utilizing media accounts, court documents, academic journal articles, books, and local law enforcement records obtained through Freedom of Information Act (FOIA) requests. Each data point was corroborated by multiple sources, which were compiled into a single document to assess the quality of information.

    In case(s) of contradiction among sources, official law enforcement or court records were used, when available, followed by the most recent media or academic source.

    Case information was subsequently compared with every other known mass murder database to ensure reliability and validity. Incidents listed in the SHR that could not be independently verified were excluded from the database.

    Project researchers also conducted extensive searches for incidents not reported in the SHR during the time period, utilizing internet search engines, Lexis-Nexis, and Newspapers.com. Search terms include: [number] dead, [number] killed, [number] slain, [number] murdered, [number] homicide, mass murder, mass shooting, massacre, rampage, family killing, familicide, and arson murder. Offender, victim, and location names were also directly searched when available.

    This project started at USA TODAY in 2012.

    Contacts

    Contact AP Data Editor Justin Myers with questions, suggestions or comments about this dataset at jmyers@ap.org. The Northeastern University researcher working with AP and USA TODAY is Professor James Alan Fox, who can be reached at j.fox@northeastern.edu or 617-416-4400.

  12. Farmers Committed Suicide

    • kaggle.com
    Updated Jan 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jitesh Rathod (2024). Farmers Committed Suicide [Dataset]. https://www.kaggle.com/datasets/rathodjitesh/farmers-who-committed-suicide-state
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 18, 2024
    Dataset provided by
    Kaggle
    Authors
    Jitesh Rathod
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Report on Accidental Deaths & Suicides in India for relevant years, National Crime Records Bureau. Sl. No. (Serial Number): This column seems to represent a unique identifier or serial number assigned to each row in the table. It's used to distinguish different entries.

    State/UT (State/Union Territory):This column lists the names of different Indian states and union territories.

    Farmers/Cultivators - 2017 to 2021: These columns provide the count of farmers or cultivators for the respective years (2017 to 2021) for each State/UT. The numbers indicate the quantity of farmers or cultivators in each region for the specified years.

    37 Unique Values: It seems there are 37 unique values in the data, and the percentages in the "Other" categories indicate the distribution of the total values that fall outside the main categories.

    Label and Count: There are two sections with labels and counts. These sections seem to represent some categorization or grouping based on specific ranges. The "Label" represents a range, and the "Count" indicates how many entries fall into each range.

    Total (All India): This row at the bottom seems to provide the total count for all states and union territories in India for each respective year.

    NA (Not Available): Some entries have "NA" values, indicating that data is not available for those particular years or regions.

  13. Death in the United States

    • kaggle.com
    zip
    Updated Aug 3, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Centers for Disease Control and Prevention (2017). Death in the United States [Dataset]. https://www.kaggle.com/datasets/cdc/mortality
    Explore at:
    zip(766333584 bytes)Available download formats
    Dataset updated
    Aug 3, 2017
    Dataset authored and provided by
    Centers for Disease Control and Preventionhttp://www.cdc.gov/
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    United States
    Description

    Every year the CDC releases the country’s most detailed report on death in the United States under the National Vital Statistics Systems. This mortality dataset is a record of every death in the country for 2005 through 2015, including detailed information about causes of death and the demographic background of the deceased.

    It's been said that "statistics are human beings with the tears wiped off." This is especially true with this dataset. Each death record represents somebody's loved one, often connected with a lifetime of memories and sometimes tragically too short.

    Putting the sensitive nature of the topic aside, analyzing mortality data is essential to understanding the complex circumstances of death across the country. The US Government uses this data to determine life expectancy and understand how death in the U.S. differs from the rest of the world. Whether you’re looking for macro trends or analyzing unique circumstances, we challenge you to use this dataset to find your own answers to one of life’s great mysteries.

    Overview

    This dataset is a collection of CSV files each containing one year's worth of data and paired JSON files containing the code mappings, plus an ICD 10 code set. The CSVs were reformatted from their original fixed-width file formats using information extracted from the CDC's PDF manuals using this script. Please note that this process may have introduced errors as the text extracted from the pdf is not a perfect match. If you have any questions or find errors in the preparation process, please leave a note in the forums. We hope to publish additional years of data using this method soon.

    A more detailed overview of the data can be found here. You'll find that the fields are consistent within this time window, but some of data codes change every few years. For example, the 113_cause_recode entry 069 only covers ICD codes (I10,I12) in 2005, but by 2015 it covers (I10,I12,I15). When I post data from years prior to 2005, expect some of the fields themselves to change as well.

    All data comes from the CDC’s National Vital Statistics Systems, with the exception of the Icd10Code, which are sourced from the World Health Organization.

    Project ideas

    • The CDC's mortality data was the basis of a widely publicized paper, by Anne Case and Nobel prize winner Angus Deaton, arguing that middle-aged whites are dying at elevated rates. One of the criticisms against the paper is that it failed to properly account for the exact ages within the broad bins available through the CDC's WONDER tool. What do these results look like with exact/not-binned age data?
    • Similarly, how sensitive are the mortality trends being discussed in the news to the choice of bin-widths?
    • As noted above, the data preparation process could have introduced errors. Can you find any discrepancies compared to the aggregate metrics on WONDER? If so, please let me know in the forums!
    • WONDER is cited in numerous economics, sociology, and public health research papers. Can you find any papers whose conclusions would be altered if they used the exact data available here rather than binned data from Wonder?

    Differences from the first version of the dataset

    • This version of the dataset was prepared in a completely different many. This has allowed us to provide a much larger volume of data and ensure that codes are available for every field.
    • We've replaced the batch of sql files with a single JSON per year. Kaggle's platform currently offer's better support for JSON files, and this keeps the number of files manageable.
    • A tutorial kernel providing a quick introduction to the new format is available here.
    • Lastly, I apologize if the transition has interrupted anyone's work! If need be, you can still download v1.
  14. Suicide occurrences, England and Wales

    • ons.gov.uk
    • cy.ons.gov.uk
    xlsx
    Updated Oct 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office for National Statistics (2025). Suicide occurrences, England and Wales [Dataset]. https://www.ons.gov.uk/peoplepopulationandcommunity/birthsdeathsandmarriages/deaths/datasets/suicideinenglandandwales
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Oct 3, 2025
    Dataset provided by
    Office for National Statisticshttp://www.ons.gov.uk/
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    Number of suicides and suicide rates, by sex and age, for England and Wales.

  15. C

    Chicago Suicides

    • data.cityofchicago.org
    csv, xlsx, xml
    Updated Oct 6, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Illinois Department of Public Health (IDPH) (2014). Chicago Suicides [Dataset]. https://data.cityofchicago.org/widgets/tit9-2u7i
    Explore at:
    xlsx, xml, csvAvailable download formats
    Dataset updated
    Oct 6, 2014
    Authors
    Illinois Department of Public Health (IDPH)
    Area covered
    Chicago
    Description

    This dataset contains the cumulative number of deaths, average number of deaths annually, average annual crude and adjusted death rates with corresponding 95% confidence intervals, and average annual years of potential life lost per 100,000 residents aged 75 and younger due to selected causes of death, by Chicago community area, for the years 2004 – 2008. A ranking for each measure is also provided, with the highest value indicated with a ranking of 1. See the full description at https://data.cityofchicago.org/api/assets/AEE362B2-8986-41D1-847E-B5A3ACAC5B76.

  16. Suicides in India during 2015

    • kaggle.com
    Updated Aug 22, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vidya Pb (2020). Suicides in India during 2015 [Dataset]. https://www.kaggle.com/vidyapb/suicides-in-india-during-2015/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 22, 2020
    Dataset provided by
    Kaggle
    Authors
    Vidya Pb
    Area covered
    India
    Description

    Context

    This dataset contains information on suicides which happened in India during 2015.

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F4208638%2Ffab2e99b439f9780daf358511060f514%2FWorld-Suicide-Prevention-Day.jpg?generation=1598114750200382&alt=media" alt="">

    The singular age-old social precept of 'Lok Kya Kahenge?' (loosely translated: "What will people say?") suppresses the much-needed psychological care in India. It's high time that we understand why suicides happen and what are the reasons behind it. This dataset aims to spread awareness about suicides in India.

    Content

    I acquired this dataset from here. Have a look at the website.

    This dataset contains 9 files in .csv format. You can find a description for each column. Let me summarize it here as well.

    1. Cause-wise distribution of suicides in Central Armed Police Force (CAPF) during 2015.
    2. Economic Status-wise distribution of suicides during 2015.
    3. Educational Status-wise distribution of suicides during 2015.
    4. Farmer or Cultivators distribution of suicides during 2015.
    5. Profession-wise distribution of suicides during 2015.
    6. Social status-wise distribution of suicides during 2015.
    7. Cause-wise distribution of suicides during 2015.
    8. Suicides by Agricultural labourers during 2015.
    9. Suicides by means adopted during 2015.

    Inspiration

    We now have plenty of data to explore to draw some conclusions about suicides which happened in India during 2015. Let's start by answering these questions: - What are the top 5 states where Farmers' suicides occurred the most? - What's the top reason that agricultural labourers committed suicide? - Which Profession has the most suicides? What could be the reason? - How many Transgender suicides have occurred in different categories?

    I hope these questions interest you in starting to explore this dataset.

    Acknowledgements

    I thank the Indian Government for making it public under their Open Government Data (OGD) Platform India. Please use this dataset strictly for educational purposes. Thank you.

  17. e

    Deaths from Suicide

    • data.europa.eu
    csv, html
    Updated Nov 12, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lincolnshire County Council (2018). Deaths from Suicide [Dataset]. https://data.europa.eu/data/datasets/deaths-from-suicide?locale=en
    Explore at:
    html, csvAvailable download formats
    Dataset updated
    Nov 12, 2018
    Dataset authored and provided by
    Lincolnshire County Council
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    This data shows deaths (of people age 10 and over) from Suicide and Undetermined Injury, numbers and rates by gender, as 3-year moving-averages.

    Suicide is a significant cause of premature deaths occurring generally at younger ages than other common causes of premature mortality. It may also be seen as an indicator of underlying rates of mental ill-health.

    Directly Age-Standardised Rates (DASR) are shown in the data, where numbers are sufficient, so that death rates can be directly compared between areas. The DASR calculation applies Age-specific rates to a Standard (European) population to cancel out possible effects on crude rates due to different age structures among populations, thus enabling direct comparisons of rates.

    The figures in this dataset include deaths recorded as suicide (people age 10 and over) and undetermined injury (age 15 and over) as those are mostly likely also to have been caused by self-harm rather than unverifiable accident, neglect or abuse. The population denominators for rates are age 10 and over. Low numbers may result in zero values or missing data.

    Data source: Office for Health Improvement and Disparities (OHID), Public Health Outcomes Framework (PHOF) indicator 41001 (E10). This data is updated annually.

  18. Data from: National Violent Death Reporting System (NVDRS)

    • catalog.data.gov
    • healthdata.gov
    • +4more
    Updated Aug 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Centers for Disease Control and Prevention, Department of Health & Human Services (2025). National Violent Death Reporting System (NVDRS) [Dataset]. https://catalog.data.gov/dataset/national-violent-death-reporting-system-nvdrs
    Explore at:
    Dataset updated
    Aug 12, 2025
    Description

    The National Violent Death Reporting System (NVDRS) provides states and communities with a clearer understanding of violent deaths to guide local decisions about efforts to prevent violence and helps them track progress over time. To stop violent deaths, we must first understand all the facts. Created in 2002, the NVDRS is a surveillance system that pulls together data on violent deaths in 18 states (see map below), including information about homicides, such as homicides perpetrated by a intimate partner (e.g., boyfriend, girlfriend, wife, husband), child maltreatment (or child abuse) fatalities, suicides, deaths where individuals are killed by law enforcement in the line of duty, unintentional firearm injury deaths, and deaths of undetermined intent. These data are supported by WISQARS, an interactive query system that provides data on injury deaths, violent deaths, and nonfatal injuries.

  19. f

    Data Sheet 1_How many people die by suicide each year? Not 727,000: a...

    • frontiersin.figshare.com
    docx
    Updated Aug 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nicola Meda; Ludovica Angelozzi; Matteo Poletto; Angelo Patane’; Josephine Zammarrelli; Irene Slongo; Fabio Sambataro; Diego De Leo (2025). Data Sheet 1_How many people die by suicide each year? Not 727,000: a systematic review and meta-analysis of suicide underreporting across 71 countries over 122 years.docx [Dataset]. http://doi.org/10.3389/fpsyt.2025.1609580.s001
    Explore at:
    docxAvailable download formats
    Dataset updated
    Aug 12, 2025
    Dataset provided by
    Frontiers
    Authors
    Nicola Meda; Ludovica Angelozzi; Matteo Poletto; Angelo Patane’; Josephine Zammarrelli; Irene Slongo; Fabio Sambataro; Diego De Leo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundSuicide underreporting undermines accurate public health assessments and resource allocation for suicide prevention. This study aims at synthesizing evidence on suicide underreporting and to estimate a global underreporting rate.MethodsWe conducted a PRISMA-compliant systematic review on suicide underreporting, following a pre-registered protocol. A meta-analytical synthesis was also conducted. Quantitative data from individual studies was extracted to provide an overall global estimate of suicide underreporting (42 studies covering 71 countries out of the initial 770 unique studies, spanning 1900–2021). Most studies used retrospective institutional datasets to estimate underreporting through reclassification of undetermined deaths or comparisons across databases. Demographic and geographic disparities were also examined.ResultsThe 42 studies selected provided some quantitative data on suicide underreporting for general or specific populations. 14 of these studies provided data to be meta-analyzed. The global suicide underreporting rate was estimated to be 17.9% (95% CI: 10.9–28.1%) with large differences between countries with high and low/very low data quality. In this scenario, the last WHO estimates of suicide deaths – corrected for underreporting – would be more than one million (1,000,638; 95% CI: 859,511–1,293,006) and not 727,000 suicides per year. Underreporting was higher in low- and middle-income countries (LMICs) with incomplete death registration systems, such as India and China (34.9%; 95% CI 20.3–53%), while high-income countries exhibited lower rates (11.5%; 95% CI 6.6–19.3%). Contributing factors included stigma, religiosity, limited forensic resources, and inconsistent use of International Classification of Diseases (ICD) codes. Gender and age disparities were notable; Female suicides and those among younger or older individuals were more likely to be misclassified.DiscussionAddressing suicide underreporting requires improving death registration systems globally, particularly in LMICs. Standardizing ICD usage, improving forensic capacity, and reducing stigma are critical steps to ensure accurate data. Heterogeneity, geographical disparities, temporal biases, and invariance of suicide underreporting for countries with low-quality data demand further corroboration of these findings.Systematic Review Registrationhttps://osf.io/9j8dg.

  20. E

    Suicides in Scotland 1982-2009

    • find.data.gov.scot
    • dtechtive.com
    xml, zip
    Updated Feb 21, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    University of Edinburgh (2017). Suicides in Scotland 1982-2009 [Dataset]. http://doi.org/10.7488/ds/1799
    Explore at:
    xml(0.0048 MB), zip(30.31 MB)Available download formats
    Dataset updated
    Feb 21, 2017
    Dataset provided by
    University of Edinburgh
    License

    ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
    License information was derived automatically

    Area covered
    Scotland
    Description

    This group of datasets describe the suicides in Scotland for the period 1982-2009. There are 4 separate datasets: All Suicides/Male Suicides/Female Suicides/All Suicide Rate (expressed per 100,000 people). The data is broken down into Local Authority Areas making it easier to investigate any spatial disparity in the suicide figures. A couple of points are worth noting are that it is unclear if the suicide data shows all suicides or just those of Adults. A recent Scottish Government report(http://www.scotland.gov.uk/Publications/2007/03/01145422/20) used deaths of people over 15 years old. Differences in the rates between this data and the results presented in the Scottish Government report may also be due to different population datasets being used. Suicide data sources form the Scottish Public Health Observatory (http://www.scotpho.org.uk/home/Healthwell-beinganddisease/suicide/suicide_data/suicide_la.asp) and the population data used to calculate the rates was sourced from ShareGeo Open (http://hdl.handle.net/10672/95) which uses mid-year estimates downloaded from Nomis (www.nomisweb.co.uk/. Datasets were joined to Local Authority (district, unitary authority and borough) boundaries downloaded from Ordnance Survey OpenData Boundary Line dataset. All spatial analysis was carried out in ArcGIS. GIS vector data. This dataset was first accessioned in the EDINA ShareGeo Open repository on 2011-01-13 and migrated to Edinburgh DataShare on 2017-02-21.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Filip Zoubek; Filip Zoubek (2021). Effect of suicide rates on life expectancy dataset [Dataset]. http://doi.org/10.5281/zenodo.4694270
Organization logo

Effect of suicide rates on life expectancy dataset

Explore at:
csvAvailable download formats
Dataset updated
Apr 16, 2021
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Filip Zoubek; Filip Zoubek
License

Attribution-NonCommercial-ShareAlike 3.0 (CC BY-NC-SA 3.0)https://creativecommons.org/licenses/by-nc-sa/3.0/
License information was derived automatically

Description

Effect of suicide rates on life expectancy dataset

Abstract
In 2015, approximately 55 million people died worldwide, of which 8 million committed suicide. In the USA, one of the main causes of death is the aforementioned suicide, therefore, this experiment is dealing with the question of how much suicide rates affects the statistics of average life expectancy.
The experiment takes two datasets, one with the number of suicides and life expectancy in the second one and combine data into one dataset. Subsequently, I try to find any patterns and correlations among the variables and perform statistical test using simple regression to confirm my assumptions.

Data

The experiment uses two datasets - WHO Suicide Statistics[1] and WHO Life Expectancy[2], which were firstly appropriately preprocessed. The final merged dataset to the experiment has 13 variables, where country and year are used as index: Country, Year, Suicides number, Life expectancy, Adult Mortality, which is probability of dying between 15 and 60 years per 1000 population, Infant deaths, which is number of Infant Deaths per 1000 population, Alcohol, which is alcohol, recorded per capita (15+) consumption, Under-five deaths, which is number of under-five deaths per 1000 population, HIV/AIDS, which is deaths per 1 000 live births HIV/AIDS, GDP, which is Gross Domestic Product per capita, Population, Income composition of resources, which is Human Development Index in terms of income composition of resources, and Schooling, which is number of years of schooling.

LICENSE

THE EXPERIMENT USES TWO DATASET - WHO SUICIDE STATISTICS AND WHO LIFE EXPECTANCY, WHICH WERE COLLEECTED FROM WHO AND UNITED NATIONS WEBSITE. THEREFORE, ALL DATASETS ARE UNDER THE LICENSE ATTRIBUTION-NONCOMMERCIAL-SHAREALIKE 3.0 IGO (https://creativecommons.org/licenses/by-nc-sa/3.0/igo/).

[1] https://www.kaggle.com/szamil/who-suicide-statistics

[2] https://www.kaggle.com/kumarajarshi/life-expectancy-who

Search
Clear search
Close search
Google apps
Main menu