Facebook
TwitterData Description: Since 1800, more than 37 million people worldwide have died while actively fighting in wars.
The number would be much higher still if it also considered the civilians who died due to the fighting, the increased number of deaths from hunger and disease resulting from these conflicts, and the deaths in smaller conflicts that are not considered wars.
Wars are also terrible in many other ways: they make peopleâs lives insecure, lower their living standards, destroy the environment, and, if fought between countries armed with nuclear weapons, can be an existential threat to humanity.
Looking at the news alone, it can be difficult to understand whether more or less people are dying as a result of war than in the past. One has to rely on statistics that are carefully collected so that they can be compared over time.
How many wars are avoided, and whether the trend of fewer deaths in them continues, is up to our own actions. Conflict deaths recently increased in the Middle East, Africa, and Europe, stressing that the future of these trends is uncertain.
In this dataset, there are 6 csv files in one zip one. Everything is clear but if you have any question, feel free to ask. Good luck.
This dataset belongs to Ourworldindata By: Bastian Herre, Lucas RodĂŠs-Guirao, Max Roser, Joe Hasell and Bobbie Macdonald
Facebook
TwitterThis dataset contains counts of deaths for California as a whole based on information entered on death certificates. Final counts are derived from static data and include out-of-state deaths to California residents, whereas provisional counts are derived from incomplete and dynamic data. Provisional counts are based on the records available when the data was retrieved and may not represent all deaths that occurred during the time period. Deaths involving injuries from external or environmental forces, such as accidents, homicide and suicide, often require additional investigation that tends to delay certification of the cause and manner of death. This can result in significant under-reporting of these deaths in provisional data.
The final data tables include both deaths that occurred in California regardless of the place of residence (by occurrence) and deaths to California residents (by residence), whereas the provisional data table only includes deaths that occurred in California regardless of the place of residence (by occurrence). The data are reported as totals, as well as stratified by age, gender, race-ethnicity, and death place type. Deaths due to all causes (ALL) and selected underlying cause of death categories are provided. See temporal coverage for more information on which combinations are available for which years.
The cause of death categories are based solely on the underlying cause of death as coded by the International Classification of Diseases. The underlying cause of death is defined by the World Health Organization (WHO) as "the disease or injury which initiated the train of events leading directly to death, or the circumstances of the accident or violence which produced the fatal injury." It is a single value assigned to each death based on the details as entered on the death certificate. When more than one cause is listed, the order in which they are listed can affect which cause is coded as the underlying cause. This means that similar events could be coded with different underlying causes of death depending on variations in how they were entered. Consequently, while underlying cause of death provides a convenient comparison between cause of death categories, it may not capture the full impact of each cause of death as it does not always take into account all conditions contributing to the death.
Facebook
Twitterhttps://github.com/nytimes/covid-19-data/blob/master/LICENSEhttps://github.com/nytimes/covid-19-data/blob/master/LICENSE
The New York Times is releasing a series of data files with cumulative counts of coronavirus cases in the United States, at the state and county level, over time. We are compiling this time series data from state and local governments and health departments in an attempt to provide a complete record of the ongoing outbreak.
Since the first reported coronavirus case in Washington State on Jan. 21, 2020, The Times has tracked cases of coronavirus in real time as they were identified after testing. Because of the widespread shortage of testing, however, the data is necessarily limited in the picture it presents of the outbreak.
We have used this data to power our maps and reporting tracking the outbreak, and it is now being made available to the public in response to requests from researchers, scientists and government officials who would like access to the data to better understand the outbreak.
The data begins with the first reported coronavirus case in Washington State on Jan. 21, 2020. We will publish regular updates to the data in this repository.
Facebook
TwitterThis dataset contains counts of deaths for California counties based on information entered on death certificates. Final counts are derived from static data and include out-of-state deaths to California residents, whereas provisional counts are derived from incomplete and dynamic data. Provisional counts are based on the records available when the data was retrieved and may not represent all deaths that occurred during the time period. Deaths involving injuries from external or environmental forces, such as accidents, homicide and suicide, often require additional investigation that tends to delay certification of the cause and manner of death. This can result in significant under-reporting of these deaths in provisional data.
The final data tables include both deaths that occurred in each California county regardless of the place of residence (by occurrence) and deaths to residents of each California county (by residence), whereas the provisional data table only includes deaths that occurred in each county regardless of the place of residence (by occurrence). The data are reported as totals, as well as stratified by age, gender, race-ethnicity, and death place type. Deaths due to all causes (ALL) and selected underlying cause of death categories are provided. See temporal coverage for more information on which combinations are available for which years.
The cause of death categories are based solely on the underlying cause of death as coded by the International Classification of Diseases. The underlying cause of death is defined by the World Health Organization (WHO) as "the disease or injury which initiated the train of events leading directly to death, or the circumstances of the accident or violence which produced the fatal injury." It is a single value assigned to each death based on the details as entered on the death certificate. When more than one cause is listed, the order in which they are listed can affect which cause is coded as the underlying cause. This means that similar events could be coded with different underlying causes of death depending on variations in how they were entered. Consequently, while underlying cause of death provides a convenient comparison between cause of death categories, it may not capture the full impact of each cause of death as it does not always take into account all conditions contributing to the death.
Facebook
TwitterTotal deaths for Maryland and its jurisdictions are derived from the U.S. Census Bureauâs Population Estimates Program. These estimates reflect revisions to the entire time series, beginning with the estimate base of April 1, 2020, through July 1 of the current year (referred to as the 'vintage year,' or V2024). Each time series incorporates updated administrative records, geographic boundary changes, and methodological improvements. The data is updated annually. Source: U.S. Census Bureau, Population Estimates Program, March 2025.
Facebook
TwitterU.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
Note: DPH is updating and streamlining the COVID-19 cases, deaths, and testing data. As of 6/27/2022, the data will be published in four tables instead of twelve.
The COVID-19 Cases, Deaths, and Tests by Day dataset contains cases and test data by date of sample submission. The death data are by date of death. This dataset is updated daily and contains information back to the beginning of the pandemic. The data can be found at https://data.ct.gov/Health-and-Human-Services/COVID-19-Cases-Deaths-and-Tests-by-Day/g9vi-2ahj.
The COVID-19 State Metrics dataset contains over 93 columns of data. This dataset is updated daily and currently contains information starting June 21, 2022 to the present. The data can be found at https://data.ct.gov/Health-and-Human-Services/COVID-19-State-Level-Data/qmgw-5kp6 .
The COVID-19 County Metrics dataset contains 25 columns of data. This dataset is updated daily and currently contains information starting June 16, 2022 to the present. The data can be found at https://data.ct.gov/Health-and-Human-Services/COVID-19-County-Level-Data/ujiq-dy22 .
The COVID-19 Town Metrics dataset contains 16 columns of data. This dataset is updated daily and currently contains information starting June 16, 2022 to the present. The data can be found at https://data.ct.gov/Health-and-Human-Services/COVID-19-Town-Level-Data/icxw-cada . To protect confidentiality, if a town has fewer than 5 cases or positive NAAT tests over the past 7 days, those data will be suppressed.
Count of COVID-19-associated deaths by date of death. Deaths reported to either the OCME or DPH are included in the COVID-19 data. COVID-19-associated deaths include persons who tested positive for COVID-19 around the time of death and persons who were not tested for COVID-19 whose death certificate lists COVID-19 disease as a cause of death or a significant condition contributing to death.
Data on Connecticut deaths were obtained from the Connecticut Deaths Registry maintained by the DPH Office of Vital Records. Cause of death was determined by a death certifier (e.g., physician, APRN, medical examiner) using their best clinical judgment. Additionally, all COVID-19 deaths, including suspected or related, are required to be reported to OCME. On April 4, 2020, CT DPH and OCME released a joint memo to providers and facilities within Connecticut providing guidelines for certifying deaths due to COVID-19 that were consistent with the CDCâs guidelines and a reminder of the required reporting to OCME.25,26 As of July 1, 2021, OCME had reviewed every case reported and performed additional investigation on about one-third of reported deaths to better ascertain if COVID-19 did or did not cause or contribute to the death. Some of these investigations resulted in the OCME performing postmortem swabs for PCR testing on individuals whose deaths were suspected to be due to COVID-19, but antemortem diagnosis was unable to be made.31 The OCME issued or re-issued about 10% of COVID-19 death certificates and, when appropriate, removed COVID-19 from the death certificate. For standardization and tabulation of mortality statistics, written cause of death statements made by the certifiers on death certificates are sent to the National Center for Health Statistics (NCHS) at the CDC which assigns cause of death codes according to the International Causes of Disease 10th Revision (ICD-10) classification system.25,26 COVID-19 deaths in this report are defined as those for which the death certificate has an ICD-10 code of U07.1 as either a primary (underlying) or a contributing cause of death. More information on COVID-19 mortality can be found at the following link: https://portal.ct.gov/DPH/Health-Information-Systems--Reporting/Mortality/Mortality-Statistics
Note the counts in this dataset may vary from the death counts in the other COVID-19-related datasets published on data.ct.gov, where deaths are counted on the date reported rather than the date of death.
Starting in July 2020, this dataset will be updated every weekday. Data are subject to future revision as reporting changes.
Facebook
TwitterThe American Civil War is the conflict with the largest number of American military fatalities in history. In fact, the Civil War's death toll is comparable to all other major wars combined, the deadliest of which were the World Wars, which have a combined death toll of more than 520,000 American fatalities. The ongoing series of conflicts and interventions in the Middle East and North Africa, collectively referred to as the War on Terror in the west, has a combined death toll of more than 7,000 for the U.S. military since 2001. Other records In terms of the number of deaths per day, the American Civil War is still at the top, with an average of 425 deaths per day, while the First and Second World Wars have averages of roughly 100 and 200 fatalities per day respectively. Technically, the costliest battle in U.S. military history was the Battle of Elsenborn Ridge, which was a part of the Battle of the Bulge in the Second World War, and saw upwards of 5,000 deaths over 10 days. However, the Battle of Gettysburg had more military fatalities of American soldiers, with almost 3,200 Union deaths and over 3,900 Confederate deaths, giving a combined total of more than 7,000. The Battle of Antietam is viewed as the bloodiest day in American military history, with over 3,600 combined fatalities and almost 23,000 total casualties on September 17, 1862. Revised Civil War figures For more than a century, the total death toll of the American Civil War was generally accepted to be around 620,000, a number which was first proposed by Union historians William F. Fox and Thomas L. Livermore in 1888. This number was calculated by using enlistment figures, battle reports, and census data, however many prominent historians since then have thought the number should be higher. In 2011, historian J. David Hacker conducted further investigations and claimed that the number was closer to 750,000 (and possibly as high as 850,000). While many Civil War historians agree that this is possible, and even likely, obtaining consistently accurate figures has proven to be impossible until now; both sides were poor at keeping detailed records throughout the war, and much of the Confederacy's records were lost by the war's end. Many Confederate widows also did not register their husbands death with the authorities, as they would have then been ineligible for benefits.
Facebook
TwitterThe Black Death was the largest and deadliest pandemic of Yersinia pestis recorded in human history, and likely the most infamous individual pandemic ever documented. The plague originated in the Eurasian Steppes, before moving with Mongol hordes to the Black Sea, where it was then brought by Italian merchants to the Mediterranean. From here, the Black Death then spread to almost all corners of Europe, the Middle East, and North Africa. While it was never endemic to these regions, it was constantly re-introduced via trade routes from Asia (such as the Silk Road), and plague was present in Western Europe until the seventeenth century, and the other regions until the nineteenth century. Impact on Europe In Europe, the major port cities and metropolitan areas were hit the hardest. The plague spread through south-western Europe, following the arrival of Italian galleys in Sicily, Genoa, Venice, and Marseilles, at the beginning of 1347. It is claimed that Venice, Florence, and Siena lost up to two thirds of their total population during epidemic's peak, while London, which was hit in 1348, is said to have lost at least half of its population. The plague then made its way around the west of Europe, and arrived in Germany and Scandinavia in 1348, before travelling along the Baltic coast to Russia by 1351 (although data relating to the death tolls east of Germany is scarce). Some areas of Europe remained untouched by the plague for decades; for example, plague did not arrive in Iceland until 1402, however it swept across the island with devastating effect, causing the population to drop from 120,000 to 40,000 within two years. Reliability While the Black Death affected three continents, there is little recorded evidence of its impact outside of Southern or Western Europe. In Europe, however, many sources conflict and contrast with one another, often giving death tolls exceeding the estimated population at the time (such as London, where the death toll is said to be three times larger than the total population). Therefore, the precise death tolls remain uncertain, and any figures given should be treated tentatively.
Facebook
TwitterA. SUMMARY This dataset shows San Francisco COVID-19 deaths by population characteristics. This data may not be immediately available for recently reported deaths. Data updates as more information becomes available.âŻBecause of this, death totals may increase or decrease.
Population characteristics are subgroups, or demographic cross-sections, like age, race, or gender. The City tracks how deaths have been distributed among different subgroups. This information can reveal trends and disparities among groups.
B. HOW THE DATASET IS CREATED As of January 1, 2023, COVID-19 deaths are defined as persons who had COVID-19 listed as a cause of death or a significant condition contributing to their death on their death certificate. This definition is in alignment with the California Department of Public Health and the national https://preparedness.cste.org/wp-content/uploads/2022/12/CSTE-Revised-Classification-of-COVID-19-associated-Deaths.Final_.11.22.22.pdf">Council of State and Territorial Epidemiologists. Death certificates are maintained by the California Department of Public Health.
Data on the population characteristics of COVID-19 deaths are from: *Case reports *Medical records *Electronic lab reports *Death certificates
Data are continually updated to maximize completeness of information and reporting on San Francisco COVID-19 deaths.
To protect resident privacy, we summarize COVID-19 data by only one population characteristic at a time. Data are not shown until cumulative citywide deaths reach five or more.
Data notes on select population characteristic types are listed below.
Race/ethnicity * We include all race/ethnicity categories that are collected for COVID-19 cases.
Gender * The City collects information on gender identity using these guidelines.
C. UPDATE PROCESS Updates automatically at 06:30 and 07:30 AM Pacific Time on Wednesday each week.
Dataset will not update on the business day following any federal holiday.
D. HOW TO USE THIS DATASET Population estimates are only available for age groups and race/ethnicity categories. San Francisco population estimates for race/ethnicity and age groups can be found in a dataset based on the San Francisco Population and Demographic Census dataset.These population estimates are from the 2018-2022 5-year American Community Survey (ACS).
This dataset includes several characteristic types. Filter the âCharacteristic Typeâ column to explore a topic area. Then, the âCharacteristic Groupâ column shows each group or category within that topic area and the number of cumulative deaths.
Cumulative deaths are the running total of all San Francisco COVID-19 deaths in that characteristic group up to the date listed.
To explore data on the total number of deaths, use the COVID-19 Deaths Over Time dataset.
E. CHANGE LOG
Facebook
TwitterThis file contains COVID-19 death counts, death rates, and percent of total deaths by jurisdiction of residence. The data is grouped by different time periods including 3-month period, weekly, and total (cumulative since January 1, 2020). United States death counts and rates include the 50 states, plus the District of Columbia and New York City. New York state estimates exclude New York City. Puerto Rico is included in HHS Region 2 estimates. Deaths with confirmed or presumed COVID-19, coded to ICDâ10 code U07.1. Number of deaths reported in this file are the total number of COVID-19 deaths received and coded as of the date of analysis and may not represent all deaths that occurred in that period. Counts of deaths occurring before or after the reporting period are not included in the file. Data during recent periods are incomplete because of the lag in time between when the death occurred and when the death certificate is completed, submitted to NCHS and processed for reporting purposes. This delay can range from 1 week to 8 weeks or more, depending on the jurisdiction and cause of death. Death counts should not be compared across states. Data timeliness varies by state. Some states report deaths on a daily basis, while other states report deaths weekly or monthly. The ten (10) United States Department of Health and Human Services (HHS) regions include the following jurisdictions. Region 1: Connecticut, Maine, Massachusetts, New Hampshire, Rhode Island, Vermont; Region 2: New Jersey, New York, New York City, Puerto Rico; Region 3: Delaware, District of Columbia, Maryland, Pennsylvania, Virginia, West Virginia; Region 4: Alabama, Florida, Georgia, Kentucky, Mississippi, North Carolina, South Carolina, Tennessee; Region 5: Illinois, Indiana, Michigan, Minnesota, Ohio, Wisconsin; Region 6: Arkansas, Louisiana, New Mexico, Oklahoma, Texas; Region 7: Iowa, Kansas, Missouri, Nebraska; Region 8: Colorado, Montana, North Dakota, South Dakota, Utah, Wyoming; Region 9: Arizona, California, Hawaii, Nevada; Region 10: Alaska, Idaho, Oregon, Washington. Rates were calculated using the population estimates for 2021, which are estimated as of July 1, 2021 based on the Blended Base produced by the US Census Bureau in lieu of the April 1, 2020 decennial population count. The Blended Base consists of the blend of Vintage 2020 postcensal population estimates, 2020 Demographic Analysis Estimates, and 2020 Census PL 94-171 Redistricting File (see https://www2.census.gov/programs-surveys/popest/technical-documentation/methodology/2020-2021/methods-statement-v2021.pdf). Rates are based on deaths occurring in the specified week/month and are age-adjusted to the 2000 standard population using the direct method (see https://www.cdc.gov/nchs/data/nvsr/nvsr70/nvsr70-08-508.pdf). These rates differ from annual age-adjusted rates, typically presented in NCHS publications based on a full year of data and annualized weekly/monthly age-adjusted rates which have been adjusted to allow comparison with annual rates. Annualization rates presents deaths per year per 100,000 population that would be expected in a year if the observed period specific (weekly/monthly) rate prevailed for a full year. Sub-national death counts between 1-9 are suppressed in accordance with NCHS data confidentiality standards. Rates based on death counts less than 20 are suppressed in accordance with NCHS standards of reliability as specified in NCHS Data Presentation Standards for Proportions (available from: https://www.cdc.gov/nchs/data/series/sr_02/sr02_175.pdf.).
Facebook
Twitterhttps://www.worldbank.org/en/about/legal/terms-of-use-for-datasetshttps://www.worldbank.org/en/about/legal/terms-of-use-for-datasets
Subjected dataset is extracted using world bank and UN websites to find population collapse according to countries and regions. The code generates data for seven indicators based on the current date and is available from Year 2000 to the year 2021.
This code is useful for research purposes, there are nine distinct CSV files associated with this code, seven of them deals with indicators, one CSV file pertaining to country groups and last CSV file is analysis for 20 years between seven indicators. Below are seven indicators extracted from the world bank and the United Nations websites.
Total Population, Population Growth, Life Expectancy at birth, Fertility Rate, Death Rate (per 1,000 people)), Birth Rate (per 1,000 people), Median Age
Population collapse is calculated using Total Population, Population Growth, Life Expectancy at birth, Fertility Rate, Death Rate, Birth Rate and Median Age, for that various criteria were applied to extract data:
The data was filtered based on several attributes, first ids and title has been extracted from the world bank data then timeframe and columns provided to extract data. This filtering process ensured that only relevant data meeting the specified criteria. For median age UN website is used and data is extracted for all countries. Median age data is not available for groups or regions; however, it could be calculated as median age data is available for all countries of the globe.
Variables: Economy, Seven Indicators Years from 2000 to 2021
For country group files, all countries are assigned according to regions, groups, by lending, by income, etc. so for this file each country is repeated as one country is member of more than one group.
Below screenshot is extracted for those countries whose population does fall in 20 years and death rate is increased while birth rate is decrease. So, for instance Ukraine population in Year 2002 was 48.2M while as per Year 2021 there population is decreased by 9% to 43.8M, similarly there death rate is increase from 15.7 to 18.5 (per 1000 people) and birth rate is decrease by 10% from 8.10 to 7.30.
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F15657145%2Ffeaff87cec8a478065eb06229045d7f1%2FPopulation%20Collapse.JPG?generation=1691841930935324&alt=media" alt="">
Facebook
TwitterBy Data Society [source]
This dataset contains key demographic, health status indicators and leading cause of death data to help us understand the current trends and health outcomes in communities across the United States. By looking at this data, it can be seen how different states, counties and populations have changed over time. With this data we can analyze levels of national health services use such as vaccination rates or mammography rates; review leading causes of death to create public policy initiatives; as well as identify risk factors for specific conditions that may be associated with certain populations or regions. The information from these files includes State FIPS Code, County FIPS Code, CHSI County Name, CHSI State Name, CHSI State Abbreviation, Influenza B (FluB) report count & expected cases rate per 100K population , Hepatitis A (HepA) Report Count & expected cases rate per 100K population , Hepatitis B (HepB) Report Count & expected cases rate per 100K population , Measles (Meas) Report Count & expected cases rate per 100K population , Pertussis(Pert) Report Count & expected case rate per 100K population , CRS report count & expected case rate per 100K population , Syphilis report count and expected case rate per 100k popuation. We also look at measures related to preventive care services such as Pap smear screen among women aged 18-64 years old check lower/upper confidence intervals seperately ; Mammogram checks among women aged 40-64 years old specified lower/upper conifence intervals separetly ; Colonosopy/ Proctoscpushy among men aged 50+ measured in lower/upper limits ; Pneumonia Vaccination amongst 65+ with loewr/upper confidence level detail Additionally we have some interesting trend indicating variables like measures of birth adn death which includes general fertility ratye ; Teen Birth Rate by Mother's age group etc Summary Measures covers mortality trend following life expectancy by sex&age categories Vressionable populations access info gives us insight into disablilty ratio + access to envtiromental issues due to poor quality housing facilities Finally Risk Factors cover speicfic hoslitic condtiions suchs asthma diagnosis prevelance cancer diabetes alcholic abuse smoking trends All these information give a good understanding on Healthy People 2020 target setings demograpihcally speaking hence will aid is generating more evience backed policies
For more datasets, click here.
- đ¨ Your notebook can be here! đ¨!
What the Dataset Contains
This dataset contains valuable information about public health relevant to each county in the United States, broken down into 9 indicator domains: Demographics, Leading Causes of Death, Summary Measures of Health, Measures of Birth and Death Rates, Relative Health Importance, Vulnerable Populations and Environmental Health Conditions, Preventive Services Use Data from BRFSS Survey System Data , Risk Factors and Access to Care/Health Insurance Coverage & State Developed Types of Measurements such as CRS with Multiple Categories Identified for Each Type . The data includes indicators such as percentages or rates for influenza (FLU), hepatitis (HepA/B), measles(MEAS) pertussis(PERT), syphilis(Syphilis) , cervical cancer (CI_Min_Pap_Smear - CI_Max\Pap \Smear), breast cancer (CI\Min Mammogram - CI \Max \Mammogram ) proctoscopy (CI Min Proctoscopy - CI Max Proctoscopy ), pneumococcal vaccinations (Ci min Pneumo Vax - Ci max Pneumo Vax )and flu vaccinations (Ci min Flu Vac - Ci Max Flu Vac). Additionally , it provides information on leading causes of death at both county levels & national level including age-adjusted mortality rates due to suicide among teens aged between 15-19 yrs per 100000 population etc.. Furthermore , summary measures such as age adjusted percentage who consider their physical health fair or poor are provided; vulnerable populations related indicators like relative importance score for disabled adults ; preventive service use related ones ranging from self reported vaccination coverage among men40-64 yrs old against hepatitis B virus etc...
Getting Started With The Dataset
To get started with exploring this dataset first your need to understand what each column in the table represents: State FIPS Code identifies a unique identifier used by various US government agencies which denote states . County FIPS code denotes counties wi...
Facebook
TwitterTHIS DATASET WAS LAST UPDATED AT 7:11 AM EASTERN ON DEC. 1
2019 had the most mass killings since at least the 1970s, according to the Associated Press/USA TODAY/Northeastern University Mass Killings Database.
In all, there were 45 mass killings, defined as when four or more people are killed excluding the perpetrator. Of those, 33 were mass shootings . This summer was especially violent, with three high-profile public mass shootings occurring in the span of just four weeks, leaving 38 killed and 66 injured.
A total of 229 people died in mass killings in 2019.
The AP's analysis found that more than 50% of the incidents were family annihilations, which is similar to prior years. Although they are far less common, the 9 public mass shootings during the year were the most deadly type of mass murder, resulting in 73 people's deaths, not including the assailants.
One-third of the offenders died at the scene of the killing or soon after, half from suicides.
The Associated Press/USA TODAY/Northeastern University Mass Killings database tracks all U.S. homicides since 2006 involving four or more people killed (not including the offender) over a short period of time (24 hours) regardless of weapon, location, victim-offender relationship or motive. The database includes information on these and other characteristics concerning the incidents, offenders, and victims.
The AP/USA TODAY/Northeastern database represents the most complete tracking of mass murders by the above definition currently available. Other efforts, such as the Gun Violence Archive or Everytown for Gun Safety may include events that do not meet our criteria, but a review of these sites and others indicates that this database contains every event that matches the definition, including some not tracked by other organizations.
This data will be updated periodically and can be used as an ongoing resource to help cover these events.
To get basic counts of incidents of mass killings and mass shootings by year nationwide, use these queries:
To get these counts just for your state:
Mass murder is defined as the intentional killing of four or more victims by any means within a 24-hour period, excluding the deaths of unborn children and the offender(s). The standard of four or more dead was initially set by the FBI.
This definition does not exclude cases based on method (e.g., shootings only), type or motivation (e.g., public only), victim-offender relationship (e.g., strangers only), or number of locations (e.g., one). The time frame of 24 hours was chosen to eliminate conflation with spree killers, who kill multiple victims in quick succession in different locations or incidents, and to satisfy the traditional requirement of occurring in a âsingle incident.â
Offenders who commit mass murder during a spree (before or after committing additional homicides) are included in the database, and all victims within seven days of the mass murder are included in the victim count. Negligent homicides related to driving under the influence or accidental fires are excluded due to the lack of offender intent. Only incidents occurring within the 50 states and Washington D.C. are considered.
Project researchers first identified potential incidents using the Federal Bureau of Investigationâs Supplementary Homicide Reports (SHR). Homicide incidents in the SHR were flagged as potential mass murder cases if four or more victims were reported on the same record, and the type of death was murder or non-negligent manslaughter.
Cases were subsequently verified utilizing media accounts, court documents, academic journal articles, books, and local law enforcement records obtained through Freedom of Information Act (FOIA) requests. Each data point was corroborated by multiple sources, which were compiled into a single document to assess the quality of information.
In case(s) of contradiction among sources, official law enforcement or court records were used, when available, followed by the most recent media or academic source.
Case information was subsequently compared with every other known mass murder database to ensure reliability and validity. Incidents listed in the SHR that could not be independently verified were excluded from the database.
Project researchers also conducted extensive searches for incidents not reported in the SHR during the time period, utilizing internet search engines, Lexis-Nexis, and Newspapers.com. Search terms include: [number] dead, [number] killed, [number] slain, [number] murdered, [number] homicide, mass murder, mass shooting, massacre, rampage, family killing, familicide, and arson murder. Offender, victim, and location names were also directly searched when available.
This project started at USA TODAY in 2012.
Contact AP Data Editor Justin Myers with questions, suggestions or comments about this dataset at jmyers@ap.org. The Northeastern University researcher working with AP and USA TODAY is Professor James Alan Fox, who can be reached at j.fox@northeastern.edu or 617-416-4400.
Facebook
TwitterNote: This COVID-19 data set is no longer being updated as of December 1, 2023. Access current COVID-19 data on the CDPH respiratory virus dashboard (https://www.cdph.ca.gov/Programs/CID/DCDC/Pages/Respiratory-Viruses/RespiratoryDashboard.aspx) or in open data format (https://data.chhs.ca.gov/dataset/respiratory-virus-dashboard-metrics).
As of August 17, 2023, data is being updated each Friday.
For death data after December 31, 2022, California uses Provisional Deaths from the Center for Disease Control and Preventionâs National Center for Health Statistics (NCHS) National Vital Statistics System (NVSS). Prior to January 1, 2023, death data was sourced from the COVID-19 registry. The change in data source occurred in July 2023 and was applied retroactively to all 2023 data to provide a consistent source of death data for the year of 2023.
As of May 11, 2023, data on cases, deaths, and testing is being updated each Thursday. Metrics by report date have been removed, but previous versions of files with report date metrics are archived below.
All metrics include people in state and federal prisons, US Immigration and Customs Enforcement facilities, US Marshal detention facilities, and Department of State Hospitals facilities. Members of California's tribal communities are also included.
The "Total Tests" and "Positive Tests" columns show totals based on the collection date. There is a lag between when a specimen is collected and when it is reported in this dataset. As a result, the most recent dates on the table will temporarily show NONE in the "Total Tests" and "Positive Tests" columns. This should not be interpreted as no tests being conducted on these dates. Instead, these values will be updated with the number of tests conducted as data is received.
Facebook
TwitterThis dataset of U.S. mortality trends since 1900 highlights the differences in age-adjusted death rates and life expectancy at birth by race and sex. Age-adjusted death rates (deaths per 100,000) after 1998 are calculated based on the 2000 U.S. standard population. Populations used for computing death rates for 2011â2017 are postcensal estimates based on the 2010 census, estimated as of July 1, 2010. Rates for census years are based on populations enumerated in the corresponding censuses. Rates for noncensus years between 2000 and 2010 are revised using updated intercensal population estimates and may differ from rates previously published. Data on age-adjusted death rates prior to 1999 are taken from historical data (see References below). Life expectancy data are available up to 2017. Due to changes in categories of race used in publications, data are not available for the black population consistently before 1968, and not at all before 1960. More information on historical data on age-adjusted death rates is available at https://www.cdc.gov/nchs/nvss/mortality/hist293.htm. SOURCES CDC/NCHS, National Vital Statistics System, historical data, 1900-1998 (see https://www.cdc.gov/nchs/nvss/mortality_historical_data.htm); CDC/NCHS, National Vital Statistics System, mortality data (see http://www.cdc.gov/nchs/deaths.htm); and CDC WONDER (see http://wonder.cdc.gov). REFERENCES National Center for Health Statistics, Data Warehouse. Comparability of cause-of-death between ICD revisions. 2008. Available from: http://www.cdc.gov/nchs/nvss/mortality/comparability_icd.htm. National Center for Health Statistics. Vital statistics data available. Mortality multiple cause files. Hyattsville, MD: National Center for Health Statistics. Available from: https://www.cdc.gov/nchs/data_access/vitalstatsonline.htm. Kochanek KD, Murphy SL, Xu JQ, Arias E. Deaths: Final data for 2017. National Vital Statistics Reports; vol 68 no 9. Hyattsville, MD: National Center for Health Statistics. 2019. Available from: https://www.cdc.gov/nchs/data/nvsr/nvsr68/nvsr68_09-508.pdf. Arias E, Xu JQ. United States life tables, 2017. National Vital Statistics Reports; vol 68 no 7. Hyattsville, MD: National Center for Health Statistics. 2019. Available from: https://www.cdc.gov/nchs/data/nvsr/nvsr68/nvsr68_07-508.pdf. National Center for Health Statistics. Historical Data, 1900-1998. 2009. Available from: https://www.cdc.gov/nchs/nvss/mortality_historical_data.htm.
Facebook
TwitterNOTE: This dataset has been retired and marked as historical-only. This dataset is a companion to the COVID-19 Daily Cases and Deaths dataset (https://data.cityofchicago.org/d/naz8-j4nc). The major difference in this dataset is that the case, death, and hospitalization corresponding rates per 100,000 population are not those for the single date indicated. They are rolling averages for the seven-day period ending on that date. This rolling average is used to account for fluctuations that may occur in the data, such as fewer cases being reported on weekends, and small numbers. The intent is to give a more representative view of the ongoing COVID-19 experience, less affected by what is essentially noise in the data. All rates are per 100,000 population in the indicated group, or Chicago, as a whole, for âTotalâ columns. Only Chicago residents are included based on the home address as provided by the medical provider. Cases with a positive molecular (PCR) or antigen test are included in this dataset. Cases are counted based on the date the test specimen was collected. Deaths among cases are aggregated by day of death. Hospitalizations are reported by date of first hospital admission. Demographic data are based on what is reported by medical providers or collected by CDPH during follow-up investigation. Denominators are from the U.S. Census Bureau American Community Survey 1-year estimate for 2018 and can be seen in the Citywide, 2018 row of the Chicago Population Counts dataset (https://data.cityofchicago.org/d/85cm-7uqa). All data are provisional and subject to change. Information is updated as additional details are received and it is, in fact, very common for recent dates to be incomplete and to be updated as time goes on. At any given time, this dataset reflects cases and deaths currently known to CDPH. Numbers in this dataset may differ from other public sources due to definitions of COVID-19-related cases and deaths, sources used, how cases and deaths are associated to a specific date, and similar factors. Data Source: Illinois National Electronic Disease Surveillance System, Cook County Medical Examinerâs Office, U.S. Census Bureau American Community Survey
Facebook
TwitterA. SUMMARY This archived dataset includes data for population characteristics that are no longer being reported publicly. The date on which each population characteristic type was archived can be found in the field âdata_loaded_atâ. B. HOW THE DATASET IS CREATED Data on the population characteristics of COVID-19 cases are from:⯠* Case interviews⯠* Laboratories⯠* Medical providers⯠⯠These multiple streams of data are merged, deduplicated, and undergo data verification processes. ⯠Race/ethnicity * We include all race/ethnicity categories that are collected for COVID-19 cases. * The population estimates for the "Other" or âMulti-racialâ groups should be considered with caution. The Census definition is likely not exactly aligned with how the City collects this data. For that reason, we do not recommend calculating population rates for these groups. Gender * The City collects information on gender identity using these guidelines. Skilled Nursing Facility (SNF) occupancy * A Skilled Nursing Facility (SNF) is a type of long-term care facility that provides care to individuals, generally in their 60s and older, who need functional assistance in their daily lives.⯠* This dataset includes data for COVID-19 cases reported in Skilled Nursing Facilities (SNFs) through 12/31/2022, archived on 1/5/2023. These data were identified where âCharacteristic_Typeâ = âSkilled Nursing Facility Occupancyâ. Sexual orientation * The City began asking adults 18 years old or older for their sexual orientation identification during case interviews as of April 28, 2020. Sexual orientation data prior to this date is unavailable. * The City doesnât collect or report information about sexual orientation for persons under 12 years of age. * Case investigation interviews transitioned to the California Department of Public Health, Virtual Assistant information gathering beginning December 2021. The Virtual Assistant is only sent to adults who are 18+ years old.âŻLearn more about our data collection guidelines pertaining to sexual orientation. Comorbidities * Underlying conditions are reported when a person has one or more underlying health conditions at the time of diagnosis or death. Homelessness Persons are identified as homeless based on several data sources: * self-reported living situation * the location at the time of testing * Department of Public Health homelessness and health databases * Residents in Single-Room Occupancy hotels are not included in these figures. These methods serve as an estimate of persons experiencing homelessness. They may not meet other homelessness definitions. Single Room Occupancy (SRO) tenancy * SRO buildings are defined by the San Francisco Housing Code as having six or more "residential guest rooms" which may be attached to shared bathrooms, kitchens, and living spaces. * The details of a person's living arrangements are verified during case interviews. Transmission Type * Information on transmission of COVID-19 is based on case interviews with individuals who have a confirmed positive test. Individuals are asked if they have been in close contact with a known COVID-19 case. If they answer yes, transmission category is recorded as contact with a known case. If they report no contact with a known case, transmission category is recorded as community transmission. If the case is not interviewed or was not asked the question, they are counted as unknown. C. UPDATE PROCESS This dataset has been archived and will no longer update as of 9/11/2023. D. HOW TO USE THIS DATASET Population estimates are only available for age groups and race/ethnicity categories. San Francisco po
Facebook
TwitterThis is a chronology of deaths. Deaths of people and even non-humans are reported if they have their own Wikipedia article. The cause of death was not reported for all individuals, but the dataset still provides an interesting snapshot of some of the most famous deaths that occurred.
This dataset contains information about celebrity deaths that occurred. The data includes the name of the deceased, their age, and a short biography. The cause of death is also included if it was reported
- Look at the mortality rates of celebrities over time
- Research the causes of death for celebrities.
- Study the biography of celebrities
The columns in the dataset are: date of death, name, age, bio, cause of death
License
Unknown License - Please check the dataset description for more information.
File: celebrity_deaths_2016.csv | Column name | Description | |:-------------------|:---------------------------------------------------------| | date of death | The date on which the celebrity died. (Date) | | name | The name of the celebrity. (String) | | age | The age of the celebrity at the time of death. (Integer) | | bio | A short biography of the celebrity. (String) | | cause of death | The cause of death of the celebrity. (String) |
Facebook
TwitterEstimates for the total death count of the Second World War generally range somewhere between 70 and 85 million people. The Soviet Union suffered the highest number of fatalities of any single nation, with estimates mostly falling between 22 and 27 million deaths. China then suffered the second greatest, at around 20 million, although these figures are less certain and often overlap with the Chinese Civil War. Over 80 percent of all deaths were of those from Allied countries, and the majority of these were civilians. In contrast, 15 to 20 percent were among the Axis powers, and the majority of these were military deaths, as shown in the death ratios of Germany and Japan. Civilian deaths and atrocities It is believed that 60 to 67 percent of all deaths were civilian fatalities, largely resulting from war-related famine or disease, and war crimes or atrocities. Systematic genocide, extermination campaigns, and forced labor, particularly by the Germans, Japanese, and Soviets, led to the deaths of millions. In this regard, Nazi activities alone resulted in 17 million deaths, including six million Jews in what is now known as The Holocaust. Not only was the scale of the conflict larger than any that had come before, but the nature of and reasoning behind this loss make the Second World War stand out as one of the most devastating and cruelest conflicts in history. Problems with these statistics Although the war is considered by many to be the defining event of the 20th century, exact figures for death tolls have proven impossible to determine, for a variety of reasons. Countries such as the U.S. have fairly consistent estimates due to preserved military records and comparatively few civilian casualties, although figures still vary by source. For most of Europe, records are less accurate. Border fluctuations and the upheaval of the interwar period mean that pre-war records were already poor or non-existent for many regions. The rapid and chaotic nature of the war then meant that deaths could not be accurately recorded at the time, and mass displacement or forced relocation resulted in the deaths of many civilians outside of their homeland, which makes country-specific figures more difficult to find. Early estimates of the warâs fatalities were also taken at face value and formed the basis of many historical works; these were often very inaccurate, but the validity of the source means that the figures continue to be cited today, despite contrary evidence.
In comparison to Europe, estimate ranges are often greater across Asia, where populations were larger but pre-war data was in short supply. Many of the Asian countries with high death tolls were European colonies, and the actions of authorities in the metropoles, such as the diversion of resources from Asia to Europe, led to millions of deaths through famine and disease. Additionally, over one million African soldiers were drafted into Europeâs armies during the war, yet individual statistics are unavailable for most of these colonies or successor states (notably Algeria and Libya). Thousands of Asian and African military deaths went unrecorded or are included with European or Japanese figures, and there are no reliable figures for deaths of millions from countries across North Africa or East Asia. Additionally, many concentration camp records were destroyed, and such records in Africa and Asia were even sparser than in Europe. While the Second World War is one of the most studied academic topics of the past century, it is unlikely that we will ever have a clear number for the lives lost in the conflict.
Facebook
TwitterBy data.world's Admin [source]
Weâre asking how stories live on in digital spacesâand what that reveals about our broader cultural values. We invite you to consider this data from a socially critical perspective to explore which people get their stories honored long term via massive online attention⌠is it limited to celebrities or do more ordinary people have a place? What can we learn when we look closely at those who are memorialized through vast networks? Join us in attempting this open-ended conversation through exploring Wiki Deaths!
For more datasets, click here.
- đ¨ Your notebook can be here! đ¨!
This dataset contains information about notable deaths from 2015-2018, and can be used to analyze the public impact of these people after their deaths on Wikipedia.
Here are a few steps you can use to work with this dataset:
Explore the available variables - The dataset includes fields such as name, year of birth and death, pageviews and other related information which can be used to compare the impact of different individuals after their death.
Investigate differences between years - Use this data set to compare how public interest changes across years by looking at variables such as median pageviews after death.
Identify outliers - Take a look at maximum pageviews compared to median pageviews before death in order to identify individual cases that had particularly high increases in traffic or particularly dramatic falls in traffic following the person's death.
Analyze trends and patterns - Look through extracted HTML fields for specific patterns related to notable deaths from 2015-2018 in order to gain better understanding of what topics were popular during that time frame and where interest has been growing or declining since then
- Investigating the effects of trends on notable deaths and related pageview activity. For example, analyzing how the peak popularity of a celebrity within a certain year may impact their Wikipedia pageviews posthumously.
- Exploring the impact of social media campaigns surrounding a notable death and the potential increase in web traffic that follows during this period.
- Examining how display practices (e.g., smaller thumbnails, fewer clickable links) can influence user engagement with certain Wikipedia pages as well as collective memorialization behaviors post-death of notables on the platform
If you use this dataset in your research, please credit the original authors. Data Source
See the dataset description for more information.
File: people.csv | Column name | Description | |:-----------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------| | link | The link to the Wikipedia page of the notable person. (String) | | name | The name of the notable person. (String) | | year_of_birth | The year the notable person was born. (Integer) | | year_of_death | The year the notable person died. (Integer) | | date_of_death | The date the notable person died. (Date) | | timestamp_of_death | The timestamp of the notable person's death. (Timestamp) | | median_views | The median pageviews of the notable person's Wikipedia page. (Integer) | | median_views_before | The median pageviews of the notable person's Wikiped...
Facebook
TwitterData Description: Since 1800, more than 37 million people worldwide have died while actively fighting in wars.
The number would be much higher still if it also considered the civilians who died due to the fighting, the increased number of deaths from hunger and disease resulting from these conflicts, and the deaths in smaller conflicts that are not considered wars.
Wars are also terrible in many other ways: they make peopleâs lives insecure, lower their living standards, destroy the environment, and, if fought between countries armed with nuclear weapons, can be an existential threat to humanity.
Looking at the news alone, it can be difficult to understand whether more or less people are dying as a result of war than in the past. One has to rely on statistics that are carefully collected so that they can be compared over time.
How many wars are avoided, and whether the trend of fewer deaths in them continues, is up to our own actions. Conflict deaths recently increased in the Middle East, Africa, and Europe, stressing that the future of these trends is uncertain.
In this dataset, there are 6 csv files in one zip one. Everything is clear but if you have any question, feel free to ask. Good luck.
This dataset belongs to Ourworldindata By: Bastian Herre, Lucas RodĂŠs-Guirao, Max Roser, Joe Hasell and Bobbie Macdonald