Facebook
TwitterData Description: Since 1800, more than 37 million people worldwide have died while actively fighting in wars.
The number would be much higher still if it also considered the civilians who died due to the fighting, the increased number of deaths from hunger and disease resulting from these conflicts, and the deaths in smaller conflicts that are not considered wars.
Wars are also terrible in many other ways: they make people’s lives insecure, lower their living standards, destroy the environment, and, if fought between countries armed with nuclear weapons, can be an existential threat to humanity.
Looking at the news alone, it can be difficult to understand whether more or less people are dying as a result of war than in the past. One has to rely on statistics that are carefully collected so that they can be compared over time.
How many wars are avoided, and whether the trend of fewer deaths in them continues, is up to our own actions. Conflict deaths recently increased in the Middle East, Africa, and Europe, stressing that the future of these trends is uncertain.
In this dataset, there are 6 csv files in one zip one. Everything is clear but if you have any question, feel free to ask. Good luck.
This dataset belongs to Ourworldindata By: Bastian Herre, Lucas Rodés-Guirao, Max Roser, Joe Hasell and Bobbie Macdonald
Facebook
TwitterThis dataset contains counts of deaths for California as a whole based on information entered on death certificates. Final counts are derived from static data and include out-of-state deaths to California residents, whereas provisional counts are derived from incomplete and dynamic data. Provisional counts are based on the records available when the data was retrieved and may not represent all deaths that occurred during the time period. Deaths involving injuries from external or environmental forces, such as accidents, homicide and suicide, often require additional investigation that tends to delay certification of the cause and manner of death. This can result in significant under-reporting of these deaths in provisional data.
The final data tables include both deaths that occurred in California regardless of the place of residence (by occurrence) and deaths to California residents (by residence), whereas the provisional data table only includes deaths that occurred in California regardless of the place of residence (by occurrence). The data are reported as totals, as well as stratified by age, gender, race-ethnicity, and death place type. Deaths due to all causes (ALL) and selected underlying cause of death categories are provided. See temporal coverage for more information on which combinations are available for which years.
The cause of death categories are based solely on the underlying cause of death as coded by the International Classification of Diseases. The underlying cause of death is defined by the World Health Organization (WHO) as "the disease or injury which initiated the train of events leading directly to death, or the circumstances of the accident or violence which produced the fatal injury." It is a single value assigned to each death based on the details as entered on the death certificate. When more than one cause is listed, the order in which they are listed can affect which cause is coded as the underlying cause. This means that similar events could be coded with different underlying causes of death depending on variations in how they were entered. Consequently, while underlying cause of death provides a convenient comparison between cause of death categories, it may not capture the full impact of each cause of death as it does not always take into account all conditions contributing to the death.
Facebook
Twitterhttps://github.com/nytimes/covid-19-data/blob/master/LICENSEhttps://github.com/nytimes/covid-19-data/blob/master/LICENSE
The New York Times is releasing a series of data files with cumulative counts of coronavirus cases in the United States, at the state and county level, over time. We are compiling this time series data from state and local governments and health departments in an attempt to provide a complete record of the ongoing outbreak.
Since the first reported coronavirus case in Washington State on Jan. 21, 2020, The Times has tracked cases of coronavirus in real time as they were identified after testing. Because of the widespread shortage of testing, however, the data is necessarily limited in the picture it presents of the outbreak.
We have used this data to power our maps and reporting tracking the outbreak, and it is now being made available to the public in response to requests from researchers, scientists and government officials who would like access to the data to better understand the outbreak.
The data begins with the first reported coronavirus case in Washington State on Jan. 21, 2020. We will publish regular updates to the data in this repository.
Facebook
TwitterThis dataset contains counts of deaths for California counties based on information entered on death certificates. Final counts are derived from static data and include out-of-state deaths to California residents, whereas provisional counts are derived from incomplete and dynamic data. Provisional counts are based on the records available when the data was retrieved and may not represent all deaths that occurred during the time period. Deaths involving injuries from external or environmental forces, such as accidents, homicide and suicide, often require additional investigation that tends to delay certification of the cause and manner of death. This can result in significant under-reporting of these deaths in provisional data.
The final data tables include both deaths that occurred in each California county regardless of the place of residence (by occurrence) and deaths to residents of each California county (by residence), whereas the provisional data table only includes deaths that occurred in each county regardless of the place of residence (by occurrence). The data are reported as totals, as well as stratified by age, gender, race-ethnicity, and death place type. Deaths due to all causes (ALL) and selected underlying cause of death categories are provided. See temporal coverage for more information on which combinations are available for which years.
The cause of death categories are based solely on the underlying cause of death as coded by the International Classification of Diseases. The underlying cause of death is defined by the World Health Organization (WHO) as "the disease or injury which initiated the train of events leading directly to death, or the circumstances of the accident or violence which produced the fatal injury." It is a single value assigned to each death based on the details as entered on the death certificate. When more than one cause is listed, the order in which they are listed can affect which cause is coded as the underlying cause. This means that similar events could be coded with different underlying causes of death depending on variations in how they were entered. Consequently, while underlying cause of death provides a convenient comparison between cause of death categories, it may not capture the full impact of each cause of death as it does not always take into account all conditions contributing to the death.
Facebook
TwitterBy Health [source]
This fascinating dataset takes a look at the leading causes of death in the United States from 1980-2009, broken down by sex, race, and Hispanic origin. This data sheds light on how mortality in the US has changed over time among these categories. Accounting for everything from heart disease to cancer to suicide, this insight can be used by health researchers and policy makers to gain a better understanding of disparities in healthcare and deaths across different groups. Whether studying questions related to public health or more targeted population issues such as gender biases in death rates, this dataset provides an important resource for anyone interested in examining mortality across demographic lines
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
This dataset can be used to explore some of the leading causes of death in the United States from 1980 to 2009, broken down by sex, race, and Hispanic origin. This data can be used to better understand mortality trends and risk factors associated with different populations in America.
By using this dataset you can compare and contrast mortality rates across different gender, racial, and ethnic groups during this time period. You can also compare different causes of death within these demographic categories to see if there are any patterns over time or notable differences between groups.
You could even use this data to track changes across population groups as a whole or look at details for specific years or types of causes of death in particular groups. With this information one may gain insight into health disparities across population segments in America— aiding advocates for social change & public policy shifts toward improved health outcomes for all Americans!
- Analyzing regional or state-level differences in mortality rates over time.
- Examining the beahvioral factors or risk factors associated with each cause of death for different genders and populations.
- Examining the prevalence of each cause of death as a proportion to an overall population trend in different socio-economic categories such as race or income level
If you use this dataset in your research, please credit the original authors. Data Source
License: Dataset copyright by authors - You are free to: - Share - copy and redistribute the material in any medium or format for any purpose, even commercially. - Adapt - remix, transform, and build upon the material for any purpose, even commercially. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. - Keep intact - all notices that refer to this license, including copyright notices.
File: Selected_Trend_Table_from_Health_United_States_2011._Leading_causes_of_death_and_numbers_of_deaths_by_sex_race_and_Hispanic_origin_United_States_1980_and_2009.csv | Column name | Description | |:-------------------|:---------------------------------------------------------------------------------------------------------| | Group | The group of people the cause of death applies to (e.g. men, women, whites, blacks, hispanics). (String) | | Year | The year the cause of death was recorded. (Integer) | | Cause of death | The cause of death. (String) | | Flag | A flag indicating whether the cause of death is considered a leading cause. (Boolean) | | Deaths | The number of deaths attributed to the cause of death. (Integer) |
If you use this dataset in your research, please credit the original authors. If you use this dataset in your research, please credit Health.
Facebook
TwitterThe Black Death was the largest and deadliest pandemic of Yersinia pestis recorded in human history, and likely the most infamous individual pandemic ever documented. The plague originated in the Eurasian Steppes, before moving with Mongol hordes to the Black Sea, where it was then brought by Italian merchants to the Mediterranean. From here, the Black Death then spread to almost all corners of Europe, the Middle East, and North Africa. While it was never endemic to these regions, it was constantly re-introduced via trade routes from Asia (such as the Silk Road), and plague was present in Western Europe until the seventeenth century, and the other regions until the nineteenth century. Impact on Europe In Europe, the major port cities and metropolitan areas were hit the hardest. The plague spread through south-western Europe, following the arrival of Italian galleys in Sicily, Genoa, Venice, and Marseilles, at the beginning of 1347. It is claimed that Venice, Florence, and Siena lost up to two thirds of their total population during epidemic's peak, while London, which was hit in 1348, is said to have lost at least half of its population. The plague then made its way around the west of Europe, and arrived in Germany and Scandinavia in 1348, before travelling along the Baltic coast to Russia by 1351 (although data relating to the death tolls east of Germany is scarce). Some areas of Europe remained untouched by the plague for decades; for example, plague did not arrive in Iceland until 1402, however it swept across the island with devastating effect, causing the population to drop from 120,000 to 40,000 within two years. Reliability While the Black Death affected three continents, there is little recorded evidence of its impact outside of Southern or Western Europe. In Europe, however, many sources conflict and contrast with one another, often giving death tolls exceeding the estimated population at the time (such as London, where the death toll is said to be three times larger than the total population). Therefore, the precise death tolls remain uncertain, and any figures given should be treated tentatively.
Facebook
TwitterAttribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Context:
This dataset provides data on death rates for suicide categorized by selected population characteristics including sex, race, Hispanic origin, and age in the United States. It includes critical information about measures, definitions, and changes over time.
Source: - NCHS, National Vital Statistics System (NVSS) - Grove RD, Hetzel AM. Vital statistics rates in the United States, 1940–1960. National Center for Health Statistics. 1968 - Numerator data from NVSS annual public-use Mortality Files - Denominator data from U.S. Census Bureau national population estimates - Murphy SL, Xu JQ, Kochanek KD, Arias E, Tejada-Vera B. Deaths: Final data for 2018. National Vital Statistics Reports; vol 69 no 13. Hyattsville, MD: National Center for Health Statistics. 2021
Source URLs:
Death rates for suicide by sex, race, Hispanic origin, and age: United States - HUS 2019 Data Finder - National Vital Statistics Reports - NVSS Appendix Entry
The dataset consists of data collected from the National Vital Statistics System (NVSS) and the U.S. Census Bureau, providing a comprehensive overview of suicide death rates across different demographics in the United States from 1950 to 2001.
| Column Name | Description |
|---|---|
| INDICATOR | Indicator for the data type, e.g., Death rate |
| UNIT | Unit of measurement, e.g., Deaths per 100,000 population |
| UNIT_NU | Numerical value representing the unit |
| STUB_NA | Stub name for category, e.g., Total |
| STUB_LA | Label for the stub category, e.g., All persons |
| STUB_LA_1 | Additional label information for the stub category |
| YEAR | The year the data was recorded |
| YEAR_NUM | Numerical value representing the year |
| AGE | Age group category, e.g., All ages |
| AGE_NUM | Numerical value representing the age group |
| ESTIMATE | Estimated death rate |
Facebook
TwitterTotal deaths for Maryland and its jurisdictions are derived from the U.S. Census Bureau’s Population Estimates Program. These estimates reflect revisions to the entire time series, beginning with the estimate base of April 1, 2020, through July 1 of the current year (referred to as the 'vintage year,' or V2024). Each time series incorporates updated administrative records, geographic boundary changes, and methodological improvements. The data is updated annually. Source: U.S. Census Bureau, Population Estimates Program, March 2025.
Facebook
TwitterU.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
Note: DPH is updating and streamlining the COVID-19 cases, deaths, and testing data. As of 6/27/2022, the data will be published in four tables instead of twelve.
The COVID-19 Cases, Deaths, and Tests by Day dataset contains cases and test data by date of sample submission. The death data are by date of death. This dataset is updated daily and contains information back to the beginning of the pandemic. The data can be found at https://data.ct.gov/Health-and-Human-Services/COVID-19-Cases-Deaths-and-Tests-by-Day/g9vi-2ahj.
The COVID-19 State Metrics dataset contains over 93 columns of data. This dataset is updated daily and currently contains information starting June 21, 2022 to the present. The data can be found at https://data.ct.gov/Health-and-Human-Services/COVID-19-State-Level-Data/qmgw-5kp6 .
The COVID-19 County Metrics dataset contains 25 columns of data. This dataset is updated daily and currently contains information starting June 16, 2022 to the present. The data can be found at https://data.ct.gov/Health-and-Human-Services/COVID-19-County-Level-Data/ujiq-dy22 .
The COVID-19 Town Metrics dataset contains 16 columns of data. This dataset is updated daily and currently contains information starting June 16, 2022 to the present. The data can be found at https://data.ct.gov/Health-and-Human-Services/COVID-19-Town-Level-Data/icxw-cada . To protect confidentiality, if a town has fewer than 5 cases or positive NAAT tests over the past 7 days, those data will be suppressed.
Count of COVID-19-associated deaths by date of death. Deaths reported to either the OCME or DPH are included in the COVID-19 data. COVID-19-associated deaths include persons who tested positive for COVID-19 around the time of death and persons who were not tested for COVID-19 whose death certificate lists COVID-19 disease as a cause of death or a significant condition contributing to death.
Data on Connecticut deaths were obtained from the Connecticut Deaths Registry maintained by the DPH Office of Vital Records. Cause of death was determined by a death certifier (e.g., physician, APRN, medical examiner) using their best clinical judgment. Additionally, all COVID-19 deaths, including suspected or related, are required to be reported to OCME. On April 4, 2020, CT DPH and OCME released a joint memo to providers and facilities within Connecticut providing guidelines for certifying deaths due to COVID-19 that were consistent with the CDC’s guidelines and a reminder of the required reporting to OCME.25,26 As of July 1, 2021, OCME had reviewed every case reported and performed additional investigation on about one-third of reported deaths to better ascertain if COVID-19 did or did not cause or contribute to the death. Some of these investigations resulted in the OCME performing postmortem swabs for PCR testing on individuals whose deaths were suspected to be due to COVID-19, but antemortem diagnosis was unable to be made.31 The OCME issued or re-issued about 10% of COVID-19 death certificates and, when appropriate, removed COVID-19 from the death certificate. For standardization and tabulation of mortality statistics, written cause of death statements made by the certifiers on death certificates are sent to the National Center for Health Statistics (NCHS) at the CDC which assigns cause of death codes according to the International Causes of Disease 10th Revision (ICD-10) classification system.25,26 COVID-19 deaths in this report are defined as those for which the death certificate has an ICD-10 code of U07.1 as either a primary (underlying) or a contributing cause of death. More information on COVID-19 mortality can be found at the following link: https://portal.ct.gov/DPH/Health-Information-Systems--Reporting/Mortality/Mortality-Statistics
Note the counts in this dataset may vary from the death counts in the other COVID-19-related datasets published on data.ct.gov, where deaths are counted on the date reported rather than the date of death.
Starting in July 2020, this dataset will be updated every weekday. Data are subject to future revision as reporting changes.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
A straightforward way to assess the health status of a population is to focus on mortality – or concepts like child mortality or life expectancy, which are based on mortality estimates. A focus on mortality, however, does not take into account that the burden of diseases is not only that they kill people, but that they cause suffering to people who live with them. Assessing health outcomes by both mortality and morbidity (the prevalent diseases) provides a more encompassing view on health outcomes. This is the topic of this entry. The sum of mortality and morbidity is referred to as the ‘burden of disease’ and can be measured by a metric called ‘Disability Adjusted Life Years‘ (DALYs).
DALYs are measuring lost health and are a standardized metric that allow for direct comparisons of disease burdens of different diseases across countries, between different populations, and over time. Conceptually, one DALY is the equivalent of losing one year in good health because of either premature death or disease or disability. One DALY represents one lost year of healthy life. The first ‘Global Burden of Disease’ (GBD) was GBD 1990 and the DALY metric was prominently featured in the World Bank’s 1993 World Development Report. Today it is published by both the researchers at the Institute of Health Metrics and Evaluation (IHME) and the ‘Disease Burden Unit’ at the World Health Organization (WHO), which was created in 1998. The IHME continues the work that was started in the early 1990s and publishes the Global Burden of Disease study.
In this Dataset, we have Historical Data of different cause of deaths for all ages around the World. The key features of this Dataset are: Meningitis, Alzheimer's Disease and Other Dementias, Parkinson's Disease, Nutritional Deficiencies, Malaria, Drowning, Interpersonal Violence, Maternal Disorders, HIV/AIDS, Drug Use Disorders, Tuberculosis, Cardiovascular Diseases, Lower Respiratory Infections, Neonatal Disorders, Alcohol Use Disorders, Self-harm, Exposure to Forces of Nature, Diarrheal Diseases, Environmental Heat and Cold Exposure, Neoplasms, Conflict and Terrorism, Diabetes Mellitus, Chronic Kidney Disease, Poisonings, Protein-Energy Malnutrition, Road Injuries, Chronic Respiratory Diseases, Cirrhosis and Other Chronic Liver Diseases, Digestive Diseases, Fire, Heat, and Hot Substances, Acute Hepatitis.
This Dataset is created from Our World in Data. This Dataset falls under open access under the Creative Commons BY license. You can check the FAQ for more informa...
Facebook
TwitterThe American Civil War is the conflict with the largest number of American military fatalities in history. In fact, the Civil War's death toll is comparable to all other major wars combined, the deadliest of which were the World Wars, which have a combined death toll of more than 520,000 American fatalities. The ongoing series of conflicts and interventions in the Middle East and North Africa, collectively referred to as the War on Terror in the west, has a combined death toll of more than 7,000 for the U.S. military since 2001. Other records In terms of the number of deaths per day, the American Civil War is still at the top, with an average of 425 deaths per day, while the First and Second World Wars have averages of roughly 100 and 200 fatalities per day respectively. Technically, the costliest battle in U.S. military history was the Battle of Elsenborn Ridge, which was a part of the Battle of the Bulge in the Second World War, and saw upwards of 5,000 deaths over 10 days. However, the Battle of Gettysburg had more military fatalities of American soldiers, with almost 3,200 Union deaths and over 3,900 Confederate deaths, giving a combined total of more than 7,000. The Battle of Antietam is viewed as the bloodiest day in American military history, with over 3,600 combined fatalities and almost 23,000 total casualties on September 17, 1862. Revised Civil War figures For more than a century, the total death toll of the American Civil War was generally accepted to be around 620,000, a number which was first proposed by Union historians William F. Fox and Thomas L. Livermore in 1888. This number was calculated by using enlistment figures, battle reports, and census data, however many prominent historians since then have thought the number should be higher. In 2011, historian J. David Hacker conducted further investigations and claimed that the number was closer to 750,000 (and possibly as high as 850,000). While many Civil War historians agree that this is possible, and even likely, obtaining consistently accurate figures has proven to be impossible until now; both sides were poor at keeping detailed records throughout the war, and much of the Confederacy's records were lost by the war's end. Many Confederate widows also did not register their husbands death with the authorities, as they would have then been ineligible for benefits.
Facebook
TwitterNote: Note: Starting October 10th, 2025 this dataset is deprecated and is no longer being updated. As of April 27, 2023 updates changed from daily to weekly. Summary The cumulative number of confirmed COVID-19 deaths among Maryland residents within a single Maryland jurisdiction. Description The MD COVID-19 - Confirmed Deaths by County data layer is a collection of the statewide confirmed COVID-19 related deaths that have been reported each day by the Vital Statistics Administration that have occurred in each Maryland jurisdiction. A death is classified as confirmed if the person had a laboratory-confirmed positive COVID-19 test result. Some data on deaths may be unavailable due to the time lag between the death, typically reported by a hospital or other facility, and the submission of the complete death certificate. This data layer does not include probable deaths. Probable deaths are available from the MD COVID-19 - Probable Deaths by County data layer. Terms of Use The Spatial Data, and the information therein, (collectively the "Data") is provided "as is" without warranty of any kind, either expressed, implied, or statutory. The user assumes the entire risk as to quality and performance of the Data. No guarantee of accuracy is granted, nor is any responsibility for reliance thereon assumed. In no event shall the State of Maryland be liable for direct, indirect, incidental, consequential or special damages of any kind. The State of Maryland does not accept liability for any damages or misrepresentation caused by inaccuracies in the Data or as a result to changes to the Data, nor is there responsibility assumed to maintain the Data in any manner or form. The Data can be freely distributed as long as the metadata entry is not modified or deleted. Any data derived from the Data must acknowledge the State of Maryland in the metadata.
Facebook
Twitterhttps://www.worldbank.org/en/about/legal/terms-of-use-for-datasetshttps://www.worldbank.org/en/about/legal/terms-of-use-for-datasets
Subjected dataset is extracted using world bank and UN websites to find population collapse according to countries and regions. The code generates data for seven indicators based on the current date and is available from Year 2000 to the year 2021.
This code is useful for research purposes, there are nine distinct CSV files associated with this code, seven of them deals with indicators, one CSV file pertaining to country groups and last CSV file is analysis for 20 years between seven indicators. Below are seven indicators extracted from the world bank and the United Nations websites.
Total Population, Population Growth, Life Expectancy at birth, Fertility Rate, Death Rate (per 1,000 people)), Birth Rate (per 1,000 people), Median Age
Population collapse is calculated using Total Population, Population Growth, Life Expectancy at birth, Fertility Rate, Death Rate, Birth Rate and Median Age, for that various criteria were applied to extract data:
The data was filtered based on several attributes, first ids and title has been extracted from the world bank data then timeframe and columns provided to extract data. This filtering process ensured that only relevant data meeting the specified criteria. For median age UN website is used and data is extracted for all countries. Median age data is not available for groups or regions; however, it could be calculated as median age data is available for all countries of the globe.
Variables: Economy, Seven Indicators Years from 2000 to 2021
For country group files, all countries are assigned according to regions, groups, by lending, by income, etc. so for this file each country is repeated as one country is member of more than one group.
Below screenshot is extracted for those countries whose population does fall in 20 years and death rate is increased while birth rate is decrease. So, for instance Ukraine population in Year 2002 was 48.2M while as per Year 2021 there population is decreased by 9% to 43.8M, similarly there death rate is increase from 15.7 to 18.5 (per 1000 people) and birth rate is decrease by 10% from 8.10 to 7.30.
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F15657145%2Ffeaff87cec8a478065eb06229045d7f1%2FPopulation%20Collapse.JPG?generation=1691841930935324&alt=media" alt="">
Facebook
TwitterNote: Note: Starting October 10th, 2025 this dataset is deprecated and is no longer being updated. As of April 27, 2023 updates changed from daily to weekly. Summary The cumulative number of confirmed COVID-19 deaths among Maryland residents by age: 0-9; 10-19; 20-29; 30-39; 40-49; 50-59; 60-69; 70-79; 80+; Unknown. Description The MD COVID-19 - Confirmed Deaths by Age Distribution data layer is a collection of the statewide confirmed COVID-19 related deaths that have been reported each day by the Vital Statistics Administration by designated age ranges. A death is classified as confirmed if the person had a laboratory-confirmed positive COVID-19 test result. Some data on deaths may be unavailable due to the time lag between the death, typically reported by a hospital or other facility, and the submission of the complete death certificate. Probable deaths are available from the MD COVID-19 - Probable Deaths by Age Distribution data layer. Terms of Use The Spatial Data, and the information therein, (collectively the "Data") is provided "as is" without warranty of any kind, either expressed, implied, or statutory. The user assumes the entire risk as to quality and performance of the Data. No guarantee of accuracy is granted, nor is any responsibility for reliance thereon assumed. In no event shall the State of Maryland be liable for direct, indirect, incidental, consequential or special damages of any kind. The State of Maryland does not accept liability for any damages or misrepresentation caused by inaccuracies in the Data or as a result to changes to the Data, nor is there responsibility assumed to maintain the Data in any manner or form. The Data can be freely distributed as long as the metadata entry is not modified or deleted. Any data derived from the Data must acknowledge the State of Maryland in the metadata.
Facebook
TwitterNote: This COVID-19 data set is no longer being updated as of December 1, 2023. Access current COVID-19 data on the CDPH respiratory virus dashboard (https://www.cdph.ca.gov/Programs/CID/DCDC/Pages/Respiratory-Viruses/RespiratoryDashboard.aspx) or in open data format (https://data.chhs.ca.gov/dataset/respiratory-virus-dashboard-metrics).
As of August 17, 2023, data is being updated each Friday.
For death data after December 31, 2022, California uses Provisional Deaths from the Center for Disease Control and Prevention’s National Center for Health Statistics (NCHS) National Vital Statistics System (NVSS). Prior to January 1, 2023, death data was sourced from the COVID-19 registry. The change in data source occurred in July 2023 and was applied retroactively to all 2023 data to provide a consistent source of death data for the year of 2023.
As of May 11, 2023, data on cases, deaths, and testing is being updated each Thursday. Metrics by report date have been removed, but previous versions of files with report date metrics are archived below.
All metrics include people in state and federal prisons, US Immigration and Customs Enforcement facilities, US Marshal detention facilities, and Department of State Hospitals facilities. Members of California's tribal communities are also included.
The "Total Tests" and "Positive Tests" columns show totals based on the collection date. There is a lag between when a specimen is collected and when it is reported in this dataset. As a result, the most recent dates on the table will temporarily show NONE in the "Total Tests" and "Positive Tests" columns. This should not be interpreted as no tests being conducted on these dates. Instead, these values will be updated with the number of tests conducted as data is received.
Facebook
TwitterA. SUMMARY This dataset shows San Francisco COVID-19 deaths by population characteristics. This data may not be immediately available for recently reported deaths. Data updates as more information becomes available. Because of this, death totals may increase or decrease.
Population characteristics are subgroups, or demographic cross-sections, like age, race, or gender. The City tracks how deaths have been distributed among different subgroups. This information can reveal trends and disparities among groups.
B. HOW THE DATASET IS CREATED As of January 1, 2023, COVID-19 deaths are defined as persons who had COVID-19 listed as a cause of death or a significant condition contributing to their death on their death certificate. This definition is in alignment with the California Department of Public Health and the national https://preparedness.cste.org/wp-content/uploads/2022/12/CSTE-Revised-Classification-of-COVID-19-associated-Deaths.Final_.11.22.22.pdf">Council of State and Territorial Epidemiologists. Death certificates are maintained by the California Department of Public Health.
Data on the population characteristics of COVID-19 deaths are from: *Case reports *Medical records *Electronic lab reports *Death certificates
Data are continually updated to maximize completeness of information and reporting on San Francisco COVID-19 deaths.
To protect resident privacy, we summarize COVID-19 data by only one population characteristic at a time. Data are not shown until cumulative citywide deaths reach five or more.
Data notes on select population characteristic types are listed below.
Race/ethnicity * We include all race/ethnicity categories that are collected for COVID-19 cases.
Gender * The City collects information on gender identity using these guidelines.
C. UPDATE PROCESS Updates automatically at 06:30 and 07:30 AM Pacific Time on Wednesday each week.
Dataset will not update on the business day following any federal holiday.
D. HOW TO USE THIS DATASET Population estimates are only available for age groups and race/ethnicity categories. San Francisco population estimates for race/ethnicity and age groups can be found in a dataset based on the San Francisco Population and Demographic Census dataset.These population estimates are from the 2018-2022 5-year American Community Survey (ACS).
This dataset includes several characteristic types. Filter the “Characteristic Type” column to explore a topic area. Then, the “Characteristic Group” column shows each group or category within that topic area and the number of cumulative deaths.
Cumulative deaths are the running total of all San Francisco COVID-19 deaths in that characteristic group up to the date listed.
To explore data on the total number of deaths, use the COVID-19 Deaths Over Time dataset.
E. CHANGE LOG
Facebook
TwitterThis file contains COVID-19 death counts and rates by month and year of death, jurisdiction of residence (U.S., HHS Region) and demographic characteristics (sex, age, race and Hispanic origin, and age/race and Hispanic origin). United States death counts and rates include the 50 states, plus the District of Columbia. Deaths with confirmed or presumed COVID-19, coded to ICD–10 code U07.1. Number of deaths reported in this file are the total number of COVID-19 deaths received and coded as of the date of analysis and may not represent all deaths that occurred in that period. Counts of deaths occurring before or after the reporting period are not included in the file. Data during recent periods are incomplete because of the lag in time between when the death occurred and when the death certificate is completed, submitted to NCHS and processed for reporting purposes. This delay can range from 1 week to 8 weeks or more, depending on the jurisdiction and cause of death. Death counts should not be compared across jurisdictions. Data timeliness varies by state. Some states report deaths on a daily basis, while other states report deaths weekly or monthly. The ten (10) United States Department of Health and Human Services (HHS) regions include the following jurisdictions. Region 1: Connecticut, Maine, Massachusetts, New Hampshire, Rhode Island, Vermont; Region 2: New Jersey, New York; Region 3: Delaware, District of Columbia, Maryland, Pennsylvania, Virginia, West Virginia; Region 4: Alabama, Florida, Georgia, Kentucky, Mississippi, North Carolina, South Carolina, Tennessee; Region 5: Illinois, Indiana, Michigan, Minnesota, Ohio, Wisconsin; Region 6: Arkansas, Louisiana, New Mexico, Oklahoma, Texas; Region 7: Iowa, Kansas, Missouri, Nebraska; Region 8: Colorado, Montana, North Dakota, South Dakota, Utah, Wyoming; Region 9: Arizona, California, Hawaii, Nevada; Region 10: Alaska, Idaho, Oregon, Washington. Rates were calculated using the population estimates for 2021, which are estimated as of July 1, 2021 based on the Blended Base produced by the US Census Bureau in lieu of the April 1, 2020 decennial population count. The Blended Base consists of the blend of Vintage 2020 postcensal population estimates, 2020 Demographic Analysis Estimates, and 2020 Census PL 94-171 Redistricting File (see https://www2.census.gov/programs-surveys/popest/technical-documentation/methodology/2020-2021/methods-statement-v2021.pdf). Rate are based on deaths occurring in the specified week and are age-adjusted to the 2000 standard population using the direct method (see https://www.cdc.gov/nchs/data/nvsr/nvsr70/nvsr70-08-508.pdf). These rates differ from annual age-adjusted rates, typically presented in NCHS publications based on a full year of data and annualized weekly age-adjusted rates which have been adjusted to allow comparison with annual rates. Annualization rates presents deaths per year per 100,000 population that would be expected in a year if the observed period specific (weekly) rate prevailed for a full year. Sub-national death counts between 1-9 are suppressed in accordance with NCHS data confidentiality standards. Rates based on death counts less than 20 are suppressed in accordance with NCHS standards of reliability as specified in NCHS Data Presentation Standards for Proportions (available from: https://www.cdc.gov/nchs/data/series/sr_02/sr02_175.pdf.).
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
NHS UK - COVID-19 Daily Deaths
This section contains information on deaths of patients who have died in hospitals in England and had tested positive for COVID-19 at time of death. All deaths are recorded against the date of death rather than the date the deaths were announced. Interpretation of the figures should take into account the fact that totals by date of death, particularly for most recent days, are likely to be updated in future releases. For example as deaths are confirmed as testing positive for COVID-19, as more post-mortem tests are processed and data from them are validated. Any changes are made clear in the daily files.
These figures do not include deaths outside hospital, such as those in care homes. This approach makes it possible to compile deaths data on a daily basis using up to date figures.
Dataset Content
These figures will be updated at 2pm each day and include confirmed cases reported at 5pm the previous day. Confirmation of COVID-19 diagnosis, death notification and reporting in central figures can take up to several days and the hospitals providing the data are under significant operational pressure. This means that the totals reported at 5pm on each day may not include all deaths that occurred on that day or on recent prior days.
The original dataset is sourced directly from the NHS source site, this original dataset is then cleaned and converted to a csv format available for inclusion into a Kaggle notebook.
There are 3 files considered within the data :- 1. Fatalities_by_age_uk 2.Fatalities_by_region_uk 3.Fatalities_by_trust_uk
Data runs from March 1st up to the current day. Any discrepancies will be outlined. The first is cumulative for any previous days leading up to of relevance. The following days are not cumulative and represent the updated value for the date under consideration.
A start kernel is provided to demonstrate using the dataset.
Citations
This dataset is sourced from the NHS statistical work areas:- https://www.england.nhs.uk/statistics/statistical-work-areas/
This dataset has been sourced and provided to aid in the following competition:- https://www.kaggle.com/c/covid19-global-forecasting-week-4
Facebook
TwitterA. SUMMARY This archived dataset includes data for population characteristics that are no longer being reported publicly. The date on which each population characteristic type was archived can be found in the field “data_loaded_at”. B. HOW THE DATASET IS CREATED Data on the population characteristics of COVID-19 cases are from: * Case interviews * Laboratories * Medical providers These multiple streams of data are merged, deduplicated, and undergo data verification processes. Race/ethnicity * We include all race/ethnicity categories that are collected for COVID-19 cases. * The population estimates for the "Other" or “Multi-racial” groups should be considered with caution. The Census definition is likely not exactly aligned with how the City collects this data. For that reason, we do not recommend calculating population rates for these groups. Gender * The City collects information on gender identity using these guidelines. Skilled Nursing Facility (SNF) occupancy * A Skilled Nursing Facility (SNF) is a type of long-term care facility that provides care to individuals, generally in their 60s and older, who need functional assistance in their daily lives. * This dataset includes data for COVID-19 cases reported in Skilled Nursing Facilities (SNFs) through 12/31/2022, archived on 1/5/2023. These data were identified where “Characteristic_Type” = ‘Skilled Nursing Facility Occupancy’. Sexual orientation * The City began asking adults 18 years old or older for their sexual orientation identification during case interviews as of April 28, 2020. Sexual orientation data prior to this date is unavailable. * The City doesn’t collect or report information about sexual orientation for persons under 12 years of age. * Case investigation interviews transitioned to the California Department of Public Health, Virtual Assistant information gathering beginning December 2021. The Virtual Assistant is only sent to adults who are 18+ years old. Learn more about our data collection guidelines pertaining to sexual orientation. Comorbidities * Underlying conditions are reported when a person has one or more underlying health conditions at the time of diagnosis or death. Homelessness Persons are identified as homeless based on several data sources: * self-reported living situation * the location at the time of testing * Department of Public Health homelessness and health databases * Residents in Single-Room Occupancy hotels are not included in these figures. These methods serve as an estimate of persons experiencing homelessness. They may not meet other homelessness definitions. Single Room Occupancy (SRO) tenancy * SRO buildings are defined by the San Francisco Housing Code as having six or more "residential guest rooms" which may be attached to shared bathrooms, kitchens, and living spaces. * The details of a person's living arrangements are verified during case interviews. Transmission Type * Information on transmission of COVID-19 is based on case interviews with individuals who have a confirmed positive test. Individuals are asked if they have been in close contact with a known COVID-19 case. If they answer yes, transmission category is recorded as contact with a known case. If they report no contact with a known case, transmission category is recorded as community transmission. If the case is not interviewed or was not asked the question, they are counted as unknown. C. UPDATE PROCESS This dataset has been archived and will no longer update as of 9/11/2023. D. HOW TO USE THIS DATASET Population estimates are only available for age groups and race/ethnicity categories. San Francisco po
Facebook
TwitterEstimates for the total death count of the Second World War generally range somewhere between 70 and 85 million people. The Soviet Union suffered the highest number of fatalities of any single nation, with estimates mostly falling between 22 and 27 million deaths. China then suffered the second greatest, at around 20 million, although these figures are less certain and often overlap with the Chinese Civil War. Over 80 percent of all deaths were of those from Allied countries, and the majority of these were civilians. In contrast, 15 to 20 percent were among the Axis powers, and the majority of these were military deaths, as shown in the death ratios of Germany and Japan. Civilian deaths and atrocities It is believed that 60 to 67 percent of all deaths were civilian fatalities, largely resulting from war-related famine or disease, and war crimes or atrocities. Systematic genocide, extermination campaigns, and forced labor, particularly by the Germans, Japanese, and Soviets, led to the deaths of millions. In this regard, Nazi activities alone resulted in 17 million deaths, including six million Jews in what is now known as The Holocaust. Not only was the scale of the conflict larger than any that had come before, but the nature of and reasoning behind this loss make the Second World War stand out as one of the most devastating and cruelest conflicts in history. Problems with these statistics Although the war is considered by many to be the defining event of the 20th century, exact figures for death tolls have proven impossible to determine, for a variety of reasons. Countries such as the U.S. have fairly consistent estimates due to preserved military records and comparatively few civilian casualties, although figures still vary by source. For most of Europe, records are less accurate. Border fluctuations and the upheaval of the interwar period mean that pre-war records were already poor or non-existent for many regions. The rapid and chaotic nature of the war then meant that deaths could not be accurately recorded at the time, and mass displacement or forced relocation resulted in the deaths of many civilians outside of their homeland, which makes country-specific figures more difficult to find. Early estimates of the war’s fatalities were also taken at face value and formed the basis of many historical works; these were often very inaccurate, but the validity of the source means that the figures continue to be cited today, despite contrary evidence.
In comparison to Europe, estimate ranges are often greater across Asia, where populations were larger but pre-war data was in short supply. Many of the Asian countries with high death tolls were European colonies, and the actions of authorities in the metropoles, such as the diversion of resources from Asia to Europe, led to millions of deaths through famine and disease. Additionally, over one million African soldiers were drafted into Europe’s armies during the war, yet individual statistics are unavailable for most of these colonies or successor states (notably Algeria and Libya). Thousands of Asian and African military deaths went unrecorded or are included with European or Japanese figures, and there are no reliable figures for deaths of millions from countries across North Africa or East Asia. Additionally, many concentration camp records were destroyed, and such records in Africa and Asia were even sparser than in Europe. While the Second World War is one of the most studied academic topics of the past century, it is unlikely that we will ever have a clear number for the lives lost in the conflict.
Facebook
TwitterData Description: Since 1800, more than 37 million people worldwide have died while actively fighting in wars.
The number would be much higher still if it also considered the civilians who died due to the fighting, the increased number of deaths from hunger and disease resulting from these conflicts, and the deaths in smaller conflicts that are not considered wars.
Wars are also terrible in many other ways: they make people’s lives insecure, lower their living standards, destroy the environment, and, if fought between countries armed with nuclear weapons, can be an existential threat to humanity.
Looking at the news alone, it can be difficult to understand whether more or less people are dying as a result of war than in the past. One has to rely on statistics that are carefully collected so that they can be compared over time.
How many wars are avoided, and whether the trend of fewer deaths in them continues, is up to our own actions. Conflict deaths recently increased in the Middle East, Africa, and Europe, stressing that the future of these trends is uncertain.
In this dataset, there are 6 csv files in one zip one. Everything is clear but if you have any question, feel free to ask. Good luck.
This dataset belongs to Ourworldindata By: Bastian Herre, Lucas Rodés-Guirao, Max Roser, Joe Hasell and Bobbie Macdonald