This dataset contains counts of deaths for California counties based on information entered on death certificates. Final counts are derived from static data and include out-of-state deaths to California residents, whereas provisional counts are derived from incomplete and dynamic data. Provisional counts are based on the records available when the data was retrieved and may not represent all deaths that occurred during the time period. Deaths involving injuries from external or environmental forces, such as accidents, homicide and suicide, often require additional investigation that tends to delay certification of the cause and manner of death. This can result in significant under-reporting of these deaths in provisional data.
The final data tables include both deaths that occurred in each California county regardless of the place of residence (by occurrence) and deaths to residents of each California county (by residence), whereas the provisional data table only includes deaths that occurred in each county regardless of the place of residence (by occurrence). The data are reported as totals, as well as stratified by age, gender, race-ethnicity, and death place type. Deaths due to all causes (ALL) and selected underlying cause of death categories are provided. See temporal coverage for more information on which combinations are available for which years.
The cause of death categories are based solely on the underlying cause of death as coded by the International Classification of Diseases. The underlying cause of death is defined by the World Health Organization (WHO) as "the disease or injury which initiated the train of events leading directly to death, or the circumstances of the accident or violence which produced the fatal injury." It is a single value assigned to each death based on the details as entered on the death certificate. When more than one cause is listed, the order in which they are listed can affect which cause is coded as the underlying cause. This means that similar events could be coded with different underlying causes of death depending on variations in how they were entered. Consequently, while underlying cause of death provides a convenient comparison between cause of death categories, it may not capture the full impact of each cause of death as it does not always take into account all conditions contributing to the death.
Mortality Rates for Lake County, Illinois. Explanation of field attributes: Average Age of Death – The average age at which a people in the given zip code die. Cancer Deaths – Cancer deaths refers to individuals who have died of cancer as the underlying cause. This is a rate per 100,000. Heart Disease Related Deaths – Heart Disease Related Deaths refers to individuals who have died of heart disease as the underlying cause. This is a rate per 100,000. COPD Related Deaths – COPD Related Deaths refers to individuals who have died of chronic obstructive pulmonary disease (COPD) as the underlying cause. This is a rate per 100,000.
https://www.icpsr.umich.edu/web/ICPSR/studies/36603/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/36603/terms
This collection contains information on county-level vital events that occurred in the United States from 1915-2007. When sources allow, data are disaggregated by county of occurrence, county of residence, and race. The data include information on vital events such as the number of infant deaths, births to unmarried women, births in the presence of hospital attendants, and infant birth weight.
Data on county socioeconomic status for 2,132 US counties and each county’s average annual cardiovascular mortality rate (CMR) and total PM2.5 concentration for 21 years (1990-2010). County CMR, PM2.5, and socioeconomic data were obtained from the U.S. National Center for Health Statistics, U.S. Environmental Protection Agency’s Community Multiscale Air Quality modeling system, and the U.S. Census, respectively. A socioeconomic index was created using seven county-level measures from the 1990 US census using factor analysis. Quintiles of this index were used to generate categories of county socioeconomic status. This dataset is associated with the following publication: Wyatt, L., G. Peterson, T. Wade, L. Neas, and A. Rappold. The contribution of improved air quality to reduced cardiovascular mortality: Declines in socioeconomic differences over time. ENVIRONMENT INTERNATIONAL. Elsevier B.V., Amsterdam, NETHERLANDS, 136: 105430, (2020).
Kenya recorded a crude death rate of 10.5 deaths per 1,000 population in 2019. Makueni registered the lowest rate among Kenyan counties, at 5.5 deaths per 1,000 population. On the other hand, Siaya had the highest: 15.5 deaths per 1,000 population.
As of March 10, 2023, the death rate from COVID-19 in the state of New York was 397 per 100,000 people. New York is one of the states with the highest number of COVID-19 cases.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Infant mortality rate is number of infant deaths per 1,000 live births. Data are for Santa Clara County residents. The measure is summarized for total county population, by race/ethnicity and Asian/Pacific Islander subgroups. Data are presented for single years at county level and pooled years combined for population subgroups. Source: Santa Clara County Public Health Department, 2007-2015 Birth Statistical Master File; Santa Clara County Public Health Department, VRBIS, 2007-2015. Data as of 05/26/2017.METADATA:Notes (String): Lists table title, sourceYear (String): Year of death. Pooled data years are used for certain categories to meet the minimum data requirements.Category (String): Lists the category representing the data: Santa Clara County is for total population, race/ethnicity: African American, Asian/Pacific Islander, Latino and White (non-Hispanic White only), and Asian/Pacific Islander subgroups: Asian Indian, Chinese, Filipino, Japanese, Korean, Vietnamese and Pacific Islanders.Rate per 1,000 live births (Numeric): Infant mortality rate is number of infant (under the age of 1 year) deaths in a year per 1,000 live births in the same time period.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Fetal mortality occurs after 20 weeks of gestation and before labor. Infant mortality occurs before the first year of age and is a sum of Neonatal (the first 28 days after birth) and Postneonatal (from 28 days up to 1 year) mortality. Rates are calculated per every 1000 births; rates are not available for disaggregated race/ethnicities. Fetal and infant mortality values are available for given race/ethnicities. Connecticut Department of Public Health collects and reports data annually. CTData.org carries 1-, 3- and 5-Year aggregations.
https://www.icpsr.umich.edu/web/ICPSR/studies/2526/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/2526/terms
This data collection is a portion of the historical data collected by the project, "Early Indicators of Later Work Levels, Disease, and Death," which is collecting military, medical, and socioeconomic data on a sample of white males mustered into the Union Army during the Civil War. During 1850, 1860, and 1870, mortality information was gathered at the county level as an addendum to the population census. These data examine the impact of environmental factors on life outcomes and look at the influence of infectious disease rates on economic and health patterns at late ages. Part 1, Disease Data, looks at cause of death from 66 disease classifications. Part 2, General Disease Data, also examines cause of death but through 18 broad disease categories. Variables included in both parts are state, county, year of death, and frequency of death by disease.
https://data.gov.tw/licensehttps://data.gov.tw/license
Every year, statistics on the death rate of the population by gender and single year of age are provided for each county and city.
By Noah Rippner [source]
This dataset provides comprehensive information on county-level cancer death and incidence rates, as well as various related variables. It includes data on age-adjusted death rates, average deaths per year, recent trends in cancer death rates, recent 5-year trends in death rates, and average annual counts of cancer deaths or incidence. The dataset also includes the federal information processing standards (FIPS) codes for each county.
Additionally, the dataset indicates whether each county met the objective of a targeted death rate of 45.5. The recent trend in cancer deaths or incidence is also captured for analysis purposes.
The purpose of the death.csv file within this dataset is to offer detailed information specifically concerning county-level cancer death rates and related variables. On the other hand, the incd.csv file contains data on county-level cancer incidence rates and additional relevant variables.
To provide more context and understanding about the included data points, there is a separate file named cancer_data_notes.csv. This file serves to provide informative notes and explanations regarding the various aspects of the cancer data used in this dataset.
Please note that this particular description provides an overview for a linear regression walkthrough using this dataset based on Python programming language. It highlights how to source and import the data properly before moving into data preparation steps such as exploratory analysis. The walkthrough further covers model selection and important model diagnostics measures.
It's essential to bear in mind that this example serves as an initial attempt at creating a multivariate Ordinary Least Squares regression model using these datasets from various sources like cancer.gov along with US Census American Community Survey data. This baseline model allows easy comparisons with future iterations intended for improvements or refinements.
Important columns found within this extensively documented Kaggle dataset include County names along with their corresponding FIPS codes—a standardized coding system by Federal Information Processing Standards (FIPS). Moreover,Met Objective of 45.5? (1) column denotes whether a specific county achieved the targeted objective of a death rate of 45.5 or not.
Overall, this dataset aims to offer valuable insights into county-level cancer death and incidence rates across various regions, providing policymakers, researchers, and healthcare professionals with essential information for analysis and decision-making purposes
Familiarize Yourself with the Columns:
- County: The name of the county.
- FIPS: The Federal Information Processing Standards code for the county.
- Met Objective of 45.5? (1): Indicates whether the county met the objective of a death rate of 45.5 (Boolean).
- Age-Adjusted Death Rate: The age-adjusted death rate for cancer in the county.
- Average Deaths per Year: The average number of deaths per year due to cancer in the county.
- Recent Trend (2): The recent trend in cancer death rates/incidence in the county.
- Recent 5-Year Trend (2) in Death Rates: The recent 5-year trend in cancer death rates/incidence in the county.
- Average Annual Count: The average annual count of cancer deaths/incidence in the county.
Determine Counties Meeting Objective: Use this dataset to identify counties that have met or not met an objective death rate threshold of 45.5%. Look for entries where Met Objective of 45.5? (1) is marked as True or False.
Analyze Age-Adjusted Death Rates: Study and compare age-adjusted death rates across different counties using Age-Adjusted Death Rate values provided as floats.
Explore Average Deaths per Year: Examine and compare average annual counts and trends regarding deaths caused by cancer, using Average Deaths per Year as a reference point.
Investigate Recent Trends: Assess recent trends related to cancer deaths or incidence by analyzing data under columns such as Recent Trend, Recent Trend (2), and Recent 5-Year Trend (2) in Death Rates. These columns provide information on how cancer death rates/incidence have changed over time.
Compare Counties: Utilize this dataset to compare counties based on their cancer death rates and related variables. Identify counties with lower or higher average annual counts, age-adjusted death rates, or recent trends to analyze and understand the factors contributing ...
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Fetal mortality occurs after 20 weeks of gestation and before labor. Infant mortality occurs before the first year of age and is a sum of Neonatal (the first 28 days after birth) and Postneonatal (from 28 days up to 1 year) mortality. Rates are calculated per every 1000 births; rates are not available for disaggregated race/ethnicities. Fetal and infant mortality values are available for given race/ethnicities. Connecticut Department of Public Health collects and reports data annually. CTData.org carries 1-, 3- and 5-Year aggregations.
This dataset describes drug poisoning deaths at the county level by selected demographic characteristics and includes age-adjusted death rates for drug poisoning from 1999 to 2015. Deaths are classified using the International Classification of Diseases, Tenth Revision (ICD–10). Drug-poisoning deaths are defined as having ICD–10 underlying cause-of-death codes X40–X44 (unintentional), X60–X64 (suicide), X85 (homicide), or Y10–Y14 (undetermined intent). Estimates are based on the National Vital Statistics System multiple cause-of-death mortality files (1). Age-adjusted death rates (deaths per 100,000 U.S. standard population for 2000) are calculated using the direct method. Populations used for computing death rates for 2011–2015 are postcensal estimates based on the 2010 U.S. census. Rates for census years are based on populations enumerated in the corresponding censuses. Rates for noncensus years before 2010 are revised using updated intercensal population estimates and may differ from rates previously published. Estimate does not meet standards of reliability or precision. Death rates are flagged as “Unreliable” in the chart when the rate is calculated with a numerator of 20 or less. Death rates for some states and years may be low due to a high number of unresolved pending cases or misclassification of ICD–10 codes for unintentional poisoning as R99, “Other ill-defined and unspecified causes of mortality” (2). For example, this issue is known to affect New Jersey in 2009 and West Virginia in 2005 and 2009 but also may affect other years and other states. Estimates should be interpreted with caution. Smoothed county age-adjusted death rates (deaths per 100,000 population) were obtained according to methods described elsewhere (3–5). Briefly, two-stage hierarchical models were used to generate empirical Bayes estimates of county age-adjusted death rates due to drug poisoning for each year during 1999–2015. These annual county-level estimates “borrow strength” across counties to generate stable estimates of death rates where data are sparse due to small population size (3,5). Estimates are unavailable for Broomfield County, Colo., and Denali County, Alaska, before 2003 (6,7). Additionally, Bedford City, Virginia was added to Bedford County in 2015 and no longer appears in the mortality file in 2015. County boundaries are consistent with the vintage 2005-2007 bridged-race population file geographies (6).
Age-adjustment mortality rates are rates of deaths that are computed using a statistical method to create a metric based on the true death rate so that it can be compared over time for a single population (i.e. comparing 2006-2008 to 2010-2012), as well as enable comparisons across different populations with possibly different age distributions in their populations (i.e. comparing Hispanic residents to Asian residents).
Age adjustment methods applied to Montgomery County rates are consistent with US Centers for Disease Control and Prevention (CDC), National Center for Health Statistics (NCHS) as well as Maryland Department of Health and Mental Hygiene’s Vital Statistics Administration (DHMH VSA).
PHS Planning and Epidemiology receives an annual data file of Montgomery County resident deaths registered with Maryland Department of Health and Mental Hygiene’s Vital Statistics Administration (DHMH VSA).
Using SAS analytic software, MCDHHS standardizes, aggregates, and calculates age-adjusted rates for each of the leading causes of death category consistent with state and national methods and by subgroups based on age, gender, race, and ethnicity combinations. Data are released in compliance with Data Use Agreements between DHMH VSA and MCDHHS. This dataset will be updated Annually.
This data package contains data on public health indicators, mortality and morbidity. Specifically this accelerator contains mortality and morbidity rates for groups of diseases in the United States by state and county from 1980 to 2014.
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Annual data on death registrations by area of usual residence in the UK. Summary tables including age-standardised mortality rates.
These data represent the Age-Adjusted Colorado County Mortality Rate Per 100,000 Persons for Motor Vehicle Accident as the Underlying Cause of Death (2015-2019). Population estimates for the denominator are calculated from the 2015-2019 American Community Survey. These data are from the Colorado Department of Public Health and Environment Vital Records Death Dataset and are published annually by the Colorado Department of Public Health and Environment.
The Mortality - Multiple Cause of Death data on CDC WONDER are county-level national mortality and population data spanning the yehttps://healthdata.gov/d/2sz9-6c59ars 1999-2006. These data are available in two separate data sets: one data set for years 1999-2004 with 3 race groups, and another data set for years 2005-2006 with 4 race groups and 3 Hispanic origin categories. Data are based on death certificates for U.S. residents. Each death certificate contains a single underlying cause of death, up to twenty additional multiple causes, and demographic data. The number of deaths, crude death rates, age-adjusted death rates, standard errors and 95% confidence intervals for death rates can be obtained by place of residence (total U.S., state, and county), age group (including infants), race, Hispanic ethnicity (years 2005-2006 only), sex, year of death, and cause-of-death (4-digit ICD-10 code or group of codes). The data are produced by the National Center for Health Statistics.
The Detailed Mortality - Underlying Cause of Death data on CDC WONDER are county-level national mortality and population data spanning the years 1999-2009. Data are based on death certificates for U.S. residents. Each death certificate contains a single underlying cause of death, and demographic data. The number of deaths, crude death rates, age-adjusted death rates, standard errors and 95% confidence intervals for death rates can be obtained by place of residence (total U.S., region, state, and county), age group (including infants and single-year-of-age cohorts), race (4 groups), Hispanic ethnicity, sex, year of death, and cause-of-death (4-digit ICD-10 code or group of codes, injury intent and mechanism categories, or drug and alcohol related causes), year, month and week day of death, place of death and whether an autopsy was performed. The data are produced by the National Center for Health Statistics.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Create maps of U.S. heart disease death rates by county. Data can be stratified by age, race/ethnicity, and sex. Visit the CDC/DHDSP Atlas of Heart Disease and Stroke for additional data and maps. Atlas of Heart Disease and StrokeData SourceMortality data were obtained from the National Vital Statistics System. Bridged-Race Postcensal Population Estimates were obtained from the National Center for Health Statistics. International Classification of Diseases, 10th Revision (ICD-10) codes: I00-I09, I11, I13, I20-I51; underlying cause of death.Data DictionaryData for counties with small populations are not displayed when a reliable rate could not be generated. These counties are represented in the data with values of '-1.' CDC/DHDSP excludes these values when classifying the data on a map, indicating those counties as 'Insufficient Data.' Data field names and descriptionsstcty_fips: state FIPS code + county FIPS codeOther fields use the following format: RRR_S_aaaa (e.g., API_M_35UP) RRR: 3 digits represent race/ethnicity All - Overall AIA - American Indian and Alaska Native, non-Hispanic API - Asian and Pacific Islander, non-Hispanic BLK - Black, non-Hispanic HIS - Hispanic WHT - White, non-Hispanic S: 1 digit represents sex A - All F - Female M - Male aaaa: 4 digits represent age. The first 2 digits are the lower bound for age and the last 2 digits are the upper bound for age. 'UP' indicates the data includes the maximum age available and 'LT' indicates ages less than the upper bound. Example: The column 'BLK_M_65UP' displays rates per 100,000 black men aged 65 years and older.MethodologyRates are calculated using a 3-year average and are age-standardized in 10-year age groups using the 2000 U.S. Standard Population. Rates are calculated and displayed per 100,000 population. Rates were spatially smoothed using a Local Empirical Bayes algorithm to stabilize risk by borrowing information from neighboring geographic areas, making estimates more statistically robust and stable for counties with small populations. Data for counties with small populations are coded as '-1' when a reliable rate could not be generated. County-level rates were generated when the following criteria were met over a 3-year time period within each of the filters (e.g., age, race, and sex).At least one of the following 3 criteria: At least 20 events occurred within the county and its adjacent neighbors.ORAt least 16 events occurred within the county.ORAt least 5,000 population years within the county.AND all 3 of the following criteria:At least 6 population years for each age group used for age adjustment if that age group had 1 or more event.The number of population years in an age group was greater than the number of events.At least 100 population years within the county.More Questions?Interactive Atlas of Heart Disease and StrokeData SourcesStatistical Methods
This dataset contains counts of deaths for California counties based on information entered on death certificates. Final counts are derived from static data and include out-of-state deaths to California residents, whereas provisional counts are derived from incomplete and dynamic data. Provisional counts are based on the records available when the data was retrieved and may not represent all deaths that occurred during the time period. Deaths involving injuries from external or environmental forces, such as accidents, homicide and suicide, often require additional investigation that tends to delay certification of the cause and manner of death. This can result in significant under-reporting of these deaths in provisional data.
The final data tables include both deaths that occurred in each California county regardless of the place of residence (by occurrence) and deaths to residents of each California county (by residence), whereas the provisional data table only includes deaths that occurred in each county regardless of the place of residence (by occurrence). The data are reported as totals, as well as stratified by age, gender, race-ethnicity, and death place type. Deaths due to all causes (ALL) and selected underlying cause of death categories are provided. See temporal coverage for more information on which combinations are available for which years.
The cause of death categories are based solely on the underlying cause of death as coded by the International Classification of Diseases. The underlying cause of death is defined by the World Health Organization (WHO) as "the disease or injury which initiated the train of events leading directly to death, or the circumstances of the accident or violence which produced the fatal injury." It is a single value assigned to each death based on the details as entered on the death certificate. When more than one cause is listed, the order in which they are listed can affect which cause is coded as the underlying cause. This means that similar events could be coded with different underlying causes of death depending on variations in how they were entered. Consequently, while underlying cause of death provides a convenient comparison between cause of death categories, it may not capture the full impact of each cause of death as it does not always take into account all conditions contributing to the death.