Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Facebook
Twitterhttps://www.worldbank.org/en/about/legal/terms-of-use-for-datasetshttps://www.worldbank.org/en/about/legal/terms-of-use-for-datasets
There's a story behind every dataset and here's your opportunity to share yours.
This Data consists of some world statistics published by the World Bank since 1961
Variables:
1) Agriculture and Rural development - 42 indicators published on this website. https://data.worldbank.org/topic/agriculture-and-rural-development
2) Access to electricity (% of the population) - Access to electricity is the percentage of the population with access to electricity. Electrification data are collected from industry, national surveys, and international sources.
3) CPIA gender equality rating (1=low to 6=high) - Gender equality assesses the extent to which the country has installed institutions and programs to enforce laws and policies that promote equal access for men and women in education, health, the economy, and protection under law.
4) Mineral rents (% of GDP) - Mineral rents are the difference between the value of production for a stock of minerals at world prices and their total costs of production. Minerals included in the calculation are tin, gold, lead, zinc, iron, copper, nickel, silver, bauxite, and phosphate.
5) GDP per capita (current US$) - GDP per capita is gross domestic product divided by midyear population. GDP is the sum of gross value added by all resident producers in the economy plus any product taxes and minus any subsidies not included in the value of the products. It is calculated without making deductions for depreciation of fabricated assets or for depletion and degradation of natural resources. Data are in current U.S. dollars.
6) Literacy rate, adult total (% of people ages 15 and above)- Adult literacy rate is the percentage of people ages 15 and above who can both read and write with understanding a short simple statement about their everyday life.
7) Net migration - Net migration is the net total of migrants during the period, that is, the total number of immigrants less the annual number of emigrants, including both citizens and noncitizens. Data are five-year estimates.
8) Birth rate, crude (per 1,000 people) - Crude birth rate indicates the number of live births occurring during the year, per 1,000 population estimated at midyear. Subtracting the crude death rate from the crude birth rate provides the rate of natural increase, which is equal to the rate of population change in the absence of migration.
9) Death rate, crude (per 1,000 people) - Crude death rate indicates the number of deaths occurring during the year, per 1,000 population estimated at midyear. Subtracting the crude death rate from the crude birth rate provides the rate of natural increase, which is equal to the rate of population change in the absence of migration.
10) Mortality rate, infant (per 1,000 live births) - Infant mortality rate is the number of infants dying before reaching one year of age, per 1,000 live births in a given year.
11) Population, total - Total population is based on the de facto definition of population, which counts all residents regardless of legal status or citizenship. The values shown are midyear estimates.
These datasets are publicly available for anyone to use under the following terms provided by the Dataset Source https://www.worldbank.org/en/about/legal/terms-of-use-for-datasets
Banner photo by https://population.un.org/wpp/Maps/
Subsaharan Africa and east Asia record high population total, actually Subsaharan Africa population bypassed Europe and central Asia population by 2010, has this been influenced by crop and food production, large arable land, high crude birth rates(influx), low mortality rates(exits from the population) or Net migration.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Demographic and Health Surveys (DHS) are a major source for many demographic and health indicators in developing countries. Although these indicators are well defined in the literature, using survey data to calculate some of these indicators has never been an easy task for data users. This paper presents the DHS.rates software, a user-friendly R package developed to calculate fertility indicators, such as the total fertility rate, general fertility rate, and age-specific fertility rates, and childhood mortality indicators, such as the neonatal mortality rate, post-neonatal mortality rate, infant mortality rate, child mortality rate, and under-5 mortality rate, from the DHS data. The package allows for national and subnational indicators. In addition, the package calculates sampling error indicators such as standard error, design effect, relative standard error, and confidence interval for each demographic indicator. The package can also be used to calculate the same indicators from other population surveys such as the Multiple Indicator Cluster Survey (MICS).
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
This data shows premature deaths (Age under 75), numbers and rates by gender, as 3-year moving-averages. All-Cause Mortality rates are a summary indicator of population health status. All-cause mortality is related to Life Expectancy, and both may be influenced by health inequalities. Directly Age-Standardised Rates (DASR) are shown in the data (where numbers are sufficient) so that death rates can be directly compared between areas. The DASR calculation applies Age-specific rates to a Standard (European) population to cancel out possible effects on crude rates due to different age structures among populations, thus enabling direct comparisons of rates. A limitation on using mortalities as a proxy for prevalence of health conditions is that mortalities may give an incomplete view of health conditions in an area, as ill-health might not lead to premature death. Data source: Office for Health Improvement and Disparities (OHID), Public Health Outcomes Framework (PHOF) indicator ID 108. This data is updated annually.
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
This data shows premature deaths (Age under 75), numbers and rates by gender, as 3-year moving-averages. All-Cause Mortality rates are a summary indicator of population health status. All-cause mortality is related to Life Expectancy, and both may be influenced by health inequalities. Directly Age-Standardised Rates (DASR) are shown in the data (where numbers are sufficient) so that death rates can be directly compared between areas. The DASR calculation applies Age-specific rates to a Standard (European) population to cancel out possible effects on crude rates due to different age structures among populations, thus enabling direct comparisons of rates. A limitation on using mortalities as a proxy for prevalence of health conditions is that mortalities may give an incomplete view of health conditions in an area, as ill-health might not lead to premature death. Data source: Office for Health Improvement and Disparities (OHID), Public Health Outcomes Framework (PHOF) indicator ID 108. This data is updated annually.
Facebook
TwitterAttribution-NonCommercial-ShareAlike 3.0 (CC BY-NC-SA 3.0)https://creativecommons.org/licenses/by-nc-sa/3.0/
License information was derived automatically
Dear Kaggler! This dataset consists of a main CSV file: Adult mortality rate (2019-2021).csv. This file has been processed, cleaned and prepared for your use. The dataset contains information on mortality rates in different countries of the world and some factors that may affect this rate for 2019-2023.
The data contains the following columns:
Countries: Country of study.
Continent: Continent location of the country.
Average_Pop(thousands people): Average population of the country under study for 2019-2021 in thousands.
Average_GDP(M$): Average GDP of the country under study for 2019-2021 in millions of dollars.
Average_GDP_per_capita: Average GDP per capita of the country under study for 2019-2021 in dollars.
Average_HEXP($): Health Expenditure Per Capita in the country under study in dollars.
Development_level: Level of development of the state under study (calculated by GDP per capita of the country). Please note that in this dataset we calculate this indicator only by calculating GDP per capita! Despite the fact that the United Nations (UN) does not have an unambiguous classification of countries into developed, developing and backward based on only one indicator, such as the amount of GDP per capita. It uses a wider range of economic, social and quality indicators to determine the level of development of countries.
AMR_female(per_1000_female_adults): Average mortality of adult women in the country under study (per 1000 adult women per year) for 2019-2023.
AMR_male(per_1000_male_adults): Average mortality of adult men in the country under study (per 1000 adult men per year) for 2019-2023.
Average_CDR: Average crude mortality rate for 2019–2021 in the country under study.
The dataset also contains additional files: Draft_AMR.csv, Draft_CDR.csv, Draft_Expenses.csv, Draft_GDP.csv, Draft_Population.csv. In fact, the main dataset consists of parts of these files. If you are interested in working more deeply on data cleaning and preparation, you can of course use these files. You can also use these files to create your own dataset. And be careful! Additional files may contain a different number of rows and columns with different names and data types. And of course these files are not cleaned. You will see not only the NaN values, but also other symbols in their place.
Enjoy your training, my dear Kaggler!
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F16192307%2Fc9a98b25b85b43718e5b8109712ba36a%2FDepositphotos_68536025_s-2019.jpg?generation=1711099905559419&alt=media" alt="">
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Top 20 counties that were most affected by COVID-19 associated deaths in 2020 using different metrics of COVID-19 death rate: Crude death rate calculated from the death counts reported in USAFacts and age-standardized death rate calculated from the imputation model M1.
Facebook
TwitterThe Global Subnational Infant Mortality Rates, Version 2.01 consist of Infant Mortality Rate (IMR) estimates for 234 countries and territories, 143 of which include subnational Units. The data are benchmarked to the year 2015 (Version 1 was benchmarked to the year 2000), and are drawn from national offices, Demographic and Health Surveys (DHS), Multiple Indicator Cluster Surveys (MICS), and other sources from 2006 to 2014. In addition to Infant Mortality Rates, Version 2.01 includes crude estimates of births and infant deaths, which could be aggregated or disaggregated to different geographies to calculate infant mortality rates at different scales or resolutions, where births are the rate denominator and infant deaths are the rate numerator. Boundary inputs are derived primarily from the Gridded Population of the World, Version 4 (GPWv4) data collection. National and subnational data are mapped to grid cells at a spatial resolution of 30 arc-seconds (~1 km) (Version 1 has a spatial resolution of 1/4 degree, ~28 km at the equator), allowing for easy integration with demographic, environmental, and other spatial data.
Facebook
Twitterhttps://digital.nhs.uk/about-nhs-digital/terms-and-conditionshttps://digital.nhs.uk/about-nhs-digital/terms-and-conditions
Years of life lost due to mortality from all circulatory diseases (ICD-10 I00-I99). Years of life lost (YLL) is a measure of premature mortality. Its primary purpose is to compare the relative importance of different causes of premature death within a particular population and it can therefore be used by health planners to define priorities for the prevention of such deaths. It can also be used to compare the premature mortality experience of different populations for a particular cause of death. The concept of years of life lost is to estimate the length of time a person would have lived had they not died prematurely. By inherently including the age at which the death occurs, rather than just the fact of its occurrence, the calculation is an attempt to better quantify the burden, or impact, on society from the specified cause of mortality. Legacy unique identifier: P00520
Facebook
TwitterU.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
This data is compiled by the Cook County Department of Public Health using data from the Illinois Department of Public Health Vital Statistics. It includes the annual number of deaths, crude and age-adjusted death rates by selected causes of death. Further analysis is available by age group, race/ethnicity, gender and decedent's place of residence in suburban Cook County at the time of their death. Table of Contents and other information can be found at http://opendocs.cookcountyil.gov/docs/Death_Table_of_Contents2_jh9b-icit.pdf. Note: * Counts suppressed for events between 1 and 4, - Rates not calculated for events less than 20
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The infant mortality rate is defined as the number of deaths of children under one year of age, expressed per 1 000 live births. Some of the international variation in infant mortality rates is due to variations among countries in registering practices for premature infants. The United States and Canada are two countries which register a much higher proportion of babies weighing less than 500g, with low odds of survival, resulting in higher reported infant mortality. In Europe, several countries apply a minimum gestational age of 22 weeks (or a birth weight threshold of 500g) for babies to be registered as live births. This indicator is measured in terms of deaths per 1 000 live births.
This indicator is a summary measure of premature mortality, providing an explicit way of weighting deaths occurring at younger ages, which may be preventable. The calculation of Potential Years of Life Lost (PYLL) involves summing up deaths occurring at each age and multiplying this with the number of remaining years to live up to a selected age limit (age 75 is used in OECD Health Statistics). In order to assure cross-country and trend comparison, the PYLL are standardised, for each country and each year. The total OECD population in 2010 is taken as the reference population for age standardisation. This indicator is presented as a total and per gender. It is measured in years lost per 100 000 inhabitants (total), per 100 000 men and per 100 000 women, aged 0-69.
Life expectancy at birth is defined as how long, on average, a newborn can expect to live, if current death rates do not change. However, the actual age-specific death rate of any particular birth cohort cannot be known in advance. If rates are falling, actual life spans will be higher than life expectancy calculated using current death rates. Life expectancy at birth is one of the most frequently used health status indicators. Gains in life expectancy at birth can be attributed to a number of factors, including rising living standards, improved lifestyle and better education, as well as greater access to quality health services. This indicator is presented as a total and per gender and is measured in years.
Facebook
TwitterThe Global Subnational Infant Mortality Rates, Version 2.01 consist of Infant Mortality Rate (IMR) estimates for 234 countries and territories, 143 of which include subnational Units. The data are benchmarked to the year 2015 (Version 1 was benchmarked to the year 2000), and are drawn from national offices, Demographic and Health Surveys (DHS), Multiple Indicator Cluster Surveys (MICS), and other sources from 2006 to 2014. In addition to Infant Mortality Rates, Version 2.01 includes crude estimates of births and infant deaths, which could be aggregated or disaggregated to different geographies to calculate infant mortality rates at different scales or resolutions, where births are the rate denominator and infant deaths are the rate numerator. Boundary inputs are derived primarily from the Gridded Population of the World, Version 4 (GPWv4) data collection. National and subnational data are mapped to grid cells at a spatial resolution of 30 arc-seconds (~1 km) (Version 1 has a spatial resolution of 1/4 degree, ~28 km at the equator), allowing for easy integration with demographic, environmental, and other spatial data.
Facebook
TwitterDeath rates for all causes (per 1,000 population) for Glasgow and Scotland from 1991 to 2012. The Glasgow death rates are given for the crude death rate or as standardised using the age/sex- specific rates for Scotland. They were calculated using the 'rebased' mid-year population estimates for 2002 to 2011. More information about this is available from Births and Deaths Rates: breaks in series circa 2011 Data extracted 2014-04-09 from the General Register Office for Scotland Licence: None
Facebook
TwitterData set taken from WHO: See Life Tables by Country (CUBA) & Life Expectancy at Birth (CUBA) Detailed information on year-wise deaths by age group, and population left alive by age group - this data permits calculations of Life Expectancies for Cuba. This is data for a lecture on computation of life-expectancies, which is part of a course on Real Statistics: An Islamic Approach. Lecture linked below provides further details on how to compute life expectancies from this data: Computing Life Expectancies from Mortality Tables.
Rows 3 to 21 provide Age-Specific death rates for 5 year groups 0-5. 5-10, and so on up to 80-85, and 85+ Rows 22 to 40 provide probability of dying in each of these same age-categories. Rows 41 to 59 provide Number of people left alive in each of these 5- year age groups Rows 60 to 78 provide number of people who die in each of these age categories Rows 79 to 97 provide number of person-years lived by each of these 5-year age cohorts Rows 98 to 116 provide number of person-years lived ABOVE given age group Rows 117 to 135 provide life expectancy within each age category
We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.
Your data will be in front of the world's largest data science community. What questions do you want to see answered?
Facebook
Twitterhttps://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Objective: To examine if the rankings of state HIV age-standardized death rates (ASDRs) changed if different standard population (SP) was used. Design: A cross-sectional population-based observational study. Setting 36 states in the United States. Participants: People died from 2015 to 2019. Main outcome measures: State HIV ASDR using 4 SPs, namely WHO2000, US2000, US2mor020, and Eur2011–2030. Results: The rankings of 19 states did not change when ASDRs were calculated using US2000 and US2020. Of the 17 states whose rankings changed, the rankings of 9 states calculated using US2000 were higher than those calculated using US2020; in 8 states, the rankings were lower. The states with the greatest changes in rankings between US2000 and US2020 were Kentucky (12th and 9th, respectively) and Massachusetts (8th and 11th, respectively). Conclusions: State ASDRs calculated using the current official SP (US2000) weigh middle-age HIV death rates more heavily than older-age HIV death rates, resulting in lower ASDRs among states with higher older-age HIV death rates. Methods The data were extracted from CDC WONDER.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Excel workbook of age-standardised baseline mortality rates (BMRs) for each US county by race and ethnicity used for calculating racial-ethnic disparities in health burdens for air pollution from the major oil and gas lifecycle stages in the United States.The workbook includes 3 sheets:BMRs for all-cause mortality in 25+ years population for calculating premature mortality from exposure to fine particular matter (PM2.5).BMRs for all-cause mortality in 65+ years population for calculating premature mortality from exposure to nitrogen dioxide (NO2), andBMRs for all-ages chronic obstructive pulmonary disease (COPD) mortality from exposure to ozone air pollution.Raw BMRs from the US US Centers for Disease Control and Prevention Wide-ranging ONline Data for Epidemiologic Research (CDC WONDER) are processed to gap fill data not reported at the county level. This data gap filling is detailed in Vohra et al. (2025) Science Advances, "The health burden and racial-ethnic disparities of air pollution from the major oil and gas lifecycle stages in the United States", doi:10.1126/sciadv.adu2241.
Facebook
TwitterTable of directly (DSR) age-standardised rates of suicides per 100,000 population, and Indirectly (SMR) (Includes undetermined Injuries), all ages and age 15 plus, three year (pooled) average and annual, by sex. Deaths from intentional self-harm and injury undetermined whether accidentally or purposely inflicted (ICD-10 X60-X84, Y10-Y34 exc Y33.9, ICD-9 E950-E959 and E980-E989 exc E988.8), registered in the respective calendar year(s). DSR stands for Directly age-Standardised Rates. Mortality rates are age standardised using the European Standard Population as defined by the World Health Organisation. 3 year average rates are calculated as the average of single year rates for 3 successive years. Standardised Mortality Ratio (SMR), England = 100. The annual rates at borough level are likely to be subject to relatively high levels of variability of numbers of suicides from year to year because of the relatively small numebrs of suicides that occur within boroughs. When comparing boroughs against each other, the three-year combined rate would provide a higher level of confidence. NHS mental health information can be found here. Various other suicide indicators are available from IC NHS website, including years of life lost, crude death rates, and indirectly standardised ratios (SMR). Follow: Compendium of population health indicators > Illness and Condition > Mental health and behavioural disorders
Facebook
TwitterCOVID-19 rate of death, or the known deaths divided by confirmed cases, was over ten percent in Yemen, the only country that has 1,000 or more cases. This according to a calculation that combines coronavirus stats on both deaths and registered cases for 221 different countries. Note that death rates are not the same as the chance of dying from an infection or the number of deaths based on an at-risk population. By April 26, 2022, the virus had infected over 510.2 million people worldwide, and led to a loss of 6.2 million. The source seemingly does not differentiate between "the Wuhan strain" (2019-nCOV) of COVID-19, "the Kent mutation" (B.1.1.7) that appeared in the UK in late 2020, the 2021 Delta variant (B.1.617.2) from India or the Omicron variant (B.1.1.529) from South Africa.
Where are these numbers coming from?
The numbers shown here were collected by Johns Hopkins University, a source that manually checks the data with domestic health authorities. For the majority of countries, this is from national authorities. In some cases, like China, the United States, Canada or Australia, city reports or other various state authorities were consulted. In this statistic, these separately reported numbers were put together. Note that Statista aims to also provide domestic source material for a more complete picture, and not to just look at one particular source. Examples are these statistics on the confirmed coronavirus cases in Russia or the COVID-19 cases in Italy, both of which are from domestic sources. For more information or other freely accessible content, please visit our dedicated Facts and Figures page.
A word on the flaws of numbers like this
People are right to ask whether these numbers are at all representative or not for several reasons. First, countries worldwide decide differently on who gets tested for the virus, meaning that comparing case numbers or death rates could to some extent be misleading. Germany, for example, started testing relatively early once the country’s first case was confirmed in Bavaria in January 2020, whereas Italy tests for the coronavirus postmortem. Second, not all people go to see (or can see, due to testing capacity) a doctor when they have mild symptoms. Countries like Norway and the Netherlands, for example, recommend people with non-severe symptoms to just stay at home. This means not all cases are known all the time, which could significantly alter the death rate as it is presented here. Third and finally, numbers like this change very frequently depending on how the pandemic spreads or the national healthcare capacity. It is therefore recommended to look at other (freely accessible) content that dives more into specifics, such as the coronavirus testing capacity in India or the number of hospital beds in the UK. Only with additional pieces of information can you get the full picture, something that this statistic in its current state simply cannot provide.
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
This data shows premature deaths (Age under 75) from Circulatory Disease, numbers and rates by gender, as 3-year moving-averages.
Circulatory diseases include heart diseases and stroke, and others. Socio-economic and lifestyle factors are associated with circulatory disease deaths and inequalities in circulatory disease rates. Modifiable risk factors include smoking, excess weight, diet, and physical inactivity.
Directly Age-Standardised Rates (DASR) are shown in the data, where numbers are sufficient, so that death rates can be directly compared between areas. The DASR calculation applies Age-specific rates to a Standard (European) population to cancel out possible effects on crude rates due to different age structures among populations, thus enabling direct comparisons of rates.
A limitation on using mortalities as a proxy for prevalence of health conditions is that mortalities may give an incomplete view of health conditions in an area, as ill-health might not lead to premature death.
Data source: NHS Digital Compendium hub, dataset unique identifier P00395. This data is updated annually.
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
This data shows premature deaths (Age under 75) from Cardiovascular Disease, numbers and rates by gender, as 3-year moving-averages. Cardiovascular Disease include heart diseases and stroke, and others. Socio-economic and lifestyle factors are associated with circulatory disease deaths and inequalities in circulatory disease rates. Modifiable risk factors include smoking, excess weight, diet, and physical inactivity. Directly Age-Standardised Rates (DASR) are shown in the data, where numbers are sufficient, so that death rates can be directly compared between areas. The DASR calculation applies Age-specific rates to a Standard (European) population to cancel out possible effects on crude rates due to different age structures among populations, thus enabling direct comparisons of rates. A limitation on using mortalities as a proxy for prevalence of health conditions is that mortalities may give an incomplete view of health conditions in an area, as ill-health might not lead to premature death. Data source: NHS Digital (now part of NHS England) Compendium hub, dataset unique identifier P00395. This data is updated annually. Note: Compendium Mortality Consultation 2022 NHS Digital is currently analysing the results of the consultation that closed on 14 September 2022. In the meantime the next publication is on hold. 6 February 2023 10:55 AM
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically