100+ datasets found

Life Expectancy
kaggle.com
zip
Updated Mar 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ignacio Azua (2025). Life Expectancy [Dataset]. https://www.kaggle.com/datasets/ignacioazua/life-expectancy
Explore at:
zip(3032 bytes)Available download formats
Dataset updated
Mar 4, 2025
Authors
Ignacio Azua
Description
Life Expectancy of the World Population

The dataset from Worldometer provides a ranked list of countries based on life expectancy at birth, which represents the average number of years a newborn is expected to live under current mortality rates. It includes global, regional, and country-specific life expectancy figures, with separate data for males and females. The dataset highlights disparities in longevity across nations, with countries like Hong Kong, Japan, and South Korea having the highest life expectancies. This data serves as a key indicator of public health, quality of life, and healthcare effectiveness, offering valuable insights for policymakers, researchers, and global health organizations.

Data Analysis & Machine Learning Approaches for Life Expectancy Data

Data Analysis Approaches Life expectancy data can be analyzed using descriptive statistics (mean, variance, distribution) and correlation analysis to identify relationships with factors like GDP, healthcare, and education. Time series analysis helps track longevity trends over time, while clustering techniques (e.g., K-Means) group countries with similar patterns. Additionally, geospatial analysis can visualize regional disparities in life expectancy.

Machine Learning Models For prediction, linear and multiple regression models estimate life expectancy based on socioeconomic indicators, while polynomial regression captures non-linear trends. Decision trees and Random Forests classify countries into high- and low-life expectancy groups. Deep learning techniques like neural networks (ANNs) can model complex relationships, while LSTMs are useful for time-series forecasting.

For pattern detection, K-Means clustering groups countries based on life expectancy trends, and DBSCAN identifies anomalies. Principal Component Analysis (PCA) helps in feature selection, improving model efficiency. These methods provide insights into longevity trends, helping policymakers and researchers improve public health strategies.

Life expectancy at birth. Data based on the latest United Nations Population Division estimates.

Source: https://www.worldometers.info/demographics/life-expectancy/#countries-ranked-by-life-expectancy
Life Expectancy Averaged Dataset
kaggle.com
zip
Updated Dec 4, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shreyas (2024). Life Expectancy Averaged Dataset [Dataset]. https://www.kaggle.com/datasets/shreyasg23/life-expectancy-averaged-dataset
Explore at:
zip(11410 bytes)Available download formats
Dataset updated
Dec 4, 2024
Authors
Shreyas
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset provides aggregated life expectancy data averaged over multiple years for various countries, along with associated socio-economic and health-related factors. It aims to facilitate analysis of global health trends, the relationship between life expectancy and development indicators, and regional disparities.

Key Features

Country & Region: Includes data from a diverse range of countries across continents, categorized by their respective regions.

Life Expectancy: The average life expectancy in years for each country.

Socio-economic Indicators:

GDP per capita

Schooling (average years of education)

Alcohol consumption

Economy status (developed or developing)

Health Indicators:

Infant mortality

Adult mortality

HIV prevalence

Immunization rates (e.g., Hepatitis B, Polio)

Incidence of diseases like measles and diphtheria

BMI (Body Mass Index) data

Thinness prevalence among children and adolescents

Purpose of the Dataset

This dataset can be used for: 1. Exploratory Data Analysis (EDA): Understand trends in life expectancy across different regions and economic statuses. 2. Data Visualization: Create meaningful plots (e.g., choropleth maps, scatter plots, pair plots) to analyze relationships between variables. 3. Machine Learning: Develop predictive models for life expectancy based on socio-economic and health factors. 4. Policy Research: Support policy-making by identifying key factors influencing life expectancy.

Inspiration

What are the primary factors affecting life expectancy in different countries?

How does life expectancy vary by region or economic development?

Can we predict life expectancy using other health and economic indicators?

License

This dataset is shared under the CC BY 4.0 License. Proper attribution is required for reuse.
Effect of suicide rates on life expectancy dataset
zenodo.org
data.niaid.nih.gov
csv
Updated Apr 16, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Filip Zoubek; Filip Zoubek (2021). Effect of suicide rates on life expectancy dataset [Dataset]. http://doi.org/10.5281/zenodo.4694270
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.4694270
Dataset updated
Apr 16, 2021
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Filip Zoubek; Filip Zoubek
License
Attribution-NonCommercial-ShareAlike 3.0 (CC BY-NC-SA 3.0)https://creativecommons.org/licenses/by-nc-sa/3.0/
License information was derived automatically
Description
Effect of suicide rates on life expectancy dataset

Abstract
In 2015, approximately 55 million people died worldwide, of which 8 million committed suicide. In the USA, one of the main causes of death is the aforementioned suicide, therefore, this experiment is dealing with the question of how much suicide rates affects the statistics of average life expectancy.
The experiment takes two datasets, one with the number of suicides and life expectancy in the second one and combine data into one dataset. Subsequently, I try to find any patterns and correlations among the variables and perform statistical test using simple regression to confirm my assumptions.

Data

The experiment uses two datasets - WHO Suicide Statistics[1] and WHO Life Expectancy[2], which were firstly appropriately preprocessed. The final merged dataset to the experiment has 13 variables, where country and year are used as index: Country, Year, Suicides number, Life expectancy, Adult Mortality, which is probability of dying between 15 and 60 years per 1000 population, Infant deaths, which is number of Infant Deaths per 1000 population, Alcohol, which is alcohol, recorded per capita (15+) consumption, Under-five deaths, which is number of under-five deaths per 1000 population, HIV/AIDS, which is deaths per 1 000 live births HIV/AIDS, GDP, which is Gross Domestic Product per capita, Population, Income composition of resources, which is Human Development Index in terms of income composition of resources, and Schooling, which is number of years of schooling.

LICENSE

THE EXPERIMENT USES TWO DATASET - WHO SUICIDE STATISTICS AND WHO LIFE EXPECTANCY, WHICH WERE COLLEECTED FROM WHO AND UNITED NATIONS WEBSITE. THEREFORE, ALL DATASETS ARE UNDER THE LICENSE ATTRIBUTION-NONCOMMERCIAL-SHAREALIKE 3.0 IGO (https://creativecommons.org/licenses/by-nc-sa/3.0/igo/).

[1] https://www.kaggle.com/szamil/who-suicide-statistics

[2] https://www.kaggle.com/kumarajarshi/life-expectancy-who
Life Expectancy 1960 to present (Global)
kaggle.com
zip
Updated Mar 13, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Frederick Salazar Sanchez (2025). Life Expectancy 1960 to present (Global) [Dataset]. https://www.kaggle.com/datasets/fredericksalazar/life-expectancy-1960-to-present-global
Explore at:
zip(111509 bytes)Available download formats
Dataset updated
Mar 13, 2025
Authors
Frederick Salazar Sanchez
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
PLEASE if you use or like this dataset UPVOTE 👁️

This dataset offers a detailed historical record of global life expectancy, covering data from 1960 to the present. It is meticulously curated to enable deep analysis of trends and gender disparities in life expectancy worldwide.

Dataset Structure & Key Columns:

Country Code (🔤): Unique identifier for each country.

Country Name (🌍): Official name of the country.

Region (🌐): Broad geographical area (e.g., Asia, Europe, Africa).

Sub-Region (🗺️): More specific regional classification within the broader region.

Intermediate Region (🔍): Additional granular geographical grouping when applicable.

Year (📅): The specific year to which the data pertains.

Life Expectancy for Women (👩‍⚕️): Average years a woman is expected to live in that country and year.

Life Expectancy for Men (👨‍⚕️): Average years a man is expected to live in that country and year.

Context & Use Cases:

This dataset is a rich resource for exploring long-term trends in global health and demography. By comparing life expectancy data over decades, researchers can:

Analyze Time Series Trends: Forecast future changes in life expectancy and evaluate the impact of health interventions over time.

Study Gender Disparities: Investigate the differences between life expectancy for women and men, providing insights into social, economic, and healthcare factors influencing these trends.

Regional & Sub-Regional Analysis: Compare and contrast life expectancy across various regions and sub-regions to understand geographical disparities and their underlying causes.

Support Public Policy Research: Inform policymakers by linking life expectancy trends with public health policies, socioeconomic developments, and other key indicators.

Educational & Data Science Applications: Serve as a comprehensive teaching tool for courses on public health, global development, and data analysis, as well as for Kaggle competitions and projects.

With its detailed, structured format and broad temporal coverage, this dataset is ideal for anyone looking to gain a nuanced understanding of global health trends and to drive impactful analyses in public health, social sciences, and beyond.

Feel free to ask for further customizations or additional details as needed!
Public Health Statistics - Life Expectancy By Community Area - Historical
healthdata.gov
data.cityofchicago.org
+2more
csv, xlsx, xml
Updated Apr 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
data.cityofchicago.org (2025). Public Health Statistics - Life Expectancy By Community Area - Historical [Dataset]. https://healthdata.gov/dataset/Public-Health-Statistics-Life-Expectancy-By-Commun/iw98-x3d2
Explore at:
xlsx, xml, csvAvailable download formats
Dataset updated
Apr 8, 2025
Dataset provided by
data.cityofchicago.org
Description
Note: This dataset is historical only and there are not corresponding datasets for more recent time periods. For that more-recent information, please visit the Chicago Health Atlas at https://chicagohealthatlas.org.

This dataset gives the average life expectancy and corresponding confidence intervals for each Chicago community area for the years 1990, 2000 and 2010. See the full description at: https://data.cityofchicago.org/api/views/qjr3-bm53/files/AAu4x8SCRz_bnQb8SVUyAXdd913TMObSYj6V40cR6p8?download=true&filename=P:\EPI\OEPHI\MATERIALS\REFERENCES\Life Expectancy\Dataset description - LE by community area.pdf
d
Replication Data for: The Association Between Income and Life Expectancy in...
dataone.org
Updated Nov 12, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bergeron, Augustin; Chetty, Raj; Cutler, David; Scuderi, Benjamin; Stepner, Michael; Turner, Nicholas (2023). Replication Data for: The Association Between Income and Life Expectancy in the United States, 2001-2014 [Dataset]. http://doi.org/10.7910/DVN/VVW76J
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/VVW76J
Dataset updated
Nov 12, 2023
Dataset provided by
Harvard Dataverse
Authors
Bergeron, Augustin; Chetty, Raj; Cutler, David; Scuderi, Benjamin; Stepner, Michael; Turner, Nicholas
Area covered
United States
Description
This dataset contains replication files for "The Association Between Income and Life Expectancy in the United States, 2001-2014" by Augustin Bergeron, Raj Chetty, David Cutler, Benjamin Scuderi, Michael Stepner, and Nicholas Turner. For more information, see https://opportunityinsights.org/paper/lifeexpectancy/. A summary of the related publication follows. How can we reduce socioeconomic disparities in health outcomes? Although it is well known that there are significant differences in health and longevity between income groups, debate remains about the magnitudes and determinants of these differences. We use new data from 1.4 billion anonymous earnings and mortality records to construct more precise estimates of the relationship between income and life expectancy at the national level than was feasible in prior work. We then construct new local area (county and metro area) estimates of life expectancy by income group and identify factors that are associated with higher levels of life expectancy for low-income individuals. Our findings show that disparities in life expectancy are not inevitable. There are cities throughout America — from New York to San Francisco to Birmingham, AL — where gaps in life expectancy are relatively small or are narrowing over time. Replicating these successes more broadly will require targeted local efforts, focusing on improving health behaviors among the poor in cities such as Las Vegas and Detroit. Our findings also imply that federal programs such as Social Security and Medicare are less redistributive than they might appear because low-income individuals obtain these benefits for significantly fewer years than high-income individuals, especially in cities like Detroit. Going forward, the challenge is to understand the mechanisms that lead to better health and longevity for low-income individuals in some parts of the U.S. To facilitate future research and monitor local progress, we have posted annual statistics on life expectancy by income group and geographic area (state, CZ, and county) at The Health Inequality Project website. Using these data, researchers will be able to study why certain places have high or improving levels of life expectancy and ultimately apply these lessons to reduce health disparities in other parts of the country.
🌱Life Expectation
kaggle.com
zip
Updated Sep 7, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
meer atif magsi (2023). 🌱Life Expectation [Dataset]. https://www.kaggle.com/datasets/meeratif/life-expection
Explore at:
zip(2156 bytes)Available download formats
Dataset updated
Sep 7, 2023
Authors
meer atif magsi
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
The dataset contains information on various demographic and health indicators for different countries. It is organized into several columns, each providing essential information about these countries. Here's a description of each column:

1. Country: This column represents the names of different countries or regions included in the dataset. Each row corresponds to a specific country or region, and this column serves as the identifier for each entry.

2. Life Expectancy Males: This column contains data on the average life expectancy of males in each of the listed countries. Life expectancy is a crucial health indicator and provides an estimate of the average number of years a male can expect to live, given current mortality rates and health conditions.

3. Life Expectancy Females: Similar to the "Life Expectancy Males" column, this column provides data on the average life expectancy of females in the same countries. It reflects the average number of years a female can expect to live, considering the prevailing health and mortality conditions.

4. Birth Rate: The "Birth Rate" column contains information about the birth rate in each country. Birth rate is a demographic indicator that represents the number of live births per 1,000 people in a given population over a specific period, usually a year. It can provide insights into a country's population growth or decline.

5. Death Rate: This column presents data on the death rate in each of the listed countries. The death rate is another crucial demographic indicator and represents the number of deaths per 1,000 people in a population over a specific period, often a year. It helps gauge the overall health and mortality conditions within a country.
Health state life expectancy, all ages, UK
ons.gov.uk
cy.ons.gov.uk
xlsx
Updated Dec 12, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Office for National Statistics (2024). Health state life expectancy, all ages, UK [Dataset]. https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/healthandlifeexpectancies/datasets/healthstatelifeexpectancyallagesuk
Explore at:
xlsxAvailable download formats
Dataset updated
Dec 12, 2024
Dataset provided by
Office for National Statisticshttp://www.ons.gov.uk/
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Area covered
United Kingdom
Description
Pivot table for healthy life expectancy by sex and area type, divided by three-year intervals starting from 2011 to 2013.
Vital Signs: Life Expectancy – by ZIP Code
data.bayareametro.gov
csv, xlsx, xml
Updated Apr 12, 2017
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
State of California, Department of Health: Death Records (2017). Vital Signs: Life Expectancy – by ZIP Code [Dataset]. https://data.bayareametro.gov/dataset/Vital-Signs-Life-Expectancy-by-ZIP-Code/xym8-u3kc
Explore at:
csv, xlsx, xmlAvailable download formats
Dataset updated
Apr 12, 2017
Dataset provided by
California Department of Public Healthhttps://www.cdph.ca.gov/
Authors
State of California, Department of Health: Death Records
Description
VITAL SIGNS INDICATOR Life Expectancy (EQ6)

FULL MEASURE NAME Life Expectancy

LAST UPDATED April 2017

DESCRIPTION Life expectancy refers to the average number of years a newborn is expected to live if mortality patterns remain the same. The measure reflects the mortality rate across a population for a point in time.

DATA SOURCE State of California, Department of Health: Death Records (1990-2013) No link

California Department of Finance: Population Estimates Annual Intercensal Population Estimates (1990-2010) Table P-2: County Population by Age (2010-2013) http://www.dof.ca.gov/Forecasting/Demographics/Estimates/

U.S. Census Bureau: Decennial Census ZCTA Population (2000-2010) http://factfinder.census.gov

U.S. Census Bureau: American Community Survey 5-Year Population Estimates (2013) http://factfinder.census.gov

CONTACT INFORMATION vitalsigns.info@mtc.ca.gov

METHODOLOGY NOTES (across all datasets for this indicator) Life expectancy is commonly used as a measure of the health of a population. Life expectancy does not reflect how long any given individual is expected to live; rather, it is an artificial measure that captures an aspect of the mortality rates across a population that can be compared across time and populations. More information about the determinants of life expectancy that may lead to differences in life expectancy between neighborhoods can be found in the Bay Area Regional Health Inequities Initiative (BARHII) Health Inequities in the Bay Area report at http://www.barhii.org/wp-content/uploads/2015/09/barhii_hiba.pdf. Vital Signs measures life expectancy at birth (as opposed to cohort life expectancy). A statistical model was used to estimate life expectancy for Bay Area counties and ZIP Codes based on current life tables which require both age and mortality data. A life table is a table which shows, for each age, the survivorship of a people from a certain population.

Current life tables were created using death records and population estimates by age. The California Department of Public Health provided death records based on the California death certificate information. Records include age at death and residential ZIP Code. Single-year age population estimates at the regional- and county-level comes from the California Department of Finance population estimates and projections for ages 0-100+. Population estimates for ages 100 and over are aggregated to a single age interval. Using this data, death rates in a population within age groups for a given year are computed to form unabridged life tables (as opposed to abridged life tables). To calculate life expectancy, the probability of dying between the jth and (j+1)st birthday is assumed uniform after age 1. Special consideration is taken to account for infant mortality.

For the ZIP Code-level life expectancy calculation, it is assumed that postal ZIP Codes share the same boundaries as ZIP Code Census Tabulation Areas (ZCTAs). More information on the relationship between ZIP Codes and ZCTAs can be found at http://www.census.gov/geo/reference/zctas.html. ZIP Code-level data uses three years of mortality data to make robust estimates due to small sample size. Year 2013 ZIP Code life expectancy estimates reflects death records from 2011 through 2013. 2013 is the last year with available mortality data. Death records for ZIP Codes with zero population (like those associated with P.O. Boxes) were assigned to the nearest ZIP Code with population. ZIP Code population for 2000 estimates comes from the Decennial Census. ZIP Code population for 2013 estimates are from the American Community Survey (5-Year Average). ACS estimates are adjusted using Decennial Census data for more accurate population estimates. An adjustment factor was calculated using the ratio between the 2010 Decennial Census population estimates and the 2012 ACS 5-Year (with middle year 2010) population estimates. This adjustment factor is particularly important for ZCTAs with high homeless population (not living in group quarters) where the ACS may underestimate the ZCTA population and therefore underestimate the life expectancy. The ACS provides ZIP Code population by age in five-year age intervals. Single-year age population estimates were calculated by distributing population within an age interval to single-year ages using the county distribution. Counties were assigned to ZIP Codes based on majority land-area.

ZIP Codes in the Bay Area vary in population from over 10,000 residents to less than 20 residents. Traditional life expectancy estimation (like the one used for the regional- and county-level Vital Signs estimates) cannot be used because they are highly inaccurate for small populations and may result in over/underestimation of life expectancy. To avoid inaccurate estimates, ZIP Codes with populations of less than 5,000 were aggregated with neighboring ZIP Codes until the merged areas had a population of more than 5,000. ZIP Code 94103, representing Treasure Island, was dropped from the dataset due to its small population and having no bordering ZIP Codes. In this way, the original 305 Bay Area ZIP Codes were reduced to 217 ZIP Code areas for 2013 estimates. Next, a form of Bayesian random-effects analysis was used which established a prior distribution of the probability of death at each age using the regional distribution. This prior is used to shore up the life expectancy calculations where data were sparse.
Life Expectancy WHO
kaggle.com
zip
Updated Jun 19, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
vikram amin (2023). Life Expectancy WHO [Dataset]. https://www.kaggle.com/datasets/vikramamin/life-expectancy-who
Explore at:
zip(121472 bytes)Available download formats
Dataset updated
Jun 19, 2023
Authors
vikram amin
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
The objective behind attempting this dataset was to understand the predictors that contribute to the life expectancy around the world. I have used Linear Regression, Decision Tree and Random Forest for this purpose. Steps Involved: - Read the csv file - Data Cleaning: - Variables Country and Status were showing as having character data types. These had to be converted to factor - 2563 missing values were encountered with Population variable having the most of the missing values i.e 652 - Missing rows were dropped before we could run the analysis. 3) Run Linear Regression - Before running linear regression, 3 variables were dropped as they were not found to be having that much of an effect on the dependent variable i.e Life Expectancy. These 3 variables were Country, Year & Status. This meant we are now working with 19 variables (1 dependent and 18 independent variables) - We run the linear regression. Multiple R squared is 83% which means that independent variables can explain 83% change or variance in the dependent variable. - OULTLIER DETECTION. We check for outliers using IQR and find 54 outliers. These outliers are then removed before we run the regression analysis once again. Multiple R squared increased from 83% to 86%. - MULTICOLLINEARITY. We check for multicollinearity using the VIF model(Variance Inflation Factor). This is being done in case when two or more independent variables showing high correlation. The thumb rule is that absolute VIF values above 5 should be removed. We find 6 variables that have a VIF value higher than 5 namely Infant.deaths, percentage.expenditure,Under.five.deaths,GDP,thinness1.19,thinness5.9. Infant deaths and Under Five deaths have strong collinearity so we drop infant deaths(which has the higher VIF value). - When we run the linear regression model again, VIF value of Under.Five.Deaths goes down from 211.46 to 2.74 while the other variable's VIF values reduce very less. Variable thinness1.19 is now dropped and we run the regression once more. - Variable thinness5.9 whose absolute VIF value was 7.61 has now dropped to 1.95. GDP and Population are still having VIF value more than 5 but I decided against dropping these as I consider them to be important independent variables. - SET THE SEED AND SPLIT THE DATA INTO TRAIN AND TEST DATA. We run the train data and get multiple R squared of 86% and p value less than that of alpha which states that it is statistically significant. We use the train data to predict the test data to find out the RMSE and MAPE. We run the library(Metrics) for this purpose. - In Linear Regression, RMSE (Root Mean Squared Error) is 3.2. This indicates that on an average, the predicted values have an error of 3.2 years as compared to the actual life expectancy values. - MAPE (Mean Absolute Percentage Error) is 0.037. This indicates an accuracy prediction of 96.20% (1-0.037). - MAE (Mean Absolute Error) is 2.55. This indicates that on an average, the predicted values deviate by approximately 2.83 years from the actual values.

We use DECISION TREE MODEL for the analysis.

Run the required libraries (rpart, rpart.plot, RColorBrewer, rattle).

We run the decision tree analysis using rpart and plot the tree. We use fancyRpartPlot.

We use 5 fold cross validation method with CP (complexity parameter) being 0.01.

In Decision Tree , RMSE (Root Mean Squared Error) is 3.06. This indicates that on an average, the predicted values have an error of 3.06 years as compared to the actual life expectancy values.

MAPE (Mean Absolute Percentage Error) is 0.035. This indicates an accuracy prediction of 96.45% (1-0.035).

MAE (Mean Absolute Error) is 2.35. This indicates that on an average, the predicted values deviate by approximately 2.35 years from the actual values.

We use RANDOM FOREST for the analysis.

Run library(randomForest)

We use varImpPlot to find out which variables are most significant and least significant. Income composition is the most important followed by adult mortality and the least relevant independent variable is Population.

Predict Life expectancy through random forest model.

In Random Forest , RMSE (Root Mean Squared Error) is 1.73. This indicates that on an average, the predicted values have an error of 1.73 years as compared to the actual life expectancy values.

MAPE (Mean Absolute Percentage Error) is 0.01. This indicates an accuracy prediction of 98.27% (1-0.01).

MAE (Mean Absolute Error) is 1.14. This indicates that on an average, the predicted values deviate by approximately 1.14 years from the actual values.

Conclusion: Random Forest is the best model for predicting the life expectancy values as it has the lowest RMSE, MAPE and MAE.
d
Public Health Statistics - Life Expectancy By Race Ethnicity - Historical
catalog.data.gov
data.cityofchicago.org
+2more
Updated Dec 2, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
data.cityofchicago.org (2023). Public Health Statistics - Life Expectancy By Race Ethnicity - Historical [Dataset]. https://catalog.data.gov/dataset/public-health-statistics-life-expectancy-by-race-ethnicity
Explore at:
Dataset updated
Dec 2, 2023
Dataset provided by
data.cityofchicago.org
Description
Note: This dataset is historical only and there are not corresponding datasets for more recent time periods. For that more-recent information, please visit the Chicago Health Atlas at https://chicagohealthatlas.org. This dataset gives the average life expectancy and corresponding confidence intervals for sex and racial-ethnic groups in Chicago for the years 1990, 2000 and 2010. See the full description at: https://data.cityofchicago.org/api/views/3qdj-cqb8/files/pJ3PVVyubnsS2SpGO5P5IOPtNgCJZTE3LNOeLagC3mw?download=true&filename=P:\EPI\OEPHI\MATERIALS\REFERENCES\Life Expectancy\Dataset description_LE_ Sex_Race_Ethnicity.pdf
Life expectancy at various ages, by population group and sex, Canada
www150.statcan.gc.ca
datasets.ai
+2more
Updated Dec 17, 2015
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Government of Canada, Statistics Canada (2015). Life expectancy at various ages, by population group and sex, Canada [Dataset]. http://doi.org/10.25318/1310013401-eng
Explore at:
Unique identifier
https://doi.org/10.25318/1310013401-eng
Dataset updated
Dec 17, 2015
Dataset provided by
Statistics Canadahttps://statcan.gc.ca/en
Government of Canadahttp://www.gg.ca/
Area covered
Canada
Description
This table contains 2394 series, with data for years 1991 - 1991 (not all combinations necessarily have data for all years). This table contains data described by the following dimensions (Not all combinations are available): Geography (1 items: Canada ...), Population group (19 items: Entire cohort; Income adequacy quintile 1 (lowest);Income adequacy quintile 2;Income adequacy quintile 3 ...), Age (14 items: At 25 years; At 30 years; At 40 years; At 35 years ...), Sex (3 items: Both sexes; Females; Males ...), Characteristics (3 items: Life expectancy; High 95% confidence interval; life expectancy; Low 95% confidence interval; life expectancy ...).
Healthy and Disability-Free Life Expectancy - Dataset - data.gov.uk
ckan.publishing.service.gov.uk
Updated Jun 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ckan.publishing.service.gov.uk (2025). Healthy and Disability-Free Life Expectancy - Dataset - data.gov.uk [Dataset]. https://ckan.publishing.service.gov.uk/dataset/healthy-and-disability-free-life-expectancy
Explore at:
Dataset updated
Jun 9, 2025
Dataset provided by
CKANhttps://ckan.org/
Description
This dataset contains healthy life expectancy and disability-free life expectancy by gender, from birth and age 65. Health life expectancy is defined as the average number of years a person aged 'x' would live in good/fairly good health if he or she experiences the particular area's age-specific mortality and health rates throughout their life. Disability-free life expectancy is defined as the average number of years a person aged 'x' would live disability-free (no limiting long-term illness) if he or she experienced the particular area's age-specific mortality and health rates throughout their life. The estimates are calculated by combining age and sex specific mortality rates, with age and sex specific rates on general health and limiting long-term illness. For more information see the ONS website: https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/healthandlifeexpectancies
d
Life Expectancy - Dataset - Datopian CKAN instance
demo.dev.datopian.com
Updated Jun 22, 2007
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2007). Life Expectancy - Dataset - Datopian CKAN instance [Dataset]. https://demo.dev.datopian.com/dataset/lcc--life-expectancy
Explore at:
Dataset updated
Jun 22, 2007
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Description
Life expectancy is a summary measure of the all-cause mortality rates in an area in a given period. It shows an estimate of the average number of years a newborn baby would survive if he or she experienced the age-specific mortality rates for that area and time period throughout his or her life. Figures reflect mortality among those living in an area in the given time period, not the life expectancy of newborn children. That is because both the mortality rates of the area are likely to change in the future, and because many of those born in the area will live elsewhere for at least some part of their lives. Life expectancy is a summary measure of a population's health. It may be influenced by premature mortalities and health inequalities. Data source: Office for Health Improvement and Disparities (ODHI), indicator 90366.
d
SHIP Life Expectancy 2010-2021
catalog.data.gov
opendata.maryland.gov
+1more
Updated Feb 24, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
opendata.maryland.gov (2024). SHIP Life Expectancy 2010-2021 [Dataset]. https://catalog.data.gov/dataset/ship-life-expectancy-2010-2017
Explore at:
Dataset updated
Feb 24, 2024
Dataset provided by
opendata.maryland.gov
Description
Life Expectancy - This indicator shows life expectancy from birth, in years. Life expectancy is a summary measure used to describe overall health. Life expectancy at birth is the average number of years a newborn is expected to live given current conditions. The life expectancy in the US is the highest in recorded history thanks to public health interventions such as improvements in sanitation and food safety, development and use of vaccines, and health promotion efforts. Link to Data Details
C
Colombia CO: Life Expectancy at Birth: Total
ceicdata.com
Updated Dec 15, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CEICdata.com (2022). Colombia CO: Life Expectancy at Birth: Total [Dataset]. https://www.ceicdata.com/en/colombia/social-health-statistics/co-life-expectancy-at-birth-total
Explore at:
Dataset updated
Dec 15, 2022
Dataset provided by
CEICdata.com
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Dec 1, 2011 - Dec 1, 2022
Area covered
Colombia
Description
Colombia CO: Life Expectancy at Birth: Total data was reported at 77.725 Year in 2023. This records an increase from the previous number of 76.508 Year for 2022. Colombia CO: Life Expectancy at Birth: Total data is updated yearly, averaging 68.768 Year from Dec 1960 (Median) to 2023, with 64 observations. The data reached an all-time high of 77.725 Year in 2023 and a record low of 56.609 Year in 1960. Colombia CO: Life Expectancy at Birth: Total data remains active status in CEIC and is reported by World Bank. The data is categorized under Global Database’s Colombia – Table CO.World Bank.WDI: Social: Health Statistics. Life expectancy at birth indicates the number of years a newborn infant would live if prevailing patterns of mortality at the time of its birth were to stay the same throughout its life.;(1) United Nations Population Division. World Population Prospects: 2024 Revision; or derived from male and female life expectancy at birth from sources such as: (2) Statistical databases and publications from national statistical offices; (3) Eurostat: Demographic Statistics.;Weighted average;
m
Life expectancy at birth, female (years) - Sierra Leone
macro-rankings.com
csv, excel
Updated Nov 4, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
macro-rankings (2025). Life expectancy at birth, female (years) - Sierra Leone [Dataset]. https://www.macro-rankings.com/sierra-leone/life-expectancy-at-birth-female-(years)
Explore at:
csv, excelAvailable download formats
Dataset updated
Nov 4, 2025
Dataset authored and provided by
macro-rankings
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Sierra Leone
Description
Time series data for the statistic Life expectancy at birth, female (years) and country Sierra Leone. Indicator Definition:Life expectancy at birth indicates the number of years a newborn infant would live if prevailing patterns of mortality at the time of its birth were to stay the same throughout its life.The indicator "Life expectancy at birth, female (years)" stands at 63.50 as of 12/31/2023, the highest value at least since 12/31/1961, the period currently displayed. Regarding the One-Year-Change of the series, the current value constitutes an increase of 0.7664 percent compared to the value the year prior.The 1 year change in percent is 0.7664.The 3 year change in percent is 3.76.The 5 year change in percent is 5.40.The 10 year change in percent is 15.55.The Serie's long term average value is 46.83. It's latest available value, on 12/31/2023, is 35.59 percent higher, compared to it's long term average value.The Serie's change in percent from it's minimum value, on 12/31/1960, to it's latest available value, on 12/31/2023, is +83.33%.The Serie's change in percent from it's maximum value, on 12/31/2023, to it's latest available value, on 12/31/2023, is 0.0%.
S
San Marino SM: Life Expectancy at Birth
ceicdata.com
Updated Nov 13, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CEICdata.com (2022). San Marino SM: Life Expectancy at Birth [Dataset]. https://www.ceicdata.com/en/san-marino/social-demography-non-oecd-member-annual/sm-life-expectancy-at-birth
Explore at:
Dataset updated
Nov 13, 2022
Dataset provided by
CEICdata.com
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Dec 1, 2010 - Dec 1, 2021
Area covered
San Marino
Description
San Marino Life Expectancy at Birth data was reported at 80.880 Year in 2021. This records an increase from the previous number of 79.590 Year for 2020. San Marino Life Expectancy at Birth data is updated yearly, averaging 80.825 Year from Dec 1990 (Median) to 2021, with 32 observations. The data reached an all-time high of 82.990 Year in 2019 and a record low of 78.670 Year in 1990. San Marino Life Expectancy at Birth data remains active status in CEIC and is reported by Organisation for Economic Co-operation and Development. The data is categorized under Global Database’s San Marino – Table SM.OECD.GGI: Social: Demography: Non OECD Member: Annual.
C
Cambodia KH: Life Expectancy at Birth: Total
ceicdata.com
Updated Jul 15, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CEICdata.com (2024). Cambodia KH: Life Expectancy at Birth: Total [Dataset]. https://www.ceicdata.com/en/cambodia/social-health-statistics/kh-life-expectancy-at-birth-total
Explore at:
Dataset updated
Jul 15, 2024
Dataset provided by
CEICdata.com
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Dec 1, 2011 - Dec 1, 2022
Area covered
Cambodia
Description
Cambodia KH: Life Expectancy at Birth: Total data was reported at 70.668 Year in 2023. This records an increase from the previous number of 70.528 Year for 2022. Cambodia KH: Life Expectancy at Birth: Total data is updated yearly, averaging 55.665 Year from Dec 1960 (Median) to 2023, with 64 observations. The data reached an all-time high of 70.668 Year in 2023 and a record low of 11.295 Year in 1977. Cambodia KH: Life Expectancy at Birth: Total data remains active status in CEIC and is reported by World Bank. The data is categorized under Global Database’s Cambodia – Table KH.World Bank.WDI: Social: Health Statistics. Life expectancy at birth indicates the number of years a newborn infant would live if prevailing patterns of mortality at the time of its birth were to stay the same throughout its life.;(1) United Nations Population Division. World Population Prospects: 2024 Revision; or derived from male and female life expectancy at birth from sources such as: (2) Statistical databases and publications from national statistical offices; (3) Eurostat: Demographic Statistics.;Weighted average;
Z
Global Country Information 2023
data.niaid.nih.gov
zenodo.org
Updated Jun 15, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Elgiriyewithana, Nidula (2024). Global Country Information 2023 [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_8165228
Explore at:
Dataset updated
Jun 15, 2024
Authors
Elgiriyewithana, Nidula
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Description

This comprehensive dataset provides a wealth of information about all countries worldwide, covering a wide range of indicators and attributes. It encompasses demographic statistics, economic indicators, environmental factors, healthcare metrics, education statistics, and much more. With every country represented, this dataset offers a complete global perspective on various aspects of nations, enabling in-depth analyses and cross-country comparisons.

Key Features

Country: Name of the country.

Density (P/Km2): Population density measured in persons per square kilometer.

Abbreviation: Abbreviation or code representing the country.

Agricultural Land (%): Percentage of land area used for agricultural purposes.

Land Area (Km2): Total land area of the country in square kilometers.

Armed Forces Size: Size of the armed forces in the country.

Birth Rate: Number of births per 1,000 population per year.

Calling Code: International calling code for the country.

Capital/Major City: Name of the capital or major city.

CO2 Emissions: Carbon dioxide emissions in tons.

CPI: Consumer Price Index, a measure of inflation and purchasing power.

CPI Change (%): Percentage change in the Consumer Price Index compared to the previous year.

Currency_Code: Currency code used in the country.

Fertility Rate: Average number of children born to a woman during her lifetime.

Forested Area (%): Percentage of land area covered by forests.

Gasoline_Price: Price of gasoline per liter in local currency.

GDP: Gross Domestic Product, the total value of goods and services produced in the country.

Gross Primary Education Enrollment (%): Gross enrollment ratio for primary education.

Gross Tertiary Education Enrollment (%): Gross enrollment ratio for tertiary education.

Infant Mortality: Number of deaths per 1,000 live births before reaching one year of age.

Largest City: Name of the country's largest city.

Life Expectancy: Average number of years a newborn is expected to live.

Maternal Mortality Ratio: Number of maternal deaths per 100,000 live births.

Minimum Wage: Minimum wage level in local currency.

Official Language: Official language(s) spoken in the country.

Out of Pocket Health Expenditure (%): Percentage of total health expenditure paid out-of-pocket by individuals.

Physicians per Thousand: Number of physicians per thousand people.

Population: Total population of the country.

Population: Labor Force Participation (%): Percentage of the population that is part of the labor force.

Tax Revenue (%): Tax revenue as a percentage of GDP.

Total Tax Rate: Overall tax burden as a percentage of commercial profits.

Unemployment Rate: Percentage of the labor force that is unemployed.

Urban Population: Percentage of the population living in urban areas.

Latitude: Latitude coordinate of the country's location.

Longitude: Longitude coordinate of the country's location.

Potential Use Cases

Analyze population density and land area to study spatial distribution patterns.

Investigate the relationship between agricultural land and food security.

Examine carbon dioxide emissions and their impact on climate change.

Explore correlations between economic indicators such as GDP and various socio-economic factors.

Investigate educational enrollment rates and their implications for human capital development.

Analyze healthcare metrics such as infant mortality and life expectancy to assess overall well-being.

Study labor market dynamics through indicators such as labor force participation and unemployment rates.

Investigate the role of taxation and its impact on economic development.

Explore urbanization trends and their social and environmental consequences.

Facebook

Twitter

Click to copy link

Link copied

Cite

Ignacio Azua (2025). Life Expectancy [Dataset]. https://www.kaggle.com/datasets/ignacioazua/life-expectancy

Life Expectancy

Data based on the latest United Nations Population Division estimates

Explore at:

zip(3032 bytes)Available download formats

Dataset updated

Mar 4, 2025

Authors

Ignacio Azua

Description

Life Expectancy of the World Population

The dataset from Worldometer provides a ranked list of countries based on life expectancy at birth, which represents the average number of years a newborn is expected to live under current mortality rates. It includes global, regional, and country-specific life expectancy figures, with separate data for males and females. The dataset highlights disparities in longevity across nations, with countries like Hong Kong, Japan, and South Korea having the highest life expectancies. This data serves as a key indicator of public health, quality of life, and healthcare effectiveness, offering valuable insights for policymakers, researchers, and global health organizations.

Data Analysis & Machine Learning Approaches for Life Expectancy Data

Data Analysis Approaches Life expectancy data can be analyzed using descriptive statistics (mean, variance, distribution) and correlation analysis to identify relationships with factors like GDP, healthcare, and education. Time series analysis helps track longevity trends over time, while clustering techniques (e.g., K-Means) group countries with similar patterns. Additionally, geospatial analysis can visualize regional disparities in life expectancy.

Machine Learning Models For prediction, linear and multiple regression models estimate life expectancy based on socioeconomic indicators, while polynomial regression captures non-linear trends. Decision trees and Random Forests classify countries into high- and low-life expectancy groups. Deep learning techniques like neural networks (ANNs) can model complex relationships, while LSTMs are useful for time-series forecasting.

For pattern detection, K-Means clustering groups countries based on life expectancy trends, and DBSCAN identifies anomalies. Principal Component Analysis (PCA) helps in feature selection, improving model efficiency. These methods provide insights into longevity trends, helping policymakers and researchers improve public health strategies.

Life expectancy at birth. Data based on the latest United Nations Population Division estimates.

Source: https://www.worldometers.info/demographics/life-expectancy/#countries-ranked-by-life-expectancy

Clear search

Close search

Google apps

Main menu

Life Expectancy

Life Expectancy Averaged Dataset

Key Features

Purpose of the Dataset

Inspiration

License

Effect of suicide rates on life expectancy dataset

Life Expectancy 1960 to present (Global)

Public Health Statistics - Life Expectancy By Community Area - Historical

Replication Data for: The Association Between Income and Life Expectancy in...

🌱Life Expectation

Health state life expectancy, all ages, UK

Vital Signs: Life Expectancy – by ZIP Code

Life Expectancy WHO

We use DECISION TREE MODEL for the analysis.

We use RANDOM FOREST for the analysis.

Public Health Statistics - Life Expectancy By Race Ethnicity - Historical

Life expectancy at various ages, by population group and sex, Canada

Healthy and Disability-Free Life Expectancy - Dataset - data.gov.uk

Life Expectancy - Dataset - Datopian CKAN instance

SHIP Life Expectancy 2010-2021

Colombia CO: Life Expectancy at Birth: Total

Life expectancy at birth, female (years) - Sierra Leone

San Marino SM: Life Expectancy at Birth

Cambodia KH: Life Expectancy at Birth: Total

Global Country Information 2023

Life Expectancy

Data based on the latest United Nations Population Division estimates