100+ datasets found

Healthcare Diabetes Dataset
kaggle.com
zip
Updated Aug 23, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nandita Pore (2023). Healthcare Diabetes Dataset [Dataset]. https://www.kaggle.com/datasets/nanditapore/healthcare-diabetes
Explore at:
zip(27316 bytes)Available download formats
Dataset updated
Aug 23, 2023
Authors
Nandita Pore
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Description: Welcome to the Diabetes Prediction Dataset, a valuable resource for researchers, data scientists, and medical professionals interested in the field of diabetes risk assessment and prediction. This dataset contains a diverse range of health-related attributes, meticulously collected to aid in the development of predictive models for identifying individuals at risk of diabetes. By sharing this dataset, we aim to foster collaboration and innovation within the data science community, leading to improved early diagnosis and personalized treatment strategies for diabetes.

Columns: 1. Id: Unique identifier for each data entry. 2. Pregnancies: Number of times pregnant. 3. Glucose: Plasma glucose concentration over 2 hours in an oral glucose tolerance test. 4. BloodPressure: Diastolic blood pressure (mm Hg). 5. SkinThickness: Triceps skinfold thickness (mm). 6. Insulin: 2-Hour serum insulin (mu U/ml). 7. BMI: Body mass index (weight in kg / height in m^2). 8. DiabetesPedigreeFunction: Diabetes pedigree function, a genetic score of diabetes. 9. Age: Age in years. 10. Outcome: Binary classification indicating the presence (1) or absence (0) of diabetes.

Utilize this dataset to explore the relationships between various health indicators and the likelihood of diabetes. You can apply machine learning techniques to develop predictive models, feature selection strategies, and data visualization to uncover insights that may contribute to more accurate risk assessments. As you embark on your journey with this dataset, remember that your discoveries could have a profound impact on diabetes prevention and management.

Please ensure that you adhere to ethical guidelines and respect the privacy of individuals represented in this dataset. Proper citation and recognition of this dataset's source are appreciated to promote collaboration and knowledge sharing.

Start your exploration of the Diabetes Prediction Dataset today and contribute to the ongoing efforts to combat diabetes through data-driven insights and innovations.
C
Diabetes
data.wprdc.org
gimi9.com
+2more
csv, html, xlsx
Updated Jun 3, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Allegheny County (2024). Diabetes [Dataset]. https://data.wprdc.org/dataset/diabetes
Explore at:
html, xlsx, csv(16013), csvAvailable download formats
Dataset updated
Jun 3, 2024
Dataset authored and provided by
Allegheny County
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
These datasets provide de-identified insurance data for diabetes. The data is provided by three managed care organizations in Allegheny County (Gateway Health Plan, Highmark Health, and UPMC) and represents their insured population for the 2015 and calendar years.

Disclaimer: Users should be cautious of using administrative claims data as a measure of disease prevalence and interpreting trends over time, as data provided were collected for purposes other than surveillance. Limitations of these data include but are not limited to: misclassification, duplicate individuals, exclusion of individuals who did not seek care in past two years and those who are: uninsured, enrolled in plans not represented in the dataset, or were not enrolled in one of the represented plans for at least 90 days.

Support for Health Equity datasets and tools provided by Amazon Web Services (AWS) through their Health Equity Initiative.
De-Identified Insurance Data for Diabetes
kaggle.com
zip
Updated Apr 30, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Utkarsh Singh (2023). De-Identified Insurance Data for Diabetes [Dataset]. https://www.kaggle.com/datasets/utkarshx27/de-identified-insurance-data-for-diabetes
Explore at:
zip(7054 bytes)Available download formats
Dataset updated
Apr 30, 2023
Authors
Utkarsh Singh
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
These datasets provide de-identified insurance data for diabetes. The data is provided by three managed care organizations in Allegheny County (Gateway Health Plan, Highmark Health, and UPMC) and represents their insured population for the 2015 and calendar years.
Diabetes Clinical Dataset(100k rows)
kaggle.com
Updated Feb 7, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ziya (2025). Diabetes Clinical Dataset(100k rows) [Dataset]. https://www.kaggle.com/datasets/ziya07/diabetes-clinical-dataset100k-rows
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 7, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Ziya
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Detailed dataset comprising health and demographic data of 100,000 individuals, aimed at facilitating diabetes-related research and predictive modeling. This dataset includes information on gender, age, location, race, hypertension, heart disease, smoking history, BMI, HbA1c level, blood glucose level, and diabetes status.

Dataset Use Cases This dataset can be used for various analytical and machine learning purposes, such as:

Predictive Modeling: Build models to predict the likelihood of diabetes based on demographic and health-related features. Health Analytics: Analyze the correlation between different health metrics (e.g., BMI, HbA1c level) and diabetes. Demographic Studies: Examine the distribution of diabetes across different demographic groups and locations. Public Health Research: Identify risk factors for diabetes and target interventions to high-risk groups. Clinical Research: Study the relationship between comorbid conditions like hypertension and heart disease with diabetes. Potential Analyses Descriptive Statistics: Summarize the dataset to understand the central tendencies and dispersion of features. Correlation Analysis: Identify the relationships between features. Classification Models: Use machine learning algorithms to classify individuals as diabetic or non-diabetic. Trend Analysis: Analyze trends over the years to see how diabetes prevalence has changed. clinical_notes: clinical summaries based on patient attributes
G
Health indicator : diabetes : age-sex specific incidence rate by First...
open.canada.ca
open.alberta.ca
+1more
html
Updated Aug 14, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Government of Alberta (2024). Health indicator : diabetes : age-sex specific incidence rate by First Nations status [Dataset]. https://open.canada.ca/data/en/dataset/3b0f3ead-a82e-45eb-b5dd-26ed54c3e561
Explore at:
htmlAvailable download formats
Dataset updated
Aug 14, 2024
Dataset provided by
Government of Alberta
License
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Description
This dataset presents information on age-sex specific incidence rates of diabetes by First Nations status for Alberta, expressed as per 100,000 population.
d
Diabetes + Hypertension (comorbidity)
catalog.data.gov
data.wprdc.org
+2more
Updated Mar 14, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Allegheny County (2023). Diabetes + Hypertension (comorbidity) [Dataset]. https://catalog.data.gov/dataset/diabetes-hypertension-comorbidity
Explore at:
Dataset updated
Mar 14, 2023
Dataset provided by
Allegheny County
Description
This data set provides de-identified population data for diabetes and hypertension comorbidity prevalence in Allegheny County. The data is provided by three managed care organizations in Allegheny County (Gateway Health Plan, Highmark Health, and UPMC) and represents their insured population for the 2015 and 2016 calendar years. Disclaimer: Users should be cautious of using administrative claims data as a measure of disease prevalence and interpreting trends over time, as data provided were collected for purposes other than surveillance. Limitations of these data include but are not limited to: misclassification, duplicate individuals, exclusion of individuals who did not seek care in past two years and those who are: uninsured, enrolled in plans not represented in the dataset, or were not enrolled in one of the represented plans for at least 90 days.
Diabetes control is associated with environmental quality in the U.S.
catalog.data.gov
Updated Jul 21, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. EPA Office of Research and Development (ORD) (2022). Diabetes control is associated with environmental quality in the U.S. [Dataset]. https://catalog.data.gov/dataset/diabetes-control-is-associated-with-environmental-quality-in-the-u-s
Explore at:
Dataset updated
Jul 21, 2022
Dataset provided by
United States Environmental Protection Agencyhttp://www.epa.gov/
Area covered
United States
Description
Population-based county-level estimates for prevalence of DC were obtained from the Institute for Health Metrics and Evaluation (IHME) for the years 2004-2012 (16). DC prevalence rate was defined as the propor-tion of people within a county who had previously been diagnosed with diabetes (high fasting plasma glu-cose 126 mg/dL, hemoglobin A1c (HbA1c) of 6.5%, or diabetes diagnosis) but do not currently have high fasting plasma glucose or HbA1c for the period 2004-2012. DC prevalence estimates were calculated using a two-stage approach. The first stage used National Health and Nutrition Examination Survey (NHANES) data to predict high fasting plasma glucose (FPG) levels (≥126 mg/dL) and/or HbA1C levels (≥6.5% [48 mmol/mol]) based on self-reported demographic and behavioral characteristics (16). This model was then applied to Behavioral Risk Factor Surveillance System (BRFSS) data to impute high FPG and/or HbA1C status for each BRFSS respondent (16). The second stage used the imputed BRFSS data to fit a series of small area models, which were used to predict county-level prevalence of diabetes-related outcomes, including DC (16). The EQI was constructed for 2006-2010 for all US counties and is composed of five domains (air, water, built, land, and sociodemographic), each composed of variables to represent the environmental quality of that domain. Domain-specific EQIs were developed using principal components analysis (PCA) to reduce these variables within each domain while the overall EQI was constructed from a second PCA from these individual domains (L. C. Messer et al., 2014). To account for differences in environment across rural and urban counties, the overall and domain-specific EQIs were stratified by rural urban continuum codes (RUCCs) (U.S. Department of Agriculture, 2015). Results are reported as prevalence rate differences (PRD) with 95% confidence intervals (CIs) comparing the highest quintile/worst environmental quality to the lowest quintile/best environmental quality expo-sure metrics. PRDs are representative of the entire period of interest, 2004-2012. Due to availability of DC data and covariate data, not all counties were captured, however, the majority, 3134 of 3142 were utilized in the analysis. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: Human health data are not available publicly. EQI data are available at: https://edg.epa.gov/data/Public/ORD/NHEERL/EQI. Format: Data are stored as csv files. This dataset is associated with the following publication: Jagai, J., A. Krajewski, K. Price, D. Lobdell, and R. Sargis. Diabetes control is associated with environmental quality in the USA. Endocrine Connections. BioScientifica Ltd., Bristol, UK, 10(9): 1018-1026, (2021).
c
Diabetes Health Indicators Dataset
cubig.ai
zip
Updated Jun 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CUBIG (2025). Diabetes Health Indicators Dataset [Dataset]. https://cubig.ai/store/products/399/diabetes-health-indicators-dataset
Explore at:
zipAvailable download formats
Dataset updated
Jun 5, 2025
Dataset authored and provided by
CUBIG
License
https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service
Measurement technique
Synthetic data generation using AI techniques for model training, Privacy-preserving data transformation via differential privacy
Description
1) Data Introduction • The Diabetes Health Indicators Dataset is a large health dataset that collects various health indicators and lifestyle information related to diabetes diagnosis based on health surveys and medical records of the U.S. population.

2) Data Utilization (1) Diabetes Health Indicators Dataset has characteristics that: • The dataset consists of more than 250,000 samples and contains more than 20 health and demographic variables, including diabetes (binary or triage label), age, gender, BMI, blood pressure, cholesterol, smoking and drinking habits, physical activity, mental health, income, and education level. (2) Diabetes Health Indicators Dataset can be used to: • Diabetes prediction model development: It can be used to develop machine learning-based classification models that use health indicators and lifestyle data to predict the risk of developing diabetes. • A Study on the Correlation between Lifestyle and Diabetes: It can be used in epidemiological and public health studies to analyze the effects of various lifestyle and demographic variables such as smoking, drinking, exercise, and eating habits on diabetes incidence.
m
Pediatric Diabetes Data
mass.gov
Updated Mar 26, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Population Health Information Tool (2019). Pediatric Diabetes Data [Dataset]. https://www.mass.gov/info-details/pediatric-diabetes-data
Explore at:
Dataset updated
Mar 26, 2019
Dataset provided by
Department of Public Health
Population Health Information Tool
Bureau of Climate and Environmental Health
Area covered
Massachusetts
Description
Find data on pediatric diabetes in Massachusetts. This dataset contains information on the number of cases and prevalence of Type 1 and Type 2 diabetes among students, grades K-8, in Massachusetts.
U
United States US: Diabetes Prevalence: % of Population Aged 20-79
ceicdata.com
Updated Mar 15, 2009
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CEICdata.com (2009). United States US: Diabetes Prevalence: % of Population Aged 20-79 [Dataset]. https://www.ceicdata.com/en/united-states/health-statistics/us-diabetes-prevalence--of-population-aged-2079
Explore at:
Dataset updated
Mar 15, 2009
Dataset provided by
CEICdata.com
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Dec 1, 2017
Area covered
United States
Description
United States US: Diabetes Prevalence: % of Population Aged 20-79 data was reported at 10.790 % in 2017. United States US: Diabetes Prevalence: % of Population Aged 20-79 data is updated yearly, averaging 10.790 % from Dec 2017 (Median) to 2017, with 1 observations. United States US: Diabetes Prevalence: % of Population Aged 20-79 data remains active status in CEIC and is reported by World Bank. The data is categorized under Global Database’s USA – Table US.World Bank: Health Statistics. Diabetes prevalence refers to the percentage of people ages 20-79 who have type 1 or type 2 diabetes.; ; International Diabetes Federation, Diabetes Atlas.; Weighted average;
Diabetes Dataset
kaggle.com
zip
Updated Jul 10, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Milad Hashemi (2023). Diabetes Dataset [Dataset]. https://www.kaggle.com/datasets/hashemi221022/diabetes
Explore at:
zip(9128 bytes)Available download formats
Dataset updated
Jul 10, 2023
Authors
Milad Hashemi
Description
Diabetes is a chronic disease that affects the way the body processes blood sugar, also known as glucose. Glucose is an important source of energy for the body's cells, and insulin, a hormone produced by the pancreas, helps to regulate glucose levels in the blood.

In people with diabetes, the body either doesn't produce enough insulin, or it can't effectively use the insulin it produces. This causes glucose to build up in the blood, leading to a range of health problems over time.

There are two main types of diabetes: type 1 and type 2. Type 1 diabetes, also known as juvenile diabetes, is usually diagnosed in children and young adults. It occurs when the body's immune system attacks and destroys the cells in the pancreas that produce insulin. People with type 1 diabetes need to take insulin injections or use an insulin pump to manage their blood sugar levels.

Type 2 diabetes is the most common form of diabetes, accounting for around 90% of all cases. It usually develops in adults, but can also occur in children and teenagers. In type 2 diabetes, the body becomes resistant to the effects of insulin, and the pancreas may not produce enough insulin to keep blood sugar levels in check. Lifestyle changes, such as a healthy diet and regular exercise, can help manage type 2 diabetes, and some people may also need medication or insulin therapy.

Both types of diabetes can lead to serious health complications over time, including heart disease, stroke, kidney disease, nerve damage, and eye problems. It's important for people with diabetes to work closely with their healthcare team to manage their condition and prevent these complications.

دیابت بیماری مزمنی است که نحوه پردازش قند خون را در بدن تحت تأثیر قرار می‌دهد. قند یک منبع مهم انرژی برای سلول‌های بدن است و انسولین، یک هورمون توسط پانکراس تولید شده، به کنترل سطح قند خون در بدن کمک می‌کند

در افراد دیابتی، بدن یا انسولین کافی تولید نمی‌کند یا نمی‌تواند به طور موثر از انسولینی که تولید می‌شود، استفاده کند. این باعث می‌شود که قند در خون تجمع پیدا کند که به مشکلات سلامتی در طول زمان منجر می‌شود

دو نوع اصلی دیابت وجود دارد: نوع ۱ و نوع ۲. دیابت نوع ۱ یا دیابت جوانان، معمولاً در کودکان و جوانان بزرگسال تشخیص داده می‌شود. این بیماری زمانی رخ می‌دهد که سیستم ایمنی بدن سلول‌های پانکراسی را که انسولین تولید می‌کنند، حمله می‌کند و از بین می‌برد. افراد دیابتی نوع ۱ باید تزریقات انسولین یا استفاده از پمپ انسولین برای کنترل سطح قند خون خود استفاده کنند

دیابت نوع ۲ شایع‌ترین نوع دیابت است که حدود ۹۰٪ از کل موارد را شامل می‌شود. این نوع بیماری معمولاً در بزرگسالان ایجاد می‌شود، اما ممکن است در کودکان و نوجوانان نیز رخ دهد. در دیابت نوع ۲، بدن به اثرات انسولین مقاومت پیدا می‌کند و پانکراس ممکن است انسولین کافی برای کنترل سطح قند خون تولید نکند. تغییرات سبک زندگی مانند رژیم غذایی سالم و ورزش منظم می‌تواند به مدیریت دیابت نوع ۲ کمک کند و برخی افراد ممکن است نیاز به دارو یا درمان انسولین داشته باشند

هر دو نوع دیابت می‌تواند منجر به مشکلات سلامتی جدی در طول زمان شود، از جمله بیماری قلبی، سکته مغزی، بیماری کلیه، آسیب عصبی و مشکلات چشمی. برای افراد دارای دیابت، مهم است که به همراه تیم مراقبت از سلامتی خود همکاری کرده و برای جلوگیری از این مشکلات تلاش کنند

Diabetes ist eine chronische Krankheit, die die Art und Weise beeinflusst, wie der Körper Blutzucker, auch als Glukose, verarbeitet. Glukose ist eine wichtige Energiequelle für die Zellen des Körpers, und Insulin, ein Hormon, das von der Bauchspeicheldrüse produziert wird, hilft bei der Regulierung des Glukosespiegels im Blut.

Bei Menschen mit Diabetes produziert der Körper entweder nicht genug Insulin oder kann das Insulin, das er produziert, nicht effektiv nutzen. Dies führt dazu, dass sich Glukose im Blut ansammelt, was im Laufe der Zeit zu einer Reihe von Gesundheitsproblemen führen kann.

Es gibt zwei Haupttypen von Diabetes: Typ 1 und Typ 2. Diabetes Typ 1, auch als juveniler Diabetes bekannt, wird in der Regel bei Kindern und jungen Erwachsenen diagnostiziert. Es tritt auf, wenn das Immunsystem des Körpers die Zellen in der Bauchspeicheldrüse angreift und zerstört, die Insulin produzieren. Menschen mit Diabetes Typ 1 müssen Insulininjektionen oder eine Insulinpumpe verwenden, um ihren Blutzuckerspiegel zu kontrollieren.

Diabetes Typ 2 ist die häufigste Form von Diabetes und macht etwa 90% aller Fälle aus. Es entwickelt sich in der Regel bei Erwachsenen, kann aber auch bei Kindern und Jugendlichen auftreten. Bei Diabetes Typ 2 wird der Körper gegenüber den Wirkungen von Insulin resistent, und die Ba...
Diabetes - Dataset - data.gov.uk
ckan.publishing.service.gov.uk
Updated Jul 12, 2017
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ckan.publishing.service.gov.uk (2017). Diabetes - Dataset - data.gov.uk [Dataset]. https://ckan.publishing.service.gov.uk/dataset/diabetes
Explore at:
Dataset updated
Jul 12, 2017
Dataset provided by
CKANhttps://ckan.org/
Description
This public health factsheet describes facts, assets, and strategies related to diabetes in Camden.
c
Diabetes mellitus (in persons aged 17 and over): England
data.catchmentbasedapproach.org
Updated Apr 7, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Rivers Trust (2021). Diabetes mellitus (in persons aged 17 and over): England [Dataset]. https://data.catchmentbasedapproach.org/datasets/diabetes-mellitus-in-persons-aged-17-and-over-england
Explore at:
Dataset updated
Apr 7, 2021
Dataset authored and provided by
The Rivers Trust
Area covered

Description
SUMMARYThis analysis, designed and executed by Ribble Rivers Trust, identifies areas across England with the greatest levels of diabetes mellitus in persons (aged 17+). Please read the below information to gain a full understanding of what the data shows and how it should be interpreted.ANALYSIS METHODOLOGYThe analysis was carried out using Quality and Outcomes Framework (QOF) data, derived from NHS Digital, relating to diabetes mellitus in persons (aged 17+).This information was recorded at the GP practice level. However, GP catchment areas are not mutually exclusive: they overlap, with some areas covered by 30+ GP practices. Therefore, to increase the clarity and usability of the data, the GP-level statistics were converted into statistics based on Middle Layer Super Output Area (MSOA) census boundaries.The percentage of each MSOA’s population (aged 17+) with diabetes mellitus was estimated. This was achieved by calculating a weighted average based on:The percentage of the MSOA area that was covered by each GP practice’s catchment areaOf the GPs that covered part of that MSOA: the percentage of registered patients that have that illness The estimated percentage of each MSOA’s population with diabetes mellitus was then combined with Office for National Statistics Mid-Year Population Estimates (2019) data for MSOAs, to estimate the number of people in each MSOA with depression, within the relevant age range.Each MSOA was assigned a relative score between 1 and 0 (1 = worst, 0 = best) based on:A) the PERCENTAGE of the population within that MSOA who are estimated to have diabetes mellitusB) the NUMBER of people within that MSOA who are estimated to have diabetes mellitusAn average of scores A & B was taken, and converted to a relative score between 1 and 0 (1= worst, 0 = best). The closer to 1 the score, the greater both the number and percentage of the population in the MSOA that are estimated to have diabetes mellitus, compared to other MSOAs. In other words, those are areas where it’s estimated a large number of people suffer from diabetes mellitus, and where those people make up a large percentage of the population, indicating there is a real issue with diabetes mellitus within the population and the investment of resources to address that issue could have the greatest benefits.LIMITATIONS1. GP data for the financial year 1st April 2018 – 31st March 2019 was used in preference to data for the financial year 1st April 2019 – 31st March 2020, as the onset of the COVID19 pandemic during the latter year could have affected the reporting of medical statistics by GPs. However, for 53 GPs (out of 7670) that did not submit data in 2018/19, data from 2019/20 was used instead. Note also that some GPs (997 out of 7670) did not submit data in either year. This dataset should be viewed in conjunction with the ‘Health and wellbeing statistics (GP-level, England): Missing data and potential outliers’ dataset, to determine areas where data from 2019/20 was used, where one or more GPs did not submit data in either year, or where there were large discrepancies between the 2018/19 and 2019/20 data (differences in statistics that were > mean +/- 1 St.Dev.), which suggests erroneous data in one of those years (it was not feasible for this study to investigate this further), and thus where data should be interpreted with caution. Note also that there are some rural areas (with little or no population) that do not officially fall into any GP catchment area (although this will not affect the results of this analysis if there are no people living in those areas).2. Although all of the obesity/inactivity-related illnesses listed can be caused or exacerbated by inactivity and obesity, it was not possible to distinguish from the data the cause of the illnesses in patients: obesity and inactivity are highly unlikely to be the cause of all cases of each illness. By combining the data with data relating to levels of obesity and inactivity in adults and children (see the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset), we can identify where obesity/inactivity could be a contributing factor, and where interventions to reduce obesity and increase activity could be most beneficial for the health of the local population.3. It was not feasible to incorporate ultra-fine-scale geographic distribution of populations that are registered with each GP practice or who live within each MSOA. Populations might be concentrated in certain areas of a GP practice’s catchment area or MSOA and relatively sparse in other areas. Therefore, the dataset should be used to identify general areas where there are high levels of diabetes mellitus, rather than interpreting the boundaries between areas as ‘hard’ boundaries that mark definite divisions between areas with differing levels of diabetes mellitus.TO BE VIEWED IN COMBINATION WITH:This dataset should be viewed alongside the following datasets, which highlight areas of missing data and potential outliers in the data:Health and wellbeing statistics (GP-level, England): Missing data and potential outliersLevels of obesity, inactivity and associated illnesses (England): Missing dataDOWNLOADING THIS DATATo access this data on your desktop GIS, download the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset.DATA SOURCESThis dataset was produced using:Quality and Outcomes Framework data: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.GP Catchment Outlines. Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital. Data was cleaned by Ribble Rivers Trust before use.COPYRIGHT NOTICEThe reproduction of this data must be accompanied by the following statement:© Ribble Rivers Trust 2021. Analysis carried out using data that is: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.CaBA HEALTH & WELLBEING EVIDENCE BASEThis dataset forms part of the wider CaBA Health and Wellbeing Evidence Base.
a
Health indicator : diabetes : age-standardized incidence rate
open.alberta.ca
datasets.ai
+2more
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Health indicator : diabetes : age-standardized incidence rate [Dataset]. https://open.alberta.ca/dataset/health-indicator-diabetes-age-standardized-incidence-rate
Explore at:
Description
This dataset presents information on age-standardized incidence rates of diabetes for Alberta, for selected geographic areas , expressed as per 100,000 population.
Diabetes_Dataset_With_18_Features
kaggle.com
zip
Updated Jul 28, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Parisa Karimi Darabi (2023). Diabetes_Dataset_With_18_Features [Dataset]. https://www.kaggle.com/datasets/pkdarabi/diabetes-dataset-with-18-features
Explore at:
zip(104322 bytes)Available download formats
Dataset updated
Jul 28, 2023
Authors
Parisa Karimi Darabi
Description
Diabetes is the fourth leading cause of death in the world and one of the most common endocrine disorders. According to studies, Type 2 diabetes kills thousands of people around the world every year and imposes huge costs on societies in the form of surgeries and other treatment programs, as well as controlling complications and disability. Therefore, predicting and early diagnosis of this disease can greatly help governments and patients.

This dataset is the output of a Chinese research study conducted in 2016. It includes 1304 samples of patients who tested positive for diabetes, and the age of the participants ranges from 21 to 99 years old. The dataset was collected according to the indicators and standards of the World Health Organization, making it a reliable source for building diabetes diagnosis models. Researchers and healthcare professionals can use this dataset to train and test machine learning models to predict and diagnose diabetes in patients.

Features of Dataset: Age Gender BMI SBP (Systolic Blood Pressure) DBP (Diastolic Blood Pressure) FPG (Fasting Plasma Glucose) FFPG (Final Fasting Plasma Glucose) Cholesterol Triglyceride HDL (High-Density Lipoprotein) LDL (Low-Density Lipoprotein) ALT (Alanine Aminotransferase) BUN (Blood urea nitrogen) CCR (Creatinine Clearance) Smoking Status: (1: Current Smoker, 2: Ever Smoker, 3: Never Smoker) Drinking Status: (1: Current Drinker, 2: Ever Drinker, 3: Never Drinker) Family History of Diabetes: (1: Yes, 0: No) Diabetes

More details about dataset: The main dataset, without cleaning, is available at the following link: https://datadryad.org/stash/dataset/doi:10.5061/dryad.ft8750v. The main article corresponding to the dataset can be found at: https://doi.org/10.11.../bmjopen-2018-021768
f
Healthrise diabetes dataset noPII.
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated Dec 11, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Madela, Sanele Listen Mandlenkosi; Harriman, Nigel Walsh; Sifunda, Sibusiso; Sewpaul, Ronel; Williams, David R; Mbewu, Anthony David; Nyembezi, Anam; Manyaapelo, Thabang; Reddy, Sasiragha Priscilla (2023). Healthrise diabetes dataset noPII. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001058022
Explore at:
Dataset updated
Dec 11, 2023
Authors
Madela, Sanele Listen Mandlenkosi; Harriman, Nigel Walsh; Sifunda, Sibusiso; Sewpaul, Ronel; Williams, David R; Mbewu, Anthony David; Nyembezi, Anam; Manyaapelo, Thabang; Reddy, Sasiragha Priscilla
Description
South Africa is experiencing a rapidly growing diabetes epidemic that threatens its healthcare system. Research on the determinants of diabetes in South Africa receives considerable attention due to the lifestyle changes accompanying South Africa’s rapid urbanization since the fall of Apartheid. However, few studies have investigated how segments of the Black South African population, who continue to endure Apartheid’s institutional discriminatory legacy, experience this transition. This paper explores the association between individual and area-level socioeconomic status and diabetes prevalence, awareness, treatment, and control within a sample of Black South Africans aged 45 years or older in three municipalities in KwaZulu-Natal. Cross-sectional data were collected on 3,685 participants from February 2017 to February 2018. Individual-level socioeconomic status was assessed with employment status and educational attainment. Area-level deprivation was measured using the most recent South African Multidimensional Poverty Index scores. Covariates included age, sex, BMI, and hypertension diagnosis. The prevalence of diabetes was 23% (n = 830). Of those, 769 were aware of their diagnosis, 629 were receiving treatment, and 404 had their diabetes controlled. Compared to those with no formal education, Black South Africans with some high school education had increased diabetes prevalence, and those who had completed high school had lower prevalence of treatment receipt. Employment status was negatively associated with diabetes prevalence. Black South Africans living in more deprived wards had lower diabetes prevalence, and those residing in wards that became more deprived from 2001 to 2011 had a higher prevalence diabetes, as well as diabetic control. Results from this study can assist policymakers and practitioners in identifying modifiable risk factors for diabetes among Black South Africans to intervene on. Potential community-based interventions include those focused on patient empowerment and linkages to care. Such interventions should act in concert with policy changes, such as expanding the existing sugar-sweetened beverage tax.
f
Data from: Prevalence and Incidence of Diabetes in Stockholm County...
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated Aug 14, 2014
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Andersson, Tomas; Ahlbom, Anders; Carlsson, Sofia; Magnusson, Cecilia (2014). Prevalence and Incidence of Diabetes in Stockholm County 1990-2010 [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001178390
Explore at:
Dataset updated
Aug 14, 2014
Authors
Andersson, Tomas; Ahlbom, Anders; Carlsson, Sofia; Magnusson, Cecilia
Area covered
Stockholm County
Description
BackgroundDiabetes is on the rise in the western world, but data from Scandinavia are inconsistent with indications of stable or even reverse trends. To shed new light on this issue, we investigated diabetes trends in Stockholm County 1990–2010, taking into account trends in risk factors and mortality.MethodsWe used data from a large population-based quadrennial public health survey conducted between 1990 and 2010 in Stockholm County (∼2.1 million inhabitants), supplemented with data from national registers. The age-standardized prevalence and incidence rates of diabetes and related risk factors 1990–2010 were estimated in adult inhabitants. We also modelled the influence of potential risk factors on the diabetes trends and estimated the life time risk of diabetes.ResultsThe prevalence of diabetes was 4.6% (95% confidence interval (CI); 4.5–4.8%) in 2010 compared to 2.8% (95% CI; 2.3–3.5%) in 1990. Between 1990 and 2002 the prevalence rose annually by 3.8% (95% CI; 2.1–5.5). Incidence rates showed a similar pattern and rose by 3.0% (95% CI; 0.3–5.7%) annually 1990–2002. The rising incidence was mainly attributable to increasing prevalence of overweight/obesity, which rose by 46% during the observation period. In 2010, the lifetime risk of adult onset diabetes was 28% (95% CI; 26–31%) in men and 19% (95% CI; 17–21%) in women.ConclusionsDiabetes rates have been increasing in Stockholm over the last decades, both in terms of incidence and prevalence, and this development is most likely the result of increasing overweight and obesity in the population.
Years of Life Lost (YLL): Diabetes - Dataset - data.gov.uk
ckan.publishing.service.gov.uk
Updated Feb 9, 2010
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ckan.publishing.service.gov.uk (2010). Years of Life Lost (YLL): Diabetes - Dataset - data.gov.uk [Dataset]. https://ckan.publishing.service.gov.uk/dataset/years_of_life_lost_yll_-_diabetes
Explore at:
Dataset updated
Feb 9, 2010
Dataset provided by
CKANhttps://ckan.org/
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Description
Years of Life Lost (YLL) as a result of death from diabetes. Directly age-Standardised Rates (DSR) per 100,000 population Source: Office for National Statistics (ONS) Publisher: Information Centre (IC) - Clinical and Health Outcomes Knowledge Base Geographies: Local Authority District (LAD), Government Office Region (GOR), National, Strategic Health Authority (SHA) Geographic coverage: England Time coverage: 2005-07, 2007 Type of data: Administrative data
u
Diabetes Mellitus death rates by county, 2019-2023 - Dataset - Healthy...
midb.uspatial.umn.edu
Updated Oct 24, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Diabetes Mellitus death rates by county, 2019-2023 - Dataset - Healthy Communities Data Portal [Dataset]. https://midb.uspatial.umn.edu/hcdp/dataset/diabetes-mellitus-death-rates-by-county-2019-2023
Explore at:
Dataset updated
Oct 24, 2025
Description
Diabetes Mellitus death rates by county, all races (includes Hispanic/Latino), all sexes, all ages, 2019-2023. Death data were provided by the National Vital Statistics System. Death rates (deaths per 100,000 population per year) are age-adjusted to the 2000 US standard population (20 age groups: <1, 1-4, 5-9, ... , 80-84, 85-89, 90+). Rates calculated using SEER*Stat. Population counts for denominators are based on Census populations as modified by the National Cancer Institute. The US Population Data File is used for mortality data. The Average Annual Percent Change is based onthe APCs calculated by the Joinpoint Regression Program (Version 4.9.0.0). Due to data availability issues, the time period used in the calculation of the joinpoint regression model may differ for selected counties. Counties with a (3) after their name may have their joinpoint regresssion model calculated using a different time period due to data availability issues.
G
Health indicator : diabetes : age-sex specific incidence rate
open.canada.ca
open.alberta.ca
html
Updated Jul 24, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Government of Alberta (2024). Health indicator : diabetes : age-sex specific incidence rate [Dataset]. https://open.canada.ca/data/en/dataset/650baf4c-d8c4-42b5-82c2-bbfb88b927fb
Explore at:
htmlAvailable download formats
Dataset updated
Jul 24, 2024
Dataset provided by
Government of Alberta
License
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Description
This dataset present information on age-sex specific incidence rates of diabetes for Alberta and AHS continuum zone, expressed as per 100,000 population.

Facebook

Twitter

Click to copy link

Link copied

Cite

Nandita Pore (2023). Healthcare Diabetes Dataset [Dataset]. https://www.kaggle.com/datasets/nanditapore/healthcare-diabetes

Healthcare Diabetes Dataset

A Comprehensive Dataset for Diabetes Risk Assessment

Explore at:

zip(27316 bytes)Available download formats

Dataset updated

Aug 23, 2023

Authors

Nandita Pore

License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Description: Welcome to the Diabetes Prediction Dataset, a valuable resource for researchers, data scientists, and medical professionals interested in the field of diabetes risk assessment and prediction. This dataset contains a diverse range of health-related attributes, meticulously collected to aid in the development of predictive models for identifying individuals at risk of diabetes. By sharing this dataset, we aim to foster collaboration and innovation within the data science community, leading to improved early diagnosis and personalized treatment strategies for diabetes.

Columns: 1. Id: Unique identifier for each data entry. 2. Pregnancies: Number of times pregnant. 3. Glucose: Plasma glucose concentration over 2 hours in an oral glucose tolerance test. 4. BloodPressure: Diastolic blood pressure (mm Hg). 5. SkinThickness: Triceps skinfold thickness (mm). 6. Insulin: 2-Hour serum insulin (mu U/ml). 7. BMI: Body mass index (weight in kg / height in m^2). 8. DiabetesPedigreeFunction: Diabetes pedigree function, a genetic score of diabetes. 9. Age: Age in years. 10. Outcome: Binary classification indicating the presence (1) or absence (0) of diabetes.

Utilize this dataset to explore the relationships between various health indicators and the likelihood of diabetes. You can apply machine learning techniques to develop predictive models, feature selection strategies, and data visualization to uncover insights that may contribute to more accurate risk assessments. As you embark on your journey with this dataset, remember that your discoveries could have a profound impact on diabetes prevention and management.

Please ensure that you adhere to ethical guidelines and respect the privacy of individuals represented in this dataset. Proper citation and recognition of this dataset's source are appreciated to promote collaboration and knowledge sharing.

Start your exploration of the Diabetes Prediction Dataset today and contribute to the ongoing efforts to combat diabetes through data-driven insights and innovations.

Clear search

Close search

Google apps

Main menu

Healthcare Diabetes Dataset

Diabetes

De-Identified Insurance Data for Diabetes

Diabetes Clinical Dataset(100k rows)

Health indicator : diabetes : age-sex specific incidence rate by First...

Diabetes + Hypertension (comorbidity)

Diabetes control is associated with environmental quality in the U.S.

Diabetes Health Indicators Dataset

Pediatric Diabetes Data

United States US: Diabetes Prevalence: % of Population Aged 20-79

Diabetes Dataset

Diabetes - Dataset - data.gov.uk

Diabetes mellitus (in persons aged 17 and over): England

Health indicator : diabetes : age-standardized incidence rate

Diabetes_Dataset_With_18_Features

Healthrise diabetes dataset noPII.

Data from: Prevalence and Incidence of Diabetes in Stockholm County...

Years of Life Lost (YLL): Diabetes - Dataset - data.gov.uk

Diabetes Mellitus death rates by county, 2019-2023 - Dataset - Healthy...

Health indicator : diabetes : age-sex specific incidence rate

Healthcare Diabetes Dataset

A Comprehensive Dataset for Diabetes Risk Assessment