100+ datasets found

Comprehensive Diabetes Clinical Dataset(100k rows)
kaggle.com
zip
Updated Jul 20, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Priyam Choksi (2024). Comprehensive Diabetes Clinical Dataset(100k rows) [Dataset]. https://www.kaggle.com/datasets/priyamchoksi/100000-diabetes-clinical-dataset
Explore at:
zip(917848 bytes)Available download formats
Dataset updated
Jul 20, 2024
Authors
Priyam Choksi
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Detailed dataset comprising health and demographic data of 100,000 individuals, aimed at facilitating diabetes-related research and predictive modeling. This dataset includes information on gender, age, location, race, hypertension, heart disease, smoking history, BMI, HbA1c level, blood glucose level, and diabetes status.

Dataset Use Cases

This dataset can be used for various analytical and machine learning purposes, such as:

Predictive Modeling: Build models to predict the likelihood of diabetes based on demographic and health-related features.

Health Analytics: Analyze the correlation between different health metrics (e.g., BMI, HbA1c level) and diabetes.

Demographic Studies: Examine the distribution of diabetes across different demographic groups and locations.

Public Health Research: Identify risk factors for diabetes and target interventions to high-risk groups.

Clinical Research: Study the relationship between comorbid conditions like hypertension and heart disease with diabetes.

Potential Analyses

Descriptive Statistics: Summarize the dataset to understand the central tendencies and dispersion of features.

Correlation Analysis: Identify the relationships between features.

Classification Models: Use machine learning algorithms to classify individuals as diabetic or non-diabetic.

Trend Analysis: Analyze trends over the years to see how diabetes prevalence has changed.

Diabetes Health Indicators Dataset

kaggle.com

Updated Sep 21, 2025

Facebook

Twitter

Click to copy link

Link copied

Cite

Mohan Krishna Thalla (2025). Diabetes Health Indicators Dataset [Dataset]. http://doi.org/10.34740/kaggle/dsv/13128284

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Unique identifier

https://doi.org/10.34740/kaggle/dsv/13128284

Dataset updated

Sep 21, 2025

Dataset provided by

Kagglehttp://kaggle.com/

Authors

Mohan Krishna Thalla

License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Diabetes Health Indicators Dataset

Overview

This dataset contains 100,000 patient records designed for diabetes risk prediction, analysis, and machine learning applications. The dataset is clean, preprocessed, and ready for use in classification, regression, feature engineering, statistical analysis, and data visualization.

Rows: 100,000
Columns: 35+
File: diabetes_dataset.csv

Dataset Description

The dataset includes patient profiles with features based on demographics, lifestyle habits, family history, and clinical measurements that are well-established indicators of diabetes risk. All data is generated using statistical distributions inspired by real-world medical research, ensuring privacy preservation while reflecting realistic health patterns.

Features

Column	Type	Description	Values/Range
patient_id	Integer	Unique patient identifier	1–100000
age	Integer	Age of patient in years	18–90
gender	String	Patient gender	'Male', 'Female', 'Other'
ethnicity	String	Ethnic background	'White', 'Hispanic', 'Black', 'Asian', 'Other'
education_level	String	Highest completed education	'No formal', 'Highschool', 'Graduate', 'Postgraduate'
income_level	String	Income category	'Low', 'Medium', 'High'
employment_status	String	Employment type	'Employed', 'Unemployed', 'Retired', 'Student'
smoking_status	String	Smoking behavior	'Never', 'Former', 'Current'
alcohol_consumption_per_week	Float	Drinks consumed per week	0–30
physical_activity_minutes_per_week	Integer	Physical activity (weekly minutes)	0–600
diet_score	Integer	Diet quality (higher = healthier)	0–10
sleep_hours_per_day	Float	Average daily sleep hours	3–12
screen_time_hours_per_day	Float	Average daily screen time hours	0–12
family_history_diabetes	Integer	Family history of diabetes	0 = No, 1 = Yes
hypertension_history	Integer	Hypertension history	0 = No, 1 = Yes
cardiovascular_history	Integer	Cardiovascular history	0 = No, 1 = Yes
bmi	Float	Body Mass Index (kg/m²)	15–45
waist_to_hip_ratio	Float	Waist-to-hip ratio	0.7–1.2
systolic_bp	Integer	Systolic blood pressure (mmHg)	90–180
diastolic_bp	Integer	Diastolic blood pressure (mmHg)	60–120
heart_rate	Integer	Resting heart rate (bpm)	50–120
cholesterol_total	Float	Total cholesterol (mg/dL)	120–300
hdl_cholesterol	Float	HDL cholesterol (mg/dL)	20–100
ldl_cholesterol	Float	LDL cholesterol (mg/dL)	50–200
triglycerides	Float	Triglycerides (mg/dL)	50–500
glucose_fasting	Float	Fasting glucose (mg/dL)	70–250
glucose_postprandial	Float	Post-meal glucose (mg/dL)	90–350
insulin_level	Float	Blood insulin level (µU/mL)	2–50
hba1c	Float	HbA1c (%)	4–14
diabetes_risk_score	Integer	Risk score (calculated, 0–100)	0–100
diabetes_stage	String	Stage of diabetes	'No Diabetes', 'Pre-Diabetes', 'Type 1', 'Type 2', 'Gestational'
diagnosed_diabetes	Integer	Target: Diabetes diagnosis	0 = No, 1 = Yes

Data Quality

✅ Complete: No missing values or duplicates
✅ Clean: All values fall within medically realistic ranges
✅ Balanced Features: Distribution matches realistic population health patterns
✅ Target Distribution: ~20–25% diagnosed cases (balanced for ML classification)

Use Cases

🩺 Binary Classification → Predict diagnosed_diabetes (Yes/No)
🧮 Multiclass Classification → Predict diabetes_stage
📊 Regression → Predict glucose_fasting, hba1c, or diabetes_risk_score
🔍 EDA & Visualization → Explore lifestyle and clinical health patterns
🧠 Machine Learning → Train ML/DL models for healthcare prediction tasks
📈 Statistical Testing → Hypothesis testing on health indicators

f
Healthrise diabetes dataset noPII.
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated Dec 11, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Madela, Sanele Listen Mandlenkosi; Harriman, Nigel Walsh; Sifunda, Sibusiso; Sewpaul, Ronel; Williams, David R; Mbewu, Anthony David; Nyembezi, Anam; Manyaapelo, Thabang; Reddy, Sasiragha Priscilla (2023). Healthrise diabetes dataset noPII. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001058022
Explore at:
Dataset updated
Dec 11, 2023
Authors
Madela, Sanele Listen Mandlenkosi; Harriman, Nigel Walsh; Sifunda, Sibusiso; Sewpaul, Ronel; Williams, David R; Mbewu, Anthony David; Nyembezi, Anam; Manyaapelo, Thabang; Reddy, Sasiragha Priscilla
Description
South Africa is experiencing a rapidly growing diabetes epidemic that threatens its healthcare system. Research on the determinants of diabetes in South Africa receives considerable attention due to the lifestyle changes accompanying South Africa’s rapid urbanization since the fall of Apartheid. However, few studies have investigated how segments of the Black South African population, who continue to endure Apartheid’s institutional discriminatory legacy, experience this transition. This paper explores the association between individual and area-level socioeconomic status and diabetes prevalence, awareness, treatment, and control within a sample of Black South Africans aged 45 years or older in three municipalities in KwaZulu-Natal. Cross-sectional data were collected on 3,685 participants from February 2017 to February 2018. Individual-level socioeconomic status was assessed with employment status and educational attainment. Area-level deprivation was measured using the most recent South African Multidimensional Poverty Index scores. Covariates included age, sex, BMI, and hypertension diagnosis. The prevalence of diabetes was 23% (n = 830). Of those, 769 were aware of their diagnosis, 629 were receiving treatment, and 404 had their diabetes controlled. Compared to those with no formal education, Black South Africans with some high school education had increased diabetes prevalence, and those who had completed high school had lower prevalence of treatment receipt. Employment status was negatively associated with diabetes prevalence. Black South Africans living in more deprived wards had lower diabetes prevalence, and those residing in wards that became more deprived from 2001 to 2011 had a higher prevalence diabetes, as well as diabetic control. Results from this study can assist policymakers and practitioners in identifying modifiable risk factors for diabetes among Black South Africans to intervene on. Potential community-based interventions include those focused on patient empowerment and linkages to care. Such interventions should act in concert with policy changes, such as expanding the existing sugar-sweetened beverage tax.
Diabetes
kaggle.com
zip
Updated Oct 8, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mohamadreza Momeni (2023). Diabetes [Dataset]. https://www.kaggle.com/datasets/imtkaggleteam/diabetes
Explore at:
zip(11477 bytes)Available download formats
Dataset updated
Oct 8, 2023
Authors
Mohamadreza Momeni
Description
Dataset Description: Several hundred rural African-American patients were included. The diabetes.csv file contains the raw data of all patients, including those with missing data. This can be used for descriptive statistics. The data dictionary to explain the columns can be found here: here and here

The Diabetes_Classification file was cleaned and manipulated. Any patient without a hemoglobin A1c was excluded. If their hemoglobin A1 c was 6.5 or greater they were labelled with diabetes = yes [column = "glyhb"]. Sixty patients out of 390 were found to be diabetic. A code book of the variables is included in one of the tabs. The goal is to use machine learning (classification algorithm) to predict diabetes based on demographic and laboratory variables. What are the strongest predictors? If you exclude glucose, how strong is the prediction?
Diabetes control is associated with environmental quality in the U.S.
catalog.data.gov
Updated Jul 21, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. EPA Office of Research and Development (ORD) (2022). Diabetes control is associated with environmental quality in the U.S. [Dataset]. https://catalog.data.gov/dataset/diabetes-control-is-associated-with-environmental-quality-in-the-u-s
Explore at:
Dataset updated
Jul 21, 2022
Dataset provided by
United States Environmental Protection Agencyhttp://www.epa.gov/
Area covered
United States
Description
Population-based county-level estimates for prevalence of DC were obtained from the Institute for Health Metrics and Evaluation (IHME) for the years 2004-2012 (16). DC prevalence rate was defined as the propor-tion of people within a county who had previously been diagnosed with diabetes (high fasting plasma glu-cose 126 mg/dL, hemoglobin A1c (HbA1c) of 6.5%, or diabetes diagnosis) but do not currently have high fasting plasma glucose or HbA1c for the period 2004-2012. DC prevalence estimates were calculated using a two-stage approach. The first stage used National Health and Nutrition Examination Survey (NHANES) data to predict high fasting plasma glucose (FPG) levels (≥126 mg/dL) and/or HbA1C levels (≥6.5% [48 mmol/mol]) based on self-reported demographic and behavioral characteristics (16). This model was then applied to Behavioral Risk Factor Surveillance System (BRFSS) data to impute high FPG and/or HbA1C status for each BRFSS respondent (16). The second stage used the imputed BRFSS data to fit a series of small area models, which were used to predict county-level prevalence of diabetes-related outcomes, including DC (16). The EQI was constructed for 2006-2010 for all US counties and is composed of five domains (air, water, built, land, and sociodemographic), each composed of variables to represent the environmental quality of that domain. Domain-specific EQIs were developed using principal components analysis (PCA) to reduce these variables within each domain while the overall EQI was constructed from a second PCA from these individual domains (L. C. Messer et al., 2014). To account for differences in environment across rural and urban counties, the overall and domain-specific EQIs were stratified by rural urban continuum codes (RUCCs) (U.S. Department of Agriculture, 2015). Results are reported as prevalence rate differences (PRD) with 95% confidence intervals (CIs) comparing the highest quintile/worst environmental quality to the lowest quintile/best environmental quality expo-sure metrics. PRDs are representative of the entire period of interest, 2004-2012. Due to availability of DC data and covariate data, not all counties were captured, however, the majority, 3134 of 3142 were utilized in the analysis. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: Human health data are not available publicly. EQI data are available at: https://edg.epa.gov/data/Public/ORD/NHEERL/EQI. Format: Data are stored as csv files. This dataset is associated with the following publication: Jagai, J., A. Krajewski, K. Price, D. Lobdell, and R. Sargis. Diabetes control is associated with environmental quality in the USA. Endocrine Connections. BioScientifica Ltd., Bristol, UK, 10(9): 1018-1026, (2021).
u
Data from: T1DiabetesGranada: a longitudinal multi-modal dataset of type 1...
produccioncientifica.ugr.es
data.niaid.nih.gov
Updated 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rodriguez-Leon, Ciro; Aviles Perez, Maria Dolores; Banos, Oresti; Quesada-Charneco, Miguel; Lopez-Ibarra, Pablo J; Villalonga, Claudia; Munoz-Torres, Manuel; Rodriguez-Leon, Ciro; Aviles Perez, Maria Dolores; Banos, Oresti; Quesada-Charneco, Miguel; Lopez-Ibarra, Pablo J; Villalonga, Claudia; Munoz-Torres, Manuel (2023). T1DiabetesGranada: a longitudinal multi-modal dataset of type 1 diabetes mellitus [Dataset]. https://produccioncientifica.ugr.es/documentos/668fc429b9e7c03b01bd53b7
Explore at:
Dataset updated
2023
Authors
Rodriguez-Leon, Ciro; Aviles Perez, Maria Dolores; Banos, Oresti; Quesada-Charneco, Miguel; Lopez-Ibarra, Pablo J; Villalonga, Claudia; Munoz-Torres, Manuel; Rodriguez-Leon, Ciro; Aviles Perez, Maria Dolores; Banos, Oresti; Quesada-Charneco, Miguel; Lopez-Ibarra, Pablo J; Villalonga, Claudia; Munoz-Torres, Manuel
Description
T1DiabetesGranada

A longitudinal multi-modal dataset of type 1 diabetes mellitus

Documented by:

Rodriguez-Leon, C., Aviles-Perez, M. D., Banos, O., Quesada-Charneco, M., Lopez-Ibarra, P. J., Villalonga, C., & Munoz-Torres, M. (2023). T1DiabetesGranada: a longitudinal multi-modal dataset of type 1 diabetes mellitus. Scientific Data, 10(1), 916. https://doi.org/10.1038/s41597-023-02737-4

Background

Type 1 diabetes mellitus (T1D) patients face daily difficulties in keeping their blood glucose levels within appropriate ranges. Several techniques and devices, such as flash glucose meters, have been developed to help T1D patients improve their quality of life. Most recently, the data collected via these devices is being used to train advanced artificial intelligence models to characterize the evolution of the disease and support its management. The main problem for the generation of these models is the scarcity of data, as most published works use private or artificially generated datasets. For this reason, this work presents T1DiabetesGranada, a open under specific permission longitudinal dataset that not only provides continuous glucose levels, but also patient demographic and clinical information. The dataset includes 257780 days of measurements over four years from 736 T1D patients from the province of Granada, Spain. This dataset progresses significantly beyond the state of the art as one the longest and largest open datasets of continuous glucose measurements, thus boosting the development of new artificial intelligence models for glucose level characterization and prediction.

Data Records

The data are stored in four comma-separated values (CSV) files which are available in T1DiabetesGranada.zip. These files are described in detail below.

Patient_info.csv

Patient_info.csv is the file containing information about the patients, such as demographic data, start and end dates of blood glucose level measurements and biochemical parameters, number of biochemical parameters or number of diagnostics. This file is composed of 736 records, one for each patient in the dataset, and includes the following variables:

Patient_ID – Unique identifier of the patient. Format: LIB19XXXX.

Sex – Sex of the patient. Values: F (for female), masculine (for male)

Birth_year – Year of birth of the patient. Format: YYYY.

Initial_measurement_date – Date of the first blood glucose level measurement of the patient in the Glucose_measurements.csv file. Format: YYYY-MM-DD.

Final_measurement_date – Date of the last blood glucose level measurement of the patient in the Glucose_measurements.csv file. Format: YYYY-MM-DD.

Number_of_days_with_measures – Number of days with blood glucose level measurements of the patient, extracted from the Glucose_measurements.csv file. Values: ranging from 8 to 1463.

Number_of_measurements – Number of blood glucose level measurements of the patient, extracted from the Glucose_measurements.csv file. Values: ranging from 400 to 137292.

Initial_biochemical_parameters_date – Date of the first biochemical test to measure some biochemical parameter of the patient, extracted from the Biochemical_parameters.csv file. Format: YYYY-MM-DD.

Final_biochemical_parameters_date – Date of the last biochemical test to measure some biochemical parameter of the patient, extracted from the Biochemical_parameters.csv file. Format: YYYY-MM-DD.

Number_of_biochemical_parameters – Number of biochemical parameters measured on the patient, extracted from the Biochemical_parameters.csv file. Values: ranging from 4 to 846.

Number_of_diagnostics – Number of diagnoses realized to the patient, extracted from the Diagnostics.csv file. Values: ranging from 1 to 24.

Glucose_measurements.csv

Glucose_measurements.csv is the file containing the continuous blood glucose level measurements of the patients. The file is composed of more than 22.6 million records that constitute the time series of continuous blood glucose level measurements. It includes the following variables:

Patient_ID – Unique identifier of the patient. Format: LIB19XXXX.

Measurement_date – Date of the blood glucose level measurement. Format: YYYY-MM-DD.

Measurement_time – Time of the blood glucose level measurement. Format: HH:MM:SS.

Measurement – Value of the blood glucose level measurement in mg/dL. Values: ranging from 40 to 500.

Biochemical_parameters.csv

Biochemical_parameters.csv is the file containing data of the biochemical tests performed on patients to measure their biochemical parameters. This file is composed of 87482 records and includes the following variables:

Patient_ID – Unique identifier of the patient. Format: LIB19XXXX.

Reception_date – Date of receipt in the laboratory of the sample to measure the biochemical parameter. Format: YYYY-MM-DD.

Name – Name of the measured biochemical parameter. Values: 'Potassium', 'HDL cholesterol', 'Gammaglutamyl Transferase (GGT)', 'Creatinine', 'Glucose', 'Uric acid', 'Triglycerides', 'Alanine transaminase (GPT)', 'Chlorine', 'Thyrotropin (TSH)', 'Sodium', 'Glycated hemoglobin (Ac)', 'Total cholesterol', 'Albumin (urine)', 'Creatinine (urine)', 'Insulin', 'IA ANTIBODIES'.

Value – Value of the biochemical parameter. Values: ranging from -4.0 to 6446.74.

Diagnostics.csv

Diagnostics.csv is the file containing diagnoses of diabetes mellitus complications or other diseases that patients have in addition to type 1 diabetes mellitus. This file is composed of 1757 records and includes the following variables:

Patient_ID – Unique identifier of the patient. Format: LIB19XXXX.

Code – ICD-9-CM diagnosis code. Values: subset of 594 of the ICD-9-CM codes (https://www.cms.gov/Medicare/Coding/ICD9ProviderDiagnosticCodes/codes).

Description – ICD-9-CM long description. Values: subset of 594 of the ICD-9-CM long description (https://www.cms.gov/Medicare/Coding/ICD9ProviderDiagnosticCodes/codes).

Technical Validation

Blood glucose level measurements are collected using FreeStyle Libre devices, which are widely used for healthcare in patients with T1D. Abbott Diabetes Care, Inc., Alameda, CA, USA, the manufacturer company, has conducted validation studies of these devices concluding that the measurements made by their sensors compare to YSI analyzer devices (Xylem Inc.), the gold standard, yielding results of 99.9% of the time within zones A and B of the consensus error grid. In addition, other studies external to the company concluded that the accuracy of the measurements is adequate.

Moreover, it was also checked in most cases the blood glucose level measurements per patient were continuous (i.e. a sample at least every 15 minutes) in the Glucose_measurements.csv file as they should be.

Usage Notes

For data downloading, it is necessary to be authenticated on the Zenodo platform, accept the Data Usage Agreement and send a request specifying full name, email, and the justification of the data use. This request will be processed by the Secretary of the Department of Computer Engineering, Automatics, and Robotics of the University of Granada and access to the dataset will be granted.

The files that compose the dataset are CSV type files delimited by commas and are available in T1DiabetesGranada.zip. A Jupyter Notebook (Python v. 3.8) with code that may help to a better understanding of the dataset, with graphics and statistics, is available in UsageNotes.zip.

Graphs_and_stats.ipynb

The Jupyter Notebook generates tables, graphs and statistics for a better understanding of the dataset. It has four main sections, one dedicated to each file in the dataset. In addition, it has useful functions such as calculating the patient age, deleting a patient list from a dataset file and leaving only a patient list in a dataset file.

Code Availability

The dataset was generated using some custom code located in CodeAvailability.zip. The code is provided as Jupyter Notebooks created with Python v. 3.8. The code was used to conduct tasks such as data curation and transformation, and variables extraction.

Original_patient_info_curation.ipynb

In the Jupyter Notebook is preprocessed the original file with patient data. Mainly irrelevant rows and columns are removed, and the sex variable is recoded.

Glucose_measurements_curation.ipynb

In the Jupyter Notebook is preprocessed the original file with the continuous glucose level measurements of the patients. Principally rows without information or duplicated rows are removed and the variable with the timestamp is transformed into two new variables, measurement date and measurement time.

Biochemical_parameters_curation.ipynb

In the Jupyter Notebook is preprocessed the original file with patient data of the biochemical tests performed on patients to measure their biochemical parameters. Mainly irrelevant rows and columns are removed and the variable with the name of the measured biochemical parameter is translated.

Diagnostic_curation.ipynb

In the Jupyter Notebook is preprocessed the original file with patient data of the diagnoses of diabetes mellitus complications or other diseases that patients have in addition to T1D.

Get_patient_info_variables.ipynb

In the Jupyter Notebook it is coded the feature extraction process from the files Glucose_measurements.csv, Biochemical_parameters.csv and Diagnostics.csv to complete the file Patient_info.csv. It is divided into six sections, the first three to extract the features from each of the mentioned files and the next three to add the extracted features to the resulting new file.

Data Usage Agreement

The conditions for use are as follows:

You confirm that you will not attempt to re-identify research participants for any reason, including for re-identification theory research.

You commit to keeping the T1DiabetesGranada dataset confidential and secure and will not redistribute data or Zenodo account credentials.

You will require
g
PA: Rate of Hospitalizations for Short-Term Complications of Diabetes per...
gimi9.com
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
PA: Rate of Hospitalizations for Short-Term Complications of Diabetes per 10,000 by Age Range and County: Latest Year | gimi9.com [Dataset]. https://gimi9.com/dataset/ny_dbq9-t7nz
Explore at:
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This chart shows the rate of hospitalizations for short- term complications of diabetes for the most recent data year by age range and county. It also shows the 2017 objective by age range. This chart is based on one of three datasets related to the Prevention Agenda Tracking Indicators county level data posted on this site. Each dataset consists of county level data for 68 health tracking indicators and sub-indicators for the Prevention Agenda 2013-2017: New York State’s Health Improvement Plan. A health tracking indicator is a metric through which progress on a certain area of health improvement can be assessed. The indicators are organized by the Priority Area of the Prevention Agenda as well as the Focus Area under each Priority Area. Each dataset includes tracking indicators for the five Priority Areas of the Prevention Agenda 2013-2017. The most recent year dataset includes the most recent county level data for all indicators. The trend dataset includes the most recent county level data and historical data, where available. Each dataset also includes the Prevention Agenda 2017 state targets for the indicators. Sub-indicators are included in these datasets to measure health disparities among socioeconomic groups. For more information, check out: http://www.health.ny.gov/prevention/prevention_agenda/2013-2017/ and https://www.health.ny.gov/PreventionAgendaDashboard. The "About" tab contains additional details concerning this dataset.
f
Type 1 diabetes incidence rates in individuals aged 0–14 and 0–19 years by...
plos.figshare.com
datasetcatalog.nlm.nih.gov
xls
Updated Jun 21, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Apoorva Gomber; Zachary J. Ward; Carlo Ross; Maira Owais; Carol Mita; Jennifer M. Yeh; Ché L. Reddy; Rifat Atun (2023). Type 1 diabetes incidence rates in individuals aged 0–14 and 0–19 years by WHO regions and income. [Dataset]. http://doi.org/10.1371/journal.pgph.0001099.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pgph.0001099.t001
Dataset updated
Jun 21, 2023
Dataset provided by
PLOS Global Public Health
Authors
Apoorva Gomber; Zachary J. Ward; Carlo Ross; Maira Owais; Carol Mita; Jennifer M. Yeh; Ché L. Reddy; Rifat Atun
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
*(Age-standardised incidence rates per 100,000 individuals per year with 95% confidence intervals. † For cells labeled as NA, 95% CIs could not be estimated as there was only 1 data point).
f
Data_Sheet_1_The effect of diabetes on COVID-19 incidence and mortality:...
datasetcatalog.nlm.nih.gov
frontiersin.figshare.com
Updated Mar 8, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rossi, Paolo Giorgi; group, Reggio Emilia COVID-19 working; Bartolini, Letizia; Ottone, Marta; Bonvicini, Laura (2023). Data_Sheet_1_The effect of diabetes on COVID-19 incidence and mortality: Differences between highly-developed-country and high-migratory-pressure-country populations.pdf [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000980767
Explore at:
Dataset updated
Mar 8, 2023
Authors
Rossi, Paolo Giorgi; group, Reggio Emilia COVID-19 working; Bartolini, Letizia; Ottone, Marta; Bonvicini, Laura
Description
The objective of this study was to compare the effect of diabetes and pathologies potentially related to diabetes on the risk of infection and death from COVID-19 among people from Highly-Developed-Country (HDC), including Italians, and immigrants from the High-Migratory-Pressure-Countries (HMPC). Among the population with diabetes, whose prevalence is known to be higher among immigrants, we compared the effect of body mass index among HDC and HMPC populations. A population-based cohort study was conducted, using population registries and routinely collected surveillance data. The population was stratified into HDC and HMPC, according to the place of birth; moreover, a focus was set on the South Asiatic population. Analyses restricted to the population with type-2 diabetes were performed. We reported incidence (IRR) and mortality rate ratios (MRR) and hazard ratios (HR) with 95% confidence interval (CI) to estimate the effect of diabetes on SARS-CoV-2 infection and COVID-19 mortality. Overall, IRR of infection and MRR from COVID-19 comparing HMPC with HDC group were 0.84 (95% CI 0.82–0.87) and 0.67 (95% CI 0.46–0.99), respectively. The effect of diabetes on the risk of infection and death from COVID-19 was slightly higher in the HMPC population than in the HDC population (HRs for infection: 1.37 95% CI 1.22–1.53 vs. 1.20 95% CI 1.14–1.25; HRs for mortality: 3.96 95% CI 1.82–8.60 vs. 1.71 95% CI 1.50–1.95, respectively). No substantial difference in the strength of the association was observed between obesity or other comorbidities and SARS-CoV-2 infection. Similarly for COVID-19 mortality, HRs for obesity (HRs: 18.92 95% CI 4.48–79.87 vs. 3.91 95% CI 2.69–5.69) were larger in HMPC than in the HDC population, but differences could be due to chance. Among the population with diabetes, the HMPC group showed similar incidence (IRR: 0.99 95% CI: 0.88–1.12) and mortality (MRR: 0.89 95% CI: 0.49–1.61) to that of HDC individuals. The effect of obesity on incidence was similar in both HDC and HMPC populations (HRs: 1.73 95% CI 1.41–2.11 among HDC vs. 1.41 95% CI 0.63–3.17 among HMPC), although the estimates were very imprecise. Despite a higher prevalence of diabetes and a stronger effect of diabetes on COVID-19 mortality in HMPC than in the HDC population, our cohort did not show an overall excess risk of COVID-19 mortality in immigrants.
f
Adult Arabs have higher risk for diabetes mellitus than Jews in Israel
plos.figshare.com
datasetcatalog.nlm.nih.gov
docx
Updated Jun 2, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anat Jaffe; Shmuel Giveon; Liat Wulffhart; Bernice Oberman; Maslama Baidousi; Arnona Ziv; Ofra Kalter-Leibovici (2023). Adult Arabs have higher risk for diabetes mellitus than Jews in Israel [Dataset]. http://doi.org/10.1371/journal.pone.0176661
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0176661
Dataset updated
Jun 2, 2023
Dataset provided by
PLOS ONE
Authors
Anat Jaffe; Shmuel Giveon; Liat Wulffhart; Bernice Oberman; Maslama Baidousi; Arnona Ziv; Ofra Kalter-Leibovici
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Israel
Description
ObjectiveDiabetes mellitus is an emerging epidemic in the Arab world. Although high diabetes prevalence is documented in Israeli Arabs, information from cohort studies is scant.MethodsThis is a population study, based on information derived between 2007–2011, from the electronic database of the largest health fund in Israel, among Arabs and Jews. Prevalence, 4-year-incidence and diabetes hazard ratios [HRs], adjusted for sex and the metabolic-syndrome [MetS]-components, were determined in 3 age groups (
c
Diabetes mellitus (in persons aged 17 and over): England
data.catchmentbasedapproach.org
Updated Apr 7, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Rivers Trust (2021). Diabetes mellitus (in persons aged 17 and over): England [Dataset]. https://data.catchmentbasedapproach.org/datasets/diabetes-mellitus-in-persons-aged-17-and-over-england
Explore at:
Dataset updated
Apr 7, 2021
Dataset authored and provided by
The Rivers Trust
Area covered

Description
SUMMARYThis analysis, designed and executed by Ribble Rivers Trust, identifies areas across England with the greatest levels of diabetes mellitus in persons (aged 17+). Please read the below information to gain a full understanding of what the data shows and how it should be interpreted.ANALYSIS METHODOLOGYThe analysis was carried out using Quality and Outcomes Framework (QOF) data, derived from NHS Digital, relating to diabetes mellitus in persons (aged 17+).This information was recorded at the GP practice level. However, GP catchment areas are not mutually exclusive: they overlap, with some areas covered by 30+ GP practices. Therefore, to increase the clarity and usability of the data, the GP-level statistics were converted into statistics based on Middle Layer Super Output Area (MSOA) census boundaries.The percentage of each MSOA’s population (aged 17+) with diabetes mellitus was estimated. This was achieved by calculating a weighted average based on:The percentage of the MSOA area that was covered by each GP practice’s catchment areaOf the GPs that covered part of that MSOA: the percentage of registered patients that have that illness The estimated percentage of each MSOA’s population with diabetes mellitus was then combined with Office for National Statistics Mid-Year Population Estimates (2019) data for MSOAs, to estimate the number of people in each MSOA with depression, within the relevant age range.Each MSOA was assigned a relative score between 1 and 0 (1 = worst, 0 = best) based on:A) the PERCENTAGE of the population within that MSOA who are estimated to have diabetes mellitusB) the NUMBER of people within that MSOA who are estimated to have diabetes mellitusAn average of scores A & B was taken, and converted to a relative score between 1 and 0 (1= worst, 0 = best). The closer to 1 the score, the greater both the number and percentage of the population in the MSOA that are estimated to have diabetes mellitus, compared to other MSOAs. In other words, those are areas where it’s estimated a large number of people suffer from diabetes mellitus, and where those people make up a large percentage of the population, indicating there is a real issue with diabetes mellitus within the population and the investment of resources to address that issue could have the greatest benefits.LIMITATIONS1. GP data for the financial year 1st April 2018 – 31st March 2019 was used in preference to data for the financial year 1st April 2019 – 31st March 2020, as the onset of the COVID19 pandemic during the latter year could have affected the reporting of medical statistics by GPs. However, for 53 GPs (out of 7670) that did not submit data in 2018/19, data from 2019/20 was used instead. Note also that some GPs (997 out of 7670) did not submit data in either year. This dataset should be viewed in conjunction with the ‘Health and wellbeing statistics (GP-level, England): Missing data and potential outliers’ dataset, to determine areas where data from 2019/20 was used, where one or more GPs did not submit data in either year, or where there were large discrepancies between the 2018/19 and 2019/20 data (differences in statistics that were > mean +/- 1 St.Dev.), which suggests erroneous data in one of those years (it was not feasible for this study to investigate this further), and thus where data should be interpreted with caution. Note also that there are some rural areas (with little or no population) that do not officially fall into any GP catchment area (although this will not affect the results of this analysis if there are no people living in those areas).2. Although all of the obesity/inactivity-related illnesses listed can be caused or exacerbated by inactivity and obesity, it was not possible to distinguish from the data the cause of the illnesses in patients: obesity and inactivity are highly unlikely to be the cause of all cases of each illness. By combining the data with data relating to levels of obesity and inactivity in adults and children (see the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset), we can identify where obesity/inactivity could be a contributing factor, and where interventions to reduce obesity and increase activity could be most beneficial for the health of the local population.3. It was not feasible to incorporate ultra-fine-scale geographic distribution of populations that are registered with each GP practice or who live within each MSOA. Populations might be concentrated in certain areas of a GP practice’s catchment area or MSOA and relatively sparse in other areas. Therefore, the dataset should be used to identify general areas where there are high levels of diabetes mellitus, rather than interpreting the boundaries between areas as ‘hard’ boundaries that mark definite divisions between areas with differing levels of diabetes mellitus.TO BE VIEWED IN COMBINATION WITH:This dataset should be viewed alongside the following datasets, which highlight areas of missing data and potential outliers in the data:Health and wellbeing statistics (GP-level, England): Missing data and potential outliersLevels of obesity, inactivity and associated illnesses (England): Missing dataDOWNLOADING THIS DATATo access this data on your desktop GIS, download the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset.DATA SOURCESThis dataset was produced using:Quality and Outcomes Framework data: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.GP Catchment Outlines. Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital. Data was cleaned by Ribble Rivers Trust before use.COPYRIGHT NOTICEThe reproduction of this data must be accompanied by the following statement:© Ribble Rivers Trust 2021. Analysis carried out using data that is: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.CaBA HEALTH & WELLBEING EVIDENCE BASEThis dataset forms part of the wider CaBA Health and Wellbeing Evidence Base.
Community Health Obesity and Diabetes Related Indicators: 2008 - 2012
healthdata.gov
gimi9.com
+2more
csv, xlsx, xml
Updated Apr 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
health.data.ny.gov (2025). Community Health Obesity and Diabetes Related Indicators: 2008 - 2012 [Dataset]. https://healthdata.gov/State/Community-Health-Obesity-and-Diabetes-Related-Indi/3dgd-idb7
Explore at:
xml, xlsx, csvAvailable download formats
Dataset updated
Apr 8, 2025
Dataset provided by
health.data.ny.gov
Description
This subset of the community health indicator report data will not be updated. A dataset containing all of the community health indicators is now available. To view the latest community health obesity and diabetes related indicators, see the featured content section. This Obesity and Diabetes Related Indicators dataset provides a subset of data (40 indicators) for the two topics: Obesity and Diabetes. The dataset includes percentage or rate for Cirrhosis/Diabetes and Obesity and Related Indicators, where available, for all counties, regions and state.
New York State Community Health Indicator Reports (CHIRS) were developed in 2012, and annually updated to provide data for over 300 health indicators, organized by 15 health topic and data for all counties, regions and state are presented in table format with links to trend graphs and maps. Most recent county and state level data are provided. Multiple year combined data offers stable estimates for the burden and risk factors for these two health topics.
The association between environmental quality and diabetes in the U.S.
s.cnmilf.com
catalog.data.gov
Updated Nov 12, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. EPA Office of Research and Development (ORD) (2020). The association between environmental quality and diabetes in the U.S. [Dataset]. https://s.cnmilf.com/user74170196/https/catalog.data.gov/dataset/the-association-between-environmental-quality-and-diabetes-in-the-u-s
Explore at:
Dataset updated
Nov 12, 2020
Dataset provided by
United States Environmental Protection Agencyhttp://www.epa.gov/
Description
Population-based county-level estimates for diagnosed (DDP), undiagnosed (UDP), and total diabetes prevalence (TDP) were acquired from the Institute for Health Metrics and Evaluation (IHME) for the years 2004-2012 (Evaluation 2017). Prevalence estimates were calculated using a two-stage approach. The first stage used National Health and Nutrition Examination Survey (NHANES) data to predict high fasting plasma glucose (FPG) levels (≥126 mg/dL) and/or hemoglobin A1C (HbA1C) levels (≥6.5% [48 mmol/mol]) based on self-reported demographic and behavioral characteristics (Dwyer-Lindgren, Mackenbach et al. 2016). This model was then applied to Behavioral Risk Factor Surveillance System (BRFSS) data to impute high FPG and/or A1C status for each BRFSS respondent (Dwyer-Lindgren, Mackenbach et al. 2016). The second stage used the imputed BRFSS data to fit a series of small area models, which were used to predict the county-level prevalence of each of the diabetes-related outcomes (Dwyer-Lindgren, Mackenbach et al. 2016). Diagnosed diabetes was defined as the proportion of adults (age 20+ years) who reported a previous diabetes diagnosis, represented as an age-standardized prevalence percentage. Undiagnosed diabetes was defined as proportion of adults (age 20+ years) who have a high FPG or HbA1C but did not report a previous diagnosis of diabetes. Total diabetes was defined as the proportion of adults (age 20+ years) who reported a previous diabetes diagnosis and/or had a high FPG/HbA1C. The age-standardized diabetes prevalence (%) was used as the outcome. The EQI was constructed for 2000-2005 for all US counties and is composed of five domains (air, water, built, land, and sociodemographic), each composed of variables to represent the environmental quality of that _domain. Domain-specific EQIs were developed using principal components analysis (PCA) to reduce these variables within each _domain while the overall EQI was constructed from a second PCA from these individual domains (L. C. Messer et al., 2014). To account for differences in environment across rural and urban counties, the overall and _domain-specific EQIs were stratified by rural urban continuum codes (RUCCs) (U.S. Department of Agriculture, 2015). This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: Human health data are not available publicly. EQI data are available at: https://edg.epa.gov/data/Public/ORD/NHEERL/EQI. Format: Data are stored as csv files. This dataset is associated with the following publication: Jagai, J., A. Krajewski, S. Shaikh, D. Lobdell, and R. Sargis. Association between environmental quality and diabetes in the U.S.A.. Journal of Diabetes Investigation. John Wiley & Sons, Inc., Hoboken, NJ, USA, 11(2): 315-324, (2020).
f
Data_Sheet_1_The global burden of type 2 diabetes attributable to high body...
datasetcatalog.nlm.nih.gov
frontiersin.figshare.com
Updated Sep 9, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wu, Yufei; Ni, Tian; Li, Qiuyan; Zhang, Xuexue; Tang, Wei; Wang, Miaoran; Gu, Jiyu; Wang, Xujie; Hu, Biaoyan (2022). Data_Sheet_1_The global burden of type 2 diabetes attributable to high body mass index in 204 countries and territories, 1990–2019: An analysis of the Global Burden of Disease Study.docx [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000290581
Explore at:
Dataset updated
Sep 9, 2022
Authors
Wu, Yufei; Ni, Tian; Li, Qiuyan; Zhang, Xuexue; Tang, Wei; Wang, Miaoran; Gu, Jiyu; Wang, Xujie; Hu, Biaoyan
Description
BackgroundHigh body mass index (BMI) plays a critical role in the initiation and development of type 2 diabetes (T2D). Up to now, far too little attention has been paid to the global burden of T2D attributable to high BMI. This study aims to report the deaths and disability-adjusted life years (DALYs) of T2D related to high BMI in 204 countries and territories from 1990 to 2019.MethodsData on T2D burden attributable to high BMI were retrieved from the Global Burden of Diseases, Injuries, and Risk Factors Study (GBD) 2019. The global cases, age-standardized rates of mortality (ASMR), and disability-adjusted life years (ASDR) attributable to high BMI were estimated by age, sex, geographical location, and socio-demographic index (SDI). The estimated annual percentage change (EAPC) was calculated to quantify the trends of ASMR and ASDR during the period 1990–2019.ResultsGlobally, there were 619,494.8 deaths and 34,422,224.8 DALYs of T2D attributed to high BMI in 2019, more than triple in 1990. Moreover, the pace of increase in ASMR and ASDR accelerated during 1990–2019, with EAPC of 1.36 (95% CI: 1.27 to 1.45) and 2.13 (95% CI: 2.10 to 2.17) separately, especially in men, South Asia, and low-middle SDI regions. Oceania was the high-risk area of standardized T2D deaths and DALYs attributable to high BMI in 2019, among which Fiji was the country with the heaviest burden. In terms of SDI, middle SDI regions had the biggest T2D-related ASMR and ASDR in 2019.ConclusionThe global deaths and DALYs of T2D attributable to high BMI substantially increased from 1990 to 2019. High BMI as a major public health problem needs to be tackled properly and timely in patients with T2D.
d
Diabetes + Hypertension (comorbidity)
catalog.data.gov
data.wprdc.org
+2more
Updated Mar 14, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Allegheny County (2023). Diabetes + Hypertension (comorbidity) [Dataset]. https://catalog.data.gov/dataset/diabetes-hypertension-comorbidity
Explore at:
Dataset updated
Mar 14, 2023
Dataset provided by
Allegheny County
Description
This data set provides de-identified population data for diabetes and hypertension comorbidity prevalence in Allegheny County. The data is provided by three managed care organizations in Allegheny County (Gateway Health Plan, Highmark Health, and UPMC) and represents their insured population for the 2015 and 2016 calendar years. Disclaimer: Users should be cautious of using administrative claims data as a measure of disease prevalence and interpreting trends over time, as data provided were collected for purposes other than surveillance. Limitations of these data include but are not limited to: misclassification, duplicate individuals, exclusion of individuals who did not seek care in past two years and those who are: uninsured, enrolled in plans not represented in the dataset, or were not enrolled in one of the represented plans for at least 90 days.
C
Cambodia KH: Diabetes Prevalence: % of Population Aged 20-79
ceicdata.com
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CEICdata.com, Cambodia KH: Diabetes Prevalence: % of Population Aged 20-79 [Dataset]. https://www.ceicdata.com/en/cambodia/social-health-statistics/kh-diabetes-prevalence--of-population-aged-2079
Explore at:
Dataset provided by
CEICdata.com
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Dec 1, 2011 - Dec 1, 2021
Area covered
Cambodia
Description
Cambodia KH: Diabetes Prevalence: % of Population Aged 20-79 data was reported at 7.300 % in 2021. This records an increase from the previous number of 2.900 % for 2011. Cambodia KH: Diabetes Prevalence: % of Population Aged 20-79 data is updated yearly, averaging 5.100 % from Dec 2011 (Median) to 2021, with 2 observations. The data reached an all-time high of 7.300 % in 2021 and a record low of 2.900 % in 2011. Cambodia KH: Diabetes Prevalence: % of Population Aged 20-79 data remains active status in CEIC and is reported by World Bank. The data is categorized under Global Database’s Cambodia – Table KH.World Bank.WDI: Social: Health Statistics. Diabetes prevalence refers to the percentage of people ages 20-79 who have type 1 or type 2 diabetes. It is calculated by adjusting to a standard population age-structure.;International Diabetes Federation, Diabetes Atlas.;Weighted average;
M
Mali ML: Prevalence of Overweight: Weight for Height: % of Children Under 5
ceicdata.com
Updated Nov 27, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CEICdata.com (2021). Mali ML: Prevalence of Overweight: Weight for Height: % of Children Under 5 [Dataset]. https://www.ceicdata.com/en/mali/health-statistics/ml-prevalence-of-overweight-weight-for-height--of-children-under-5
Explore at:
Dataset updated
Nov 27, 2021
Dataset provided by
CEICdata.com
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Dec 1, 1987 - Dec 1, 2015
Area covered
Mali
Description
Mali ML: Prevalence of Overweight: Weight for Height: % of Children Under 5 data was reported at 1.900 % in 2015. This records an increase from the previous number of 1.000 % for 2010. Mali ML: Prevalence of Overweight: Weight for Height: % of Children Under 5 data is updated yearly, averaging 2.100 % from Dec 1987 (Median) to 2015, with 6 observations. The data reached an all-time high of 4.700 % in 2006 and a record low of 0.500 % in 1987. Mali ML: Prevalence of Overweight: Weight for Height: % of Children Under 5 data remains active status in CEIC and is reported by World Bank. The data is categorized under Global Database’s Mali – Table ML.World Bank: Health Statistics. Prevalence of overweight children is the percentage of children under age 5 whose weight for height is more than two standard deviations above the median for the international reference population of the corresponding age as established by the WHO's new child growth standards released in 2006.; ; UNICEF, WHO, World Bank: Joint child malnutrition estimates (JME). Aggregation is based on UNICEF, WHO, and the World Bank harmonized dataset (adjusted, comparable data) and methodology.; Linear mixed-effect model estimates; Estimates of overweight children are also from national survey data. Once considered only a high-income economy problem, overweight children have become a growing concern in developing countries. Research shows an association between childhood obesity and a high prevalence of diabetes, respiratory disease, high blood pressure, and psychosocial and orthopedic disorders (de Onis and Blössner 2003). Childhood obesity is associated with a higher chance of obesity, premature death, and disability in adulthood. In addition to increased future risks, obese children experience breathing difficulties and increased risk of fractures, hypertension, early markers of cardiovascular disease, insulin resistance, and psychological effects. Children in low- and middle-income countries are more vulnerable to inadequate nutrition before birth and in infancy and early childhood. Many of these children are exposed to high-fat, high-sugar, high-salt, calorie-dense, micronutrient-poor foods, which tend be lower in cost than more nutritious foods. These dietary patterns, in conjunction with low levels of physical activity, result in sharp increases in childhood obesity, while under-nutrition continues
C
Central African Republic CF: Prevalence of Overweight: Weight for Height:...
ceicdata.com
Updated Feb 27, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CEICdata.com (2018). Central African Republic CF: Prevalence of Overweight: Weight for Height: Female: % of Children Under 5 [Dataset]. https://www.ceicdata.com/en/central-african-republic/social-health-statistics
Explore at:
Dataset updated
Feb 27, 2018
Dataset provided by
CEICdata.com
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Dec 1, 1994 - Dec 1, 2019
Area covered
Central African Republic
Description
CF: Prevalence of Overweight: Weight for Height: Female: % of Children Under 5 data was reported at 1.500 % in 2019. This records an increase from the previous number of 0.700 % for 2018. CF: Prevalence of Overweight: Weight for Height: Female: % of Children Under 5 data is updated yearly, averaging 1.850 % from Dec 1994 (Median) to 2019, with 8 observations. The data reached an all-time high of 11.000 % in 2000 and a record low of 0.700 % in 2018. CF: Prevalence of Overweight: Weight for Height: Female: % of Children Under 5 data remains active status in CEIC and is reported by World Bank. The data is categorized under Global Database’s Central African Republic – Table CF.World Bank.WDI: Social: Health Statistics. Prevalence of overweight, female, is the percentage of girls under age 5 whose weight for height is more than two standard deviations above the median for the international reference population of the corresponding age as established by the WHO's 2006 Child Growth Standards.;UNICEF, WHO, World Bank: Joint child Malnutrition Estimates (JME). Aggregation is based on UNICEF, WHO, and the World Bank harmonized dataset (adjusted, comparable data) and methodology.;;Estimates of overweight children are from national survey data. Once considered only a high-income economy problem, overweight children have become a growing concern in developing countries. Research shows an association between childhood obesity and a high prevalence of diabetes, respiratory disease, high blood pressure, and psychosocial and orthopedic disorders (de Onis and Blössner 2003). Childhood obesity is associated with a higher chance of obesity, premature death, and disability in adulthood. In addition to increased future risks, obese children experience breathing difficulties and increased risk of fractures, hypertension, early markers of cardiovascular disease, insulin resistance, and psychological effects. Children in low- and middle-income countries are more vulnerable to inadequate nutrition before birth and in infancy and early childhood. Many of these children are exposed to high-fat, high-sugar, high-salt, calorie-dense, micronutrient-poor foods, which tend be lower in cost than more nutritious foods. These dietary patterns, in conjunction with low levels of physical activity, result in sharp increases in childhood obesity, while under-nutrition continues.
UHB Linked Diabetic Eye Disease in Acute Diabetic Hospital Admissions
healthdatagateway.org
unknown
Updated Oct 8, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
University Hospitals Birmingham NHS Foundation Trust (2024). UHB Linked Diabetic Eye Disease in Acute Diabetic Hospital Admissions [Dataset]. https://healthdatagateway.org/en/dataset/98
Explore at:
unknownAvailable download formats
Dataset updated
Oct 8, 2024
Dataset authored and provided by
University Hospitals Birmingham NHS Foundation Trusthttp://www.uhb.nhs.uk/
License
https://www.insight.hdrhub.org/https://www.insight.hdrhub.org/
Description
Background: Diabetes mellitus affects over 3.9 million people in the United Kingdom (UK), with over 2.6 million people in England alone. More than 1 million people living with diabetes are acutely admitted to hospital due to complications of their illness every year. Complications include Diabetic emergencies such as Diabetic Comas, Hypoglycaemia, Diabetic ketoacidosis and Diabetic Hyperosmolar Hyperglycaemic State. Diabetic retinopathy (DR) is a common microvascular complication of type 1 and type 2 diabetes and remains a major cause of vision loss and blindness in those of working age. This dataset includes acute all diabetic admissions to University Hospitals Birmingham NHS Trust from 2000 onwards with linked eye data including the national screening diabetic grade category (seven categories from R0M0 to R3M1) from the Birmingham, Solihull and Black Country DR screening program (a member of the National Health Service (NHS) Diabetic Eye Screening Programme) and the University Hospitals Birmingham NHS Trust Ophthalmology clinic at Queen Elizabeth Hospital, Birmingham .

Geography: The West Midlands has a population of 5.9 million. The region includes a diverse ethnic, and socio-economic mix, with a higher than UK average of minority ethnic groups. It has a large number of elderly residents but is the youngest population in the UK. There are particularly high rates of diabetes, physical inactivity, obesity, and smoking.

Data sources:
1. The Birmingham, Solihull and Black Country Data Set, University Hospitals Birmingham NHS Foundation Trust, Birmingham, United Kingdom. They manage over 200,000 diabetic patients, with longitudinal follow-up up to 15 years, making this the largest urban diabetic eye screening scheme in Europe. 2. The Electronic Health Records held at University Hospitals Birmingham NHS Foundation Trust is one of the largest NHS Trusts in England, providing direct acute services and specialist care across four hospital sites, with 2.2 million patient episodes per year, 2750 beds and 100 ITU beds. UHB runs a fully electronic healthcare record both for systemic disease as well as the Ophthalmology records.

Scope: All hospitalised patients admitted to UHB with a diabetes related health concern from 2000 onwards. Longitudinal and individually linked with their diabetic eye care from primary screening data and secondary care ophthalmology data including • Demographic information (including age, sex and ethnicity) • Diabetes status • Diabetes type • Length of time since diagnosis of diabetes • Visual acuity • The national screening diabetic screening grade category (seven categories from R0M0 to R3M1) • Diabetic eye clinical features • Reason for sight and severe sight impairment • ICD-10 and SNOMED-CT codes pertaining to diabetes • Diagnosis for the acute/emergency admission • Co-morbid conditions • Medications • Outcome
s
Data from: Dysregulation of MMP2-dependent TGF-ß2 activation impairs fibrous...
figshare.scilifelab.se
datasetcatalog.nlm.nih.gov
+2more
Updated Jan 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pratibha Singh; Jiangming Sun; Michele Cavalera; Dania Al-Sharify; Frank Matthes; Mohammad Barghouth; Christoffer Tengryd; Pontus Dunér; Ana Persson; Lena Sundius; Mihaela Nitulescu; Eva Bengtsson; Sara Rattik; Daniel Engelbertsen; Marju Orho-Melander; Jan Nilsson; Claudia Monaco; Isabel Goncalves; Andreas Edsfeldt (2025). Dysregulation of MMP2-dependent TGF-ß2 activation impairs fibrous cap formation in type 2 diabetes associated atherosclerosis [Dataset]. http://doi.org/10.17044/scilifelab.26063056.v1
Explore at:
Unique identifier
https://doi.org/10.17044/scilifelab.26063056.v1
Dataset updated
Jan 15, 2025
Dataset provided by
Lund University
Authors
Pratibha Singh; Jiangming Sun; Michele Cavalera; Dania Al-Sharify; Frank Matthes; Mohammad Barghouth; Christoffer Tengryd; Pontus Dunér; Ana Persson; Lena Sundius; Mihaela Nitulescu; Eva Bengtsson; Sara Rattik; Daniel Engelbertsen; Marju Orho-Melander; Jan Nilsson; Claudia Monaco; Isabel Goncalves; Andreas Edsfeldt
License
https://www.scilifelab.se/data/restricted-access/https://www.scilifelab.se/data/restricted-access/
Description
Cohort description:Human carotid plaques used in this study were obtained from the Carotid Plaque Imaging Project biobank (Malmö, Sweden; ClinicalTrials.gov ID NCT05821894). These plaques were collected from patients undergoing carotid endarterectomy (CEA) at Skåne University Hospital's Vascular department in Malmö, Sweden. The indications for surgery were: plaques associated with ipsilateral symptoms (transitory ischemic attack, stroke, or amaurosis fugax) and a degree of stenosis greater than 70% (measured by duplex ultrasound), or plaques from asymptomatic patients, with a degree of stenosis greater than 80%. The study has received ethical approval, and all patients provided oral and written informed consent. All work involving human subjects adheres to the principles of the 1975 Declaration of Helsinki.Dataset description:Bulk RNA sequencing was performed on carotid plaques from patients with type 2 diabetes (T2D). Total RNA was extracted with the Trizol method, and libraries were prepared using the ScriptSeq™ v2 Kit. Sequencing was performed on Illumina HiSeq2000 and NextSeq platforms. Reads were aligned to the GRCh38 genome with STAR and quantified using Salmon with GENCODE V27 annotations. Counts were normalized with edgeR and expressed as log2-transformed counts per million (CPM) after batch corrected for sequencing platforms.Single-cell transcriptomes were generated from human atherosclerotic plaque cells. Live cells were stained for CD45 and FACS sorted into 384-well plates, separating CD45+ and CD45- populations. RNA libraries were prepared and sequenced using the SmartSeq2 protocol at the SciLife Eukaryotic Genomics Facility. Cells with fewer than 10,000 raw reads, fewer than 500 detected genes, or more than 15% ERCC RNA Spike-In were excluded. Counts were log-normalized, scaled, and the top 2000 highly variable genes were used for dimensional reduction.Spatial transcriptomics was performed on 10µm carotid plaque sections using the Visium Spatial Gene Expression Slide & Reagent Kit (PN-1000184) and the standard protocols (CG000239 RevD,10x Genomics, Pleasanton, CA, USA). Libraries were sequenced on the NextSeq 500/550 platform with the High Output Kit v2.5 (150 cycles) at a depth of 400 million read pairs per sample. FASTQ files and corresponding histological images were processed with Space Ranger v1.0.0, using STAR v2.5.1b for genome alignment against the hg38 reference genome.Data availability:Current European data regulations preclude the open sharing of sensitive data from living humans, including genetic and sequencing data. The sequencing data used in this study can be accessed by making a reasonable request to the corresponding author, provided the legal terms for access are met.Terms for access:- The human data complies with GDPR regulations and is available, upon request, to qualified academic investigators solely for the purpose of replicating the procedures and findings outlined in the article.- Access to the dataset is contingent upon successful completion of a data sharing agreement with the principal investigator (thus ensuring compliance with GDPR) as well as written approvals from the ethical review board of Sweden, Region Skåne, and Lund University, respectively.- The existing ethical permit restricts the sharing of raw individual data due to its sensitive character, allowing only the sharing of aggregated data. Should researchers seek access to raw individual data, a separate written ethical approval and other legal requirements must be provided, in order for the request to be considered.- Approved researchers must refrain from attempting to identify or contact individual study participants represented in the dataset. They may not generate information that could compromise participants' identities.- Users are prohibited from using the datasets or any derivatives thereof for commercial purposes.- Approved investigators are obligated to immediately report any unauthorised data sharing or breaches of data security on their behalf to the data access committee. Such reports should include comprehensive details to facilitate resolution and ensure data confidentiality.

Facebook

Twitter

Click to copy link

Link copied

Cite

Priyam Choksi (2024). Comprehensive Diabetes Clinical Dataset(100k rows) [Dataset]. https://www.kaggle.com/datasets/priyamchoksi/100000-diabetes-clinical-dataset

Comprehensive Diabetes Clinical Dataset(100k rows)

100,000 Diabetes Dataset for Predictive Modeling and Health Analytics

Explore at:

4 scholarly articles cite this dataset (View in Google Scholar)

zip(917848 bytes)Available download formats

Dataset updated

Jul 20, 2024

Authors

Priyam Choksi

License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

Detailed dataset comprising health and demographic data of 100,000 individuals, aimed at facilitating diabetes-related research and predictive modeling. This dataset includes information on gender, age, location, race, hypertension, heart disease, smoking history, BMI, HbA1c level, blood glucose level, and diabetes status.

Dataset Use Cases

This dataset can be used for various analytical and machine learning purposes, such as:

Predictive Modeling: Build models to predict the likelihood of diabetes based on demographic and health-related features.
Health Analytics: Analyze the correlation between different health metrics (e.g., BMI, HbA1c level) and diabetes.
Demographic Studies: Examine the distribution of diabetes across different demographic groups and locations.
Public Health Research: Identify risk factors for diabetes and target interventions to high-risk groups.
Clinical Research: Study the relationship between comorbid conditions like hypertension and heart disease with diabetes.

Potential Analyses

Descriptive Statistics: Summarize the dataset to understand the central tendencies and dispersion of features.
Correlation Analysis: Identify the relationships between features.
Classification Models: Use machine learning algorithms to classify individuals as diabetic or non-diabetic.
Trend Analysis: Analyze trends over the years to see how diabetes prevalence has changed.

Clear search

Close search

Google apps

Main menu

Comprehensive Diabetes Clinical Dataset(100k rows)

Dataset Use Cases

Potential Analyses

Diabetes Health Indicators Dataset

Diabetes Health Indicators Dataset

Overview

Dataset Description

Features

Data Quality

Use Cases

Healthrise diabetes dataset noPII.

Diabetes

Diabetes control is associated with environmental quality in the U.S.

Data from: T1DiabetesGranada: a longitudinal multi-modal dataset of type 1...

PA: Rate of Hospitalizations for Short-Term Complications of Diabetes per...

Type 1 diabetes incidence rates in individuals aged 0–14 and 0–19 years by...

Data_Sheet_1_The effect of diabetes on COVID-19 incidence and mortality:...

Adult Arabs have higher risk for diabetes mellitus than Jews in Israel

Diabetes mellitus (in persons aged 17 and over): England

Community Health Obesity and Diabetes Related Indicators: 2008 - 2012

The association between environmental quality and diabetes in the U.S.

Data_Sheet_1_The global burden of type 2 diabetes attributable to high body...

Diabetes + Hypertension (comorbidity)

Cambodia KH: Diabetes Prevalence: % of Population Aged 20-79

Mali ML: Prevalence of Overweight: Weight for Height: % of Children Under 5

Central African Republic CF: Prevalence of Overweight: Weight for Height:...

UHB Linked Diabetic Eye Disease in Acute Diabetic Hospital Admissions

Data from: Dysregulation of MMP2-dependent TGF-ß2 activation impairs fibrous...

Comprehensive Diabetes Clinical Dataset(100k rows)See More Versions

100,000 Diabetes Dataset for Predictive Modeling and Health Analytics

Dataset Use Cases

Potential Analyses

Comprehensive Diabetes Clinical Dataset(100k rows)