100+ datasets found
  1. Diabetes control is associated with environmental quality in the U.S.

    • s.cnmilf.com
    • catalog.data.gov
    Updated Jul 21, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. EPA Office of Research and Development (ORD) (2022). Diabetes control is associated with environmental quality in the U.S. [Dataset]. https://s.cnmilf.com/user74170196/https/catalog.data.gov/dataset/diabetes-control-is-associated-with-environmental-quality-in-the-u-s
    Explore at:
    Dataset updated
    Jul 21, 2022
    Dataset provided by
    United States Environmental Protection Agencyhttp://www.epa.gov/
    Area covered
    United States
    Description

    Population-based county-level estimates for prevalence of DC were obtained from the Institute for Health Metrics and Evaluation (IHME) for the years 2004-2012 (16). DC prevalence rate was defined as the propor-tion of people within a county who had previously been diagnosed with diabetes (high fasting plasma glu-cose 126 mg/dL, hemoglobin A1c (HbA1c) of 6.5%, or diabetes diagnosis) but do not currently have high fasting plasma glucose or HbA1c for the period 2004-2012. DC prevalence estimates were calculated using a two-stage approach. The first stage used National Health and Nutrition Examination Survey (NHANES) data to predict high fasting plasma glucose (FPG) levels (≥126 mg/dL) and/or HbA1C levels (≥6.5% [48 mmol/mol]) based on self-reported demographic and behavioral characteristics (16). This model was then applied to Behavioral Risk Factor Surveillance System (BRFSS) data to impute high FPG and/or HbA1C status for each BRFSS respondent (16). The second stage used the imputed BRFSS data to fit a series of small area models, which were used to predict county-level prevalence of diabetes-related outcomes, including DC (16). The EQI was constructed for 2006-2010 for all US counties and is composed of five domains (air, water, built, land, and sociodemographic), each composed of variables to represent the environmental quality of that _domain. Domain-specific EQIs were developed using principal components analysis (PCA) to reduce these variables within each _domain while the overall EQI was constructed from a second PCA from these individual domains (L. C. Messer et al., 2014). To account for differences in environment across rural and urban counties, the overall and _domain-specific EQIs were stratified by rural urban continuum codes (RUCCs) (U.S. Department of Agriculture, 2015). Results are reported as prevalence rate differences (PRD) with 95% confidence intervals (CIs) comparing the highest quintile/worst environmental quality to the lowest quintile/best environmental quality expo-sure metrics. PRDs are representative of the entire period of interest, 2004-2012. Due to availability of DC data and covariate data, not all counties were captured, however, the majority, 3134 of 3142 were utilized in the analysis. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: Human health data are not available publicly. EQI data are available at: https://edg.epa.gov/data/Public/ORD/NHEERL/EQI. Format: Data are stored as csv files. This dataset is associated with the following publication: Jagai, J., A. Krajewski, K. Price, D. Lobdell, and R. Sargis. Diabetes control is associated with environmental quality in the USA. Endocrine Connections. BioScientifica Ltd., Bristol, UK, 10(9): 1018-1026, (2021).

  2. C

    Diabetes

    • data.wprdc.org
    • datasets.ai
    • +2more
    csv, html, xlsx
    Updated Jun 3, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Allegheny County (2024). Diabetes [Dataset]. https://data.wprdc.org/dataset/diabetes
    Explore at:
    csv(16013), csv, xlsx, htmlAvailable download formats
    Dataset updated
    Jun 3, 2024
    Dataset provided by
    Allegheny County
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    These datasets provide de-identified insurance data for diabetes. The data is provided by three managed care organizations in Allegheny County (Gateway Health Plan, Highmark Health, and UPMC) and represents their insured population for the 2015 and calendar years.

    Disclaimer: Users should be cautious of using administrative claims data as a measure of disease prevalence and interpreting trends over time, as data provided were collected for purposes other than surveillance. Limitations of these data include but are not limited to: misclassification, duplicate individuals, exclusion of individuals who did not seek care in past two years and those who are: uninsured, enrolled in plans not represented in the dataset, or were not enrolled in one of the represented plans for at least 90 days.

    Support for Health Equity datasets and tools provided by Amazon Web Services (AWS) through their Health Equity Initiative.

  3. i

    Type 2 Diabetes Dataset

    • ieee-dataport.org
    Updated Jan 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kabrambam Singh (2024). Type 2 Diabetes Dataset [Dataset]. https://ieee-dataport.org/documents/type-2-diabetes-dataset
    Explore at:
    Dataset updated
    Jan 17, 2024
    Authors
    Kabrambam Singh
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Germany. There are eight features in the dataset. Among the 2000 samples

  4. d

    Type 2 Diabetes

    • catalog.data.gov
    • data.ok.gov
    • +3more
    Updated Nov 22, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.ok.gov (2024). Type 2 Diabetes [Dataset]. https://catalog.data.gov/dataset/type-2-diabetes
    Explore at:
    Dataset updated
    Nov 22, 2024
    Dataset provided by
    data.ok.gov
    Description

    Decrease the percentage of people with Type 2 diabetes from 11.2% in 2014 to 10.1% by 2019.

  5. c

    Diabetes mellitus (in persons aged 17 and over): England

    • data.catchmentbasedapproach.org
    Updated Apr 7, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Rivers Trust (2021). Diabetes mellitus (in persons aged 17 and over): England [Dataset]. https://data.catchmentbasedapproach.org/datasets/diabetes-mellitus-in-persons-aged-17-and-over-england
    Explore at:
    Dataset updated
    Apr 7, 2021
    Dataset authored and provided by
    The Rivers Trust
    Area covered
    Description

    SUMMARYThis analysis, designed and executed by Ribble Rivers Trust, identifies areas across England with the greatest levels of diabetes mellitus in persons (aged 17+). Please read the below information to gain a full understanding of what the data shows and how it should be interpreted.ANALYSIS METHODOLOGYThe analysis was carried out using Quality and Outcomes Framework (QOF) data, derived from NHS Digital, relating to diabetes mellitus in persons (aged 17+).This information was recorded at the GP practice level. However, GP catchment areas are not mutually exclusive: they overlap, with some areas covered by 30+ GP practices. Therefore, to increase the clarity and usability of the data, the GP-level statistics were converted into statistics based on Middle Layer Super Output Area (MSOA) census boundaries.The percentage of each MSOA’s population (aged 17+) with diabetes mellitus was estimated. This was achieved by calculating a weighted average based on:The percentage of the MSOA area that was covered by each GP practice’s catchment areaOf the GPs that covered part of that MSOA: the percentage of registered patients that have that illness The estimated percentage of each MSOA’s population with diabetes mellitus was then combined with Office for National Statistics Mid-Year Population Estimates (2019) data for MSOAs, to estimate the number of people in each MSOA with depression, within the relevant age range.Each MSOA was assigned a relative score between 1 and 0 (1 = worst, 0 = best) based on:A) the PERCENTAGE of the population within that MSOA who are estimated to have diabetes mellitusB) the NUMBER of people within that MSOA who are estimated to have diabetes mellitusAn average of scores A & B was taken, and converted to a relative score between 1 and 0 (1= worst, 0 = best). The closer to 1 the score, the greater both the number and percentage of the population in the MSOA that are estimated to have diabetes mellitus, compared to other MSOAs. In other words, those are areas where it’s estimated a large number of people suffer from diabetes mellitus, and where those people make up a large percentage of the population, indicating there is a real issue with diabetes mellitus within the population and the investment of resources to address that issue could have the greatest benefits.LIMITATIONS1. GP data for the financial year 1st April 2018 – 31st March 2019 was used in preference to data for the financial year 1st April 2019 – 31st March 2020, as the onset of the COVID19 pandemic during the latter year could have affected the reporting of medical statistics by GPs. However, for 53 GPs (out of 7670) that did not submit data in 2018/19, data from 2019/20 was used instead. Note also that some GPs (997 out of 7670) did not submit data in either year. This dataset should be viewed in conjunction with the ‘Health and wellbeing statistics (GP-level, England): Missing data and potential outliers’ dataset, to determine areas where data from 2019/20 was used, where one or more GPs did not submit data in either year, or where there were large discrepancies between the 2018/19 and 2019/20 data (differences in statistics that were > mean +/- 1 St.Dev.), which suggests erroneous data in one of those years (it was not feasible for this study to investigate this further), and thus where data should be interpreted with caution. Note also that there are some rural areas (with little or no population) that do not officially fall into any GP catchment area (although this will not affect the results of this analysis if there are no people living in those areas).2. Although all of the obesity/inactivity-related illnesses listed can be caused or exacerbated by inactivity and obesity, it was not possible to distinguish from the data the cause of the illnesses in patients: obesity and inactivity are highly unlikely to be the cause of all cases of each illness. By combining the data with data relating to levels of obesity and inactivity in adults and children (see the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset), we can identify where obesity/inactivity could be a contributing factor, and where interventions to reduce obesity and increase activity could be most beneficial for the health of the local population.3. It was not feasible to incorporate ultra-fine-scale geographic distribution of populations that are registered with each GP practice or who live within each MSOA. Populations might be concentrated in certain areas of a GP practice’s catchment area or MSOA and relatively sparse in other areas. Therefore, the dataset should be used to identify general areas where there are high levels of diabetes mellitus, rather than interpreting the boundaries between areas as ‘hard’ boundaries that mark definite divisions between areas with differing levels of diabetes mellitus.TO BE VIEWED IN COMBINATION WITH:This dataset should be viewed alongside the following datasets, which highlight areas of missing data and potential outliers in the data:Health and wellbeing statistics (GP-level, England): Missing data and potential outliersLevels of obesity, inactivity and associated illnesses (England): Missing dataDOWNLOADING THIS DATATo access this data on your desktop GIS, download the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset.DATA SOURCESThis dataset was produced using:Quality and Outcomes Framework data: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.GP Catchment Outlines. Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital. Data was cleaned by Ribble Rivers Trust before use.COPYRIGHT NOTICEThe reproduction of this data must be accompanied by the following statement:© Ribble Rivers Trust 2021. Analysis carried out using data that is: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.CaBA HEALTH & WELLBEING EVIDENCE BASEThis dataset forms part of the wider CaBA Health and Wellbeing Evidence Base.

  6. The association between environmental quality and diabetes in the U.S.

    • catalog.data.gov
    Updated Nov 12, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. EPA Office of Research and Development (ORD) (2020). The association between environmental quality and diabetes in the U.S. [Dataset]. https://catalog.data.gov/dataset/the-association-between-environmental-quality-and-diabetes-in-the-u-s
    Explore at:
    Dataset updated
    Nov 12, 2020
    Dataset provided by
    United States Environmental Protection Agencyhttp://www.epa.gov/
    Area covered
    United States
    Description

    Population-based county-level estimates for diagnosed (DDP), undiagnosed (UDP), and total diabetes prevalence (TDP) were acquired from the Institute for Health Metrics and Evaluation (IHME) for the years 2004-2012 (Evaluation 2017). Prevalence estimates were calculated using a two-stage approach. The first stage used National Health and Nutrition Examination Survey (NHANES) data to predict high fasting plasma glucose (FPG) levels (≥126 mg/dL) and/or hemoglobin A1C (HbA1C) levels (≥6.5% [48 mmol/mol]) based on self-reported demographic and behavioral characteristics (Dwyer-Lindgren, Mackenbach et al. 2016). This model was then applied to Behavioral Risk Factor Surveillance System (BRFSS) data to impute high FPG and/or A1C status for each BRFSS respondent (Dwyer-Lindgren, Mackenbach et al. 2016). The second stage used the imputed BRFSS data to fit a series of small area models, which were used to predict the county-level prevalence of each of the diabetes-related outcomes (Dwyer-Lindgren, Mackenbach et al. 2016). Diagnosed diabetes was defined as the proportion of adults (age 20+ years) who reported a previous diabetes diagnosis, represented as an age-standardized prevalence percentage. Undiagnosed diabetes was defined as proportion of adults (age 20+ years) who have a high FPG or HbA1C but did not report a previous diagnosis of diabetes. Total diabetes was defined as the proportion of adults (age 20+ years) who reported a previous diabetes diagnosis and/or had a high FPG/HbA1C. The age-standardized diabetes prevalence (%) was used as the outcome. The EQI was constructed for 2000-2005 for all US counties and is composed of five domains (air, water, built, land, and sociodemographic), each composed of variables to represent the environmental quality of that domain. Domain-specific EQIs were developed using principal components analysis (PCA) to reduce these variables within each domain while the overall EQI was constructed from a second PCA from these individual domains (L. C. Messer et al., 2014). To account for differences in environment across rural and urban counties, the overall and domain-specific EQIs were stratified by rural urban continuum codes (RUCCs) (U.S. Department of Agriculture, 2015). This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: Human health data are not available publicly. EQI data are available at: https://edg.epa.gov/data/Public/ORD/NHEERL/EQI. Format: Data are stored as csv files. This dataset is associated with the following publication: Jagai, J., A. Krajewski, S. Shaikh, D. Lobdell, and R. Sargis. Association between environmental quality and diabetes in the U.S.A.. Journal of Diabetes Investigation. John Wiley & Sons, Inc., Hoboken, NJ, USA, 11(2): 315-324, (2020).

  7. Diabetes, by age group

    • www150.statcan.gc.ca
    Updated Nov 6, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Government of Canada, Statistics Canada (2023). Diabetes, by age group [Dataset]. http://doi.org/10.25318/1310009601-eng
    Explore at:
    Dataset updated
    Nov 6, 2023
    Dataset provided by
    Statistics Canadahttps://statcan.gc.ca/en
    Government of Canadahttp://www.gg.ca/
    Area covered
    Canada
    Description

    Number and percentage of persons having been diagnosed with diabetes, by age group and sex.

  8. United States US: Diabetes Prevalence: % of Population Aged 20-79

    • ceicdata.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CEICdata.com, United States US: Diabetes Prevalence: % of Population Aged 20-79 [Dataset]. https://www.ceicdata.com/en/united-states/health-statistics/us-diabetes-prevalence--of-population-aged-2079
    Explore at:
    Dataset provided by
    CEIC Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Dec 1, 2017
    Area covered
    United States
    Description

    United States US: Diabetes Prevalence: % of Population Aged 20-79 data was reported at 10.790 % in 2017. United States US: Diabetes Prevalence: % of Population Aged 20-79 data is updated yearly, averaging 10.790 % from Dec 2017 (Median) to 2017, with 1 observations. United States US: Diabetes Prevalence: % of Population Aged 20-79 data remains active status in CEIC and is reported by World Bank. The data is categorized under Global Database’s USA – Table US.World Bank: Health Statistics. Diabetes prevalence refers to the percentage of people ages 20-79 who have type 1 or type 2 diabetes.; ; International Diabetes Federation, Diabetes Atlas.; Weighted average;

  9. m

    HUPA-UCM Diabetes Dataset

    • data.mendeley.com
    Updated Apr 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    J. Ignacio Hidalgo (2024). HUPA-UCM Diabetes Dataset [Dataset]. http://doi.org/10.17632/3hbcscwz44.1
    Explore at:
    Dataset updated
    Apr 25, 2024
    Authors
    J. Ignacio Hidalgo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset provides a collection of Continuous Glucose Monitoring (CGM) data, insulin dose administration, meal ingestion counted in carbohydrate grams, steps, calories burned, heart rate, and sleep quality and quantity assessment acquired from 25 people with type 1 diabetes mellitus (T1DM). CGM data was acquired by FreeStyle Libre 2 CGMs, and Fitbit Ionic smartwatches were used to obtain steps, calories, heart rate, and sleep data for at least 14 days. This dataset could be utilized to obtain glucose prediction models, hypoglycemia and hyperglycemia prediction models, and research on the relationships among sleep, CGM values, and the rest of the mentioned variables. This dataset could be used directly from the preprocessed version or customized from raw data.

  10. c

    Early Classification of Diabetes Dataset

    • cubig.ai
    Updated May 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CUBIG (2025). Early Classification of Diabetes Dataset [Dataset]. https://cubig.ai/store/products/225/early-classification-of-diabetes-dataset
    Explore at:
    Dataset updated
    May 2, 2025
    Dataset authored and provided by
    CUBIG
    License

    https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service

    Measurement technique
    Privacy-preserving data transformation via differential privacy, Synthetic data generation using AI techniques for model training
    Description

    1) Data introduction • Early-diabetes dataset contains 520 observations with 17 characteristics collected using direct questionnaires and diagnostic results from patients at Sylhet Diabetes Hospital, Sylhet, Bangladesh.

    2) Data utilization (1) Early-diabetes data has characteristics that: • Predict diabetes risk using 16 tabular features, including age, gender, and polyuria. (2) Early-diabetes data can be used to: • Medical research: Researchers can use these data sets to study factors that contribute to the development of diabetes, identify key risk factors and help improve understanding of the disease. • Preventive healthcare: Using predictive models trained based on this data, healthcare providers can identify at-risk individuals early so they can intervene at the right time and create personalized treatment plans.

  11. PIMA_Diabities

    • figshare.com
    txt
    Updated Sep 8, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chidananda K (2022). PIMA_Diabities [Dataset]. http://doi.org/10.6084/m9.figshare.21063187.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Sep 8, 2022
    Dataset provided by
    figshare
    Authors
    Chidananda K
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    PIMA Diabetes dataset has 768 rows with 9 features among them BMI is one feature which is considered as one of the important features in indicating the presence of diabetesas shown in figure 7 and figure 8. The high and low levels of BMI will have a profound effect on the health. Heavy BMI multiple health complexities in diabetic and heart problem related patients. BMI values will be different with different age group people and well as on gender.

  12. Projected number of people with diabetes Indonesia 2017-2024

    • statista.com
    Updated Jan 3, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2020). Projected number of people with diabetes Indonesia 2017-2024 [Dataset]. https://www.statista.com/statistics/1052625/indonesia-diabetes-projection/
    Explore at:
    Dataset updated
    Jan 3, 2020
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Indonesia
    Description

    The projected number of patients suffering from diabetes in Indonesia is expected to reach about 9.5 million by 2024. In that year, the population growth will decline slightly but there will be an increase of the elderly over 65 years old. Alongside with the population structure, the prevalence of diseases changes proportionally.

  13. d

    Diabetes & Hypertension & Hyperlipidemia comorbidity

    • catalog.data.gov
    • data.wprdc.org
    • +3more
    Updated Mar 14, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Allegheny County (2023). Diabetes & Hypertension & Hyperlipidemia comorbidity [Dataset]. https://catalog.data.gov/dataset/diabetes-hypertension-hyperlipidemia-comorbidity
    Explore at:
    Dataset updated
    Mar 14, 2023
    Dataset provided by
    Allegheny County
    Description

    This data set provides de-identified population data for diabetes & hypertension & hyperlipidemia comorbidity prevelance. The data is provided by three managed care organizations in Allegheny County (Gateway Health Plan, Highmark Health, and UPMC) and represents their insured population for the 2015 and 2016 calendar years. Disclaimer: Users should be cautious of using administrative claims data as a measure of disease prevalence and interpreting trends over time, as data provided were collected for purposes other than surveillance. Limitations of these data include but are not limited to: misclassification, duplicate individuals, exclusion of individuals who did not seek care in past two years and those who are: uninsured, enrolled in plans not represented in the dataset, or were not enrolled in one of the represented plans for at least 90 days.

  14. Data from: Diabetes Prediction

    • kaggle.com
    zip
    Updated May 19, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Karanshah1910 (2021). Diabetes Prediction [Dataset]. https://www.kaggle.com/karanshah1910/diabetes-prediction
    Explore at:
    zip(9128 bytes)Available download formats
    Dataset updated
    May 19, 2021
    Authors
    Karanshah1910
    Description

    Overview

    We are trying to build a machine learning model to accurately predict whether the patients have diabetes or not.our objective is to prevent, cure and to improve the lives of all people affected by diabetes.

    Details about the Dataset

    The datasets consists of several medical predictor variables and one target variable, Outcome. Predictor vari- ables includes the number of pregnancies the patient has had, their BMI, insulin level, age, and so on.

    Pregnancies: Number of times pregnant Glucose: Plasma glucose concentration a 2 hours in an oral glucose tolerance test BloodPressure: Diastolic blood pressure (mm Hg) SkinThickness: Triceps skin fold thickness (mm) Insulin: 2-Hour serum insulin (mu U/ml) BMI: Body mass index (weight in kg/(height in m)2) DiabetesPedigreeFunction: Diabetes pedigree function Age:Age(years) Outcome: Classvariable(0 or 1)

    Motivation

    The motivation was to experiment with end to end machine learning project and get some idea about deployment platform like and offcourse this " Diabetes is an increasingly growing health issue due to our inactive lifestyle. If it is detected in time then through proper medical treatment, adverse effects can be prevented. To help in early detection, technology can be used very reliably and efficiently. Using machine learning we have built a predictive model that can predict whether the patient is diabetes positive or not.". This is also sort of fun to work on a project like this which could be beneficial for the society.

  15. Dataset from TrialNet Pathway to Prevention of T1D

    • data.niaid.nih.gov
    Updated Mar 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TrialNet Central Information Center general info; Kevan Herold, M.D. (2025). Dataset from TrialNet Pathway to Prevention of T1D [Dataset]. http://doi.org/10.25934/PR00008470
    Explore at:
    Dataset updated
    Mar 25, 2025
    Dataset provided by
    National Institute of Diabetes and Digestive and Kidney Diseaseshttp://niddk.nih.gov/
    Yale University
    Authors
    TrialNet Central Information Center general info; Kevan Herold, M.D.
    Area covered
    Canada, United States, United Kingdom, Italy, Finland, Australia
    Variables measured
    Annual Diabetic Blood Test
    Description

    Rationale:

    The accrual of data from the laboratory and from epidemiologic and prevention trials has improved the understanding of the etiology and pathogenesis of type 1 diabetes mellitus (T1DM). Genetic and immunologic factors play a key role in the development of T1DM, and characterization of the early metabolic abnormalities in T1DM is steadily increasing. However, information regarding the natural history of T1DM remains incomplete. The TrialNet Natural History Study of the Development of T1DM (Pathway to Prevention Study) has been designed to clarify this picture, and in so doing, will contribute to the development and implementation of studies aimed at prevention of and early treatment in T1DM.

    Purpose:

    TrialNet is an international network dedicated to the study, prevention, and early treatment of type 1 diabetes. TrialNet sites are located throughout the United States, Canada, Finland, United Kingdom, Italy, Germany, Sweden, Australia, and New Zealand. TrialNet is dedicated to testing new approaches to the prevention of and early intervention for type 1 diabetes.

    The goal of the TrialNet Natural History Study of the Development of Type 1 Diabetes is to enhance our understanding of the demographic, immunologic, and metabolic characteristics of individuals at risk for developing type 1 diabetes.

    The Natural History Study will screen relatives of people with type 1 diabetes to identify those at risk for developing the disease. Relatives of people with type 1 diabetes have about a 5% percent chance of being positive for the antibodies associated with diabetes. TrialNet will identify adults and children at risk for developing diabetes by testing for the presence of these antibodies in the blood. A positive antibody test is an early indication that damage to insulin-secreting cells may have begun. If this test is positive, additional testing will be offered to determine the likelihood that a person may develop diabetes. Individuals with antibodies will be offered the opportunity for further testing to determine their risk of developing diabetes over the next 5 years and to receive close monitoring for the development of diabetes.

  16. i

    Dataset for People for their Blood Glucose Level with their Superficial body...

    • ieee-dataport.org
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Deepali Javale, Dataset for People for their Blood Glucose Level with their Superficial body feature readings. [Dataset]. https://ieee-dataport.org/open-access/dataset-people-their-blood-glucose-level-their-superficial-body-feature-readings
    Explore at:
    Authors
    Deepali Javale
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    heart rate

  17. C

    Diabetes + Hypertension (comorbidity)

    • data.wprdc.org
    • catalog.data.gov
    • +1more
    csv, html
    Updated Jun 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Allegheny County (2024). Diabetes + Hypertension (comorbidity) [Dataset]. https://data.wprdc.org/dataset/diabetes-hypertension-comorbidity
    Explore at:
    html, csv(11966), csvAvailable download formats
    Dataset updated
    Jun 3, 2024
    Dataset provided by
    Allegheny County
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This data set provides de-identified population data for diabetes and hypertension comorbidity prevalence in Allegheny County. The data is provided by three managed care organizations in Allegheny County (Gateway Health Plan, Highmark Health, and UPMC) and represents their insured population for the 2015 and 2016 calendar years.

    Disclaimer: Users should be cautious of using administrative claims data as a measure of disease prevalence and interpreting trends over time, as data provided were collected for purposes other than surveillance. Limitations of these data include but are not limited to: misclassification, duplicate individuals, exclusion of individuals who did not seek care in past two years and those who are: uninsured, enrolled in plans not represented in the dataset, or were not enrolled in one of the represented plans for at least 90 days.

    Support for Health Equity datasets and tools provided by Amazon Web Services (AWS) through their Health Equity Initiative.

  18. m

    Type 2 Diabetes Mellitus Tongue Dataset

    • data.mendeley.com
    Updated Oct 7, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Saddam Zikri Dalimunthe (2024). Type 2 Diabetes Mellitus Tongue Dataset [Dataset]. http://doi.org/10.17632/hyb44jf936.2
    Explore at:
    Dataset updated
    Oct 7, 2024
    Authors
    Muhammad Saddam Zikri Dalimunthe
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset comprises tongue images categorized into two groups: non-diabetic tongue images and tongue images from patients with Type 2 Diabetes Mellitus (T2DM). The dataset is designed to support the development of deep learning models for detecting T2DM based on tongue image characteristics. These images capture various features such as tongue texture, color, and coating, which are known to differ between healthy individuals and those with T2DM. The dataset can be employed for training, validating, and testing machine learning algorithms aimed at automating the early screening of Type 2 Diabetes based on tongue analysis.

  19. Table_1_Diabetes websites lack information on dietary causes, risk factors,...

    • frontiersin.figshare.com
    • figshare.com
    docx
    Updated Jul 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lisa T. Crummett; Muhammad H. Aslam (2023). Table_1_Diabetes websites lack information on dietary causes, risk factors, and preventions for type 2 diabetes.docx [Dataset]. http://doi.org/10.3389/fpubh.2023.1159024.s001
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jul 13, 2023
    Dataset provided by
    Frontiers Mediahttp://www.frontiersin.org/
    Authors
    Lisa T. Crummett; Muhammad H. Aslam
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    IntroductionType 2 diabetes (T2D) is a growing public health burden throughout the world. Many people looking for information on how to prevent T2D will search on diabetes websites. Multiple dietary factors have a significant association with T2D risk, such as high intake of added sugars, refined carbohydrates, saturated fat, and red meat or processed meat; and decreased intake of dietary fiber, and fruits/vegetables. Despite this dietary information being available in the scientific literature, it is unclear whether this information is available in gray literature (websites).ObjectiveIn this study, we evaluate the use of specific terms from diabetes websites that are significantly associated with causes/risk factors and preventions for T2D from three term categories: (A) dietary factors, (B) nondietary nongenetic (lifestyle-associated) factors, and (C) genetic (non-modifiable) factors. We also evaluate the effect of website type (business, government, nonprofit) on term usage among websites.MethodsWe used web scraping and coding tools to quantify the use of specific terms from 73 diabetes websites. To determine the effect of term category and website type on the usage of specific terms among 73 websites, a repeated measures general linear model was performed.ResultsWe found that dietary risk factors that are significantly associated with T2D (e.g., sugar, processed carbohydrates, dietary fat, fruits/vegetables, fiber, processed meat/red meat) were mentioned in significantly fewer websites than either nondietary nongenetic factors (e.g., obesity, physical activity, dyslipidemia, blood pressure) or genetic factors (age, family history, ethnicity). Among websites that provided “eat healthy” guidance, one third provided zero dietary factors associated with type 2 diabetes, and only 30% provided more than two specific dietary factors associates with type 2 diabetes. We also observed that mean percent usage of all terms associated with T2D causes/risk factors and preventions was significantly lower among government websites compared to business websites and nonprofit websites.ConclusionDiabetes websites need to increase their usage of dietary factors when discussing causes/risk factors and preventions for T2D; as dietary factors are modifiable and strongly associated with all nondietary nongenetic risk factors, in addition to T2D risk.

  20. P

    Kaggle EyePACS Dataset

    • paperswithcode.com
    Updated Oct 28, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2020). Kaggle EyePACS Dataset [Dataset]. https://paperswithcode.com/dataset/kaggle-eyepacs
    Explore at:
    Dataset updated
    Oct 28, 2020
    Description

    Diabetic retinopathy is the leading cause of blindness in the working-age population of the developed world. It is estimated to affect over 93 million people.

    retina

    The US Center for Disease Control and Prevention estimates that 29.1 million people in the US have diabetes and the World Health Organization estimates that 347 million people have the disease worldwide. Diabetic Retinopathy (DR) is an eye disease associated with long-standing diabetes. Around 40% to 45% of Americans with diabetes have some stage of the disease. Progression to vision impairment can be slowed or averted if DR is detected in time, however this can be difficult as the disease often shows few symptoms until it is too late to provide effective treatment.

    Currently, detecting DR is a time-consuming and manual process that requires a trained clinician to examine and evaluate digital color fundus photographs of the retina. By the time human readers submit their reviews, often a day or two later, the delayed results lead to lost follow up, miscommunication, and delayed treatment.

    Clinicians can identify DR by the presence of lesions associated with the vascular abnormalities caused by the disease. While this approach is effective, its resource demands are high. The expertise and equipment required are often lacking in areas where the rate of diabetes in local populations is high and DR detection is most needed. As the number of individuals with diabetes continues to grow, the infrastructure needed to prevent blindness due to DR will become even more insufficient.

    The need for a comprehensive and automated method of DR screening has long been recognized, and previous efforts have made good progress using image classification, pattern recognition, and machine learning. With color fundus photography as input, the goal of this competition is to push an automated detection system to the limit of what is possible – ideally resulting in models with realistic clinical potential. The winning models will be open sourced to maximize the impact such a model can have on improving DR detection.

    Acknowledgements This competition is sponsored by the California Healthcare Foundation.

    Retinal images were provided by EyePACS, a free platform for retinopathy screening.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
U.S. EPA Office of Research and Development (ORD) (2022). Diabetes control is associated with environmental quality in the U.S. [Dataset]. https://s.cnmilf.com/user74170196/https/catalog.data.gov/dataset/diabetes-control-is-associated-with-environmental-quality-in-the-u-s
Organization logo

Diabetes control is associated with environmental quality in the U.S.

Explore at:
5 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Jul 21, 2022
Dataset provided by
United States Environmental Protection Agencyhttp://www.epa.gov/
Area covered
United States
Description

Population-based county-level estimates for prevalence of DC were obtained from the Institute for Health Metrics and Evaluation (IHME) for the years 2004-2012 (16). DC prevalence rate was defined as the propor-tion of people within a county who had previously been diagnosed with diabetes (high fasting plasma glu-cose 126 mg/dL, hemoglobin A1c (HbA1c) of 6.5%, or diabetes diagnosis) but do not currently have high fasting plasma glucose or HbA1c for the period 2004-2012. DC prevalence estimates were calculated using a two-stage approach. The first stage used National Health and Nutrition Examination Survey (NHANES) data to predict high fasting plasma glucose (FPG) levels (≥126 mg/dL) and/or HbA1C levels (≥6.5% [48 mmol/mol]) based on self-reported demographic and behavioral characteristics (16). This model was then applied to Behavioral Risk Factor Surveillance System (BRFSS) data to impute high FPG and/or HbA1C status for each BRFSS respondent (16). The second stage used the imputed BRFSS data to fit a series of small area models, which were used to predict county-level prevalence of diabetes-related outcomes, including DC (16). The EQI was constructed for 2006-2010 for all US counties and is composed of five domains (air, water, built, land, and sociodemographic), each composed of variables to represent the environmental quality of that _domain. Domain-specific EQIs were developed using principal components analysis (PCA) to reduce these variables within each _domain while the overall EQI was constructed from a second PCA from these individual domains (L. C. Messer et al., 2014). To account for differences in environment across rural and urban counties, the overall and _domain-specific EQIs were stratified by rural urban continuum codes (RUCCs) (U.S. Department of Agriculture, 2015). Results are reported as prevalence rate differences (PRD) with 95% confidence intervals (CIs) comparing the highest quintile/worst environmental quality to the lowest quintile/best environmental quality expo-sure metrics. PRDs are representative of the entire period of interest, 2004-2012. Due to availability of DC data and covariate data, not all counties were captured, however, the majority, 3134 of 3142 were utilized in the analysis. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: Human health data are not available publicly. EQI data are available at: https://edg.epa.gov/data/Public/ORD/NHEERL/EQI. Format: Data are stored as csv files. This dataset is associated with the following publication: Jagai, J., A. Krajewski, K. Price, D. Lobdell, and R. Sargis. Diabetes control is associated with environmental quality in the USA. Endocrine Connections. BioScientifica Ltd., Bristol, UK, 10(9): 1018-1026, (2021).

Search
Clear search
Close search
Google apps
Main menu