79 datasets found
  1. 💀Deaths And Obesity - 🎀Health

    • kaggle.com
    zip
    Updated May 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    waticson (2024). 💀Deaths And Obesity - 🎀Health [Dataset]. https://www.kaggle.com/datasets/yutodennou/death-and-obesity
    Explore at:
    zip(224551 bytes)Available download formats
    Dataset updated
    May 24, 2024
    Authors
    waticson
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    This data set summarizes obesity and the number of deaths caused by it in each country

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F2993575%2Fb55c8c53db1eb6809cc0fb6b5a081195%2F2024-05-25%20093352.png?generation=1716597253375211&alt=media" alt="">

    💡I have already divided these into TRAIN data, TEST data, and ANSWER data so you guys can start working on the regression problem right away.

    • train.csv: Obesity and deaths data from 1990 to 2013
    • test.csv: The explanatory variable in 2014
    • answer.csv: The objective variable in 2014

    These data were created with the assumption that the number of deaths due to obesity in 2014 will be estimated from data from 1990 to 2013.

    There is also something called HINT data(hint.csv). This is data for 2015 and beyond. I have left it out of the train or test data because it has many missing values, but it may be useful for forecasting and for those who are interested in more recent data.

    VariablesDiscription
    Country205 country names
    CodeCountry code like AFG for Afghanistan
    YearYear of collecting data
    PopulationPopulation in a country
    Percentage-OverweightPercentage of defined as overweight, BMI >= 25(age-standardized estimate)(%),Sex: both sexes, Age group:18+
    Mean-Daily-Caloric-SupplyMean of daily supply of calories among overweight or obesity, BMI >= 25(age-standardized). Only about men
    Mean-BMIBMI, Age group:18+ years. 2 columns for both male and female
    Percentage-Overweighted-MalePercentage of adults who are overweight (age-standardized) - Age group: 18+ years. 2 columns for both male and female
    Prevalence-Hypertension-MalePrevalence of hypertension among adults aged 30-79 years(age-standardized). 2 columns for both male and female
    Prevalence-ObesityPrevalence of obesity among adults, BMI >= 30(age-standardized estimate)(%),Sex: both sexes, Age group:18+
    Death-By-High-BMIDeaths that are from all causes attributed to high body-mass index per 100,000 people, in both sexes aged age-standarized
  2. Obesity Levels

    • kaggle.com
    zip
    Updated Apr 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fatemeh Mehrparvar (2024). Obesity Levels [Dataset]. https://www.kaggle.com/datasets/fatemehmehrparvar/obesity-levels
    Explore at:
    zip(58968 bytes)Available download formats
    Dataset updated
    Apr 7, 2024
    Authors
    Fatemeh Mehrparvar
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Obesity

    Obesity, which causes physical and mental problems, is a global health problem with serious consequences. The prevalence of obesity is increasing steadily, and therefore, new research is needed that examines the influencing factors of obesity and how to predict the occurrence of the condition according to these factors.

    " https://www.semanticscholar.org/paper/Estimation-of-Obesity-Levels-with-a-Trained-Neural-Ya%C4%9F%C4%B1n-G%C3%BCl%C3%BC/2c1eab51db154493d225c8b86ba885bbaf147a2c "

    Dataset Information

    This dataset include data for the estimation of obesity levels in individuals from the countries of Mexico, Peru and Colombia, based on their eating habits and physical condition. The data contains 17 attributes and 2111 records, the records are labeled with the class variable NObesity (Obesity Level), that allows classification of the data using the values of Insufficient Weight, Normal Weight, Overweight Level I, Overweight Level II, Obesity Type I, Obesity Type II and Obesity Type III. 77% of the data was generated synthetically using the Weka tool and the SMOTE filter, 23% of the data was collected directly from users through a web platform.

    Gender: Feature, Categorical, "Gender" Age : Feature, Continuous, "Age"
    Height: Feature, Continuous
    Weight: Feature Continuous
    family_history_with_overweight: Feature, Binary, " Has a family member suffered or suffers from overweight? "

    FAVC : Feature, Binary, " Do you eat high caloric food frequently? "
    FCVC : Feature, Integer, " Do you usually eat vegetables in your meals? "
    NCP : Feature, Continuous, " How many main meals do you have daily? "
    CAEC : Feature, Categorical, " Do you eat any food between meals? "
    SMOKE : Feature, Binary, " Do you smoke? "
    CH2O: Feature, Continuous, " How much water do you drink daily? "
    SCC: Feature, Binary, " Do you monitor the calories you eat daily? "
    FAF: Feature, Continuous, " How often do you have physical activity? "
    TUE : Feature, Integer, " How much time do you use technological devices such as cell phone, videogames, television, computer and others? "

    CALC : Feature, Categorical, " How often do you drink alcohol? "
    MTRANS : Feature, Categorical, " Which transportation do you usually use? "
    NObeyesdad : Target, Categorical, "Obesity level"

  3. d

    National Obesity By State

    • catalog.data.gov
    • gimi9.com
    • +3more
    Updated Nov 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lake County Illinois GIS (2024). National Obesity By State [Dataset]. https://catalog.data.gov/dataset/national-obesity-by-state-d765a
    Explore at:
    Dataset updated
    Nov 22, 2024
    Dataset provided by
    Lake County Illinois GIS
    Description

    National Obesity Percentages by State. Explanation of Field Attributes:Obesity - The percent of the state population that is considered obese from the 2015 CDC BRFSS Survey.

  4. Obesity Prediction Dataset

    • kaggle.com
    Updated Jan 14, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    RK (2025). Obesity Prediction Dataset [Dataset]. https://www.kaggle.com/datasets/ruchikakumbhar/obesity-prediction
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 14, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    RK
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Overview: This dataset include data for the estimation of obesity levels in individuals from the countries of Mexico, Peru and Colombia, based on their eating habits and physical condition. The data contains 17 attributes and 2111 records, the records are labeled with the class variable NObesity (Obesity Level), that allows classification of the data using the values of Insufficient Weight, Normal Weight, Overweight Level I, Overweight Level II, Obesity Type I, Obesity Type II and Obesity Type III.

    Data Details: - Gender: Gender
    - Age: Age
    - Height : in metres
    - Weight : in kgs
    - family_history : Has a family member suffered or suffers from overweight?
    - FAVC : Do you eat high caloric food frequently?
    - FCVC : Do you usually eat vegetables in your meals?
    - NCP : How many main meals do you have daily? - CAEC : Do you eat any food between meals?
    - SMOKE : Do you smoke?
    - CH2O : How much water do you drink daily?
    - SCC : Do you monitor the calories you eat daily?
    - FAF: How often do you have physical activity?
    - TUE : How much time do you use technological devices such as cell phone, videogames, television, computer and others? - CALC : How often do you drink alcohol?
    - MTRANS : Which transportation do you usually use? - Obesity_level (Target Column) : Obesity level

    https://www.semanticscholar.org/paper/Dataset-for-estimation-of-obesity-levels-based-on-Palechor-Manotas/35b40bacd2ffa9370885b7a3004d88995fd1d011

  5. d

    Statistics on Obesity, Physical Activity and Diet (replaced by Statistics on...

    • digital.nhs.uk
    Updated May 5, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2020). Statistics on Obesity, Physical Activity and Diet (replaced by Statistics on Public Health) [Dataset]. https://digital.nhs.uk/data-and-information/publications/statistical/statistics-on-obesity-physical-activity-and-diet
    Explore at:
    Dataset updated
    May 5, 2020
    License

    https://digital.nhs.uk/about-nhs-digital/terms-and-conditionshttps://digital.nhs.uk/about-nhs-digital/terms-and-conditions

    Time period covered
    Apr 1, 2018 - Dec 31, 2019
    Description

    This report presents information on obesity, physical activity and diet drawn together from a variety of sources for England. More information can be found in the source publications which contain a wider range of data and analysis. Each section provides an overview of key findings, as well as providing links to relevant documents and sources. Some of the data have been published previously by NHS Digital. A data visualisation tool (link provided within the key facts) allows users to select obesity related hospital admissions data for any Local Authority (as contained in the data tables), along with time series data from 2013/14. Regional and national comparisons are also provided. The report includes information on: Obesity related hospital admissions, including obesity related bariatric surgery. Obesity prevalence. Physical activity levels. Walking and cycling rates. Prescriptions items for the treatment of obesity. Perception of weight and weight management. Food and drink purchases and expenditure. Fruit and vegetable consumption. Key facts cover the latest year of data available: Hospital admissions: 2018/19 Adult obesity: 2018 Childhood obesity: 2018/19 Adult physical activity: 12 months to November 2019 Children and young people's physical activity: 2018/19 academic year

  6. a

    Obesity Percentages

    • data-lakecountyil.opendata.arcgis.com
    • catalog.data.gov
    • +2more
    Updated Dec 9, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lake County Illinois GIS (2016). Obesity Percentages [Dataset]. https://data-lakecountyil.opendata.arcgis.com/datasets/obesity-percentages/about
    Explore at:
    Dataset updated
    Dec 9, 2016
    Dataset authored and provided by
    Lake County Illinois GIS
    License

    https://www.arcgis.com/sharing/rest/content/items/89679671cfa64832ac2399a0ef52e414/datahttps://www.arcgis.com/sharing/rest/content/items/89679671cfa64832ac2399a0ef52e414/data

    Area covered
    Description

    Obesity percentages for Lake County, Illinois. Explanation of field attributes:

    Pct_Obese – The percent of people in the zip code who are considered obese, defined as having a BMI greater than or equal to 30.

    ObsOrOvrwt –The percent of people in the zip code who are considered overweight (defined as having a BMI greater than or equal to 25 but less than 30) or obese (defined as having a BMI greater than or equal to 30).

  7. c

    Obesity in adults (ages 18 plus): England

    • data.catchmentbasedapproach.org
    Updated May 25, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Rivers Trust (2021). Obesity in adults (ages 18 plus): England [Dataset]. https://data.catchmentbasedapproach.org/datasets/obesity-in-adults-ages-18-plus-england
    Explore at:
    Dataset updated
    May 25, 2021
    Dataset authored and provided by
    The Rivers Trust
    Area covered
    Description

    SUMMARYThis analysis, designed and executed by Ribble Rivers Trust, identifies areas across England with the greatest levels of obesity in adults (aged 18+). Please read the below information to gain a full understanding of what the data shows and how it should be interpreted.ANALYSIS METHODOLOGYThe analysis was carried out using Quality and Outcomes Framework (QOF) data, derived from NHS Digital, relating to obesity in adults (aged 18+).This information was recorded at the GP practice level. However, GP catchment areas are not mutually exclusive: they overlap, with some areas covered by 30+ GP practices. Therefore, to increase the clarity and usability of the data, the GP-level statistics were converted into statistics based on Middle Layer Super Output Area (MSOA) census boundaries.The percentage of each MSOA’s adult population (aged 18+) that are obese was estimated. This was achieved by calculating a weighted average based on:The percentage of the MSOA area that was covered by each GP practice’s catchment areaOf the GPs that covered part of that MSOA: the percentage of registered patients that have that illness The estimated percentage of each MSOA’s adult population that are obese was then combined with Office for National Statistics Mid-Year Population Estimates (2019) data for MSOAs, to estimate the number of people in each MSOA that are obese, within the relevant age range.Each MSOA was assigned a relative score between 1 and 0 (1 = worst, 0 = best) based on:A) the PERCENTAGE of the adult population within that MSOA who are estimated to be obeseB) the NUMBER of adults within that MSOA who are estimated to be obeseAn average of scores A & B was taken, and converted to a relative score between 1 and 0 (1= worst, 0 = best). The closer to 1 the score, the greater both the number and percentage of the population in the MSOA that are estimated to be obese compared to other MSOAs. In other words, those are areas where it’s estimated a large number of people are obese, and where those people make up a large percentage of the population, indicating there is a real issue with obesity within the adult population and the investment of resources to address that issue could have the greatest benefits.LIMITATIONS1. GP data for the financial year 1st April 2018 – 31st March 2019 was used in preference to data for the financial year 1st April 2019 – 31st March 2020, as the onset of the COVID19 pandemic during the latter year could have affected the reporting of medical statistics by GPs. However, for 53 GPs (out of 7670) that did not submit data in 2018/19, data from 2019/20 was used instead. Note also that some GPs (997 out of 7670) did not submit data in either year. This dataset should be viewed in conjunction with the ‘Health and wellbeing statistics (GP-level, England): Missing data and potential outliers’ dataset, to determine areas where data from 2019/20 was used, where one or more GPs did not submit data in either year, or where there were large discrepancies between the 2018/19 and 2019/20 data (differences in statistics that were > mean +/- 1 St.Dev.), which suggests erroneous data in one of those years (it was not feasible for this study to investigate this further), and thus where data should be interpreted with caution. This dataset also shows rural areas (with little or no population) that do not officially fall into any GP catchment area and for which there were no statistics regarding adult obesity (although this will not affect the results of this analysis if there are no people living in those areas).2. It was not feasible to incorporate ultra-fine-scale geographic distribution of populations that are registered with each GP practice or who live within each MSOA. Populations might be concentrated in certain areas of a GP practice’s catchment area or MSOA and relatively sparse in other areas. Therefore, the dataset should be used to identify general areas where there are high levels of adult obesity, rather than interpreting the boundaries between areas as ‘hard’ boundaries that mark definite divisions between areas with differing levels of adult obesity.TO BE VIEWED IN COMBINATION WITH:This dataset should be viewed alongside the following datasets, which highlight areas of missing data and potential outliers in the data:Health and wellbeing statistics (GP-level, England): Missing data and potential outliersLevels of obesity, inactivity and associated illnesses (England): Missing dataDOWNLOADING THIS DATATo access this data on your desktop GIS, download the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset.DATA SOURCESThis dataset was produced using:Quality and Outcomes Framework data: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.GP Catchment Outlines. Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital. Data was cleaned by Ribble Rivers Trust before use.COPYRIGHT NOTICEThe reproduction of this data must be accompanied by the following statement:© Ribble Rivers Trust 2021. Analysis carried out using data that is: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.CaBA HEALTH & WELLBEING EVIDENCE BASEThis dataset forms part of the wider CaBA Health and Wellbeing Evidence Base.

  8. Obesity in California, 2012 and 2013

    • data.chhs.ca.gov
    • data.ca.gov
    • +4more
    csv, xlsx, zip
    Updated Nov 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    California Department of Public Health (2025). Obesity in California, 2012 and 2013 [Dataset]. https://data.chhs.ca.gov/dataset/obesity-in-california-2012-and-2013
    Explore at:
    xlsx, csv, zipAvailable download formats
    Dataset updated
    Nov 7, 2025
    Dataset authored and provided by
    California Department of Public Healthhttps://www.cdph.ca.gov/
    Area covered
    California
    Description

    These data are from the 2013 California Dietary Practices Surveys (CDPS), 2012 California Teen Eating, Exercise and Nutrition Survey (CalTEENS), and 2013 California Children’s Healthy Eating and Exercise Practices Surveys (CalCHEEPS). These surveys have been discontinued. Adults, adolescents, and children (with parental assistance) were asked for their current height and weight, from which, body mass index (BMI) was calculated. For adults, a BMI of 30.0 and above is considered obese. For adolescents and children, obesity is defined as having a BMI at or above the 95th percentile, according to CDC growth charts.

    The California Dietary Practices Surveys (CDPS), the California Teen Eating, Exercise and Nutrition Survey (CalTEENS), and the California Children’s Healthy Eating and Exercise Practices Surveys (CalCHEEPS) (now discontinued) were the most extensive dietary and physical activity assessments of adults 18 years and older, adolescents 12 to 17, and children 6 to 11, respectively, in the state of California. CDPS and CalCHEEPS were administered biennially in odd years up through 2013 and CalTEENS was administered biennially in even years through 2014. The surveys were designed to monitor dietary trends, especially fruit and vegetable consumption, among Californias for evaluating their progress toward meeting the Dietary Guidelines for Americans and the Healthy People 2020 Objectives. All three surveys were conducted via telephone. Adult and adolescent data were collected using a list of participating CalFresh households and random digit dial, and child data were collected using only the list of CalFresh households. Older children (9-11) were the primary respondents with some parental assistance. For younger children (6-8), the primary respondent was parents. Data were oversampled for low-income and African American to provide greater sensitivity for analyzing trends among the target population. Wording of the question used for these analyses varied by survey (age group). The questions were worded are as follows: Adult:1) How tall are you without shoes?2) How much do you weigh?Adolescent:1) About how much do you weigh without shoes?2) About how tall are you without shoes? Child:1) How tall is [child's name] now without shoes on?2) How much does [child's name] weigh now without shoes on?

  9. a

    Levels of obesity and inactivity related illnesses (physical illnesses):...

    • hub.arcgis.com
    • data.catchmentbasedapproach.org
    Updated Apr 7, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Rivers Trust (2021). Levels of obesity and inactivity related illnesses (physical illnesses): Summary (England) [Dataset]. https://hub.arcgis.com/datasets/76bef8a953c44f36b569c37d7bdec45e
    Explore at:
    Dataset updated
    Apr 7, 2021
    Dataset authored and provided by
    The Rivers Trust
    Area covered
    Description

    SUMMARYThis analysis, designed and executed by Ribble Rivers Trust, identifies areas across England with the greatest levels of physical illnesses that are linked with obesity and inactivity. Please read the below information to gain a full understanding of what the data shows and how it should be interpreted.ANALYSIS METHODOLOGYThe analysis was carried out using Quality and Outcomes Framework (QOF) data, derived from NHS Digital, relating to:- Asthma (in persons of all ages)- Cancer (in persons of all ages)- Chronic kidney disease (in adults aged 18+)- Coronary heart disease (in persons of all ages)- Diabetes mellitus (in persons aged 17+)- Hypertension (in persons of all ages)- Stroke and transient ischaemic attack (in persons of all ages)This information was recorded at the GP practice level. However, GP catchment areas are not mutually exclusive: they overlap, with some areas covered by 30+ GP practices. Therefore, to increase the clarity and usability of the data, the GP-level statistics were converted into statistics based on Middle Layer Super Output Area (MSOA) census boundaries.For each of the above illnesses, the percentage of each MSOA’s population with that illness was estimated. This was achieved by calculating a weighted average based on:- The percentage of the MSOA area that was covered by each GP practice’s catchment area- Of the GPs that covered part of that MSOA: the percentage of patients registered with each GP that have that illnessThe estimated percentage of each MSOA’s population with each illness was then combined with Office for National Statistics Mid-Year Population Estimates (2019) data for MSOAs, to estimate the number of people in each MSOA with each illness, within the relevant age range.For each illness, each MSOA was assigned a relative score between 1 and 0 (1 = worst, 0 = best) based on:A) the PERCENTAGE of the population within that MSOA who are estimated to have that illnessB) the NUMBER of people within that MSOA who are estimated to have that illnessAn average of scores A & B was taken, and converted to a relative score between 1 and 0 (1= worst, 0 = best). The closer to 1 the score, the greater both the number and percentage of the population in the MSOA predicted to have that illness, compared to other MSOAs. In other words, those are areas where a large number of people are predicted to suffer from an illness, and where those people make up a large percentage of the population, indicating there is a real issue with that illness within the population and the investment of resources to address that issue could have the greatest benefits.The scores for each of the 7 illnesses were added together then converted to a relative score between 1 – 0 (1 = worst, 0 = best), to give an overall score for each MSOA: a score close to 1 would indicate that an area has high predicted levels of all obesity/inactivity-related illnesses, and these are areas where the local population could benefit the most from interventions to address those illnesses. A score close to 0 would indicate very low predicted levels of obesity/inactivity-related illnesses and therefore interventions might not be required.LIMITATIONS1. GPs do not have catchments that are mutually exclusive from each other: they overlap, with some geographic areas being covered by 30+ practices. This dataset should be viewed in combination with the ‘Health and wellbeing statistics (GP-level, England): Missing data and potential outliers’ dataset to identify where there are areas that are covered by multiple GP practices but at least one of those GP practices did not provide data. Results of the analysis in these areas should be interpreted with caution, particularly if the levels of obesity/inactivity-related illnesses appear to be significantly lower than the immediate surrounding areas.2. GP data for the financial year 1st April 2018 – 31st March 2019 was used in preference to data for the financial year 1st April 2019 – 31st March 2020, as the onset of the COVID19 pandemic during the latter year could have affected the reporting of medical statistics by GPs. However, for 53 GPs (out of 7670) that did not submit data in 2018/19, data from 2019/20 was used instead. Note also that some GPs (997 out of 7670) did not submit data in either year. This dataset should be viewed in conjunction with the ‘Health and wellbeing statistics (GP-level, England): Missing data and potential outliers’ dataset, to determine areas where data from 2019/20 was used, where one or more GPs did not submit data in either year, or where there were large discrepancies between the 2018/19 and 2019/20 data (differences in statistics that were > mean +/- 1 St.Dev.), which suggests erroneous data in one of those years (it was not feasible for this study to investigate this further), and thus where data should be interpreted with caution. Note also that there are some rural areas (with little or no population) that do not officially fall into any GP catchment area (although this will not affect the results of this analysis if there are no people living in those areas).3. Although all of the obesity/inactivity-related illnesses listed can be caused or exacerbated by inactivity and obesity, it was not possible to distinguish from the data the cause of the illnesses in patients: obesity and inactivity are highly unlikely to be the cause of all cases of each illness. By combining the data with data relating to levels of obesity and inactivity in adults and children (see the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset), we can identify where obesity/inactivity could be a contributing factor, and where interventions to reduce obesity and increase activity could be most beneficial for the health of the local population.4. It was not feasible to incorporate ultra-fine-scale geographic distribution of populations that are registered with each GP practice or who live within each MSOA. Populations might be concentrated in certain areas of a GP practice’s catchment area or MSOA and relatively sparse in other areas. Therefore, the dataset should be used to identify general areas where there are high levels of obesity/inactivity-related illnesses, rather than interpreting the boundaries between areas as ‘hard’ boundaries that mark definite divisions between areas with differing levels of these illnesses. TO BE VIEWED IN COMBINATION WITH:This dataset should be viewed alongside the following datasets, which highlight areas of missing data and potential outliers in the data:- Health and wellbeing statistics (GP-level, England): Missing data and potential outliersDOWNLOADING THIS DATATo access this data on your desktop GIS, download the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset.DATA SOURCESThis dataset was produced using:Quality and Outcomes Framework data: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.GP Catchment Outlines. Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital. Data was cleaned by Ribble Rivers Trust before use.COPYRIGHT NOTICEThe reproduction of this data must be accompanied by the following statement:© Ribble Rivers Trust 2021. Analysis carried out using data that is: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.CaBA HEALTH & WELLBEING EVIDENCE BASEThis dataset forms part of the wider CaBA Health and Wellbeing Evidence Base.

  10. Percentage of obese U.S. adults by state 2023

    • statista.com
    Updated Nov 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Percentage of obese U.S. adults by state 2023 [Dataset]. https://www.statista.com/statistics/378988/us-obesity-rate-by-state/
    Explore at:
    Dataset updated
    Nov 19, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    2023
    Area covered
    United States
    Description

    West Virginia, Mississippi, and Arkansas are the U.S. states with the highest percentage of their population who are obese. The states with the lowest percentage of their population who are obese include Colorado, Hawaii, and Massachusetts. Obesity in the United States Obesity is a growing problem in many countries around the world, but the United States has the highest rate of obesity among all OECD countries. The prevalence of obesity in the United States has risen steadily over the previous two decades, with no signs of declining. Obesity in the U.S. is more common among women than men, and overweight and obesity rates are higher among African Americans than any other race or ethnicity. Causes and health impacts Obesity is most commonly the result of a combination of poor diet, overeating, physical inactivity, and a genetic susceptibility. Obesity is associated with various negative health impacts, including an increased risk of cardiovascular diseases, certain types of cancer, and diabetes type 2. As of 2022, around 8.4 percent of the U.S. population had been diagnosed with diabetes. Diabetes is currently the eighth leading cause of death in the United States.

  11. f

    Validation data (obesity, diabetes)

    • figshare.com
    txt
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Luca Maria Aiello (2023). Validation data (obesity, diabetes) [Dataset]. http://doi.org/10.6084/m9.figshare.7796672.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    figshare
    Authors
    Luca Maria Aiello
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This set of files contains public data used to validate the grocery data. All references to the original sources are provided below.CHILD OBESITYPeriodically, the English National Health Service (NHS) publishes statistics about various aspects of the health and habits of people living in England, including obesity. The NHS National Child Measurement (NCMP) measures the height and weight of children in Reception class (aged 4 to 5) and year 6 (aged 10 to 11), to assess overweight and obesity levels in children within primary schools. The program is carried out every year in England and statistics are produced at the level of Local Authority (that corresponds to Boroughs in London). We report the data for the school year 2015-2016 (file: child_obesity_london_borough_2015-2016.csv). For the school year 2013-2014, statistics in London are also available at ward-level (file: child_obesity_london_ward_2013-2014.csv)The files are comma-separated and contain the following fields: area_id: the id of the boroughnumber_reception_measured: number of children in reception year measurednumber_y6_measured: number of children in reception year measuredprevalence_overweight_reception: the prevalence (percentage) of overweight children in reception year prevalence_overweight_y6: the prevalence (percentage) of overweight children in year 6prevalence_obese_reception: the prevalence (percentage) of obese children in reception yearprevalence_obese_y6: the prevalence (percentage) of obese children in year 6ADULT OBESITYThe Active People Survey (APS) was a survey used to measure the number of adults taking part in sport across England and included two questions about the height and weight of participants. We report the results of the APS for the year 2012. Prevalence of underweight, healthy weight, overweight, and obese people at borough level are provided in the file london_obesity_borough_2012.csv.The file is comma-separated and contains the following fields: area_id: the id of the boroughnumber_measured: number of people who participated in the surveyprevalence_healthy_weight: the prevalence (percentage) of healthy-weight peopleprevalence_overweight: the prevalence (percentage) of overweight peopleprevalence_obese: the prevalence (percentage) of obese peopleBARIATRIC HOSPITALIZATIONThe NHS records and publishes an annual compendium report about the number of hospital admissions attributable to obesity or bariatric surgery (i.e., weight loss surgery used as a treatment for people who are very obese), and the number of prescription items provided in primary care for the treatment of obesity. The NHS provides both raw counts at the Local Authority level and numbers normalized by population living in those areas. In the file obesity_hospitalization_borough_2016.csv, we report the statistics for the year 2015 (measurements made between Jan 2015 and March 2016).The file is comma-separated and contains the following fields:area_id: the id of the boroughtotal_hospitalizations: total number of obesity-related hospitalizationstotal_bariatric: total number of hospitalizations for bariatric surgeryprevalence_hospitalizations: prevalence (percentage) of obesity-related hospitalizations prevalence_bariatric: prevalence (percentage) of bariatric surgery hospitalizations DIABETESThrough the Quality and Outcomes Framework, NHS Digital publishes annually the number of people aged 17+ on a register for diabetes at each GP practice in England. NHS also publishes the number of people living in a census area who are registered to any of the GP in England. Based on these two sources, an estimate is produced about the prevalence of diabetes in each area. The data (file diabetes_estimates_osward_2016.csv) was collected in 2016 at LSOA-level and published at ward-level.The file is comma-separated and contains the following fields:area_id: the id of the wardgp_patients: total number of GP patients gp_patients_diabetes: total number of GP patients with a diabetes diagnosisestimated_diabetes_prevalence: prevalence (percentage) of diabetesAREA MAPPINGMapping of Greater London postcodes into larger geographical aggregations. The file is comma-separated and contains the following fields:pcd: postcodelat: latitudelong: longitudeoa11: output arealsoa11: lower super output areamsoa11: medium super output areaosward: wardoslaua: borough

  12. c

    Coronary heart disease (in persons of all ages): England

    • data.catchmentbasedapproach.org
    • hub.arcgis.com
    Updated Apr 7, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Rivers Trust (2021). Coronary heart disease (in persons of all ages): England [Dataset]. https://data.catchmentbasedapproach.org/items/832de0122e4b4bba9ff69cadc1bf53c4
    Explore at:
    Dataset updated
    Apr 7, 2021
    Dataset authored and provided by
    The Rivers Trust
    Area covered
    Description

    SUMMARYThis analysis, designed and executed by Ribble Rivers Trust, identifies areas across England with the greatest levels of coronary heart disease (in persons of all ages). Please read the below information to gain a full understanding of what the data shows and how it should be interpreted.ANALYSIS METHODOLOGYThe analysis was carried out using Quality and Outcomes Framework (QOF) data, derived from NHS Digital, relating to coronary heart disease (in persons of all ages).This information was recorded at the GP practice level. However, GP catchment areas are not mutually exclusive: they overlap, with some areas covered by 30+ GP practices. Therefore, to increase the clarity and usability of the data, the GP-level statistics were converted into statistics based on Middle Layer Super Output Area (MSOA) census boundaries.The percentage of each MSOA’s population (all ages) with coronary heart disease was estimated. This was achieved by calculating a weighted average based on:The percentage of the MSOA area that was covered by each GP practice’s catchment areaOf the GPs that covered part of that MSOA: the percentage of registered patients that have that illness The estimated percentage of each MSOA’s population with coronary heart disease was then combined with Office for National Statistics Mid-Year Population Estimates (2019) data for MSOAs, to estimate the number of people in each MSOA with coronary heart disease, within the relevant age range.Each MSOA was assigned a relative score between 1 and 0 (1 = worst, 0 = best) based on:A) the PERCENTAGE of the population within that MSOA who are estimated to have coronary heart diseaseB) the NUMBER of people within that MSOA who are estimated to have coronary heart diseaseAn average of scores A & B was taken, and converted to a relative score between 1 and 0 (1= worst, 0 = best). The closer to 1 the score, the greater both the number and percentage of the population in the MSOA that are estimated to have coronary heart disease, compared to other MSOAs. In other words, those are areas where it’s estimated a large number of people suffer from coronary heart disease, and where those people make up a large percentage of the population, indicating there is a real issue with coronary heart disease within the population and the investment of resources to address that issue could have the greatest benefits.LIMITATIONS1. GP data for the financial year 1st April 2018 – 31st March 2019 was used in preference to data for the financial year 1st April 2019 – 31st March 2020, as the onset of the COVID19 pandemic during the latter year could have affected the reporting of medical statistics by GPs. However, for 53 GPs (out of 7670) that did not submit data in 2018/19, data from 2019/20 was used instead. Note also that some GPs (997 out of 7670) did not submit data in either year. This dataset should be viewed in conjunction with the ‘Health and wellbeing statistics (GP-level, England): Missing data and potential outliers’ dataset, to determine areas where data from 2019/20 was used, where one or more GPs did not submit data in either year, or where there were large discrepancies between the 2018/19 and 2019/20 data (differences in statistics that were > mean +/- 1 St.Dev.), which suggests erroneous data in one of those years (it was not feasible for this study to investigate this further), and thus where data should be interpreted with caution. Note also that there are some rural areas (with little or no population) that do not officially fall into any GP catchment area (although this will not affect the results of this analysis if there are no people living in those areas).2. Although all of the obesity/inactivity-related illnesses listed can be caused or exacerbated by inactivity and obesity, it was not possible to distinguish from the data the cause of the illnesses in patients: obesity and inactivity are highly unlikely to be the cause of all cases of each illness. By combining the data with data relating to levels of obesity and inactivity in adults and children (see the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset), we can identify where obesity/inactivity could be a contributing factor, and where interventions to reduce obesity and increase activity could be most beneficial for the health of the local population.3. It was not feasible to incorporate ultra-fine-scale geographic distribution of populations that are registered with each GP practice or who live within each MSOA. Populations might be concentrated in certain areas of a GP practice’s catchment area or MSOA and relatively sparse in other areas. Therefore, the dataset should be used to identify general areas where there are high levels of coronary heart disease, rather than interpreting the boundaries between areas as ‘hard’ boundaries that mark definite divisions between areas with differing levels of coronary heart disease.TO BE VIEWED IN COMBINATION WITH:This dataset should be viewed alongside the following datasets, which highlight areas of missing data and potential outliers in the data:Health and wellbeing statistics (GP-level, England): Missing data and potential outliersLevels of obesity, inactivity and associated illnesses (England): Missing dataDOWNLOADING THIS DATATo access this data on your desktop GIS, download the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset.DATA SOURCESThis dataset was produced using:Quality and Outcomes Framework data: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.GP Catchment Outlines. Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital. Data was cleaned by Ribble Rivers Trust before use.COPYRIGHT NOTICEThe reproduction of this data must be accompanied by the following statement:© Ribble Rivers Trust 2021. Analysis carried out using data that is: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.CaBA HEALTH & WELLBEING EVIDENCE BASEThis dataset forms part of the wider CaBA Health and Wellbeing Evidence Base.

  13. E

    World Obesity levels 2002-10

    • dtechtive.com
    • find.data.gov.scot
    xml, zip
    Updated Feb 22, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    University of Edinburgh (2017). World Obesity levels 2002-10 [Dataset]. http://doi.org/10.7488/ds/1941
    Explore at:
    zip(4.643 MB), xml(0.0038 MB)Available download formats
    Dataset updated
    Feb 22, 2017
    Dataset provided by
    University of Edinburgh
    License

    Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
    License information was derived automatically

    Area covered
    Global
    Description

    This dataset shows the levels of overweight and obese people by country. Data is provided for 2002 and 2010 as a percentage of the total population and is also broken down by sex. Rates of change between 2002 and 2010 are also provided. The data was collated by the World Health Organisation (WHO)(http://www.who.int/gho/ncd/risk_factors/overweight/en/index.html) and was downloaded via the Guardian website (http://www.theguardian.com/news/datablog/interactive/2013/feb/19/obesity-map-of-world-weight). GIS vector data. This dataset was first accessioned in the EDINA ShareGeo Open repository on 2014-01-03 and migrated to Edinburgh DataShare on 2017-02-22.

  14. Obesity in Adults - Dataset - data.gov.uk

    • ckan.publishing.service.gov.uk
    Updated Jun 9, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ckan.publishing.service.gov.uk (2025). Obesity in Adults - Dataset - data.gov.uk [Dataset]. https://ckan.publishing.service.gov.uk/dataset/obesity-in-adults
    Explore at:
    Dataset updated
    Jun 9, 2025
    Dataset provided by
    CKANhttps://ckan.org/
    Description

    The spreadsheet contains regional level obesity trend data from the the HSE, BMI data from Understanding Society, and adjusted prevalence of underweight, healthy weight, overweight, and obesity by local authority from the Active People Survey. Understanding Society data shows the percentage of the population aged 10 and over by their Body Mass Index Classification, covering underweight, normal weight, overweight, and three classes of obesity. Questions on self-reported height and weight were added to the Sport England Active People Survey (APS) in January 2012 to provide data for monitoring excess weight (overweight including obesity, BMI ≥25kg/m2) in adults (age 16 and over) at local authority level for the Public Health Outcomes Framework (PHOF). Health Survey for England (HSE) results at a national level are available on the NHS Information Centre website. Other NHS indicators on obesity are available for Strategic Health Authorities (SHA). Relevant links: http://discover.ukdataservice.ac.uk/series/?sn=2000053 http://www.noo.org.uk/visualisation/adult_obesity

  15. Obesity Dataset Cleaned and Data Sinthetic

    • kaggle.com
    zip
    Updated Mar 3, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mandy Sia (2022). Obesity Dataset Cleaned and Data Sinthetic [Dataset]. https://www.kaggle.com/datasets/mandysia/obesity-dataset-cleaned-and-data-sinthetic
    Explore at:
    zip(43113 bytes)Available download formats
    Dataset updated
    Mar 3, 2022
    Authors
    Mandy Sia
    Description

    Context

    This dataset is open-source data collected from ScienceDirect under a Creative Commons license. This dataset had collected some information of residents in Mexico, Peru and Colombia about their lifestyle. The original file type is off so I transformed it into a CSV file then did some clean processes and transformations.

    Content

    id: unique id for each row

    Gender: sex - male or female

    Age: age

    Height: height

    Weight: weight

    family_history_with_overweight: Has a family member suffered or suffers f from overweight? - yes or no

    FAVC: Frequent consumption of high caloric food - yes or no

    FCVC: Frequency of consumption of vegetables - Never, Sometimes, Always

    NCP: Number of main meals - 1, 2, 3, 4

    CAEC: Consumption of food between meals - No, Sometimes, Frequently, Always

    SMOKE: Do you smoke - yes o no

    CH2O: Consumption of water daily - Less than a litter, between 1 and 2 l, more than 2 l

    SCC: Calories consumption monitoring - yes or no

    FAF: Physical activity frequency - 0, 1 to 2, 2 to 4, 4 to 5

    TUE: Time using technology devices - 0 to 2, 3 to 5, >5

    CALC: Consumption of alcohol - no, sometimes, frequently, always

    MTRANS: Transportation used - automobile, motorbike, bike, public_transportation, walking

    NObeyesdad: Type of obesity - insufficient_weight, normal_weight, overweight-level_i, overweight-level_ii, obesity_type_i, obesity_type_ii, obesity_type_iii BMI: Body mass index

  16. a

    Cancer (in persons of all ages): England

    • hub.arcgis.com
    • data.catchmentbasedapproach.org
    Updated Apr 6, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Rivers Trust (2021). Cancer (in persons of all ages): England [Dataset]. https://hub.arcgis.com/datasets/c5c07229db684a65822fdc9a29388b0b
    Explore at:
    Dataset updated
    Apr 6, 2021
    Dataset authored and provided by
    The Rivers Trust
    Area covered
    Description

    SUMMARYThis analysis, designed and executed by Ribble Rivers Trust, identifies areas across England with the greatest levels of cancer (in persons of all ages). Please read the below information to gain a full understanding of what the data shows and how it should be interpreted.ANALYSIS METHODOLOGYThe analysis was carried out using Quality and Outcomes Framework (QOF) data, derived from NHS Digital, relating to cancer (in persons of all ages).This information was recorded at the GP practice level. However, GP catchment areas are not mutually exclusive: they overlap, with some areas covered by 30+ GP practices. Therefore, to increase the clarity and usability of the data, the GP-level statistics were converted into statistics based on Middle Layer Super Output Area (MSOA) census boundaries.The percentage of each MSOA’s population (all ages) with cancer was estimated. This was achieved by calculating a weighted average based on:The percentage of the MSOA area that was covered by each GP practice’s catchment areaOf the GPs that covered part of that MSOA: the percentage of registered patients that have that illness The estimated percentage of each MSOA’s population with cancer was then combined with Office for National Statistics Mid-Year Population Estimates (2019) data for MSOAs, to estimate the number of people in each MSOA with cancer, within the relevant age range.Each MSOA was assigned a relative score between 1 and 0 (1 = worst, 0 = best) based on:A) the PERCENTAGE of the population within that MSOA who are estimated to have cancerB) the NUMBER of people within that MSOA who are estimated to have cancerAn average of scores A & B was taken, and converted to a relative score between 1 and 0 (1= worst, 0 = best). The closer to 1 the score, the greater both the number and percentage of the population in the MSOA that are estimated to have cancer, compared to other MSOAs. In other words, those are areas where it’s estimated a large number of people suffer from cancer, and where those people make up a large percentage of the population, indicating there is a real issue with cancer within the population and the investment of resources to address that issue could have the greatest benefits.LIMITATIONS1. GP data for the financial year 1st April 2018 – 31st March 2019 was used in preference to data for the financial year 1st April 2019 – 31st March 2020, as the onset of the COVID19 pandemic during the latter year could have affected the reporting of medical statistics by GPs. However, for 53 GPs (out of 7670) that did not submit data in 2018/19, data from 2019/20 was used instead. Note also that some GPs (997 out of 7670) did not submit data in either year. This dataset should be viewed in conjunction with the ‘Health and wellbeing statistics (GP-level, England): Missing data and potential outliers’ dataset, to determine areas where data from 2019/20 was used, where one or more GPs did not submit data in either year, or where there were large discrepancies between the 2018/19 and 2019/20 data (differences in statistics that were > mean +/- 1 St.Dev.), which suggests erroneous data in one of those years (it was not feasible for this study to investigate this further), and thus where data should be interpreted with caution. Note also that there are some rural areas (with little or no population) that do not officially fall into any GP catchment area (although this will not affect the results of this analysis if there are no people living in those areas).2. Although all of the obesity/inactivity-related illnesses listed can be caused or exacerbated by inactivity and obesity, it was not possible to distinguish from the data the cause of the illnesses in patients: obesity and inactivity are highly unlikely to be the cause of all cases of each illness. By combining the data with data relating to levels of obesity and inactivity in adults and children (see the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset), we can identify where obesity/inactivity could be a contributing factor, and where interventions to reduce obesity and increase activity could be most beneficial for the health of the local population.3. It was not feasible to incorporate ultra-fine-scale geographic distribution of populations that are registered with each GP practice or who live within each MSOA. Populations might be concentrated in certain areas of a GP practice’s catchment area or MSOA and relatively sparse in other areas. Therefore, the dataset should be used to identify general areas where there are high levels of cancer, rather than interpreting the boundaries between areas as ‘hard’ boundaries that mark definite divisions between areas with differing levels of cancer.TO BE VIEWED IN COMBINATION WITH:This dataset should be viewed alongside the following datasets, which highlight areas of missing data and potential outliers in the data:Health and wellbeing statistics (GP-level, England): Missing data and potential outliersLevels of obesity, inactivity and associated illnesses (England): Missing dataDOWNLOADING THIS DATATo access this data on your desktop GIS, download the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset.DATA SOURCESThis dataset was produced using:Quality and Outcomes Framework data: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.GP Catchment Outlines. Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital. Data was cleaned by Ribble Rivers Trust before use.MSOA boundaries: © Office for National Statistics licensed under the Open Government Licence v3.0. Contains OS data © Crown copyright and database right 2021.Population data: Mid-2019 (June 30) Population Estimates for Middle Layer Super Output Areas in England and Wales. © Office for National Statistics licensed under the Open Government Licence v3.0. © Crown Copyright 2020.COPYRIGHT NOTICEThe reproduction of this data must be accompanied by the following statement:© Ribble Rivers Trust 2021. Analysis carried out using data that is: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital; © Office for National Statistics licensed under the Open Government Licence v3.0. Contains OS data © Crown copyright and database right 2021. © Crown Copyright 2020.CaBA HEALTH & WELLBEING EVIDENCE BASEThis dataset forms part of the wider CaBA Health and Wellbeing Evidence Base.

  17. c

    Hypertension (in persons of all ages): England

    • data.catchmentbasedapproach.org
    Updated Apr 7, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Rivers Trust (2021). Hypertension (in persons of all ages): England [Dataset]. https://data.catchmentbasedapproach.org/datasets/hypertension-in-persons-of-all-ages-england
    Explore at:
    Dataset updated
    Apr 7, 2021
    Dataset authored and provided by
    The Rivers Trust
    Area covered
    Description

    SUMMARYThis analysis, designed and executed by Ribble Rivers Trust, identifies areas across England with the greatest levels of hypertension (in persons of all ages). Please read the below information to gain a full understanding of what the data shows and how it should be interpreted.ANALYSIS METHODOLOGYThe analysis was carried out using Quality and Outcomes Framework (QOF) data, derived from NHS Digital, relating to hypertension (in persons of all ages).This information was recorded at the GP practice level. However, GP catchment areas are not mutually exclusive: they overlap, with some areas covered by 30+ GP practices. Therefore, to increase the clarity and usability of the data, the GP-level statistics were converted into statistics based on Middle Layer Super Output Area (MSOA) census boundaries.The percentage of each MSOA’s population (all ages) with hypertension was estimated. This was achieved by calculating a weighted average based on:The percentage of the MSOA area that was covered by each GP practice’s catchment areaOf the GPs that covered part of that MSOA: the percentage of registered patients that have that illness The estimated percentage of each MSOA’s population with hypertension was then combined with Office for National Statistics Mid-Year Population Estimates (2019) data for MSOAs, to estimate the number of people in each MSOA with hypertension , within the relevant age range.Each MSOA was assigned a relative score between 1 and 0 (1 = worst, 0 = best) based on:A) the PERCENTAGE of the population within that MSOA who are estimated to have hypertension B) the NUMBER of people within that MSOA who are estimated to have hypertension An average of scores A & B was taken, and converted to a relative score between 1 and 0 (1= worst, 0 = best). The closer to 1 the score, the greater both the number and percentage of the population in the MSOA that are estimated to have hypertension , compared to other MSOAs. In other words, those are areas where it’s estimated a large number of people suffer from hypertension, and where those people make up a large percentage of the population, indicating there is a real issue with hypertension within the population and the investment of resources to address that issue could have the greatest benefits.LIMITATIONS1. GP data for the financial year 1st April 2018 – 31st March 2019 was used in preference to data for the financial year 1st April 2019 – 31st March 2020, as the onset of the COVID19 pandemic during the latter year could have affected the reporting of medical statistics by GPs. However, for 53 GPs (out of 7670) that did not submit data in 2018/19, data from 2019/20 was used instead. Note also that some GPs (997 out of 7670) did not submit data in either year. This dataset should be viewed in conjunction with the ‘Health and wellbeing statistics (GP-level, England): Missing data and potential outliers’ dataset, to determine areas where data from 2019/20 was used, where one or more GPs did not submit data in either year, or where there were large discrepancies between the 2018/19 and 2019/20 data (differences in statistics that were > mean +/- 1 St.Dev.), which suggests erroneous data in one of those years (it was not feasible for this study to investigate this further), and thus where data should be interpreted with caution. Note also that there are some rural areas (with little or no population) that do not officially fall into any GP catchment area (although this will not affect the results of this analysis if there are no people living in those areas).2. Although all of the obesity/inactivity-related illnesses listed can be caused or exacerbated by inactivity and obesity, it was not possible to distinguish from the data the cause of the illnesses in patients: obesity and inactivity are highly unlikely to be the cause of all cases of each illness. By combining the data with data relating to levels of obesity and inactivity in adults and children (see the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset), we can identify where obesity/inactivity could be a contributing factor, and where interventions to reduce obesity and increase activity could be most beneficial for the health of the local population.3. It was not feasible to incorporate ultra-fine-scale geographic distribution of populations that are registered with each GP practice or who live within each MSOA. Populations might be concentrated in certain areas of a GP practice’s catchment area or MSOA and relatively sparse in other areas. Therefore, the dataset should be used to identify general areas where there are high levels of hypertension, rather than interpreting the boundaries between areas as ‘hard’ boundaries that mark definite divisions between areas with differing levels of hypertension .TO BE VIEWED IN COMBINATION WITH:This dataset should be viewed alongside the following datasets, which highlight areas of missing data and potential outliers in the data:Health and wellbeing statistics (GP-level, England): Missing data and potential outliersLevels of obesity, inactivity and associated illnesses (England): Missing dataDOWNLOADING THIS DATATo access this data on your desktop GIS, download the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset.DATA SOURCESThis dataset was produced using:Quality and Outcomes Framework data: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.GP Catchment Outlines. Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital. Data was cleaned by Ribble Rivers Trust before use.COPYRIGHT NOTICEThe reproduction of this data must be accompanied by the following statement:© Ribble Rivers Trust 2021. Analysis carried out using data that is: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.CaBA HEALTH & WELLBEING EVIDENCE BASEThis dataset forms part of the wider CaBA Health and Wellbeing Evidence Base.

  18. c

    Diabetes mellitus (in persons aged 17 and over): England

    • data.catchmentbasedapproach.org
    Updated Apr 7, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Rivers Trust (2021). Diabetes mellitus (in persons aged 17 and over): England [Dataset]. https://data.catchmentbasedapproach.org/datasets/diabetes-mellitus-in-persons-aged-17-and-over-england
    Explore at:
    Dataset updated
    Apr 7, 2021
    Dataset authored and provided by
    The Rivers Trust
    Area covered
    Description

    SUMMARYThis analysis, designed and executed by Ribble Rivers Trust, identifies areas across England with the greatest levels of diabetes mellitus in persons (aged 17+). Please read the below information to gain a full understanding of what the data shows and how it should be interpreted.ANALYSIS METHODOLOGYThe analysis was carried out using Quality and Outcomes Framework (QOF) data, derived from NHS Digital, relating to diabetes mellitus in persons (aged 17+).This information was recorded at the GP practice level. However, GP catchment areas are not mutually exclusive: they overlap, with some areas covered by 30+ GP practices. Therefore, to increase the clarity and usability of the data, the GP-level statistics were converted into statistics based on Middle Layer Super Output Area (MSOA) census boundaries.The percentage of each MSOA’s population (aged 17+) with diabetes mellitus was estimated. This was achieved by calculating a weighted average based on:The percentage of the MSOA area that was covered by each GP practice’s catchment areaOf the GPs that covered part of that MSOA: the percentage of registered patients that have that illness The estimated percentage of each MSOA’s population with diabetes mellitus was then combined with Office for National Statistics Mid-Year Population Estimates (2019) data for MSOAs, to estimate the number of people in each MSOA with depression, within the relevant age range.Each MSOA was assigned a relative score between 1 and 0 (1 = worst, 0 = best) based on:A) the PERCENTAGE of the population within that MSOA who are estimated to have diabetes mellitusB) the NUMBER of people within that MSOA who are estimated to have diabetes mellitusAn average of scores A & B was taken, and converted to a relative score between 1 and 0 (1= worst, 0 = best). The closer to 1 the score, the greater both the number and percentage of the population in the MSOA that are estimated to have diabetes mellitus, compared to other MSOAs. In other words, those are areas where it’s estimated a large number of people suffer from diabetes mellitus, and where those people make up a large percentage of the population, indicating there is a real issue with diabetes mellitus within the population and the investment of resources to address that issue could have the greatest benefits.LIMITATIONS1. GP data for the financial year 1st April 2018 – 31st March 2019 was used in preference to data for the financial year 1st April 2019 – 31st March 2020, as the onset of the COVID19 pandemic during the latter year could have affected the reporting of medical statistics by GPs. However, for 53 GPs (out of 7670) that did not submit data in 2018/19, data from 2019/20 was used instead. Note also that some GPs (997 out of 7670) did not submit data in either year. This dataset should be viewed in conjunction with the ‘Health and wellbeing statistics (GP-level, England): Missing data and potential outliers’ dataset, to determine areas where data from 2019/20 was used, where one or more GPs did not submit data in either year, or where there were large discrepancies between the 2018/19 and 2019/20 data (differences in statistics that were > mean +/- 1 St.Dev.), which suggests erroneous data in one of those years (it was not feasible for this study to investigate this further), and thus where data should be interpreted with caution. Note also that there are some rural areas (with little or no population) that do not officially fall into any GP catchment area (although this will not affect the results of this analysis if there are no people living in those areas).2. Although all of the obesity/inactivity-related illnesses listed can be caused or exacerbated by inactivity and obesity, it was not possible to distinguish from the data the cause of the illnesses in patients: obesity and inactivity are highly unlikely to be the cause of all cases of each illness. By combining the data with data relating to levels of obesity and inactivity in adults and children (see the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset), we can identify where obesity/inactivity could be a contributing factor, and where interventions to reduce obesity and increase activity could be most beneficial for the health of the local population.3. It was not feasible to incorporate ultra-fine-scale geographic distribution of populations that are registered with each GP practice or who live within each MSOA. Populations might be concentrated in certain areas of a GP practice’s catchment area or MSOA and relatively sparse in other areas. Therefore, the dataset should be used to identify general areas where there are high levels of diabetes mellitus, rather than interpreting the boundaries between areas as ‘hard’ boundaries that mark definite divisions between areas with differing levels of diabetes mellitus.TO BE VIEWED IN COMBINATION WITH:This dataset should be viewed alongside the following datasets, which highlight areas of missing data and potential outliers in the data:Health and wellbeing statistics (GP-level, England): Missing data and potential outliersLevels of obesity, inactivity and associated illnesses (England): Missing dataDOWNLOADING THIS DATATo access this data on your desktop GIS, download the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset.DATA SOURCESThis dataset was produced using:Quality and Outcomes Framework data: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.GP Catchment Outlines. Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital. Data was cleaned by Ribble Rivers Trust before use.COPYRIGHT NOTICEThe reproduction of this data must be accompanied by the following statement:© Ribble Rivers Trust 2021. Analysis carried out using data that is: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.CaBA HEALTH & WELLBEING EVIDENCE BASEThis dataset forms part of the wider CaBA Health and Wellbeing Evidence Base.

  19. a

    Depression (in adults aged 18 and over): England

    • hub.arcgis.com
    • data.catchmentbasedapproach.org
    Updated Apr 6, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Rivers Trust (2021). Depression (in adults aged 18 and over): England [Dataset]. https://hub.arcgis.com/maps/theriverstrust::depression-in-adults-aged-18-and-over-england
    Explore at:
    Dataset updated
    Apr 6, 2021
    Dataset authored and provided by
    The Rivers Trust
    Area covered
    Description

    SUMMARYThis analysis, designed and executed by Ribble Rivers Trust, identifies areas across England with the greatest levels of depression in adults (aged 18+). Please read the below information to gain a full understanding of what the data shows and how it should be interpreted.ANALYSIS METHODOLOGYThe analysis was carried out using Quality and Outcomes Framework (QOF) data, derived from NHS Digital, relating to depression in adults (aged 18+).This information was recorded at the GP practice level. However, GP catchment areas are not mutually exclusive: they overlap, with some areas covered by 30+ GP practices. Therefore, to increase the clarity and usability of the data, the GP-level statistics were converted into statistics based on Middle Layer Super Output Area (MSOA) census boundaries.The percentage of each MSOA’s population (aged 18+) with depression was estimated. This was achieved by calculating a weighted average based on:The percentage of the MSOA area that was covered by each GP practice’s catchment areaOf the GPs that covered part of that MSOA: the percentage of registered patients that have that illness The estimated percentage of each MSOA’s population with depression was then combined with Office for National Statistics Mid-Year Population Estimates (2019) data for MSOAs, to estimate the number of people in each MSOA with depression, within the relevant age range.Each MSOA was assigned a relative score between 1 and 0 (1 = worst, 0 = best) based on:A) the PERCENTAGE of the population within that MSOA who are estimated to have depressionB) the NUMBER of people within that MSOA who are estimated to have depressionAn average of scores A & B was taken, and converted to a relative score between 1 and 0 (1= worst, 0 = best). The closer to 1 the score, the greater both the number and percentage of the population in the MSOA that are estimated to have depression, compared to other MSOAs. In other words, those are areas where it’s estimated a large number of people suffer from depression, and where those people make up a large percentage of the population, indicating there is a real issue with depression within the population and the investment of resources to address that issue could have the greatest benefits.LIMITATIONS1. GP data for the financial year 1st April 2018 – 31st March 2019 was used in preference to data for the financial year 1st April 2019 – 31st March 2020, as the onset of the COVID19 pandemic during the latter year could have affected the reporting of medical statistics by GPs. However, for 53 GPs (out of 7670) that did not submit data in 2018/19, data from 2019/20 was used instead. Note also that some GPs (997 out of 7670) did not submit data in either year. This dataset should be viewed in conjunction with the ‘Health and wellbeing statistics (GP-level, England): Missing data and potential outliers’ dataset, to determine areas where data from 2019/20 was used, where one or more GPs did not submit data in either year, or where there were large discrepancies between the 2018/19 and 2019/20 data (differences in statistics that were > mean +/- 1 St.Dev.), which suggests erroneous data in one of those years (it was not feasible for this study to investigate this further), and thus where data should be interpreted with caution. Note also that there are some rural areas (with little or no population) that do not officially fall into any GP catchment area (although this will not affect the results of this analysis if there are no people living in those areas).2. Although all of the obesity/inactivity-related illnesses listed can be caused or exacerbated by inactivity and obesity, it was not possible to distinguish from the data the cause of the illnesses in patients: obesity and inactivity are highly unlikely to be the cause of all cases of each illness. By combining the data with data relating to levels of obesity and inactivity in adults and children (see the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset), we can identify where obesity/inactivity could be a contributing factor, and where interventions to reduce obesity and increase activity could be most beneficial for the health of the local population.3. It was not feasible to incorporate ultra-fine-scale geographic distribution of populations that are registered with each GP practice or who live within each MSOA. Populations might be concentrated in certain areas of a GP practice’s catchment area or MSOA and relatively sparse in other areas. Therefore, the dataset should be used to identify general areas where there are high levels of depression, rather than interpreting the boundaries between areas as ‘hard’ boundaries that mark definite divisions between areas with differing levels of depression.TO BE VIEWED IN COMBINATION WITH:This dataset should be viewed alongside the following datasets, which highlight areas of missing data and potential outliers in the data:Health and wellbeing statistics (GP-level, England): Missing data and potential outliersLevels of obesity, inactivity and associated illnesses (England): Missing dataDOWNLOADING THIS DATATo access this data on your desktop GIS, download the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset.DATA SOURCESThis dataset was produced using:Quality and Outcomes Framework data: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.GP Catchment Outlines. Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital. Data was cleaned by Ribble Rivers Trust before use.COPYRIGHT NOTICEThe reproduction of this data must be accompanied by the following statement:© Ribble Rivers Trust 2021. Analysis carried out using data that is: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.CaBA HEALTH & WELLBEING EVIDENCE BASEThis dataset forms part of the wider CaBA Health and Wellbeing Evidence Base.

  20. c

    Levels of obesity, inactivity and associated illnesses (England): Missing...

    • data.catchmentbasedapproach.org
    • hub.arcgis.com
    Updated Apr 8, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Rivers Trust (2021). Levels of obesity, inactivity and associated illnesses (England): Missing data [Dataset]. https://data.catchmentbasedapproach.org/datasets/theriverstrust::levels-of-obesity-inactivity-and-associated-illnesses-england-missing-data/about
    Explore at:
    Dataset updated
    Apr 8, 2021
    Dataset authored and provided by
    The Rivers Trust
    Area covered
    Description

    SUMMARYTo be viewed in combination with the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset.This dataset shows where there was no data* relating to one of more of the following factors:Obesity/inactivity-related illnesses (recorded at the GP practice catchment area level*)Adult obesity (recorded at the GP practice catchment area level*)Inactivity in children (recorded at the district level)Excess weight in children (recorded at the Middle Layer Super Output Area level)* GPs do not have catchments that are mutually exclusive from each other: they overlap, with some geographic areas being covered by 30+ practices.GP data for the financial year 1st April 2018 – 31st March 2019 was used in preference to data for the financial year 1st April 2019 – 31st March 2020, as the onset of the COVID19 pandemic during the latter year could have affected the reporting of medical statistics by GPs. However, for 53 GPs (out of 7670) that did not submit data in 2018/19, data from 2019/20 was used instead. This dataset identifies areas where data from 2019/20 was used, where one or more GPs did not submit data in either year (this could be because there are rural areas that aren’t officially covered by any GP practices), or where there were large discrepancies between the 2018/19 and 2019/20 data (differences in statistics that were > mean +/- 1 St.Dev.), which suggests erroneous data in one of those years (it was not feasible for this study to investigate this further), and thus where data should be interpreted with caution.Results of the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ analysis in these areas should be interpreted with caution, particularly if the levels of obesity, inactivity and associated illnesses appear to be significantly lower than in their immediate surrounding areas.Really small areas with ‘missing’ data were deleted, where it was deemed that missing data will not have impacted the overall analysis (i.e. where GP data was missing from really small countryside areas where no people live).See also Health and wellbeing statistics (GP-level, England): Missing data and potential outliers dataDATA SOURCESThis dataset was produced using:- Quality and Outcomes Framework data: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.- National Child Measurement Programme: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital. - Active Lives Survey 2019: Sport and Physical Activity Levels amongst children and young people in school years 1-11 (aged 5-16). © Sport England 2020.- Active Lives Survey 2019: Sport and Physical Activity Levels amongst adults aged 16+. © Sport England 2020.- GP Catchment Outlines. Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital. Data was cleaned by Ribble Rivers Trust before use.- Administrative boundaries: Boundary-LineTM: Contains Ordnance Survey data © Crown copyright and database right 2021. Contains public sector information licensed under the Open Government Licence v3.0.- MSOA boundaries: © Office for National Statistics licensed under the Open Government Licence v3.0. Contains OS data © Crown copyright and database right 2021.COPYRIGHT NOTICEThe reproduction of this data must be accompanied by the following statement:© Ribble Rivers Trust 2021. Analysis carried out using data that is: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital; © Sport England 2020; © Office for National Statistics licensed under the Open Government Licence v3.0. Contains Ordnance Survey data © Crown copyright and database right 2021. Contains public sector information licensed under the Open Government Licence v3.0.CaBA HEALTH & WELLBEING EVIDENCE BASEThis dataset forms part of the wider CaBA Health and Wellbeing Evidence Base.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
waticson (2024). 💀Deaths And Obesity - 🎀Health [Dataset]. https://www.kaggle.com/datasets/yutodennou/death-and-obesity
Organization logo

💀Deaths And Obesity - 🎀Health

Projection of deaths due to obesity and overweight in each country

Explore at:
zip(224551 bytes)Available download formats
Dataset updated
May 24, 2024
Authors
waticson
License

Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically

Description

This data set summarizes obesity and the number of deaths caused by it in each country

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F2993575%2Fb55c8c53db1eb6809cc0fb6b5a081195%2F2024-05-25%20093352.png?generation=1716597253375211&alt=media" alt="">

💡I have already divided these into TRAIN data, TEST data, and ANSWER data so you guys can start working on the regression problem right away.

  • train.csv: Obesity and deaths data from 1990 to 2013
  • test.csv: The explanatory variable in 2014
  • answer.csv: The objective variable in 2014

These data were created with the assumption that the number of deaths due to obesity in 2014 will be estimated from data from 1990 to 2013.

There is also something called HINT data(hint.csv). This is data for 2015 and beyond. I have left it out of the train or test data because it has many missing values, but it may be useful for forecasting and for those who are interested in more recent data.

VariablesDiscription
Country205 country names
CodeCountry code like AFG for Afghanistan
YearYear of collecting data
PopulationPopulation in a country
Percentage-OverweightPercentage of defined as overweight, BMI >= 25(age-standardized estimate)(%),Sex: both sexes, Age group:18+
Mean-Daily-Caloric-SupplyMean of daily supply of calories among overweight or obesity, BMI >= 25(age-standardized). Only about men
Mean-BMIBMI, Age group:18+ years. 2 columns for both male and female
Percentage-Overweighted-MalePercentage of adults who are overweight (age-standardized) - Age group: 18+ years. 2 columns for both male and female
Prevalence-Hypertension-MalePrevalence of hypertension among adults aged 30-79 years(age-standardized). 2 columns for both male and female
Prevalence-ObesityPrevalence of obesity among adults, BMI >= 30(age-standardized estimate)(%),Sex: both sexes, Age group:18+
Death-By-High-BMIDeaths that are from all causes attributed to high body-mass index per 100,000 people, in both sexes aged age-standarized
Search
Clear search
Close search
Google apps
Main menu