Facebook
TwitterNHIS Adult Summary Health Statistics
Description
Interactive Summary Health Statistics for Adults provide annual estimates of selected health topics for adults aged 18 years and over based on final data from the National Health Interview Survey.
Dataset Details
Publisher: Centers for Disease Control and Prevention Temporal Coverage: 2019/2023 Last Modified: 2025-04-21 Contact: National Center for Health Statistics (cdcinfo@cdc.gov)
Source
Original… See the full description on the dataset page: https://huggingface.co/datasets/HHS-Official/nhis-adult-summary-health-statistics.
Facebook
Twitter2019–present. The National Health Interview Survey (NHIS) is a nationally representative household health survey of the U.S. civilian noninstitutionalized population. The NHIS data are used to monitor trends in illness and disability, track progress toward achieving national health objectives, for epidemiologic and policy analysis of various health problems, determining barriers to accessing and using appropriate health care, and evaluating Federal health programs. NHIS is conducted continuously throughout the year by the National Center for Health Statistics (NCHS). Public-use data files on adults and children with corresponding imputed income data files, and survey paradata are released annually. The NHIS data website (https://www.cdc.gov/nchs/nhis/documentation/index.html) features the most up-to-date public-use data files and documentation for downloading including questionnaire, codebooks, CSV and ASCII data files, programs and sample code, and in-depth survey description. Most of the NHIS data are included in the public use files. NHIS is protected by Federal confidentiality laws that state the data collected by NCHS may be used only for statistical reporting and analysis. Some NHIS variables have been suppressed or edited in the public use files to protect confidentiality. Analysts interested in using data that has been suppressed or edited may apply for access through the NCHS Research Data Center at https://www.cdc.gov/rdc/. In 2019, NHIS launched a redesigned content and structure that differs from its previous questionnaire designs. NHIS has been conducted continuously since 1957.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
a Covariates: Age group per 5 years, socioeconomic status group (decile), nickel allergy.CI, Confidence interval; OR, Odds ratioMultivariate logistic regression analysis of endometriosis and nickel allergy.
Facebook
Twitterhttps://www.icpsr.umich.edu/web/ICPSR/studies/9703/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/9703/terms
The basic purpose of the National Health Interview Survey (NHIS) is to obtain information about the amount and distribution of illness, its effects in terms of disability and chronic impairments, and the kinds of health services people receive. The 1989 Health Insurance Supplement provides 93 variables from the core Person File (see NATIONAL HEALTH INTERVIEW SURVEY, 1989 [ICPSR 9583]) including sex, age, race, marital status, veteran status, education, income, industry and occupation codes, and limits on activity. It also contains summary variables from the Health Conditions, Doctor Visits, and Hospital Episode files of the 1989 NHIS. The 53 variables unique to this supplement include the type of health insurance program the respondent was covered by (including public assistance programs), extent of coverage, reasons for not carrying health insurance, and effects of job loss on coverage.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This series of code and corresponding data files are intended for use in cognitive decline and Alzheimer’s disease and related dementias (ADRD) research. The files include twelve years of cleaned datasets derived from the 2007-2018 years of the National Health Interview Survey (NHIS). NHIS is a nationally representative study aimed at monitoring the health of the non-institutionalized United States population. The provided datasets include sociodemographic information on respondents’ age, sex, race, and marital status from the Sample Adult Files, cognition variables from the Sample Adult files and, in applicable years, merged cognition data from the Adult Functioning and Disability (AFD) supplement. The files were constructed to allow for users to append multiple years of data for longitudinal analysis. Brief and detailed summaries of the variables available in these datasets along with more detailed descriptions of performed calculations can be found in the provided data dictionaries. Users may also refer to the provided “Overview of variables across years” document to see which variables are available each year. SAS, Stata, and CSV data file formats are provided as are the full coding scripts used in Stata.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Odds ratios (ORs) for dementia according to prevalence of systemic rheumatic diseases.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
BackgroundIdentifying acute ischemic stroke (AIS) among potential stroke cases is crucial for stroke research based on claims data. However, the accuracy of using the diagnostic codes of the International Classification of Diseases 10th revision was less than expected.MethodsFrom the National Health Insurance Service (NHIS) claims data, stroke cases admitted to the hospitals participating in the multicenter stroke registry (Clinical Research Collaboration for Stroke in Korea, CRCS-K) during the study period with principal or additional diagnosis codes of I60-I64 on the 10th revision of International Classification of Diseases were extracted. The datasets were randomly divided into development and validation sets with a ratio of 7:3. A stroke identification algorithm using the claims data was developed and validated through the linkage between the extracted datasets and the registry database.ResultsAltogether, 40,443 potential cases were extracted from the NHIS claims data, of which 31.7% were certified as AIS through linkage with the CRCS-K database. We selected 17 key identifiers from the claims data and developed 37 conditions through combinations of those key identifiers. The key identifiers comprised brain CT, MRI, use of tissue plasminogen activator, endovascular treatment, carotid endarterectomy or stenting, antithrombotics, anticoagulants, etc. The sensitivity, specificity, and diagnostic accuracy of the algorithm were 81.2%, 82.9%, and 82.4% in the development set, and 80.2%, 82.0%, and 81.4% in the validation set, respectively.ConclusionsOur stroke identification algorithm may be useful to grasp stroke burden in Korea. However, further efforts to refine the algorithm are necessary.
Facebook
TwitterIn Korea, everyone is compulsorily required to join the National Health Insurance. The National Health Insurance Service (NHIS), which manages national health insurance, provides basic health checkups to subscribers every year.
This dataset is the result of a random sampling of 1 million people per year from 2002 to 2021 among those who underwent basic health checkups provided by the NHIS.
Missing values are those that have not been selectively tested by individuals.
This dataset consists of a total of 19 csv files for each year, and each csv file contains only the health checkup results for that year. There are differences in the features of the dataset by year.
There are features that have been excluded or added by year.

Difference in AREA_CODE
After 2012, a new area, 'SEAJONG' was named and a new area code, 36, was added.

Categorization differences in AGE_GROUP
There is a difference in age categorization criteria between 2002 and 2013 and the dataset after 2014.
A description of each column is as follows.
| feature name | description | form of expression | range |
|---|---|---|---|
| YEAR | Base year of the information | YYYY | 2002~2020 |
| IDV_ID | Serial number assigned to subscriber | N | 1~1,000,000 |
| AREA_CODE | Residency code of examinee | N | |
| SEX | Gender | N | 1: male, 2:female |
| AGE_GROUP | A code that categorizes the examinee's age into 5-year-olds based on the year. Refer to the table below for details. | N | 2002~2013: 1~14, 2014~: 1~18 |
| HEIGHT | Examiner's height (in units of 5 cm) | N/cm | |
| WEIGHT | Examiner's weight (in units of 5 kg) | N/Kg | |
| WAIST | examiner's waist circumference | N/Kg | |
| SIGHT_LEFT | Eyesight of the examinee's left eye | N | (0.1~2.5, eyesight < 0.1 == 0.1, blind==9.9) |
| SIGHT_RIGHT | Eyesight of the examinee's right eye | N | (0.1~2.5, eyesight < 0.1 == 0.1, blind==9.9) |
| BP_HIGH | The examiner's systolic blood pressure | N/mmHg | |
| BP_LWST | Diastolic blood pressure of examinee | N/mmHg | |
| BLDS | Pre-meal blood glucose of the examinee. The concentration of glucose per 100 ml of blood | N/mg/dL | |
| TOT_CHOLE | Sum of ester and non-ester cholesterol in serum. Normal values are 150 to 250 mg/dL | mg/dL | |
| TRIGLYCERIDE | Amount of simple lipids or neutral lipids. Normal values are 30 to 135 mg/dL | mg/dL | |
| HDL_CHOLE | The amount of cholesterol contained in HDL. Normal values are 30 to 65 mg/dL. | mg/dL | |
| LDL_CHOLE | The amount of cholesterol contained in LDL. If it is 170 mg/dL or higher, hyper-LDLemia is diagnosed. | mg/dL | |
| CREATININE | Serum concentration of creatinine, the dehydration of creatine. Increases and decreases in creatinine are not related to food, but to muscle development and exercise. Normal values are 0.8 to 1.7 mg/dL. | mg/dL | |
| HMG | It is a pigment-protein present in blood and blood cells, composed of globin and heme, and plays a role as an oxygen carrier in the blood. | N/g/dL | |
| OLIG_PROTE_CD | excretion of protein in the urine | N | 1(-), 2(±), 3(+1), 4(+2), 5(+3), 6(+4) |
| SGOT_AST | Levels on blood tests that indicate liver function. Concentrations increase when liver cells, heart, kidney, brain, and muscle cells are damaged. Normal value is 0~40IU/L | N/IU/L | |
| SGPT_ALT | Levels in blood tests that indicate liver function. ALT mainly exists only in hepatocytes, and its concentration increases when hepatocytes are damaged. Normal values are 0 to 40 IU/L | N/IU/L | |
| GAMMA_GTP | Levels in blood tests that indicate liver function. Gamma GTP is an enzyme mainly present in the bile duct in the liver, and blood concentration increases when bile excretion disorder or hepatocellular disorder occurs. Normal values are 11 to 64 IU/L for men and 8 to 35 IU/L for women. | N/IU/L | |
| SMK_STAT | Whether or not the examinee's smoking status | N | 1 (don't smoke) / 2 (smoked before, but quit) / 3 (currently smokes) |
| DRK_YN | Whether or not the examinee's drinking status | N | 0,N (don't drink) / 1,Y (drinking) |
| HCHK_CE_IN | Whether or not the examinee chose oral examination. | N | 0,N (not tested)/1,Y (tested) |
| CRS_YN | Whether or not the examinee has dental caries | N | 0 (none) / 1 (present) |
| TTH_MSS_YN | Existence of missing teeth of the examinee | N | 0 (none) / 1 (present) |
| ODT_TRB_YN | Whether or not the examinee has denta... |
Facebook
TwitterThe purpose of the National Health Interview Survey (NHIS) is to obtain information about the amount and distribution of illness, its effects in terms of disability and chronic impairments, and the kinds of health services people receive. The National Health Interview Survey on Disability (NHIS-D), Phase I, was administered at the same time as the NHIS 1995 core, NATIONAL HEALTH INTERVIEW SURVEY, 1995 (ICPSR 2533). This questionnaire collected basic data on disability and was used as a screening device to determine eligibility for the second phase of the survey, the NHIS-D Disability Followback Survey. This supplement contains four fields: record type, year, person ID, and outcome code for adult and child. (Source: downloaded from ICPSR 7/13/10)
Please Note: This dataset is part of the historical CISER Data Archive Collection and is also available at ICPSR at https://doi.org/10.3886/ICPSR02540.v1. We highly recommend using the ICPSR version as they may make this dataset available in multiple data formats in the future.
Facebook
Twitterhttps://www.icpsr.umich.edu/web/ICPSR/studies/9839/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/9839/terms
The basic purpose of the National Health Interview Survey (NHIS) is to obtain information about the amount and distribution of illness, its effects in terms of disability and chronic impairments, and the kinds of health services people receive. Household variables in this data collection include type of living quarters, size of family, number of families in household, and geographic region. Person variables include sex, age, race, marital status, veteran status, education, income, industry and occupation codes, and limits on activity. The Condition, Doctor Visit, and Hospital files contain information on each reported health condition, doctor visit within a two-week period, and hospitalization (12-month recall), respectively.
Facebook
TwitterThe purpose of the National Health Interview Survey (NHIS) is to obtain information about the amount and distribution of illness, its effects in terms of disability and chronic impairments, and the kinds of health services people receive. This supplement contains edited and imputed data for the Income and Assets portion (Part D) of the 1994 NHIS Family Resources questionnaire. Other components of the Family Resources questionnaire cover Access to Care (Part A), Health Care Coverage (Part B), and Private Plan and Coverage Detail (Part C). The Income and Assets supplement contains variables from the NHIS core Person File (see NATIONAL HEALTH INTERVIEW SURVEY, 1994 (ICPSR 2533)), including sex, age, race, marital status, veteran status, education, income, industry and occupation codes, and limits on activity. Other items focus on employment, income from employment and businesses, other income sources including retirement and Social Security, and asset holdings such as cars, houses, businesses, and investment properties. Additional information on the receipt of income from public programs like Aid to Families with Dependent Children (AFDC), Supplemental Security Income (SSI), and food stamps is also included. (Source: downloaded from ICPSR 7/13/10)
Please Note: This dataset is part of the historical CISER Data Archive Collection and is also available at ICPSR at https://doi.org/10.3886/ICPSR02656.v1. We highly recommend using the ICPSR version as they may make this dataset available in multiple data formats in the future.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
We provide the master data, working data, and codes to replicate the paper "The Rise in American Pain: The Importance of the Great Recession". Data was downloaded from IPUMS Health Surveys: NHIS (https://nhis.ipums.org/nhis/).
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Odds ratios (ORs) for dementia according to systemic lupus erythematosus by dementia type, CCI and age group.
Facebook
TwitterThe purpose of the National Health Interview Survey (NHIS) is to obtain information about the amount and distribution of illness, its effects in terms of disability and chronic impairments, and the kinds of health services people receive. The 1992 Cancer Epidemiology Supplement includes variables from the NHIS core Person File (see NATIONAL HEALTH INTERVIEW SURVEY, 1992 (ICPSR 6343)), including sex, age, race, marital status, veteran status, education, income, industry and occupation codes, and limits on activity. Variables unique to this supplement include information on acculturation (in terms of language, ethnicity, and place of birth), dietary knowledge, smoking and tobacco use, and occupational history. The supplement also covers frequency and portion sizes of food and drink items, vitamin and mineral intake, hormone use, family's and respondent's history of cancer, and respondent's height and weight. (Source: downloaded from ICPSR 7/13/10)
Please Note: This dataset is part of the historical CISER Data Archive Collection and is also available at ICPSR at https://doi.org/10.3886/ICPSR06349.v1. We highly recommend using the ICPSR version as they may make this dataset available in multiple data formats in the future.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Characteristics in the seropositive RA and control groups.
Facebook
Twitterhttps://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de456959https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de456959
Abstract (en): The basic purpose of the National Health Interview Survey is to obtain information about the amount and distribution of illness, its effects in terms of disability and chronic impairments, and the kinds of health services people receive. There are five types of records in the core survey, each in a separate data file. The variables in the Household File (Part 1) include type of living quarters, size of family, number of families in household, and geographic region. The variables in the Person File (Part 2) include sex, age, race, marital status, veteran status, education, income, industry and occupation codes, and limits on activity. These variables are found in the Condition, Doctor Visit, and Hospital Episode Files as well. The Person File also supplies data on height, weight, bed days, doctor visits, hospital stays, years at residence, and region variables. The Condition (Part 3), Doctor Visit (Part 4), and the Hospital Episode (Part 5) Files contain information on each reported condition, two-week doctor visit, or hospitalization (twelve-month recall), respectively. A sixth, seventh, and eighth file have been added along with the five core files. The Health Promotions and Disease Prevention Supplement is separated into three categories as follows: Child Safety/Infant Feeding (Part 6), Sample Person (Part 7), and Smoking (Part 8). These data files include questions on health and fitness awareness, general health habits, injury control, child safety and health, high blood pressure, stress, exercise, smoking, alcohol use, dental care, and occupational safety and health. Detailed information regarding the use of weights is located within the documentation. ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection: Created online analysis version with question text.. Civilian, non-institutionalized population of the United States and the District of Columbia from 1,900 geographically defined Primary Sampling Units (PSU). Starting in 1985, the NHIS multi-stage probability sampling design incorporates several major changes which facilitate linkages with other National Center for Health Statistics surveys, improve precision of estimates, and reduce costs. Starting with an all-area frame, a reduced number of 198 Primary Sampling Units (PSU) were selected, including two from each non-self representing stratum. Black persons were oversampled. Four independent representative samples were drawn which may be used in any combination. 2011-05-25 SAS, SPSS, and Stata setup files have been added. Some corresponding documentation has been updated and pre-existing data files have been replaced. The Health Promotion and Disease Prevention Supplement has been broken into three separate data files.2006-01-18 File CB8668.ALL.PDF was removed from any previous datasets and flagged as a study-level file, so that it will accompany all downloads. face-to-face interviewThe data contain amp (&), dash (-), and blank codes. In conjunction with the changes made to the core of NHIS in 1982, all 5 types of files have revised tape layouts. In general, identification items are at the beginning of each record followed by household and person information in the same location on each tape. Limitation of activity, acute conditions, disability days, and doctor visits should not be compared to data gathered before 1982. For the Health Promotions and Disease Prevention Supplement, one person was selected from each family interviewed and this sample person was a self-respondent to the supplement's questions.
Facebook
TwitterThe purpose of the National Health Interview Survey (NHIS) is to obtain information about the amount and distribution of illness, its effects in terms of disability and chronic impairments, and the kinds of health services people receive. This supplement includes variables from the NHIS core Person File (see NATIONAL HEALTH INTERVIEW SURVEY, 1993 [ICPSR 6534]), including sex, age, race, marital status, veteran status, education, income, industry and occupation codes, and limits on activity. The Year 2000 Supplement also contains items on nine selected topics that relate to the Department of Health and Human Services (HHS) Year 2000 Health Objectives: environmental health, tobacco use, nutrition, occupational safety and health, heart disease, other chronic and disabling conditions, clinical and preventive services, mental health, and oral health. Among the variables included in this supplement are type of residence, whether the home was built before 1950, whether household air was tested for radon, current smoking status, current activities to control weight, employer-sponsored exercise programs, amount of stress in the past year, and the effect of stress on health in the past year. Variables included in the sections on mental health and oral health include the amount of stress in the past two weeks and in the past year, total number of dental visits in the past 12 months, loss of teeth, and general health status.
Facebook
Twitterhttps://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de445957https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de445957
Abstract (en): The basic purpose of the National Health Interview Survey (NHIS) is to obtain information about the amount and distribution of illness, its effects in terms of disability and chronic impairments, and the kinds of health services people receive. The purpose of this supplement was to obtain current data concerning problems with the feet, access to foot care, and the amount, frequency, and type of foot care obtained. The file contains approximately 90 variables from the core file (see NATIONAL HEALTH INTERVIEW SURVEY, 1990 [ICPSR 9839]), including sex, age, race, marital status, veteran status, education, income, industry and occupation codes, and limits on activity. Variables unique to this supplement include toenail, infection, corn, and callus problems, flat feet or fallen arches, deformities, sprains, strains, or fractures, arthritis, length of time with and prevalence of the problem, health insurance that covered foot problems, the types of health professional seen for foot problems, and where the health professionals were seen. Civilian noninstitutionalized population of the 50 states and the District of Columbia. The NHIS uses a multistage probability sampling design. Four independent representative samples, which may be used in any combination, were drawn. Black persons were oversampled. Per agreement with NCHS, ICPSR distributes the data file(s) and technical documentation in this collection in their original form as prepared by NCHS. The age distribution is under 6 years (N = 11,685), 6-16 years (N = 19,793), 17-24 years (N = 12,995), 25-34 years (N = 20,020), 35-44 years (N = 17,995), 45-54 years (N = 12,266), 55-64 years (N = 10,299), 65-74 years (N = 8,769), and 75+ years (N = 5,809). The racial/ethnic distribution is White (N = 94,935), Black (N = 17,967), Asian/Pacific Islander (N = 2,815), Native American (N = 959), multiple (N = 131), and other or unknown (N = 2,824).
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
## Overview
EMDS 7 For Detectron2 is a dataset for object detection tasks - it contains Microorganisms annotations for 4,027 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
## Overview
Preview Dataset_detectron2 is a dataset for object detection tasks - it contains Pathogens annotations for 1,441 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
Facebook
TwitterNHIS Adult Summary Health Statistics
Description
Interactive Summary Health Statistics for Adults provide annual estimates of selected health topics for adults aged 18 years and over based on final data from the National Health Interview Survey.
Dataset Details
Publisher: Centers for Disease Control and Prevention Temporal Coverage: 2019/2023 Last Modified: 2025-04-21 Contact: National Center for Health Statistics (cdcinfo@cdc.gov)
Source
Original… See the full description on the dataset page: https://huggingface.co/datasets/HHS-Official/nhis-adult-summary-health-statistics.