In Korea, everyone is compulsorily required to join the National Health Insurance. The National Health Insurance Service (NHIS), which manages national health insurance, provides basic health checkups to subscribers every year.
This dataset is the result of a random sampling of 1 million people per year from 2002 to 2021 among those who underwent basic health checkups provided by the NHIS.
Missing values are those that have not been selectively tested by individuals.
This dataset consists of a total of 19 csv files for each year, and each csv file contains only the health checkup results for that year. There are differences in the features of the dataset by year.
There are features that have been excluded or added by year.
Difference in AREA_CODE
After 2012, a new area, 'SEAJONG' was named and a new area code, 36, was added.
Categorization differences in AGE_GROUP
There is a difference in age categorization criteria between 2002 and 2013 and the dataset after 2014.
A description of each column is as follows.
feature name | description | form of expression | range |
---|---|---|---|
YEAR | Base year of the information | YYYY | 2002~2020 |
IDV_ID | Serial number assigned to subscriber | N | 1~1,000,000 |
AREA_CODE | Residency code of examinee | N | |
SEX | Gender | N | 1: male, 2:female |
AGE_GROUP | A code that categorizes the examinee's age into 5-year-olds based on the year. Refer to the table below for details. | N | 2002~2013: 1~14, 2014~: 1~18 |
HEIGHT | Examiner's height (in units of 5 cm) | N/cm | |
WEIGHT | Examiner's weight (in units of 5 kg) | N/Kg | |
WAIST | examiner's waist circumference | N/Kg | |
SIGHT_LEFT | Eyesight of the examinee's left eye | N | (0.1~2.5, eyesight < 0.1 == 0.1, blind==9.9) |
SIGHT_RIGHT | Eyesight of the examinee's right eye | N | (0.1~2.5, eyesight < 0.1 == 0.1, blind==9.9) |
BP_HIGH | The examiner's systolic blood pressure | N/mmHg | |
BP_LWST | Diastolic blood pressure of examinee | N/mmHg | |
BLDS | Pre-meal blood glucose of the examinee. The concentration of glucose per 100 ml of blood | N/mg/dL | |
TOT_CHOLE | Sum of ester and non-ester cholesterol in serum. Normal values are 150 to 250 mg/dL | mg/dL | |
TRIGLYCERIDE | Amount of simple lipids or neutral lipids. Normal values are 30 to 135 mg/dL | mg/dL | |
HDL_CHOLE | The amount of cholesterol contained in HDL. Normal values are 30 to 65 mg/dL. | mg/dL | |
LDL_CHOLE | The amount of cholesterol contained in LDL. If it is 170 mg/dL or higher, hyper-LDLemia is diagnosed. | mg/dL | |
CREATININE | Serum concentration of creatinine, the dehydration of creatine. Increases and decreases in creatinine are not related to food, but to muscle development and exercise. Normal values are 0.8 to 1.7 mg/dL. | mg/dL | |
HMG | It is a pigment-protein present in blood and blood cells, composed of globin and heme, and plays a role as an oxygen carrier in the blood. | N/g/dL | |
OLIG_PROTE_CD | excretion of protein in the urine | N | 1(-), 2(±), 3(+1), 4(+2), 5(+3), 6(+4) |
SGOT_AST | Levels on blood tests that indicate liver function. Concentrations increase when liver cells, heart, kidney, brain, and muscle cells are damaged. Normal value is 0~40IU/L | N/IU/L | |
SGPT_ALT | Levels in blood tests that indicate liver function. ALT mainly exists only in hepatocytes, and its concentration increases when hepatocytes are damaged. Normal values are 0 to 40 IU/L | N/IU/L | |
GAMMA_GTP | Levels in blood tests that indicate liver function. Gamma GTP is an enzyme mainly present in the bile duct in the liver, and blood concentration increases when bile excretion disorder or hepatocellular disorder occurs. Normal values are 11 to 64 IU/L for men and 8 to 35 IU/L for women. | N/IU/L | |
SMK_STAT | Whether or not the examinee's smoking status | N | 1 (don't smoke) / 2 (smoked before, but quit) / 3 (currently smokes) |
DRK_YN | Whether or not the examinee's drinking status | N | 0,N (don't drink) / 1,Y (drinking) |
HCHK_CE_IN | Whether or not the examinee chose oral examination. | N | 0,N (not tested)/1,Y (tested) |
CRS_YN | Whether or not the examinee has dental caries | N | 0 (none) / 1 (present) |
TTH_MSS_YN | Existence of missing teeth of the examinee | N | 0 (none) / 1 (present) |
ODT_TRB_YN | Whether or not the examinee has denta... |
The global number of Facebook users was forecast to continuously increase between 2023 and 2027 by in total 391 million users (+14.36 percent). After the fourth consecutive increasing year, the Facebook user base is estimated to reach 3.1 billion users and therefore a new peak in 2027. Notably, the number of Facebook users was continuously increasing over the past years. User figures, shown here regarding the platform Facebook, have been estimated by taking into account company filings or press material, secondary research, app downloads and traffic data. They refer to the average monthly active users over the period and count multiple accounts by persons only once.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).
The number of Youtube users in Africa was forecast to continuously increase between 2024 and 2029 by in total 0.03 million users (+3.95 percent). The Youtube user base is estimated to amount to 0.79 million users in 2029. User figures, shown here regarding the platform youtube, have been estimated by taking into account company filings or press material, secondary research, app downloads and traffic data. They refer to the average monthly active users over the period.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).Find more key insights for the number of Youtube users in countries like Worldwide and the Americas.
Facebook received 73,390 user data requests from federal agencies and courts in the United States during the second half of 2023. The social network produced some user data in 88.84 percent of requests from U.S. federal authorities. The United States accounts for the largest share of Facebook user data requests worldwide.
The global number of KakaoTalk users in was forecast to decrease between 2024 and 2028 by in total 0.7 million users. This overall decrease does not happen continuously, notably not in 2026 and 2027. The KakaoTalk user base is estimated to amount to 48.7 million users in 2028. Notably, the number of KakaoTalk users of was continuously increasing over the past years.User figures, here concerning the platform kakaoTalk, have been estimated by taking into account company filings or press material, secondary research, app downloads and traffic data. They refer to the average monthly active users over the period and count multiple accounts by persons only once.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).
Not seeing a result you expected?
Learn how you can add new datasets to our index.
In Korea, everyone is compulsorily required to join the National Health Insurance. The National Health Insurance Service (NHIS), which manages national health insurance, provides basic health checkups to subscribers every year.
This dataset is the result of a random sampling of 1 million people per year from 2002 to 2021 among those who underwent basic health checkups provided by the NHIS.
Missing values are those that have not been selectively tested by individuals.
This dataset consists of a total of 19 csv files for each year, and each csv file contains only the health checkup results for that year. There are differences in the features of the dataset by year.
There are features that have been excluded or added by year.
Difference in AREA_CODE
After 2012, a new area, 'SEAJONG' was named and a new area code, 36, was added.
Categorization differences in AGE_GROUP
There is a difference in age categorization criteria between 2002 and 2013 and the dataset after 2014.
A description of each column is as follows.
feature name | description | form of expression | range |
---|---|---|---|
YEAR | Base year of the information | YYYY | 2002~2020 |
IDV_ID | Serial number assigned to subscriber | N | 1~1,000,000 |
AREA_CODE | Residency code of examinee | N | |
SEX | Gender | N | 1: male, 2:female |
AGE_GROUP | A code that categorizes the examinee's age into 5-year-olds based on the year. Refer to the table below for details. | N | 2002~2013: 1~14, 2014~: 1~18 |
HEIGHT | Examiner's height (in units of 5 cm) | N/cm | |
WEIGHT | Examiner's weight (in units of 5 kg) | N/Kg | |
WAIST | examiner's waist circumference | N/Kg | |
SIGHT_LEFT | Eyesight of the examinee's left eye | N | (0.1~2.5, eyesight < 0.1 == 0.1, blind==9.9) |
SIGHT_RIGHT | Eyesight of the examinee's right eye | N | (0.1~2.5, eyesight < 0.1 == 0.1, blind==9.9) |
BP_HIGH | The examiner's systolic blood pressure | N/mmHg | |
BP_LWST | Diastolic blood pressure of examinee | N/mmHg | |
BLDS | Pre-meal blood glucose of the examinee. The concentration of glucose per 100 ml of blood | N/mg/dL | |
TOT_CHOLE | Sum of ester and non-ester cholesterol in serum. Normal values are 150 to 250 mg/dL | mg/dL | |
TRIGLYCERIDE | Amount of simple lipids or neutral lipids. Normal values are 30 to 135 mg/dL | mg/dL | |
HDL_CHOLE | The amount of cholesterol contained in HDL. Normal values are 30 to 65 mg/dL. | mg/dL | |
LDL_CHOLE | The amount of cholesterol contained in LDL. If it is 170 mg/dL or higher, hyper-LDLemia is diagnosed. | mg/dL | |
CREATININE | Serum concentration of creatinine, the dehydration of creatine. Increases and decreases in creatinine are not related to food, but to muscle development and exercise. Normal values are 0.8 to 1.7 mg/dL. | mg/dL | |
HMG | It is a pigment-protein present in blood and blood cells, composed of globin and heme, and plays a role as an oxygen carrier in the blood. | N/g/dL | |
OLIG_PROTE_CD | excretion of protein in the urine | N | 1(-), 2(±), 3(+1), 4(+2), 5(+3), 6(+4) |
SGOT_AST | Levels on blood tests that indicate liver function. Concentrations increase when liver cells, heart, kidney, brain, and muscle cells are damaged. Normal value is 0~40IU/L | N/IU/L | |
SGPT_ALT | Levels in blood tests that indicate liver function. ALT mainly exists only in hepatocytes, and its concentration increases when hepatocytes are damaged. Normal values are 0 to 40 IU/L | N/IU/L | |
GAMMA_GTP | Levels in blood tests that indicate liver function. Gamma GTP is an enzyme mainly present in the bile duct in the liver, and blood concentration increases when bile excretion disorder or hepatocellular disorder occurs. Normal values are 11 to 64 IU/L for men and 8 to 35 IU/L for women. | N/IU/L | |
SMK_STAT | Whether or not the examinee's smoking status | N | 1 (don't smoke) / 2 (smoked before, but quit) / 3 (currently smokes) |
DRK_YN | Whether or not the examinee's drinking status | N | 0,N (don't drink) / 1,Y (drinking) |
HCHK_CE_IN | Whether or not the examinee chose oral examination. | N | 0,N (not tested)/1,Y (tested) |
CRS_YN | Whether or not the examinee has dental caries | N | 0 (none) / 1 (present) |
TTH_MSS_YN | Existence of missing teeth of the examinee | N | 0 (none) / 1 (present) |
ODT_TRB_YN | Whether or not the examinee has denta... |