Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This heart disease dataset is acquired from one of the multispecialty hospitals in India. Over 14 common features make it one of the heart disease datasets available so far for research purposes. This dataset consists of 1000 subjects with 12 features. This dataset will be useful for building early-stage heart disease detection as well as for generating predictive machine-learning models.
| S.No | Attribute | Explain | Unit | Type of Data |
|---|---|---|---|---|
| 1 | Patient Identification Number | patientid | Numeric | Number |
| 2 | Age | age | Numeric | In Years |
| 3 | Gender | gender | Binary | 0 (female) / 1 (male) |
| 4 | Resting blood pressure | restingBP | Numeric | 94-200 (in mm HG) |
| 5 | Serum cholesterol | serumcholestrol | Numeric | 126-564 (in mg/dl) |
| 6 | Fasting blood sugar | fastingbloodsugar | Binary | 0 (false) / 1 (true) > 120 mg/dl |
| 7 | Chest pain type | chestpain | Nominal | 0 (typical angina), 1 (atypical angina), 2 (non-anginal pain), 3 (asymptomatic) |
| 8 | Resting electrocardiogram results | restingelectro | Nominal | 0 (normal), 1 (ST-T wave abnormality), 2 (probable or definite left ventricular hypertrophy) |
| 9 | Maximum heart rate achieved | maxheartrate | Numeric | 71-202 |
| 10 | Exercise induced angina | exerciseangina | Binary | 0 (no) / 1 (yes) |
| 11 | Oldpeak = ST | oldpeak | Numeric | 0-6.2 |
| 12 | Slope of the peak exercise ST segment | slope | Nominal | 1 (upsloping), 2 (flat), 3 (downsloping) |
| 13 | Number of major vessels | noofmajorvessels | Numeric | 0, 1, 2, 3 |
| 14 | Classification (target) | target | Binary | 0 (Absence of Heart Disease), 1 (Presence of Heart Disease) |
Health Sciences
Bhanu Prakash Doppala, Debnath Bhattacharyya
Institutions:Lincoln University College
Facebook
TwitterThe Heart Attack Risk Prediction Dataset serves as a valuable resource for delving into the intricate dynamics of heart health and its predictors. Heart attacks, or myocardial infarctions, continue to be a significant global health issue, necessitating a deeper comprehension of their precursors and potential mitigating factors. This dataset encapsulates a diverse range of attributes including age, cholesterol levels, blood pressure, smoking habits, exercise patterns, dietary preferences, and more, aiming to elucidate the complex interplay of these variables in determining the likelihood of a heart attack. By employing predictive analytics and machine learning on this dataset, researchers and healthcare professionals can work towards proactive strategies for heart disease prevention and management. The dataset stands as a testament to collective efforts to enhance our understanding of cardiovascular health and pave the way for a healthier future.
This synthetic dataset provides a comprehensive array of features relevant to heart health and lifestyle choices, encompassing patient-specific details such as age, gender, cholesterol levels, blood pressure, heart rate, and indicators like diabetes, family history, smoking habits, obesity, and alcohol consumption. Additionally, lifestyle factors like exercise hours, dietary habits, stress levels, and sedentary hours are included. Medical aspects comprising previous heart problems, medication usage, and triglyceride levels are considered. Socioeconomic aspects such as income and geographical attributes like country, continent, and hemisphere are incorporated. The dataset, consisting of 8763 records from patients around the globe, culminates in a crucial binary classification feature denoting the presence or absence of a heart attack risk, providing a comprehensive resource for predictive analysis and research in cardiovascular health.
https://i.imgur.com/5cTusqA.png" alt="">
This dataset is a synthetic creation generated using ChatGPT to simulate a realistic experience. Its purpose is to provide a platform for beginners and data enthusiasts, allowing them to create, enjoy, practice, and learn from a dataset that mirrors real-world scenarios. The aim is to foster learning and experimentation in a simulated environment, encouraging a deeper understanding of data analysis and interpretation.
Cover Photo by: brgfx on Freepik
Thumbnail by: vectorjuice on Freepik
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Cardiovascular diseases (CVDs) are the leading cause of death in India, with heart attacks (myocardial infarctions) accounting for a significant portion. India has a higher heart disease burden than many other nations, with cases occurring at younger ages compared to Western countries. This dataset incorporates key medical and lifestyle risk factors such as diabetes, hypertension, obesity, smoking, air pollution exposure, and healthcare access.
With a diverse representation across India's states, the dataset reflects the urban-rural disparity in healthcare, lifestyle patterns, and emergency response times. It can be used for predictive modeling, machine learning applications, epidemiological research, and policy analysis to improve early detection and intervention strategies for heart disease.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This heart disease dataset is curated by combining 3 popular heart disease datasets. The first dataset (Collected from Kaggle) contains 70000 records with 11 independent features which makes it the largest heart disease dataset available so far for research purposes. These data were collected at the moment of medical examination and information given by the patient. Second and third datasets contain 303 and 293 intstances respectively with 13 common features. The three datasets used for its curation are:Cardio Data (Kaggle Dataset)
Facebook
TwitterAttribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
This synthetic dataset is designed to predict the risk of heart disease based on a combination of symptoms, lifestyle factors, and medical history. Each row in the dataset represents a patient, with binary (Yes/No) indicators for symptoms and risk factors, along with a computed risk label indicating whether the patient is at high or low risk of developing heart disease.
The dataset contains 70,000 samples, making it suitable for training machine learning models for classification tasks. The goal is to provide researchers, data scientists, and healthcare professionals with a clean and structured dataset to explore predictive modeling for cardiovascular health.
This dataset is a side project of EarlyMed, developed by students of Vellore Institute of Technology (VIT-AP). EarlyMed aims to leverage data science and machine learning for early detection and prevention of chronic diseases.
chest_pain): Presence of chest pain, a common symptom of heart disease.shortness_of_breath): Difficulty breathing, often associated with heart conditions.fatigue): Persistent tiredness without an obvious cause.palpitations): Irregular or rapid heartbeat.dizziness): Episodes of lightheadedness or fainting.swelling): Swelling due to fluid retention, often linked to heart failure.radiating_pain): Radiating pain, a hallmark of angina or heart attacks.cold_sweats): Symptoms commonly associated with acute cardiac events.age): Patient's age in years (continuous variable).hypertension): History of hypertension (Yes/No).cholesterol_high): Elevated cholesterol levels (Yes/No).diabetes): Diagnosis of diabetes (Yes/No).smoker): Whether the patient is a smoker (Yes/No).obesity): Obesity status (Yes/No).family_history): Family history of cardiovascular conditions (Yes/No).risk_label): Binary label indicating the risk of heart disease:
0: Low risk1: High riskThis dataset was synthetically generated using Python libraries such as numpy and pandas. The generation process ensured a balanced distribution of high-risk and low-risk cases while maintaining realistic correlations between features. For example:
- Patients with multiple risk factors (e.g., smoking, hypertension, and diabetes) were more likely to be labeled as high risk.
- Symptom patterns were modeled after clinical guidelines and research studies on heart disease.
The design of this dataset was inspired by the following resources:
This dataset can be used for a variety of purposes:
Machine Learning Research:
Healthcare Analytics:
Educational Purposes:
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This heart disease dataset is curated by combining 5 popular heart disease datasets already available independently but not combined before. In this dataset
Facebook
TwitterThe "Framingham" heart disease dataset includes over 4,240 records,16 columns and 15 attributes. The goal of the dataset is to predict whether the patient has 10-year risk of future (CHD) coronary heart disease
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Adaptation of http://archive.ics.uci.edu/ml/datasets/Heart+Disease
Ready for usage with ehrapy
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description:
This dataset contains information related to individuals and their risk factors for heart disease. The data includes demographic information such as age and gender, as well as medical history, lifestyle factors, and symptoms associated with heart disease. The target variable indicates whether an individual has been diagnosed with heart disease or not.
Column Descriptions:
Age: Age of the individual (years). Gender: Gender of the individual (Male/Female). Cholesterol: Cholesterol level in mg/dL. Blood Pressure: Systolic blood pressure in mmHg. Heart Rate: Heart rate in beats per minute. Smoking: Smoking status (Never/Former/Current). Alcohol Intake: Alcohol intake frequency (None/Moderate/Heavy). Exercise Hours: Hours of exercise per week. Family History: Family history of heart disease (Yes/No). Diabetes: Diabetes status (Yes/No). Obesity: Obesity status (Yes/No). Stress Level: Stress level on a scale of 1 to 10. Blood Sugar: Fasting blood sugar level in mg/dL. Exercise Induced Angina: Presence of exercise-induced angina (Yes/No). Chest Pain Type: Type of chest pain experienced (Typical Angina/Atypical Angina/Non-anginal Pain/Asymptomatic). Heart Disease: Target variable indicating presence of heart disease (0: No, 1: Yes).
Facebook
TwitterBy Kuzak Dempsy [source]
This dataset contains detailed information on the risk factors for cardiovascular disease. It includes information on age, gender, height, weight, blood pressure values, cholesterol levels, glucose levels, smoking habits and alcohol consumption of over 70 thousand individuals. Additionally it outlines if the person is active or not and if he or she has any cardiovascular diseases. This dataset provides a great resource for researchers to apply modern machine learning techniques to explore the potential relations between risk factors and cardiovascular disease that can ultimately lead to improved understanding of this serious health issue and design better preventive measures
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
This dataset can be used to explore the risk factors of cardiovascular disease in adults. The aim is to understand how certain demographic factors, health behaviors and biological markers affect the development of heart disease.
To start, look through the columns of data and familiarize yourself with each one. Understand what each field means and how it relates to heart health: - Age: Age of participant (integer) - Gender: Gender of participant (male/female). - Height: Height measured in centimeters (integer) - Weight: Weight measured in kilograms (integer) - Ap_hi: Systolic blood pressure reading taken from patient (integer) - Ap_lo : Diastolic blood pressure reading taken from patient (integer) - Cholesterol : Total cholesterol level read as mg/dl on a scale 0 - 5+ units( integer). Each unit denoting increase/decrease by 20 mg/dL respectively.
‐ Gluc : Glucose level read as mmol/l on a scale 0 - 16+ units( integer). Each unit denoting increase Decreaseby 1 mmol/L respectively. ‐ Smoke : Whether person smokes or not(binary; 0= No , 1=Yes). ‐ Alco : Whether person drinks alcohol or not(binary; 0 =No ,1 =Yes ). • Active : whether person physically active or not( Binary ;0 =No,1 = Yes ). . Cardio : whether person suffers from cardiovascular diseases or not(Binary ;0 – no , 1 ‑yes ).Identify any trends between the different values for each attribute and the developmetn for cardiovascular disease among individuals represented by this dataset . Age, gender, weight, lifestyle practices like smoking & drinking alcohol are all key influences when analyzing this problem set. You can always modify pieces of your analysis until you're able to find patterns that will enable you make conclusions based on your understanding & exploration. You can further enrich your understanding using couple mopdeling technique like Regressions & Classification models over this dataset alongwith latest Deep Learning approach! Have Fun!
- Analyzing the effect of lifestyle and environmental factors on the risk of cardiovascular disease.
- Predicting the risks of different age groups based on their demographic characteristics such as gender, height, weight and smoking status.
- Detecting patterns between levels of physical activity, blood pressure and cholesterol levels with likelihood of developing cardiovascular disease among individuals
If you use this dataset in your research, please credit the original authors. Data Source
See the dataset description for more information.
File: heart_data.csv | Column name | Description | |:----------------|:---------------------------------------------------------| | age | Age of the individual. (Integer) | | gender | Gender of the individual. (String) | | height | Height of the individual in centimeters. (Integer) | | weight | Weight of the individual in kilograms. (Integer) | | ap_hi | Systolic blood pressure reading. (Integer) | | ap_lo | Diastolic blood pressure reading. (Integer) | | cholesterol | Cholesterol level of the individual. (Integer) | | gluc |...
Facebook
TwitterOpen Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
Cardiovascular diseases (CVDs) are the number 1 cause of death globally, taking an estimated 17.9 million lives each year, which accounts for 31% of all deaths worldwide. Four out of 5CVD deaths are due to heart attacks and strokes, and one-third of these deaths occur prematurely in people under 70 years of age. Heart failure is a common event caused by CVDs and this dataset contains 11 features that can be used to predict a possible heart disease.
People with cardiovascular disease or who are at high cardiovascular risk (due to the presence of one or more risk factors such as hypertension, diabetes, hyperlipidaemia or already established disease) need early detection and management wherein a machine learning model can be of great help.
This dataset was created by combining different datasets already available independently but not combined before. In this dataset, 5 heart datasets are combined over 11 common features which makes it the largest heart disease dataset available so far for research purposes. The five datasets used for its curation are:
Total: 1190 observations Duplicated: 272 observations
Final dataset: 918 observations
Every dataset used can be found under the Index of heart disease datasets from UCI Machine Learning Repository on the following link: https://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/
fedesoriano. (September 2021). Heart Failure Prediction Dataset. Retrieved [Date Retrieved] from https://www.kaggle.com/fedesoriano/heart-failure-prediction.
Creators:
Donor: David W. Aha (aha '@' ics.uci.edu) (714) 856-8779
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset contains medical data used for predicting heart disease. The data includes various attributes such as age, sex, chest pain type (cp), resting blood pressure (trestbps), cholesterol (chol), fasting blood sugar (fbs), resting electrocardiographic results (restecg), maximum heart rate achieved (thalach), exercise-induced angina (exang), and ST depression induced by exercise relative to rest (oldpeak).
age: Age of the patient (in years) sex: Sex of the patient (1 = male, 0 = female) cp: Chest pain type (1-4) trestbps: Resting blood pressure (in mm Hg on admission to the hospital) chol: Serum cholesterol in mg/dl fbs: Fasting blood sugar > 120 mg/dl (1 = true; 0 = false) restecg: Resting electrocardiographic results (0-2) thalach: Maximum heart rate achieved exang: Exercise-induced angina (1 = yes; 0 = no) oldpeak: ST depression induced by exercise relative to rest
Facebook
TwitterThis dataset was created by Rithik Kotha
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The Heart Attack Analysis & Prediction Dataset is a dataset used for research and analysis in the field of cardiovascular health. It typically contains various attributes related to an individual's health and lifestyle, along with an indicator of whether the individual experienced a heart attack or not.
Here are some common attributes found in such datasets:
Age: The age of the individual. **Sex: **The gender of the individual. Chest Pain Type: The type of chest pain experienced by the individual. Resting Blood Pressure: The resting blood pressure of the individual (in mm Hg). **Cholesterol: **The cholesterol levels of the individual (in mg/dL). **Fasting Blood Sugar: **The fasting blood sugar level of the individual (> 120 mg/dL is considered high). **Resting Electrocardiographic Results (ECG): **Results of the resting electrocardiogram. Maximum Heart Rate Achieved: The maximum heart rate achieved by the individual during exercise. **Exercise Induced Angina: **Whether the individual experienced angina during exercise. **ST Depression Induced by Exercise: **The ST depression induced by exercise relative to rest. **Slope of the Peak Exercise ST Segment: **The slope of the peak exercise ST segment. **Number of Major Vessels Colored by Fluoroscopy: **The number of major blood vessels colored by fluoroscopy. **Thalassemia: **A blood disorder; different types of thalassemia might be represented in the dataset. **Target: **Whether the individual had a heart attack or not (typically represented as binary: 0 for no heart attack, 1 for heart attack).
This dataset is often used for building machine learning models to predict the likelihood of a person having a heart attack based on their health attributes. Researchers and healthcare professionals use such models to better understand risk factors and develop preventive strategies for cardiovascular diseases.
Hone your analytical and ML skills by participating in tasks of my other dataset's. Given below. Data Science Job Posting on Glassdoor
Groceries dataset for Market Basket Analysis(MBA)
Dataset for Facial recognition using ML approach
Covid_w/wo_Pneumonia Chest Xray
Disney Movies 1937-2016 Gross Income
Bollywood Movie data from 2000 to 2019
17.7K English song data from 2008-2017
About this dataset Age : Age of the patient
Sex : Sex of the patient
exang: exercise induced angina (1 = yes; 0 = no)
ca: number of major vessels (0-3)
cp : Chest Pain type chest pain type
Value 1: typical angina Value 2: atypical angina Value 3: non-anginal pain Value 4: asymptomatic trtbps : resting blood pressure (in mm Hg)
chol : cholestoral in mg/dl fetched via BMI sensor
fbs : (fasting blood sugar > 120 mg/dl) (1 = true; 0 = false)
rest_ecg : resting electrocardiographic results
Value 0: normal Value 1: having ST-T wave abnormality (T wave inversions and/or ST elevation or depression of > 0.05 mV) Value 2: showing probable or definite left ventricular hypertrophy by Estes' criteria thalach : maximum heart rate achieved
target : 0= less chance of heart attack 1= more chance of heart attack
n
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset contains records related to the diagnosis of heart disease in patients. It includes various clinical and non-clinical attributes used to determine the presence or absence of heart disease. The data was collected from Kaggle, and it consists of 918 entries with 11 features. Columns cover aspects like patient demographics, vital signs, symptoms, and potential risk factors associated with heart conditions. Prior to analysis, preprocessing steps. This dataset can be utilized for exploratory data analysis, predictive modeling, and uncovering patterns contributing to heart disease diagnosis.
Facebook
Twitterhttp://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
PLEASE UPVOTE Try some basic EDA on this dataset and try simpler model on this dataset and post your work.
Features:
Age | Objective Feature | age | int (days)
Height | Objective Feature | height | int (cm) |
Weight | Objective Feature | weight | float (kg) |
Gender | Objective Feature | gender | categorical code |
Systolic blood pressure | Examination Feature | ap_hi | int |
Diastolic blood pressure | Examination Feature | ap_lo | int |
Cholesterol | Examination Feature | cholesterol | 1: normal, 2: above normal, 3: well above normal |
Glucose | Examination Feature | gluc | 1: normal, 2: above normal, 3: well above normal |
Smoking | Subjective Feature | smoke | binary |
Alcohol intake | Subjective Feature | alco | binary |
Physical activity | Subjective Feature | active | binary |
Presence or absence of cardiovascular disease | Target Variable | cardio | binary |
Facebook
TwitterThis dataset was created by Abel Tesfaye Sisaye
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset contains information related to cardiac health, with 1,025 data entries. Each entry represents a patient and includes the following attributes:
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The heart attack datasets were collected at Zheen hospital in Erbil, Iraq, from January 2019 to May 2019. The attributes of this dataset are: age, gender, heart rate, systolic blood pressure, diastolic blood pressure, blood sugar, ck-mb and troponin with negative or positive output.
Feature Engineering, EDA, and Prediction in this Notebook: EDA and Heart Attack Prediction | 99% Accuracy
According to the provided information, the medical dataset classifies either heart attack or none. The gender column in the data is normalized: the male is set to 1 and the female to 0. The glucose column is set to 1 if it is > 120; otherwise, 0. As for the output, positive is set to 1 and negative to 0.
The dataset has 9 column: Age: The patient's age
Gender: Biological sex of the patient (The male is set to 1 and the female to 0)
Heart Rate: The number of heartbeats per minute
Systolic Blood Pressure: The pressure in arteries when the heart contracts
Diastolic Blood Pressure: The pressure in arteries between heartbeats
Blood Sugar: The patient's blood glucose level
Ck-mb: A cardiac enzyme released during heart muscle damage
Troponin:A highly specific protein biomarker for heart muscle injury
Result: The outcome label indicating whether or not the patient experienced a heart attack
Institutions University of Kurdistan Hewler
Categories Medicine, Heart Disease
DOI: Rashid, Tarik A.; Hassan, Bryar (2022), “Heart Attack Dataset”, Mendeley Data, V1, doi: 10.17632/wmhctcrt5v.1
Facebook
TwitterThe Data set consists of 1200 records of Cardiovascular ECGs where each of the 300 records belongs to one ailment, in such a way 4 ailments have been considered. The original signals are taken from the MIT-BIH physio-net Database. One ailment is the MIT-BIH Arrhythmia Database, the other is BIDMC Congestive Heart Failure Database and MIT-BIH Atrial Fibrillation Database and finally MIT-BIH Normal Sinus Rhythm Database. From these four databases, ECG records have been segmented at 4120 samples each forming 300 signals. They are normalized with mentioned gain for each database and are preprocessed with bandpass filters. MODWPT technique was used to obtain 54 features that are given as columns in .csv file that is uploaded here. So the file has 1200 x 54 size records. Note:: Missing values have to be handled according to your application.
ACKNOWLEDGEMENT Please credit the authors if you use this dataset file in your research.
Citation:
Alekhya, L., and P. Rajesh Kumar, "A new approach to detect cardiovascular diseases using ECG scalograms and ML-based CNN algorithm." Mar 20, 2023. International Journal of Computational Vision and Robotics/Inderscience publishers. DOI: 10.1504/IJCVR.2022.10051429 Link: https://www.inderscience.com/info/ingeneral/forthcoming.php?jcode=IJCVR
Alekhya, L., and P. Rajesh Kumar. "A Novel Application for Autonomous Detection of Cardiac Ailments using ECG Scalograms with Alex Net Convolution Neural Network." Design Engineering (2021): 13176-13189. Link: http://www.thedesignengineering.com/index.php/DE/article/view/6434
Autonomous Detection of Cardiac Ailments using Long-short term Memory Model based on Electrocardiogram signals, L. Alekhya, P. Rajesh Kumar, A. Venkata Sriram DOI: 10.14704/nq.2022.20.7.NQ33431. Pages: 3509 - 3518. Link: https://www.neuroquantology.com/open-access/Autonomous+Detection+of+Cardiac+Ailments+using+Longshort+term+Memory+Model+based+on+Electrocardiogram+signals_5781/
Autonomous Detection of Cardia Ailments diagnosed by Electrocardiogram using various Supervised Machine Learning AlgorithmsAutonomous Detection of Cardia Ailments diagnosed by Electrocardiogram using various Supervised Machine Learning Algorithms AMA, Agricultural Mechanization in Asia, Africa and Latin America (ISSN: 00845841) · Sep 18, 2021. Link: https://www.shin-norinco.com/article/autonomous-detection-of-cardia-ailments-diagnosed-by-electrocardiogram-using-various-supervised-machine-learning-algorithms
L Alekhya, P Rajesh Kumar, “Maximal Overlap Discrete Wavelet Packet Transform Based Characteristic waves detection in Electrocardiogram of Cardiovascular Diseases”, INTERNATIONAL JOURNAL OF SPECIAL EDUCATION, vol 36 (1), pp 51-61, 2021.
License License was not specified at the source, yet access to the data is public and a citation was requested.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This heart disease dataset is acquired from one of the multispecialty hospitals in India. Over 14 common features make it one of the heart disease datasets available so far for research purposes. This dataset consists of 1000 subjects with 12 features. This dataset will be useful for building early-stage heart disease detection as well as for generating predictive machine-learning models.
| S.No | Attribute | Explain | Unit | Type of Data |
|---|---|---|---|---|
| 1 | Patient Identification Number | patientid | Numeric | Number |
| 2 | Age | age | Numeric | In Years |
| 3 | Gender | gender | Binary | 0 (female) / 1 (male) |
| 4 | Resting blood pressure | restingBP | Numeric | 94-200 (in mm HG) |
| 5 | Serum cholesterol | serumcholestrol | Numeric | 126-564 (in mg/dl) |
| 6 | Fasting blood sugar | fastingbloodsugar | Binary | 0 (false) / 1 (true) > 120 mg/dl |
| 7 | Chest pain type | chestpain | Nominal | 0 (typical angina), 1 (atypical angina), 2 (non-anginal pain), 3 (asymptomatic) |
| 8 | Resting electrocardiogram results | restingelectro | Nominal | 0 (normal), 1 (ST-T wave abnormality), 2 (probable or definite left ventricular hypertrophy) |
| 9 | Maximum heart rate achieved | maxheartrate | Numeric | 71-202 |
| 10 | Exercise induced angina | exerciseangina | Binary | 0 (no) / 1 (yes) |
| 11 | Oldpeak = ST | oldpeak | Numeric | 0-6.2 |
| 12 | Slope of the peak exercise ST segment | slope | Nominal | 1 (upsloping), 2 (flat), 3 (downsloping) |
| 13 | Number of major vessels | noofmajorvessels | Numeric | 0, 1, 2, 3 |
| 14 | Classification (target) | target | Binary | 0 (Absence of Heart Disease), 1 (Presence of Heart Disease) |
Health Sciences
Bhanu Prakash Doppala, Debnath Bhattacharyya
Institutions:Lincoln University College