100+ datasets found

Heart Disease Prediction Dataset
kaggle.com
Updated Sep 27, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
M. Farhaan Nazirkhan (2024). Heart Disease Prediction Dataset [Dataset]. https://www.kaggle.com/datasets/mfarhaannazirkhan/heart-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 27, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
M. Farhaan Nazirkhan
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Heart Disease Prediction Dataset

This dataset contains 1,888 records merged from five publicly available heart disease datasets. It includes 14 features that are crucial for predicting heart attack and stroke risks, covering both medical and demographic factors. Below is a detailed description of each feature.

Feature Descriptions:

age: Age of the patient (Numeric).

sex: Gender of the patient. Values: 1 = male, 0 = female.

cp: Chest pain type. Values: 0 = Typical angina, 1 = Atypical angina, 2 = Non-anginal pain, 3 = Asymptomatic.

trestbps: Resting Blood Pressure (in mm Hg) (Numeric).

chol: Serum Cholesterol level (in mg/dl) (Numeric).

fbs: Fasting blood sugar > 120 mg/dl. Values: 1 = true, 0 = false.

restecg: Resting electrocardiographic results. Values: 0 = Normal, 1 = ST-T wave abnormality, 2 = Left ventricular hypertrophy.

thalach: Maximum heart rate achieved (Numeric).

exang: Exercise-induced angina. Values: 1 = yes, 0 = no.

oldpeak: ST depression induced by exercise relative to rest (Numeric).

slope: Slope of the peak exercise ST segment. Values: 0 = Upsloping, 1 = Flat, 2 = Downsloping.

ca: Number of major vessels (0-3) colored by fluoroscopy. Values: 0, 1, 2, 3.

thal: Thalassemia types. Values: 1 = Normal, 2 = Fixed defect, 3 = Reversible defect.

target: Outcome variable (heart attack risk). Values: 1 = more chance of heart attack, 0 = less chance of heart attack.

Dataset Details:

This dataset is a combination of five publicly available heart disease datasets, with a total of 1,888 records. Merging these datasets provides a more robust foundation for training machine learning models aimed at predicting heart attack risk.

Datasets Used:

Heart Attack Analysis & Prediction Dataset
Number of Records: 304
Reference: Rahman, 2021

Heart Disease Dataset
Number of Records: 1,026
Reference: Lapp, 2019

Heart Attack Prediction (Dataset 3)
Number of Records: 295
Reference: Damarla, 2020

Heart Attack Prediction (Dataset 4)
Number of Records: 271
Reference: Anand, 2018

Heart CSV Dataset
Number of Records: 290
Reference: Nandal, 2022

Description:

This dataset includes 14 features known to contribute to heart attack risk. It is ideal for training machine learning models aimed at early detection and prevention of heart disease. The records have been cleaned by removing missing data to ensure data integrity. This dataset can be applied to various machine learning algorithms, including classification models such as Decision Trees, Neural Networks, and others.
i
Heart Disease Dataset (Comprehensive)
ieee-dataport.org
Updated Jan 1, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
MANU SIDDHARTHA (2025). Heart Disease Dataset (Comprehensive) [Dataset]. https://ieee-dataport.org/open-access/heart-disease-dataset-comprehensive
Explore at:
Dataset updated
Jan 1, 2025
Authors
MANU SIDDHARTHA
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This heart disease dataset is curated by combining 5 popular heart disease datasets already available independently but not combined before. In this dataset
i
Cardiovascular Disease Dataset
ieee-dataport.org
Updated Oct 25, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rajib Kumar Halder Halder (2022). Cardiovascular Disease Dataset [Dataset]. https://ieee-dataport.org/documents/cardiovascular-disease-dataset
Explore at:
Dataset updated
Oct 25, 2022
Authors
Rajib Kumar Halder Halder
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This heart disease dataset is curated by combining 3 popular heart disease datasets. The first dataset (Collected from Kaggle) contains 70000 records with 11 independent features which makes it the largest heart disease dataset available so far for research purposes. These data were collected at the moment of medical examination and information given by the patient. Second and third datasets contain 303 and 293 intstances respectively with 13 common features. The three datasets used for its curation are:Cardio Data (Kaggle Dataset)
m
Cardiovascular_Disease_Dataset
data.mendeley.com
Updated Apr 16, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bhanu Prakash Doppala (2021). Cardiovascular_Disease_Dataset [Dataset]. http://doi.org/10.17632/dzz48mvjht.1
Explore at:
Unique identifier
https://doi.org/10.17632/dzz48mvjht.1
Dataset updated
Apr 16, 2021
Authors
Bhanu Prakash Doppala
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This heart disease dataset is acquired from one o f the multispecialty hospitals in India. Over 14 common features which makes it one of the heart disease dataset available so far for research purposes. This dataset consists of 1000 subjects with 12 features. This dataset will be useful for building a early-stage heart disease detection as well as to generate predictive machine learning models.
Predicting Heart Failure
kaggle.com
Updated Sep 13, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Aman Chauhan (2022). Predicting Heart Failure [Dataset]. https://www.kaggle.com/datasets/whenamancodes/heart-failure-clinical-records
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 13, 2022
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Aman Chauhan
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Cardiovascular diseases (CVDs) are the number 1 cause of death globally, taking an estimated 17.9 million lives each year, which accounts for 31% of all deaths worlwide. Heart failure is a common event caused by CVDs and this dataset contains 12 features that can be used to predict mortality by heart failure.

Most cardiovascular diseases can be prevented by addressing behavioural risk factors such as tobacco use, unhealthy diet and obesity, physical inactivity and harmful use of alcohol using population-wide strategies.

People with cardiovascular disease or who are at high cardiovascular risk (due to the presence of one or more risk factors such as hypertension, diabetes, hyperlipidaemia or already established disease) need early detection and management wherein a machine learning model can be of great help.

Attribute Information:

Thirteen (13) clinical features: - age: age of the patient (years) - anaemia: decrease of red blood cells or hemoglobin (boolean) - high blood pressure: if the patient has hypertension (boolean) - creatinine phosphokinase (CPK): level of the CPK enzyme in the blood (mcg/L) - diabetes: if the patient has diabetes (boolean) - ejection fraction: percentage of blood leaving the heart at each contraction (percentage) - platelets: platelets in the blood (kiloplatelets/mL) - sex: woman or man (binary) - serum creatinine: level of serum creatinine in the blood (mg/dL) - serum sodium: level of serum sodium in the blood (mEq/L) - smoking: if the patient smokes or not (boolean) - time: follow-up period (days) - [target] death event: if the patient deceased during the follow-up period (boolean)

More - Find More Exciting🙀 Datasets Here - An Upvote👍 A Dayᕙ(`▿´)ᕗ , Keeps Aman Hurray Hurray..... ٩(˘◡˘)۶Haha
ECG Images Dataset of Cardiac Patients
kaggle.com
Updated Aug 28, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Evil Spirit05 (2024). ECG Images Dataset of Cardiac Patients [Dataset]. https://www.kaggle.com/datasets/evilspirit05/ecg-analysis
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 28, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Evil Spirit05
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
The ECG Images Dataset of Cardiac Patients is an extensive collection of electrocardiogram (ECG) images designed to aid research and advancements in the field of cardiovascular medicine. This dataset provides a wealth of data that can be utilized for various analyses, including the development of diagnostic tools and the study of different cardiac conditions.

Dataset Overview

The dataset is organized into four main categories, each representing different cardiac conditions:

ECG Images of Myocardial Infarction Patients

Number of Images: 240

Total Dimensions: 240x12 (total of 2880 images)

Description: These images are from patients diagnosed with myocardial infarction (MI), commonly known as a heart attack. The images reflect the ECG patterns typically associated with this critical condition.

ECG Images of Patients with Abnormal Heartbeat

Number of Images: 233

Total Dimensions: 233x12 (total of 2796 images)

Description: This category includes ECG images from patients exhibiting abnormal heartbeat patterns. Such patterns may indicate a range of arrhythmias or other cardiac issues, providing crucial data for diagnostic and research purposes.

ECG Images of Patients with a History of Myocardial Infarction

Number of Images: 172

Total Dimensions: 172x12 (total of 2064 images)

Description: These images come from patients who have a documented history of myocardial infarction. They offer insights into the long-term effects and recovery patterns associated with heart attacks.

Normal Person ECG Images

Number of Images: 284

Total Dimensions: 284x12 (total of 3408 images)

Description: This category features ECG images from individuals with no known cardiac issues, serving as a baseline for comparison with pathological cases.

Certainly! Here’s a revised and enhanced description for your Kaggle dataset post, with the requested information removed:

ECG Images Dataset of Cardiac Patients Description The ECG Images Dataset of Cardiac Patients is an extensive collection of electrocardiogram (ECG) images designed to aid research and advancements in the field of cardiovascular medicine. This dataset provides a wealth of data that can be utilized for various analyses, including the development of diagnostic tools and the study of different cardiac conditions.

Dataset Overview The dataset is organized into four main categories, each representing different cardiac conditions:

ECG Images of Myocardial Infarction Patients

Number of Images: 240 Total Dimensions: 240x12 (total of 2880 images) Description: These images are from patients diagnosed with myocardial infarction (MI), commonly known as a heart attack. The images reflect the ECG patterns typically associated with this critical condition. ECG Images of Patients with Abnormal Heartbeat

Number of Images: 233 Total Dimensions: 233x12 (total of 2796 images) Description: This category includes ECG images from patients exhibiting abnormal heartbeat patterns. Such patterns may indicate a range of arrhythmias or other cardiac issues, providing crucial data for diagnostic and research purposes. ECG Images of Patients with a History of Myocardial Infarction

Number of Images: 172 Total Dimensions: 172x12 (total of 2064 images) Description: These images come from patients who have a documented history of myocardial infarction. They offer insights into the long-term effects and recovery patterns associated with heart attacks. Normal Person ECG Images

Number of Images: 284 Total Dimensions: 284x12 (total of 3408 images) Description: This category features ECG images from individuals with no known cardiac issues, serving as a baseline for comparison with pathological cases.

Applications

The ECG Images Dataset is a valuable resource for various applications, including:

Machine Learning and AI Models: Train and validate models for ECG classification, anomaly detection, and predictive analytics.

Cardiac Research: Investigate patterns and features of different cardiac conditions to improve diagnostic methods and patient outcomes.

Diagnostic Tool Development: Create automated systems for detecting and interpreting ECG abnormalities.

Download

The dataset is available for download from Kaggle and is provided in a compressed file of approximately 194 MB.
Data from: Heart Failure Prediction Dataset
kaggle.com
Updated Sep 10, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
fedesoriano (2021). Heart Failure Prediction Dataset [Dataset]. https://www.kaggle.com/fedesoriano/heart-failure-prediction/activity
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 10, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
fedesoriano
License
Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
Description
Similar Datasets

Hepatitis C Dataset: LINK

Body Fat Prediction Dataset: LINK

Cirrhosis Prediction Dataset: LINK

Stroke Prediction Dataset: LINK

Stellar Classification Dataset - SDSS17: LINK

Wind Speed Prediction Dataset: LINK

Spanish Wine Quality Dataset: LINK

Context

Cardiovascular diseases (CVDs) are the number 1 cause of death globally, taking an estimated 17.9 million lives each year, which accounts for 31% of all deaths worldwide. Four out of 5CVD deaths are due to heart attacks and strokes, and one-third of these deaths occur prematurely in people under 70 years of age. Heart failure is a common event caused by CVDs and this dataset contains 11 features that can be used to predict a possible heart disease.

People with cardiovascular disease or who are at high cardiovascular risk (due to the presence of one or more risk factors such as hypertension, diabetes, hyperlipidaemia or already established disease) need early detection and management wherein a machine learning model can be of great help.

Attribute Information

Age: age of the patient [years]

Sex: sex of the patient [M: Male, F: Female]

ChestPainType: chest pain type [TA: Typical Angina, ATA: Atypical Angina, NAP: Non-Anginal Pain, ASY: Asymptomatic]

RestingBP: resting blood pressure [mm Hg]

Cholesterol: serum cholesterol [mm/dl]

FastingBS: fasting blood sugar [1: if FastingBS > 120 mg/dl, 0: otherwise]

RestingECG: resting electrocardiogram results [Normal: Normal, ST: having ST-T wave abnormality (T wave inversions and/or ST elevation or depression of > 0.05 mV), LVH: showing probable or definite left ventricular hypertrophy by Estes' criteria]

MaxHR: maximum heart rate achieved [Numeric value between 60 and 202]

ExerciseAngina: exercise-induced angina [Y: Yes, N: No]

Oldpeak: oldpeak = ST [Numeric value measured in depression]

ST_Slope: the slope of the peak exercise ST segment [Up: upsloping, Flat: flat, Down: downsloping]

HeartDisease: output class [1: heart disease, 0: Normal]

Source

This dataset was created by combining different datasets already available independently but not combined before. In this dataset, 5 heart datasets are combined over 11 common features which makes it the largest heart disease dataset available so far for research purposes. The five datasets used for its curation are:

Cleveland: 303 observations

Hungarian: 294 observations

Switzerland: 123 observations

Long Beach VA: 200 observations

Stalog (Heart) Data Set: 270 observations

Total: 1190 observations Duplicated: 272 observations

Final dataset: 918 observations

Every dataset used can be found under the Index of heart disease datasets from UCI Machine Learning Repository on the following link: https://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/

Citation

fedesoriano. (September 2021). Heart Failure Prediction Dataset. Retrieved [Date Retrieved] from https://www.kaggle.com/fedesoriano/heart-failure-prediction.

Acknowledgements

Creators:

Hungarian Institute of Cardiology. Budapest: Andras Janosi, M.D.

University Hospital, Zurich, Switzerland: William Steinbrunn, M.D.

University Hospital, Basel, Switzerland: Matthias Pfisterer, M.D.

V.A. Medical Center, Long Beach and Cleveland Clinic Foundation: Robert Detrano, M.D., Ph.D.

Donor: David W. Aha (aha '@' ics.uci.edu) (714) 856-8779
cardiovascular-disease-dataset
kaggle.com
Updated May 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Long NGzzz (2025). cardiovascular-disease-dataset [Dataset]. https://www.kaggle.com/datasets/longngzzz/cardiovascular-disease-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 9, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Long NGzzz
License
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Description
Dataset

This dataset was created by Long NGzzz

Released under Database: Open Database, Contents: Database Contents

Contents
m
ECG Images dataset of Cardiac Patients
data.mendeley.com
Updated Mar 19, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ali Haider Khan (2021). ECG Images dataset of Cardiac Patients [Dataset]. http://doi.org/10.17632/gwbz3fsgp8.2
Explore at:
Unique identifier
https://doi.org/10.17632/gwbz3fsgp8.2
Dataset updated
Mar 19, 2021
Authors
Ali Haider Khan
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
ECG images dataset of Cardiac Patients created under the auspices of Ch. Pervaiz Elahi Institute of Cardiology Multan, Pakistan that aims to help the scientific community for conducting the research for Cardiovascular diseases.
A
‘Heart Disease Prediction using DifferentTechniques’ analyzed by Analyst-2
analyst-2.ai
Updated Nov 13, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ‘Heart Disease Prediction using DifferentTechniques’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-heart-disease-prediction-using-differenttechniques-9270/8b0472e8/?iid=041-227&v=presentation
Explore at:
Dataset updated
Nov 13, 2021
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analysis of ‘Heart Disease Prediction using DifferentTechniques’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/jillanisofttech/heart-disease-prediction-using-differenttechniques on 13 November 2021.

--- Dataset description provided by original source is as follows ---

Context: The leading cause of death in the developed world is heart disease. Therefore there needs to be work done to help prevent the risks of having a heart attack or stroke.

Content: Use this dataset to predict which patients are most likely to suffer from heart disease in the near future using the features given.

Acknowledgment: This data comes from the UCI at https://archive.ics.uci.edu/ml/datasets/Heart+Disease.

--- Original source retains full ownership of the source dataset ---
m
Heart Attack Dataset
data.mendeley.com
Updated Nov 23, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tarik A. Rashid (2022). Heart Attack Dataset [Dataset]. http://doi.org/10.17632/wmhctcrt5v.1
Explore at:
Unique identifier
https://doi.org/10.17632/wmhctcrt5v.1
Dataset updated
Nov 23, 2022
Authors
Tarik A. Rashid
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The heart attack datasets were collected at Zheen hospital in Erbil, Iraq, from January 2019 to May 2019. The attributes of this dataset are: age, gender, heart rate, systolic blood pressure, diastolic blood pressure, blood sugar, ck-mb and troponin with negative or positive output. According to the provided information, the medical dataset classifies either heart attack or none. The gender column in the data is normalized: the male is set to 1 and the female to 0. The glucose column is set to 1 if it is > 120; otherwise, 0. As for the output, positive is set to 1 and negative to 0.
Heart Disease Dataset
kaggle.com
Updated Feb 16, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
George Williams77555 (2023). Heart Disease Dataset [Dataset]. https://www.kaggle.com/datasets/georgewilliams77555/heart-disease-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 16, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
George Williams77555
License
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Description
This data set dates from 1988 and consists of four databases: Cleveland, Hungary, Switzerland, and Long Beach V. It contains 9 attributes and is a shorter version of the original model. The "target" field refers to the presence of heart disease in the patient. It is integer valued 0 = no disease and 1 = disease. Source of the original data can be found here: https://archive.ics.uci.edu/ml/datasets/heart+Disease

age

sex

chest pain type (4 values)

resting blood pressure

serum cholestoral in mg/dl

fasting blood sugar > 120 mg/dl

heart rate max- maximum heart rate achieved

angina - exercise induced angina 0 no, 1 yes

target - 1 = heart disease, 0 = no heart disease
A
‘Heart Disease Cleveland UCI’ analyzed by Analyst-2
analyst-2.ai
Updated Sep 30, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ‘Heart Disease Cleveland UCI’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-heart-disease-cleveland-uci-dc64/6c64d19e/?iid=042-708&v=presentation
Explore at:
Dataset updated
Sep 30, 2021
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Cleveland
Description
Analysis of ‘Heart Disease Cleveland UCI’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/cherngs/heart-disease-cleveland-uci on 30 September 2021.

--- Dataset description provided by original source is as follows ---

Context

The data is already presented in https://www.kaggle.com/ronitf/heart-disease-uci but there are some descriptions and values that are wrong as discussed in https://www.kaggle.com/ronitf/heart-disease-uci/discussion/105877. So, here is re-processed dataset that was cross-checked with the original data https://archive.ics.uci.edu/ml/datasets/Heart+Disease.

Content

There are 13 attributes 1. age: age in years 2. sex: sex (1 = male; 0 = female) 3. cp: chest pain type -- Value 0: typical angina -- Value 1: atypical angina -- Value 2: non-anginal pain -- Value 3: asymptomatic 4. trestbps: resting blood pressure (in mm Hg on admission to the hospital) 5. chol: serum cholestoral in mg/dl 6. fbs: (fasting blood sugar > 120 mg/dl) (1 = true; 0 = false) 7. restecg: resting electrocardiographic results -- Value 0: normal -- Value 1: having ST-T wave abnormality (T wave inversions and/or ST elevation or depression of > 0.05 mV) -- Value 2: showing probable or definite left ventricular hypertrophy by Estes' criteria 8. thalach: maximum heart rate achieved 9. exang: exercise induced angina (1 = yes; 0 = no) 10. oldpeak = ST depression induced by exercise relative to rest 11. slope: the slope of the peak exercise ST segment -- Value 0: upsloping -- Value 1: flat -- Value 2: downsloping 12. ca: number of major vessels (0-3) colored by flourosopy 13. thal: 0 = normal; 1 = fixed defect; 2 = reversable defect and the label 14. condition: 0 = no disease, 1 = disease

Acknowledgements

Data posted on Kaggle: https://www.kaggle.com/ronitf/heart-disease-uci Description of the data above: https://www.kaggle.com/ronitf/heart-disease-uci/discussion/105877 Original data https://archive.ics.uci.edu/ml/datasets/Heart+Disease

Creators: Hungarian Institute of Cardiology. Budapest: Andras Janosi, M.D. University Hospital, Zurich, Switzerland: William Steinbr Creators: Hungarian Institute of Cardiology. Budapest: Andras Janosi, M.D. University Hospital, Zurich, Switzerland: William Steinbrunn, M.D. University Hospital, Basel, Switzerland: Matthias Pfisterer, M.D. V.A. Medical Center, Long Beach and Cleveland Clinic Foundation: Robert Detrano, M.D., Ph.D. Donor: David W. Aha (aha '@' ics.uci.edu) (714) 856-8779

Inspiration

With the attributes described above, can you predict if a patient has heart disease?

--- Original source retains full ownership of the source dataset ---
m
Data from: Classification of Heart Failure Using Machine Learning: A...
data.mendeley.com
Updated Oct 29, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bryan Chulde (2024). Classification of Heart Failure Using Machine Learning: A Comparative Study [Dataset]. http://doi.org/10.17632/959dxmgj8d.1
Explore at:
Unique identifier
https://doi.org/10.17632/959dxmgj8d.1
Dataset updated
Oct 29, 2024
Authors
Bryan Chulde
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Our research demonstrates that machine learning algorithms can effectively predict heart failure, highlighting high-accuracy models that improve detection and treatment. The Kaggle “Heart Failure” dataset, with 918 instances and 12 key features, was preprocessed to remove outliers and features a distribution of cases with and without heart disease (508 and 410). Five models were evaluated: the random forest achieved the highest accuracy (92%) and was consolidated as the most effective at classifying cases. Logistic regression and multilayer perceptron were also quite accurate (89%), while decision tree and k-nearest neighbors performed less well, showing that k-neighbors is less suitable for this data. F1 scores confirmed the random forest as the optimal one, benefiting from preprocessing and hyperparameter tuning. The data analysis revealed that age, blood pressure and cholesterol correlate with disease risk, suggesting that these models may help prioritize patients at risk and improve their preventive management. The research underscores the potential of these models in clinical practice to improve diagnostic accuracy and reduce costs, supporting informed medical decisions and improving health outcomes.
A
‘Heart Failure Prediction’ analyzed by Analyst-2
analyst-2.ai
Updated Nov 21, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ‘Heart Failure Prediction’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-heart-failure-prediction-e809/6cc020ab/?iid=025-855&v=presentation
Explore at:
Dataset updated
Nov 21, 2021
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analysis of ‘Heart Failure Prediction’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/andrewmvd/heart-failure-clinical-data on 21 November 2021.

--- Dataset description provided by original source is as follows ---

About this dataset

Cardiovascular diseases (CVDs) are the number 1 cause of death globally, taking an estimated 17.9 million lives each year, which accounts for 31% of all deaths worlwide. Heart failure is a common event caused by CVDs and this dataset contains 12 features that can be used to predict mortality by heart failure.

Most cardiovascular diseases can be prevented by addressing behavioural risk factors such as tobacco use, unhealthy diet and obesity, physical inactivity and harmful use of alcohol using population-wide strategies.

People with cardiovascular disease or who are at high cardiovascular risk (due to the presence of one or more risk factors such as hypertension, diabetes, hyperlipidaemia or already established disease) need early detection and management wherein a machine learning model can be of great help.

How to use this dataset

Create a model for predicting mortality caused by Heart Failure.

Your kernel can be featured here!

More datasets

Acknowledgements

If you use this dataset in your research, please credit the authors

Citation

Davide Chicco, Giuseppe Jurman: Machine learning can predict survival of patients with heart failure from serum creatinine and ejection fraction alone. BMC Medical Informatics and Decision Making 20, 16 (2020). (link)

License

CC BY 4.0

Splash icon

Icon by Freepik, available on Flaticon.

Splash banner

Wallpaper by jcomp, available on Freepik.

--- Original source retains full ownership of the source dataset ---
Heart Disease Prediction
kaggle.com
Updated Aug 23, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rishi Damarla (2020). Heart Disease Prediction [Dataset]. https://www.kaggle.com/rishidamarla/heart-disease-prediction/notebooks
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 23, 2020
Dataset provided by
Kaggle
Authors
Rishi Damarla
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context: The leading cause of death in the developed world is heart disease. Therefore there needs to be work done to help prevent the risks of of having a heart attack or stroke.

Content: Use this dataset to predict which patients are most likely to suffer from a heart disease in the near future using the features given.

Acknowledgement: This data comes from the University of California Irvine's Machine Learning Repository at https://archive.ics.uci.edu/ml/datasets/Heart+Disease.
h
heart-failure-dataset
huggingface.co
Updated Apr 26, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Krish Kumar (2025). heart-failure-dataset [Dataset]. https://huggingface.co/datasets/imkrish/heart-failure-dataset
Explore at:
Dataset updated
Apr 26, 2025
Authors
Krish Kumar
Description
Heart Failure Prediction Dataset

Downloaded from Kaggle’s fedesoriano/heart-failure-prediction.
A
‘Heart_Diseases’ analyzed by Analyst-2
analyst-2.ai
Updated Nov 13, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ‘Heart_Diseases’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-heart-diseases-1f4c/2f0b4aa2/?iid=042-044&v=presentation
Explore at:
Dataset updated
Nov 13, 2021
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analysis of ‘Heart_Diseases’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/charankakaraparthi/heart-dissease on 30 September 2021.

--- Dataset description provided by original source is as follows ---

Hello There , Greetings of the day !!

Here I am Providing a Small Description About the Dataset

This Dataset may be carried out in Python language to discover the accuracy based at the algorithm.This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them.The "goal" field refers to the presence of heart disease in the patient. It is integer valued from 0 (no presence) to 4.

Thanks & Regards,

Team Head

--- Original source retains full ownership of the source dataset ---
A
‘Heart Failure Clinical Records Dataset’ analyzed by Analyst-2
analyst-2.ai
Updated Sep 30, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ‘Heart Failure Clinical Records Dataset’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-heart-failure-clinical-records-dataset-0308/6bc9d5d1/?iid=022-671&v=presentation
Explore at:
Dataset updated
Sep 30, 2021
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analysis of ‘Heart Failure Clinical Records Dataset’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/rithikkotha/heart-failure-clinical-records-dataset on 30 September 2021.

--- No further description of dataset provided by original source ---

--- Original source retains full ownership of the source dataset ---
Heart Disease Prediction Dataset
kaggle.com
Updated Oct 6, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The citation is currently not available for this dataset.
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 6, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Sivagurunathan28
Description
Dataset

This dataset was created by Sivagurunathan28

Contents

Facebook

Twitter

Click to copy link

Link copied

Cite

M. Farhaan Nazirkhan (2024). Heart Disease Prediction Dataset [Dataset]. https://www.kaggle.com/datasets/mfarhaannazirkhan/heart-dataset

Heart Disease Prediction Dataset

A Comprehensive Dataset for Machine Learning-Based Heart Disease Prediction

Explore at:

2 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Sep 27, 2024

Dataset provided by

Kagglehttp://kaggle.com/

Authors

M. Farhaan Nazirkhan

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Heart Disease Prediction Dataset

This dataset contains 1,888 records merged from five publicly available heart disease datasets. It includes 14 features that are crucial for predicting heart attack and stroke risks, covering both medical and demographic factors. Below is a detailed description of each feature.

Feature Descriptions:

age: Age of the patient (Numeric).
sex: Gender of the patient. Values: 1 = male, 0 = female.
cp: Chest pain type. Values: 0 = Typical angina, 1 = Atypical angina, 2 = Non-anginal pain, 3 = Asymptomatic.
trestbps: Resting Blood Pressure (in mm Hg) (Numeric).
chol: Serum Cholesterol level (in mg/dl) (Numeric).
fbs: Fasting blood sugar > 120 mg/dl. Values: 1 = true, 0 = false.
restecg: Resting electrocardiographic results. Values: 0 = Normal, 1 = ST-T wave abnormality, 2 = Left ventricular hypertrophy.
thalach: Maximum heart rate achieved (Numeric).
exang: Exercise-induced angina. Values: 1 = yes, 0 = no.
oldpeak: ST depression induced by exercise relative to rest (Numeric).
slope: Slope of the peak exercise ST segment. Values: 0 = Upsloping, 1 = Flat, 2 = Downsloping.
ca: Number of major vessels (0-3) colored by fluoroscopy. Values: 0, 1, 2, 3.
thal: Thalassemia types. Values: 1 = Normal, 2 = Fixed defect, 3 = Reversible defect.
target: Outcome variable (heart attack risk). Values: 1 = more chance of heart attack, 0 = less chance of heart attack.

Dataset Details:

This dataset is a combination of five publicly available heart disease datasets, with a total of 1,888 records. Merging these datasets provides a more robust foundation for training machine learning models aimed at predicting heart attack risk.

Datasets Used:

Heart Attack Analysis & Prediction Dataset
Number of Records: 304
Reference: Rahman, 2021
Heart Disease Dataset
Number of Records: 1,026
Reference: Lapp, 2019
Heart Attack Prediction (Dataset 3)
Number of Records: 295
Reference: Damarla, 2020
Heart Attack Prediction (Dataset 4)
Number of Records: 271
Reference: Anand, 2018
Heart CSV Dataset
Number of Records: 290
Reference: Nandal, 2022

Description:

This dataset includes 14 features known to contribute to heart attack risk. It is ideal for training machine learning models aimed at early detection and prevention of heart disease. The records have been cleaned by removing missing data to ensure data integrity. This dataset can be applied to various machine learning algorithms, including classification models such as Decision Trees, Neural Networks, and others.

Clear search

Close search

Google apps

Main menu

Heart Disease Prediction Dataset

Heart Disease Prediction Dataset

Feature Descriptions:

Dataset Details:

Datasets Used:

Description:

Heart Disease Dataset (Comprehensive)

Cardiovascular Disease Dataset

Cardiovascular_Disease_Dataset

Predicting Heart Failure

Attribute Information:

ECG Images Dataset of Cardiac Patients

Dataset Overview

ECG Images of Myocardial Infarction Patients

ECG Images of Patients with Abnormal Heartbeat

ECG Images of Patients with a History of Myocardial Infarction

Normal Person ECG Images

Applications

The ECG Images Dataset is a valuable resource for various applications, including:

Download

Data from: Heart Failure Prediction Dataset

Similar Datasets

Context

Attribute Information

Source

Citation

Acknowledgements

cardiovascular-disease-dataset

Dataset

Contents

ECG Images dataset of Cardiac Patients

‘Heart Disease Prediction using DifferentTechniques’ analyzed by Analyst-2

Heart Attack Dataset

Heart Disease Dataset

‘Heart Disease Cleveland UCI’ analyzed by Analyst-2

Context

Content

Acknowledgements

Inspiration

Data from: Classification of Heart Failure Using Machine Learning: A...

‘Heart Failure Prediction’ analyzed by Analyst-2

About this dataset

How to use this dataset

Acknowledgements

Citation

License

Splash icon

Splash banner

Heart Disease Prediction

heart-failure-dataset

‘Heart_Diseases’ analyzed by Analyst-2

‘Heart Failure Clinical Records Dataset’ analyzed by Analyst-2

Heart Disease Prediction Dataset

Dataset

Contents

Heart Disease Prediction Dataset

A Comprehensive Dataset for Machine Learning-Based Heart Disease Prediction

Heart Disease Prediction Dataset

Feature Descriptions:

Dataset Details:

Datasets Used:

Description: