55 datasets found

heart.csv.xls
kaggle.com
zip
Updated Dec 20, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
LIU Meiting Avril (2024). heart.csv.xls [Dataset]. https://www.kaggle.com/liumeitingavril/heart-csv-xls
Explore at:
zip(3478 bytes)Available download formats
Dataset updated
Dec 20, 2024
Authors
LIU Meiting Avril
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by LIU Meiting Avril

Released under Apache 2.0

Contents
Heart Disease Data.csv
kaggle.com
zip
Updated Jan 24, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Srikanth Chitteti (2025). Heart Disease Data.csv [Dataset]. https://www.kaggle.com/datasets/srikanthchitteti/heart-disease-data-csv
Explore at:
zip(8762 bytes)Available download formats
Dataset updated
Jan 24, 2025
Authors
Srikanth Chitteti
Description
Dataset

This dataset was created by Srikanth Chitteti

Contents
m
Cardiovascular_Disease_Dataset
data.mendeley.com
Updated Apr 16, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bhanu Prakash Doppala (2021). Cardiovascular_Disease_Dataset [Dataset]. http://doi.org/10.17632/dzz48mvjht.1
Explore at:
Unique identifier
https://doi.org/10.17632/dzz48mvjht.1
Dataset updated
Apr 16, 2021
Authors
Bhanu Prakash Doppala
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This heart disease dataset is acquired from one o f the multispecialty hospitals in India. Over 14 common features which makes it one of the heart disease dataset available so far for research purposes. This dataset consists of 1000 subjects with 12 features. This dataset will be useful for building a early-stage heart disease detection as well as to generate predictive machine learning models.
Heart Disease Mortality Data Among US Adults (35+) by State/Territory and...
catalog.data.gov
data.virginia.gov
+2more
Updated Feb 13, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Centers for Disease Control and Prevention (2025). Heart Disease Mortality Data Among US Adults (35+) by State/Territory and County – 2018-2020 [Dataset]. https://catalog.data.gov/dataset/heart-disease-mortality-data-among-us-adults-35-by-state-territory-and-county-2018-2020-3a2b0
Explore at:
Dataset updated
Feb 13, 2025
Dataset provided by
Centers for Disease Control and Preventionhttp://www.cdc.gov/
Description
2018 to 2020, 3-year average. Rates are age-standardized. County rates are spatially smoothed. The data can be viewed by sex and race/ethnicity. Data source: National Vital Statistics System. Additional data, maps, and methodology can be viewed on the Interactive Atlas of Heart Disease and Stroke https://www.cdc.gov/heart-disease-stroke-atlas/about/index.html
p
Coronary Heart Disease - Dataset - CKAN
data.poltekkes-smg.ac.id
Updated Oct 7, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Coronary Heart Disease - Dataset - CKAN [Dataset]. https://data.poltekkes-smg.ac.id/dataset/coronary-heart-disease
Explore at:
Dataset updated
Oct 7, 2024
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The data set CHDdata.csv contains cases of coronary heart disease (CHD) and variables associated with the patient's condition: systolic blood pressure, yearly tobacco use (in kg), low density lipoprotein (Idl), adiposity, family history (0 or 1), type A personality score (typea), obesity (body mass index), alcohol use, age, and the diagnosis of CHD (0 or 1).
Heart Disease Dataset
kaggle.com
zip
Updated Nov 11, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rischan Mafrur (2020). Heart Disease Dataset [Dataset]. https://www.kaggle.com/datasets/rischan/heart-disease-dataset
Explore at:
zip(4370 bytes)Available download formats
Dataset updated
Nov 11, 2020
Authors
Rischan Mafrur
Description
Dataset

This dataset was created by Rischan Mafrur

Contents
Heart disease uci dataset
kaggle.com
Updated Aug 23, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Muneer Iqbal24 (2024). Heart disease uci dataset [Dataset]. https://www.kaggle.com/datasets/muneeriqbal24/heart-disease-uci-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 23, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Muneer Iqbal24
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Dataset

This dataset was created by Muneer Iqbal24

Released under CC0: Public Domain

Contents
f
Data_Sheet_2_Cross-Sectional Transcriptional Analysis of the Aging Murine...
frontiersin.figshare.com
txt
Updated Jun 6, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Matthew Greenig; Andrew Melville; Derek Huntley; Mark Isalan; Michal Mielcarek (2023). Data_Sheet_2_Cross-Sectional Transcriptional Analysis of the Aging Murine Heart.CSV [Dataset]. http://doi.org/10.3389/fmolb.2020.565530.s002
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.3389/fmolb.2020.565530.s002
Dataset updated
Jun 6, 2023
Dataset provided by
Frontiers
Authors
Matthew Greenig; Andrew Melville; Derek Huntley; Mark Isalan; Michal Mielcarek
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Cardiovascular disease accounts for millions of deaths each year and is currently the leading cause of mortality worldwide. The aging process is clearly linked to cardiovascular disease, however, the exact relationship between aging and heart function is not fully understood. Furthermore, a holistic view of cardiac aging, linking features of early life development to changes observed in old age, has not been synthesized. Here, we re-purpose RNA-sequencing data previously-collected by our group, investigating gene expression differences between wild-type mice of different age groups that represent key developmental milestones in the murine lifespan. DESeq2's generalized linear model was applied with two hypothesis testing approaches to identify differentially-expressed (DE) genes, both between pairs of age groups and across mice of all ages. Pairwise comparisons identified genes associated with specific age transitions, while comparisons across all age groups identified a large set of genes associated with the aging process more broadly. An unsupervised machine learning approach was then applied to extract common expression patterns from this set of age-associated genes. Sets of genes with both linear and non-linear expression trajectories were identified, suggesting that aging not only involves the activation of gene expression programs unique to different age groups, but also the re-activation of gene expression programs from earlier ages. Overall, we present a comprehensive transcriptomic analysis of cardiac gene expression patterns across the entirety of the murine lifespan.
Rates and Trends in Heart Disease and Stroke Mortality Among US Adults (35+)...
catalog.data.gov
healthdata.gov
+3more
Updated Aug 26, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Centers for Disease Control and Prevention (2023). Rates and Trends in Heart Disease and Stroke Mortality Among US Adults (35+) by County, Age Group, Race/Ethnicity, and Sex – 2000-2019 [Dataset]. https://catalog.data.gov/dataset/rates-and-trends-in-heart-disease-and-stroke-mortality-among-us-adults-35-by-county-a-2000-45659
Explore at:
Dataset updated
Aug 26, 2023
Dataset provided by
Centers for Disease Control and Preventionhttp://www.cdc.gov/
Description
This dataset documents rates and trends in heart disease and stroke mortality. Specifically, this report presents county (or county equivalent) estimates of heart disease and stroke death rates in 2000-2019 and trends during two intervals (2000-2010, 2010-2019) by age group (ages 35–64 years, ages 65 years and older), race/ethnicity (non-Hispanic American Indian/Alaska Native, non-Hispanic Asian/Pacific Islander, non-Hispanic Black, Hispanic, non-Hispanic White), and sex (women, men). The rates and trends were estimated using a Bayesian spatiotemporal model and a smoothed over space, time, and demographic group. Rates are age-standardized in 10-year age groups using the 2010 US population. Data source: National Vital Statistics System.
Heart_csv
kaggle.com
Updated Dec 6, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anna Tshngryan (2018). Heart_csv [Dataset]. https://www.kaggle.com/annatshngryan/heart-csv/discussion
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 6, 2018
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Anna Tshngryan
Description
Dataset

This dataset was created by Anna Tshngryan

Contents
Heart Disease Dataset
kaggle.com
Updated Oct 27, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Deesya Lovely Susanto (2023). Heart Disease Dataset [Dataset]. https://www.kaggle.com/datasets/deesyalovely/heart-disease-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 27, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Deesya Lovely Susanto
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Deesya Lovely Susanto

Released under Apache 2.0

Contents
heart-diseases
kaggle.com
zip
Updated Nov 4, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Esha Asif005 (2024). heart-diseases [Dataset]. https://www.kaggle.com/datasets/eshaasif005/heart-diseases
Explore at:
zip(6342 bytes)Available download formats
Dataset updated
Nov 4, 2024
Authors
Esha Asif005
Description
Dataset

This dataset was created by Esha Asif005

Contents
f
DataSheet_3_Pro-Inflammatory Derangement of the Immuno-Interactome in Heart...
frontiersin.figshare.com
txt
Updated Jun 14, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pavanish Kumar; Amanda Lim; Su Li Poh; Sharifah Nur Hazirah; Camillus Jian Hui Chua; Nursyuhadah Binte Sutamam; Thaschawee Arkachaisri; Joo Guan Yeo; Theo Kofidis; Vitaly Sorokin; Carolyn S. P. Lam; Arthur Mark Richards; Salvatore Albani (2023). DataSheet_3_Pro-Inflammatory Derangement of the Immuno-Interactome in Heart Failure.csv [Dataset]. http://doi.org/10.3389/fimmu.2022.817514.s003
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.3389/fimmu.2022.817514.s003
Dataset updated
Jun 14, 2023
Dataset provided by
Frontiers
Authors
Pavanish Kumar; Amanda Lim; Su Li Poh; Sharifah Nur Hazirah; Camillus Jian Hui Chua; Nursyuhadah Binte Sutamam; Thaschawee Arkachaisri; Joo Guan Yeo; Theo Kofidis; Vitaly Sorokin; Carolyn S. P. Lam; Arthur Mark Richards; Salvatore Albani
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Chronic heart failure (HF) is a syndrome of heterogeneous etiology associated with multiple co-morbidities. Inflammation is increasingly recognized as a key contributor to the pathophysiology of HF. Heterogeneity and lack of data on the immune mechanism(s) contributing to HF may partially underlie the failure of clinical trials targeting inflammatory mediators. We studied the Immunome in HF cohort using mass cytometry and used data-driven systems immunology approach to discover and characterize modulated immune cell subsets from peripheral blood. We showed cytotoxic and inflammatory innate lymphoid and myeloid cells were expanded in HF patients compared to healthy controls. Network analysis showed highly modular and centralized immune cell architecture in healthy control immune cell network. In contrast, the HF immune cell network showed greater inter-cellular communication and less modular structure. Furthermore, we found, as an immune mechanism specific to HF with preserved ejection fraction (HFpEF), an increase in inflammatory MAIT and CD4 T cell subsets.
ECG in High Intensity Exercise Dataset
zenodo.org
opendatalab.com
+3more
zip
Updated Dec 26, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Elisabetta De Giovanni; Elisabetta De Giovanni; Tomas Teijeiro; Tomas Teijeiro; David Meier; Grégoire Millet; Grégoire Millet; David Atienza; David Atienza; David Meier (2021). ECG in High Intensity Exercise Dataset [Dataset]. http://doi.org/10.5281/zenodo.5727800
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.5727800
Dataset updated
Dec 26, 2021
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Elisabetta De Giovanni; Elisabetta De Giovanni; Tomas Teijeiro; Tomas Teijeiro; David Meier; Grégoire Millet; Grégoire Millet; David Atienza; David Atienza; David Meier
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The data presented here was extracted from a larger dataset collected through a collaboration between the Embedded Systems Laboratory (ESL) of the Swiss Federal Institute of Technology in Lausanne (EPFL), Switzerland and the Institute of Sports Sciences of the University of Lausanne (ISSUL). In this dataset, we report the extracted segments used for an analysis of R peak detection algorithms during high intensity exercise.

Protocol of the experiments
The protocol of the experiment was the following.

22 subjects performing a cardio-pulmonary maximal exercise test on a cycle ergometer, using a gas mask. A single-lead electrocardiogram (ECG) was measured using the BIOPAC system.

An initial 3 min of rest were recorded.

After this baseline, the subjects started cycling at a power of 60W or 90W depending on their fitness level.

Then, the power of the cycle ergometer was increased by 30W every 3 min till exhaustion (in terms of maximum oxygen uptake or VO2max).

Finally, physiology experts assessed the so-called ventilatory thresholds and the VO2max based on the pulmonary data (volume of oxygen and CO2).

Description of the extracted dataset

The characteristics of the dataset are the following:

We report only 20 out of 22 subjects that were used for the analysis, because for two subjects the signals were too corrupted or not complete. Specifically, subjects 5 and 12 were discarded.

The ECG signal was sampled at 500 Hz and then downsampled at 250 Hz. The original ECG signal were measured at maximum 10 mV. Then, they were scaled down by a factor of 1000, hence the data is represented in uV.

For each subject, 5 segments of 20 s were extracted from the ECG recordings and chosen based on different phases of the maximal exercise test (i.e., before and after the so-called second ventilatory threshold or VT2, before and in the middle of VO2max, and during the recovery after exhaustion) to represent different intensities of physical activity.

seg1 --> [VT2-50,VT2-30]
seg2 --> [VT2+60,VT2+80]
seg3 --> [VO2max-50,VO2max-30]
seg4 --> [VO2max-10,VO2max+10]
seg5 --> [VO2max+60,VO2max+80]

The R peak locations were manually annotated in all segments and reviewed by a physician of the Lausanne University Hospital, CHUV. Only segment 5 of subject 9 could not be annotated since there was a problem with the input signal. So, the total number of segments extracted were 20 * 5 - 1 = 99.

Format of the extracted dataset

The dataset is divided in two main folders:

The folder `ecg_segments/` contains the ECG signals saved in two formats, `.csv` and `.mat`. This folder includes both raw (`ecg_raw`) and processed (`ecg`) signals. The processing consists of a morphological filtering and a relative energy non filtering method to enhance the R peaks. The `.csv` files contain only the signal, while the `.mat` files include the signal, the time vector within the maximal stress test, the sampling frequency and the unit of the signal amplitude (uV, as we mentioned before).

The folder `manual_annotations/` contains the sample indices of the annotated R peaks in `.csv` format. The annotation was done on the processed signals.
Z
Virtual cohort of 1000 synthetic heart meshes from adult human healthy...
data.niaid.nih.gov
zenodo.org
Updated May 11, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Longobardi, Stefano (2022). Virtual cohort of 1000 synthetic heart meshes from adult human healthy population [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4506929
Explore at:
Dataset updated
May 11, 2022
Dataset provided by
Rodero, Cristobal
Vigmond, Edward J.
Longobardi, Stefano
NIederer, Steven
Plank, Gernot
Gillette, Karli
Whitaker, John
Augustin, Christoph
Strocchi, Marina
Lamata, Pablo
Marciniak, Maciej
O'Neill, Mark D.
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset Description: We present a database of four-chamber heart models derived from a statistical shape model (SSM) suitable for electro-mechanical (EM) simulations. Our database consists of 1000 four-chamber heart models generated from end-diastolic CT-derived meshes (available in the repository called ("Virtual cohort of adult healthy four-chamber heart meshes from CT images"). These meshes were used for EM simulations. The weights of the SSM are also provided.

Cardiac meshes: To build the SSM, we rigidly aligned the CT cohort and extracted the surfaces, representing them asdeRham currents. The registration between meshes and computation of the average shape was done using a Large Deformation Diffeomorphic Metric Mapping method. The deformation functions depend on a set of uniformly distributed control points in which the shapes are embedded, and on the deformation vectors attached to these points. It is in this spatial field of deformation vectors (one per each control point) where the Principal Component Analysis (PCA) is applied. Case #20 of the CT cohort was not included. More information on the details can be found in Supplement 3 of the reference paper. We created this cohort by modifying the weight of the modes explaining 90%of the variance in shape (corresponding to modes 1 to 9) within 2 standard deviations (SD) of each mode added to the average mesh. The elements of all the meshes are labelled as follows:

Left ventricle myocardium

Right ventricle myocardium

Left atrium myocardium

Right atrium myocardium

Aorta wall

Pulmonary artery wall

Mitral valve plane

Tricuspid valve plane

Aortic valve plane

Pulmonary valve plane

Left atrium appendage "inlet"

Left superior pulmonary vein inlet

Left inferior pulmonary vein inlet

Right inferior pulmonary vein inlet

Right superior pulmonary vein inlet

Superior vena cava inlet

Inferior vena cava inlet

Left atrial appendage border

Right inferior pulmonary vein border

Left inferior pulmonary vein border

Left superior pulmonary vein border

Right superior pulmonary vein border

Superior vena cava border

Inferior vena cava border

Each zipped folder contains 25 meshes and the weights of modes used to construct them for each mesh, A VTK file for each mesh (in ASCII) contains an UNSTRUCTURED GRID with the following fields:

POINTS, with the coordinates of the points in mm.

CELL_TYPES, having all of the points the value 10 since they are tetrahedra.

CELLS, with the indices of the vertices of every element.

CELL_DATA, corresponding to the meshing tags.

In addition, three descriptive files are included:

Normalized_explained_variance.csv contains the percentages of variance explained by each of the 18 modes generated from PCA.

Mode_standard_deviation.csv contains absolute standard deviations of each of the 18 modes.

Eigenvectors.csv contains the directions of maximum shape variability within the shape population.
vinbigdata Detect the heart and lungs csv
kaggle.com
zip
Updated Mar 9, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Alien (2021). vinbigdata Detect the heart and lungs csv [Dataset]. https://www.kaggle.com/h053473666/vinbigdata-detect-the-heart-and-lungs-csv
Explore at:
zip(249385 bytes)Available download formats
Dataset updated
Mar 9, 2021
Authors
Alien
Description
Dataset

This dataset was created by Alien

Contents
Heart Disease Health Indicators Dataset
kaggle.com
Updated Mar 10, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Alex Teboul (2022). Heart Disease Health Indicators Dataset [Dataset]. https://www.kaggle.com/datasets/alexteboul/heart-disease-health-indicators-dataset/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 10, 2022
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Alex Teboul
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

Heart Disease is among the most prevalent chronic diseases in the United States, impacting millions of Americans each year and exerting a significant financial burden on the economy. In the United States alone, heart disease claims roughly 647,000 lives each year — making it the leading cause of death. The buildup of plaques inside larger coronary arteries, molecular changes associated with aging, chronic inflammation, high blood pressure, and diabetes are all causes of and risk factors for heart disease.

While there are different types of coronary heart disease, the majority of individuals only learn they have the disease following symptoms such as chest pain, a heart attack, or sudden cardiac arrest. This fact highlights the importance of preventative measures and tests that can accurately predict heart disease in the population prior to negative outcomes like myocardial infarctions (heart attacks) taking place.

The Centers for Disease Control and Prevention has identified high blood pressure, high blood cholesterol, and smoking as three key risk factors for heart disease. Roughly half of Americans have at least one of these three risk factors. The National Heart, Lung, and Blood Institute highlights a wider array of factors such as Age, Environment and Occupation, Family History and Genetics, Lifestyle Habits, Other Medical Conditions, Race or Ethnicity, and Sex for clinicians to use in diagnosing coronary heart disease. Diagnosis tends to be driven by an initial survey of these common risk factors followed by bloodwork and other tests.

Content

The Behavioral Risk Factor Surveillance System (BRFSS) is a health-related telephone survey that is collected annually by the CDC. Each year, the survey collects responses from over 400,000 Americans on health-related risk behaviors, chronic health conditions, and the use of preventative services. It has been conducted every year since 1984. For this project, I downloaded a csv of the dataset available on Kaggle for the year 2015. This original dataset contains responses from 441,455 individuals and has 330 features. These features are either questions directly asked of participants, or calculated variables based on individual participant responses.

This dataset contains 253,680 survey responses from cleaned BRFSS 2015 to be used primarily for the binary classification of heart disease. Not that there is strong class imbalance in this dataset. 229,787 respondents do not have/have not had heart disease while 23,893 have had heart disease. The question to be explored is:

1. To what extend can survey responses from the BRFSS be used for predicting heart disease risk?

and

2. Can a subset of questions from the BRFSS be used for preventative health screening for diseases like heart disease?

Acknowledgements

It it important to reiterate that I did not create this dataset, it is just a cleaned and consolidated dataset created from the BRFSS 2015 dataset already on Kaggle. That dataset can be found here and the notebook I used for the data cleaning can be found here.

Inspiration

Let's build some predictive models for for heart disease.
Z
Interstage single ventricle heart disease infants show dysregulation in...
data.niaid.nih.gov
zenodo.org
Updated Jun 17, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Frank, Benjamin (2024). Interstage single ventricle heart disease infants show dysregulation in multiple metabolic pathways: targeted metabolomics analysis - Data [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_11999624
Explore at:
Dataset updated
Jun 17, 2024
Dataset provided by
Frank, Benjamin
Davidson, Jesse
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The data in this Zenodo entry corresponds to the data used to produce the results in https://www.jacc.org/doi/full/10.1016/j.jacadv.2022.100169. The zipped folder contains three files

Metabolite Data.csv - The meatobilte measurements for all the samples

Clinical Data.csv - Values for the clinical variables

Clinical Data Descriptions.csv - More in depth explanation of clinical variables as well as possible values of the variables

This study was supported by the American Heart Association (AHA20CDA35310498 and AHA18IPA34170070) and the National Institutes of Health (NIH/NCATS Colorado CTSA, No. UL1 TR001082 and NIH/NHLBI K23HL12363
Stroke Mortality Data Among US Adults (35+) by State/Territory and County –...
catalog.data.gov
data.virginia.gov
+1more
Updated Feb 13, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Centers for Disease Control and Prevention (2025). Stroke Mortality Data Among US Adults (35+) by State/Territory and County – 2016-2018 [Dataset]. https://catalog.data.gov/dataset/stroke-mortality-data-among-us-adults-35-by-state-territory-and-county-2016-2018-3aa59
Explore at:
Dataset updated
Feb 13, 2025
Dataset provided by
Centers for Disease Control and Preventionhttp://www.cdc.gov/
Description
2016 to 2018, 3-year average. Rates are age-standardized. County rates are spatially smoothed. The data can be viewed by sex and race/ethnicity. Data source: National Vital Statistics System. Additional data, maps, and methodology can be viewed on the Interactive Atlas of Heart Disease and Stroke https://www.cdc.gov/heart-disease-stroke-atlas/about/index.html
HRV-ACC: a dataset with R-R intervals and accelerometer data for the...
zenodo.org
data.niaid.nih.gov
csv, txt, zip
Updated Aug 9, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kamil Książek; Kamil Książek; Wilhelm Masarczyk; Wilhelm Masarczyk; Przemysław Głomb; Przemysław Głomb; Michał Romaszewski; Michał Romaszewski; Iga Stokłosa; Iga Stokłosa; Piotr Ścisło; Piotr Ścisło; Paweł Dębski; Paweł Dębski; Robert Pudlo; Robert Pudlo; Piotr Gorczyca; Piotr Gorczyca; Magdalena Piegza; Magdalena Piegza (2023). HRV-ACC: a dataset with R-R intervals and accelerometer data for the diagnosis of psychotic disorders using a Polar H10 wearable sensor [Dataset]. http://doi.org/10.5281/zenodo.8171266
Explore at:
txt, zip, csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.8171266
Dataset updated
Aug 9, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Kamil Książek; Kamil Książek; Wilhelm Masarczyk; Wilhelm Masarczyk; Przemysław Głomb; Przemysław Głomb; Michał Romaszewski; Michał Romaszewski; Iga Stokłosa; Iga Stokłosa; Piotr Ścisło; Piotr Ścisło; Paweł Dębski; Paweł Dębski; Robert Pudlo; Robert Pudlo; Piotr Gorczyca; Piotr Gorczyca; Magdalena Piegza; Magdalena Piegza
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
ABSTRACT

The issue of diagnosing psychotic diseases, including schizophrenia and bipolar disorder, in particular, the objectification of symptom severity assessment, is still a problem requiring the attention of researchers. Two measures that can be helpful in patient diagnosis are heart rate variability calculated based on electrocardiographic signal and accelerometer mobility data. The following dataset contains data from 30 psychiatric ward patients having schizophrenia or bipolar disorder and 30 healthy persons. The duration of the measurements for individuals was usually between 1.5 and 2 hours. R-R intervals necessary for heart rate variability calculation were collected simultaneously with accelerometer data using a wearable Polar H10 device. The Positive and Negative Syndrome Scale (PANSS) test was performed for each patient participating in the experiment, and its results were attached to the dataset. Furthermore, the code for loading and preprocessing data, as well as for statistical analysis, was included on the corresponding GitHub repository.

BACKGROUND

Heart rate variability (HRV), calculated based on electrocardiographic (ECG) recordings of R-R intervals stemming from the heart's electrical activity, may be used as a biomarker of mental illnesses, including schizophrenia and bipolar disorder (BD) [Benjamin et al]. The variations of R-R interval values correspond to the heart's autonomic regulation changes [Berntson et al, Stogios et al]. Moreover, the HRV measure reflects the activity of the sympathetic and parasympathetic parts of the autonomous nervous system (ANS) [Task Force of the European Society of Cardiology the North American Society of Pacing Electrophysiology, Matusik et al]. Patients with psychotic mental disorders show a tendency for a change in the centrally regulated ANS balance in the direction of less dynamic changes in the ANS activity in response to different environmental conditions [Stogios et al]. Larger sympathetic activity relative to the parasympathetic one leads to lower HRV, while, on the other hand, higher parasympathetic activity translates to higher HRV. This loss of dynamic response may be an indicator of mental health. Additional benefits may come from measuring the daily activity of patients using accelerometry. This may be used to register periods of physical activity and inactivity or withdrawal for further correlation with HRV values recorded at the same time.

EXPERIMENTS

In our experiment, the participants were 30 psychiatric ward patients with schizophrenia or BD and 30 healthy people. All measurements were performed using a Polar H10 wearable device. The sensor collects ECG recordings and accelerometer data and, additionally, prepares a detection of R wave peaks. Participants of the experiment had to wear the sensor for a given time. Basically, it was between 1.5 and 2 hours, but the shortest recording was 70 minutes. During this time, evaluated persons could perform any activity a few minutes after starting the measurement. Participants were encouraged to undertake physical activity and, more specifically, to take a walk. Due to patients being in the medical ward, they received instruction to take a walk in the corridors at the beginning of the experiment. They were to repeat the walk 30 minutes and 1 hour after the first walk. The subsequent walks were to be slightly longer (about 3, 5 and 7 minutes, respectively). We did not remind or supervise the command during the experiment, both in the treatment and the control group. Seven persons from the control group did not receive this order and their measurements correspond to freely selected activities with rest periods but at least three of them performed physical activities during this time. Nevertheless, at the start of the experiment, all participants were requested to rest in a sitting position for 5 minutes. Moreover, for each patient, the disease severity was assessed using the PANSS test and its scores are attached to the dataset.

The data from sensors were collected using Polar Sensor Logger application [Happonen]. Such extracted measurements were then preprocessed and analyzed using the code prepared by the authors of the experiment. It is publicly available on the GitHub repository [Książek et al].

Firstly, we performed a manual artifact detection to remove abnormal heartbeats due to non-sinus beats and technical issues of the device (e.g. temporary disconnections and inappropriate electrode readings). We also performed anomaly detection using Daubechies wavelet transform. Nevertheless, the dataset includes raw data, while a full code necessary to reproduce our anomaly detection approach is available in the repository. Optionally, it is also possible to perform cubic spline data interpolation. After that step, rolling windows of a particular size and time intervals between them are created. Then, a statistical analysis is prepared, e.g. mean HRV calculation using the RMSSD (Root Mean Square of Successive Differences) approach, measuring a relationship between mean HRV and PANSS scores, mobility coefficient calculation based on accelerometer data and verification of dependencies between HRV and mobility scores.

DATA DESCRIPTION

The structure of the dataset is as follows. One folder, called HRV_anonymized_data contains values of R-R intervals together with timestamps for each experiment participant. The data was properly anonymized, i.e. the day of the measurement was removed to prevent person identification. Files concerned with patients have the name treatment_X.csv, where X is the number of the person, while files related to the healthy controls are named control_Y.csv, where Y is the identification number of the person. Furthermore, for visualization purposes, an image of the raw RR intervals for each participant is presented. Its name is raw_RR_{control,treatment}_N.png, where N is the number of the person from the control/treatment group. The collected data are raw, i.e. before the anomaly removal. The code enabling reproducing the anomaly detection stage and removing suspicious heartbeats is publicly available in the repository [Książek et al]. The structure of consecutive files collecting R-R intervals is following:

Phone timestamp RR-interval [ms]
12:43:26.538000 651
12:43:27.189000 632
12:43:27.821000 618
12:43:28.439000 621
12:43:29.060000 661
... ...

The first column contains the timestamp for which the distance between two consecutive R peaks was registered. The corresponding R-R interval is presented in the second column of the file and is expressed in milliseconds.
The second folder, called accelerometer_anonymized_data contains values of accelerometer data collected at the same time as R-R intervals. The naming convention is similar to that of the R-R interval data: treatment_X.csv and control_X.csv represent the data coming from the persons from the treatment and control group, respectively, while X is the identification number of the selected participant. The numbers are exactly the same as for R-R intervals. The structure of the files with accelerometer recordings is as follows:

Phone timestamp X [mg] Y [mg] Z [mg]
13:00:17.196000 -961 -23 182
13:00:17.205000 -965 -21 181
13:00:17.215000 -966 -22 187
13:00:17.225000 -967 -26 193
13:00:17.235000 -965 -27 191
... ... ... ...

The first column contains a timestamp, while the next three columns correspond to the currently registered acceleration in three axes: X, Y and Z, in milli-g unit.

We also attached a file with the PANSS test scores (PANSS.csv) for all patients participating in the measurement. The structure of this file is as follows:

no_of_person PANSS_P PANSS_N PANSS_G PANSS_total
1 8 13 22 43
2 11 7 18 36
3 14 30 44 88
4 18 13 27 58
... ... ... ... ..

The first column contains the identification number of the patient, while the three following columns refer to the PANSS scores related to positive, negative and general symptoms, respectively.

USAGE NOTES

All the files necessary to run the HRV and/or accelerometer data analysis are available on the GitHub repository [Książek et al]. HRV data loading, preprocessing (i.e. anomaly detection and removal), as well as the

Facebook

Twitter

Click to copy link

Link copied

Cite

LIU Meiting Avril (2024). heart.csv.xls [Dataset]. https://www.kaggle.com/liumeitingavril/heart-csv-xls

heart.csv.xls

Explore at:

zip(3478 bytes)Available download formats

Dataset updated

Dec 20, 2024

Authors

LIU Meiting Avril

License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Dataset

This dataset was created by LIU Meiting Avril

Released under Apache 2.0

Clear search

Close search

Google apps

Main menu

Phone timestamp	RR-interval [ms]
12:43:26.538000	651
12:43:27.189000	632
12:43:27.821000	618
12:43:28.439000	621
12:43:29.060000	661
...	...

Phone timestamp	X [mg]	Y [mg]	Z [mg]
13:00:17.196000	-961	-23	182
13:00:17.205000	-965	-21	181
13:00:17.215000	-966	-22	187
13:00:17.225000	-967	-26	193
13:00:17.235000	-965	-27	191
...	...	...	...

no_of_person	PANSS_P	PANSS_N	PANSS_G	PANSS_total
1	8	13	22	43
2	11	7	18	36
3	14	30	44	88
4	18	13	27	58
...	...	...	...	..

heart.csv.xls

Dataset

Contents

Heart Disease Data.csv

Dataset

Contents

Cardiovascular_Disease_Dataset

Heart Disease Mortality Data Among US Adults (35+) by State/Territory and...

Coronary Heart Disease - Dataset - CKAN

Heart Disease Dataset

Dataset

Contents

Heart disease uci dataset

Dataset

Contents

Data_Sheet_2_Cross-Sectional Transcriptional Analysis of the Aging Murine...

Rates and Trends in Heart Disease and Stroke Mortality Among US Adults (35+)...

Heart_csv

Dataset

Contents

Heart Disease Dataset

Dataset

Contents

heart-diseases

Dataset

Contents

DataSheet_3_Pro-Inflammatory Derangement of the Immuno-Interactome in Heart...

ECG in High Intensity Exercise Dataset

Virtual cohort of 1000 synthetic heart meshes from adult human healthy...

vinbigdata Detect the heart and lungs csv

Dataset

Contents

Heart Disease Health Indicators Dataset

Context

Content

1. To what extend can survey responses from the BRFSS be used for predicting heart disease risk?

2. Can a subset of questions from the BRFSS be used for preventative health screening for diseases like heart disease?

Acknowledgements

Inspiration

Interstage single ventricle heart disease infants show dysregulation in...

Stroke Mortality Data Among US Adults (35+) by State/Territory and County –...

HRV-ACC: a dataset with R-R intervals and accelerometer data for the...

heart.csv.xls

Dataset

Contents