55 datasets found
  1. heart.csv.xls

    • kaggle.com
    zip
    Updated Dec 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    LIU Meiting Avril (2024). heart.csv.xls [Dataset]. https://www.kaggle.com/liumeitingavril/heart-csv-xls
    Explore at:
    zip(3478 bytes)Available download formats
    Dataset updated
    Dec 20, 2024
    Authors
    LIU Meiting Avril
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by LIU Meiting Avril

    Released under Apache 2.0

    Contents

  2. Heart Disease Data.csv

    • kaggle.com
    zip
    Updated Jan 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Srikanth Chitteti (2025). Heart Disease Data.csv [Dataset]. https://www.kaggle.com/datasets/srikanthchitteti/heart-disease-data-csv
    Explore at:
    zip(8762 bytes)Available download formats
    Dataset updated
    Jan 24, 2025
    Authors
    Srikanth Chitteti
    Description

    Dataset

    This dataset was created by Srikanth Chitteti

    Contents

  3. m

    Cardiovascular_Disease_Dataset

    • data.mendeley.com
    Updated Apr 16, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bhanu Prakash Doppala (2021). Cardiovascular_Disease_Dataset [Dataset]. http://doi.org/10.17632/dzz48mvjht.1
    Explore at:
    Dataset updated
    Apr 16, 2021
    Authors
    Bhanu Prakash Doppala
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This heart disease dataset is acquired from one o f the multispecialty hospitals in India. Over 14 common features which makes it one of the heart disease dataset available so far for research purposes. This dataset consists of 1000 subjects with 12 features. This dataset will be useful for building a early-stage heart disease detection as well as to generate predictive machine learning models.

  4. Heart Disease Mortality Data Among US Adults (35+) by State/Territory and...

    • catalog.data.gov
    • data.virginia.gov
    • +2more
    Updated Feb 13, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Centers for Disease Control and Prevention (2025). Heart Disease Mortality Data Among US Adults (35+) by State/Territory and County – 2018-2020 [Dataset]. https://catalog.data.gov/dataset/heart-disease-mortality-data-among-us-adults-35-by-state-territory-and-county-2018-2020-3a2b0
    Explore at:
    Dataset updated
    Feb 13, 2025
    Dataset provided by
    Centers for Disease Control and Preventionhttp://www.cdc.gov/
    Description

    2018 to 2020, 3-year average. Rates are age-standardized. County rates are spatially smoothed. The data can be viewed by sex and race/ethnicity. Data source: National Vital Statistics System. Additional data, maps, and methodology can be viewed on the Interactive Atlas of Heart Disease and Stroke https://www.cdc.gov/heart-disease-stroke-atlas/about/index.html

  5. p

    Coronary Heart Disease - Dataset - CKAN

    • data.poltekkes-smg.ac.id
    Updated Oct 7, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Coronary Heart Disease - Dataset - CKAN [Dataset]. https://data.poltekkes-smg.ac.id/dataset/coronary-heart-disease
    Explore at:
    Dataset updated
    Oct 7, 2024
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The data set CHDdata.csv contains cases of coronary heart disease (CHD) and variables associated with the patient's condition: systolic blood pressure, yearly tobacco use (in kg), low density lipoprotein (Idl), adiposity, family history (0 or 1), type A personality score (typea), obesity (body mass index), alcohol use, age, and the diagnosis of CHD (0 or 1).

  6. Heart Disease Dataset

    • kaggle.com
    zip
    Updated Nov 11, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rischan Mafrur (2020). Heart Disease Dataset [Dataset]. https://www.kaggle.com/datasets/rischan/heart-disease-dataset
    Explore at:
    zip(4370 bytes)Available download formats
    Dataset updated
    Nov 11, 2020
    Authors
    Rischan Mafrur
    Description

    Dataset

    This dataset was created by Rischan Mafrur

    Contents

  7. Heart disease uci dataset

    • kaggle.com
    Updated Aug 23, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muneer Iqbal24 (2024). Heart disease uci dataset [Dataset]. https://www.kaggle.com/datasets/muneeriqbal24/heart-disease-uci-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 23, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Muneer Iqbal24
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Muneer Iqbal24

    Released under CC0: Public Domain

    Contents

  8. f

    Data_Sheet_2_Cross-Sectional Transcriptional Analysis of the Aging Murine...

    • frontiersin.figshare.com
    txt
    Updated Jun 6, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matthew Greenig; Andrew Melville; Derek Huntley; Mark Isalan; Michal Mielcarek (2023). Data_Sheet_2_Cross-Sectional Transcriptional Analysis of the Aging Murine Heart.CSV [Dataset]. http://doi.org/10.3389/fmolb.2020.565530.s002
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 6, 2023
    Dataset provided by
    Frontiers
    Authors
    Matthew Greenig; Andrew Melville; Derek Huntley; Mark Isalan; Michal Mielcarek
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Cardiovascular disease accounts for millions of deaths each year and is currently the leading cause of mortality worldwide. The aging process is clearly linked to cardiovascular disease, however, the exact relationship between aging and heart function is not fully understood. Furthermore, a holistic view of cardiac aging, linking features of early life development to changes observed in old age, has not been synthesized. Here, we re-purpose RNA-sequencing data previously-collected by our group, investigating gene expression differences between wild-type mice of different age groups that represent key developmental milestones in the murine lifespan. DESeq2's generalized linear model was applied with two hypothesis testing approaches to identify differentially-expressed (DE) genes, both between pairs of age groups and across mice of all ages. Pairwise comparisons identified genes associated with specific age transitions, while comparisons across all age groups identified a large set of genes associated with the aging process more broadly. An unsupervised machine learning approach was then applied to extract common expression patterns from this set of age-associated genes. Sets of genes with both linear and non-linear expression trajectories were identified, suggesting that aging not only involves the activation of gene expression programs unique to different age groups, but also the re-activation of gene expression programs from earlier ages. Overall, we present a comprehensive transcriptomic analysis of cardiac gene expression patterns across the entirety of the murine lifespan.

  9. Rates and Trends in Heart Disease and Stroke Mortality Among US Adults (35+)...

    • catalog.data.gov
    • healthdata.gov
    • +3more
    Updated Aug 26, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Centers for Disease Control and Prevention (2023). Rates and Trends in Heart Disease and Stroke Mortality Among US Adults (35+) by County, Age Group, Race/Ethnicity, and Sex – 2000-2019 [Dataset]. https://catalog.data.gov/dataset/rates-and-trends-in-heart-disease-and-stroke-mortality-among-us-adults-35-by-county-a-2000-45659
    Explore at:
    Dataset updated
    Aug 26, 2023
    Dataset provided by
    Centers for Disease Control and Preventionhttp://www.cdc.gov/
    Description

    This dataset documents rates and trends in heart disease and stroke mortality. Specifically, this report presents county (or county equivalent) estimates of heart disease and stroke death rates in 2000-2019 and trends during two intervals (2000-2010, 2010-2019) by age group (ages 35–64 years, ages 65 years and older), race/ethnicity (non-Hispanic American Indian/Alaska Native, non-Hispanic Asian/Pacific Islander, non-Hispanic Black, Hispanic, non-Hispanic White), and sex (women, men). The rates and trends were estimated using a Bayesian spatiotemporal model and a smoothed over space, time, and demographic group. Rates are age-standardized in 10-year age groups using the 2010 US population. Data source: National Vital Statistics System.

  10. Heart_csv

    • kaggle.com
    Updated Dec 6, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anna Tshngryan (2018). Heart_csv [Dataset]. https://www.kaggle.com/annatshngryan/heart-csv/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 6, 2018
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Anna Tshngryan
    Description

    Dataset

    This dataset was created by Anna Tshngryan

    Contents

  11. Heart Disease Dataset

    • kaggle.com
    Updated Oct 27, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Deesya Lovely Susanto (2023). Heart Disease Dataset [Dataset]. https://www.kaggle.com/datasets/deesyalovely/heart-disease-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 27, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Deesya Lovely Susanto
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Deesya Lovely Susanto

    Released under Apache 2.0

    Contents

  12. heart-diseases

    • kaggle.com
    zip
    Updated Nov 4, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Esha Asif005 (2024). heart-diseases [Dataset]. https://www.kaggle.com/datasets/eshaasif005/heart-diseases
    Explore at:
    zip(6342 bytes)Available download formats
    Dataset updated
    Nov 4, 2024
    Authors
    Esha Asif005
    Description

    Dataset

    This dataset was created by Esha Asif005

    Contents

  13. f

    DataSheet_3_Pro-Inflammatory Derangement of the Immuno-Interactome in Heart...

    • frontiersin.figshare.com
    txt
    Updated Jun 14, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pavanish Kumar; Amanda Lim; Su Li Poh; Sharifah Nur Hazirah; Camillus Jian Hui Chua; Nursyuhadah Binte Sutamam; Thaschawee Arkachaisri; Joo Guan Yeo; Theo Kofidis; Vitaly Sorokin; Carolyn S. P. Lam; Arthur Mark Richards; Salvatore Albani (2023). DataSheet_3_Pro-Inflammatory Derangement of the Immuno-Interactome in Heart Failure.csv [Dataset]. http://doi.org/10.3389/fimmu.2022.817514.s003
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 14, 2023
    Dataset provided by
    Frontiers
    Authors
    Pavanish Kumar; Amanda Lim; Su Li Poh; Sharifah Nur Hazirah; Camillus Jian Hui Chua; Nursyuhadah Binte Sutamam; Thaschawee Arkachaisri; Joo Guan Yeo; Theo Kofidis; Vitaly Sorokin; Carolyn S. P. Lam; Arthur Mark Richards; Salvatore Albani
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Chronic heart failure (HF) is a syndrome of heterogeneous etiology associated with multiple co-morbidities. Inflammation is increasingly recognized as a key contributor to the pathophysiology of HF. Heterogeneity and lack of data on the immune mechanism(s) contributing to HF may partially underlie the failure of clinical trials targeting inflammatory mediators. We studied the Immunome in HF cohort using mass cytometry and used data-driven systems immunology approach to discover and characterize modulated immune cell subsets from peripheral blood. We showed cytotoxic and inflammatory innate lymphoid and myeloid cells were expanded in HF patients compared to healthy controls. Network analysis showed highly modular and centralized immune cell architecture in healthy control immune cell network. In contrast, the HF immune cell network showed greater inter-cellular communication and less modular structure. Furthermore, we found, as an immune mechanism specific to HF with preserved ejection fraction (HFpEF), an increase in inflammatory MAIT and CD4 T cell subsets.

  14. ECG in High Intensity Exercise Dataset

    • zenodo.org
    • opendatalab.com
    • +3more
    zip
    Updated Dec 26, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Elisabetta De Giovanni; Elisabetta De Giovanni; Tomas Teijeiro; Tomas Teijeiro; David Meier; Grégoire Millet; Grégoire Millet; David Atienza; David Atienza; David Meier (2021). ECG in High Intensity Exercise Dataset [Dataset]. http://doi.org/10.5281/zenodo.5727800
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 26, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Elisabetta De Giovanni; Elisabetta De Giovanni; Tomas Teijeiro; Tomas Teijeiro; David Meier; Grégoire Millet; Grégoire Millet; David Atienza; David Atienza; David Meier
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The data presented here was extracted from a larger dataset collected through a collaboration between the Embedded Systems Laboratory (ESL) of the Swiss Federal Institute of Technology in Lausanne (EPFL), Switzerland and the Institute of Sports Sciences of the University of Lausanne (ISSUL). In this dataset, we report the extracted segments used for an analysis of R peak detection algorithms during high intensity exercise.

    Protocol of the experiments
    The protocol of the experiment was the following.

    • 22 subjects performing a cardio-pulmonary maximal exercise test on a cycle ergometer, using a gas mask. A single-lead electrocardiogram (ECG) was measured using the BIOPAC system.
    • An initial 3 min of rest were recorded.
    • After this baseline, the subjects started cycling at a power of 60W or 90W depending on their fitness level.
    • Then, the power of the cycle ergometer was increased by 30W every 3 min till exhaustion (in terms of maximum oxygen uptake or VO2max).
    • Finally, physiology experts assessed the so-called ventilatory thresholds and the VO2max based on the pulmonary data (volume of oxygen and CO2).

    Description of the extracted dataset

    The characteristics of the dataset are the following:

    • We report only 20 out of 22 subjects that were used for the analysis, because for two subjects the signals were too corrupted or not complete. Specifically, subjects 5 and 12 were discarded.
    • The ECG signal was sampled at 500 Hz and then downsampled at 250 Hz. The original ECG signal were measured at maximum 10 mV. Then, they were scaled down by a factor of 1000, hence the data is represented in uV.
    • For each subject, 5 segments of 20 s were extracted from the ECG recordings and chosen based on different phases of the maximal exercise test (i.e., before and after the so-called second ventilatory threshold or VT2, before and in the middle of VO2max, and during the recovery after exhaustion) to represent different intensities of physical activity.

    seg1 --> [VT2-50,VT2-30]
    seg2 --> [VT2+60,VT2+80]
    seg3 --> [VO2max-50,VO2max-30]
    seg4 --> [VO2max-10,VO2max+10]
    seg5 --> [VO2max+60,VO2max+80]

    • The R peak locations were manually annotated in all segments and reviewed by a physician of the Lausanne University Hospital, CHUV. Only segment 5 of subject 9 could not be annotated since there was a problem with the input signal. So, the total number of segments extracted were 20 * 5 - 1 = 99.

    Format of the extracted dataset

    The dataset is divided in two main folders:

    • The folder `ecg_segments/` contains the ECG signals saved in two formats, `.csv` and `.mat`. This folder includes both raw (`ecg_raw`) and processed (`ecg`) signals. The processing consists of a morphological filtering and a relative energy non filtering method to enhance the R peaks. The `.csv` files contain only the signal, while the `.mat` files include the signal, the time vector within the maximal stress test, the sampling frequency and the unit of the signal amplitude (uV, as we mentioned before).
    • The folder `manual_annotations/` contains the sample indices of the annotated R peaks in `.csv` format. The annotation was done on the processed signals.
  15. Z

    Virtual cohort of 1000 synthetic heart meshes from adult human healthy...

    • data.niaid.nih.gov
    • zenodo.org
    Updated May 11, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Longobardi, Stefano (2022). Virtual cohort of 1000 synthetic heart meshes from adult human healthy population [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4506929
    Explore at:
    Dataset updated
    May 11, 2022
    Dataset provided by
    Rodero, Cristobal
    Vigmond, Edward J.
    Longobardi, Stefano
    NIederer, Steven
    Plank, Gernot
    Gillette, Karli
    Whitaker, John
    Augustin, Christoph
    Strocchi, Marina
    Lamata, Pablo
    Marciniak, Maciej
    O'Neill, Mark D.
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset Description: We present a database of four-chamber heart models derived from a statistical shape model (SSM) suitable for electro-mechanical (EM) simulations. Our database consists of 1000 four-chamber heart models generated from end-diastolic CT-derived meshes (available in the repository called ("Virtual cohort of adult healthy four-chamber heart meshes from CT images"). These meshes were used for EM simulations. The weights of the SSM are also provided.

    Cardiac meshes: To build the SSM, we rigidly aligned the CT cohort and extracted the surfaces, representing them asdeRham currents. The registration between meshes and computation of the average shape was done using a Large Deformation Diffeomorphic Metric Mapping method. The deformation functions depend on a set of uniformly distributed control points in which the shapes are embedded, and on the deformation vectors attached to these points. It is in this spatial field of deformation vectors (one per each control point) where the Principal Component Analysis (PCA) is applied. Case #20 of the CT cohort was not included. More information on the details can be found in Supplement 3 of the reference paper. We created this cohort by modifying the weight of the modes explaining 90%of the variance in shape (corresponding to modes 1 to 9) within 2 standard deviations (SD) of each mode added to the average mesh. The elements of all the meshes are labelled as follows:

    Left ventricle myocardium

    Right ventricle myocardium

    Left atrium myocardium

    Right atrium myocardium

    Aorta wall

    Pulmonary artery wall

    Mitral valve plane

    Tricuspid valve plane

    Aortic valve plane

    Pulmonary valve plane

    Left atrium appendage "inlet"

    Left superior pulmonary vein inlet

    Left inferior pulmonary vein inlet

    Right inferior pulmonary vein inlet

    Right superior pulmonary vein inlet

    Superior vena cava inlet

    Inferior vena cava inlet

    Left atrial appendage border

    Right inferior pulmonary vein border

    Left inferior pulmonary vein border

    Left superior pulmonary vein border

    Right superior pulmonary vein border

    Superior vena cava border

    Inferior vena cava border

    Each zipped folder contains 25 meshes and the weights of modes used to construct them for each mesh, A VTK file for each mesh (in ASCII) contains an UNSTRUCTURED GRID with the following fields:

    POINTS, with the coordinates of the points in mm.

    CELL_TYPES, having all of the points the value 10 since they are tetrahedra.

    CELLS, with the indices of the vertices of every element.

    CELL_DATA, corresponding to the meshing tags.

    In addition, three descriptive files are included:

    Normalized_explained_variance.csv contains the percentages of variance explained by each of the 18 modes generated from PCA.

    Mode_standard_deviation.csv contains absolute standard deviations of each of the 18 modes.

    Eigenvectors.csv contains the directions of maximum shape variability within the shape population.

  16. vinbigdata Detect the heart and lungs csv

    • kaggle.com
    zip
    Updated Mar 9, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alien (2021). vinbigdata Detect the heart and lungs csv [Dataset]. https://www.kaggle.com/h053473666/vinbigdata-detect-the-heart-and-lungs-csv
    Explore at:
    zip(249385 bytes)Available download formats
    Dataset updated
    Mar 9, 2021
    Authors
    Alien
    Description

    Dataset

    This dataset was created by Alien

    Contents

  17. Heart Disease Health Indicators Dataset

    • kaggle.com
    Updated Mar 10, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alex Teboul (2022). Heart Disease Health Indicators Dataset [Dataset]. https://www.kaggle.com/datasets/alexteboul/heart-disease-health-indicators-dataset/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 10, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Alex Teboul
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    Heart Disease is among the most prevalent chronic diseases in the United States, impacting millions of Americans each year and exerting a significant financial burden on the economy. In the United States alone, heart disease claims roughly 647,000 lives each year — making it the leading cause of death. The buildup of plaques inside larger coronary arteries, molecular changes associated with aging, chronic inflammation, high blood pressure, and diabetes are all causes of and risk factors for heart disease.

    While there are different types of coronary heart disease, the majority of individuals only learn they have the disease following symptoms such as chest pain, a heart attack, or sudden cardiac arrest. This fact highlights the importance of preventative measures and tests that can accurately predict heart disease in the population prior to negative outcomes like myocardial infarctions (heart attacks) taking place.

    The Centers for Disease Control and Prevention has identified high blood pressure, high blood cholesterol, and smoking as three key risk factors for heart disease. Roughly half of Americans have at least one of these three risk factors. The National Heart, Lung, and Blood Institute highlights a wider array of factors such as Age, Environment and Occupation, Family History and Genetics, Lifestyle Habits, Other Medical Conditions, Race or Ethnicity, and Sex for clinicians to use in diagnosing coronary heart disease. Diagnosis tends to be driven by an initial survey of these common risk factors followed by bloodwork and other tests.

    Content

    The Behavioral Risk Factor Surveillance System (BRFSS) is a health-related telephone survey that is collected annually by the CDC. Each year, the survey collects responses from over 400,000 Americans on health-related risk behaviors, chronic health conditions, and the use of preventative services. It has been conducted every year since 1984. For this project, I downloaded a csv of the dataset available on Kaggle for the year 2015. This original dataset contains responses from 441,455 individuals and has 330 features. These features are either questions directly asked of participants, or calculated variables based on individual participant responses.

    This dataset contains 253,680 survey responses from cleaned BRFSS 2015 to be used primarily for the binary classification of heart disease. Not that there is strong class imbalance in this dataset. 229,787 respondents do not have/have not had heart disease while 23,893 have had heart disease. The question to be explored is:

    1. To what extend can survey responses from the BRFSS be used for predicting heart disease risk?

    and

    2. Can a subset of questions from the BRFSS be used for preventative health screening for diseases like heart disease?

    Acknowledgements

    It it important to reiterate that I did not create this dataset, it is just a cleaned and consolidated dataset created from the BRFSS 2015 dataset already on Kaggle. That dataset can be found here and the notebook I used for the data cleaning can be found here.

    Inspiration

    Let's build some predictive models for for heart disease.

  18. Z

    Interstage single ventricle heart disease infants show dysregulation in...

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jun 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Frank, Benjamin (2024). Interstage single ventricle heart disease infants show dysregulation in multiple metabolic pathways: targeted metabolomics analysis - Data [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_11999624
    Explore at:
    Dataset updated
    Jun 17, 2024
    Dataset provided by
    Frank, Benjamin
    Davidson, Jesse
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The data in this Zenodo entry corresponds to the data used to produce the results in https://www.jacc.org/doi/full/10.1016/j.jacadv.2022.100169. The zipped folder contains three files

    Metabolite Data.csv - The meatobilte measurements for all the samples

    Clinical Data.csv - Values for the clinical variables

    Clinical Data Descriptions.csv - More in depth explanation of clinical variables as well as possible values of the variables

    This study was supported by the American Heart Association (AHA20CDA35310498 and AHA18IPA34170070) and the National Institutes of Health (NIH/NCATS Colorado CTSA, No. UL1 TR001082 and NIH/NHLBI K23HL12363

  19. Stroke Mortality Data Among US Adults (35+) by State/Territory and County –...

    • catalog.data.gov
    • data.virginia.gov
    • +1more
    Updated Feb 13, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Centers for Disease Control and Prevention (2025). Stroke Mortality Data Among US Adults (35+) by State/Territory and County – 2016-2018 [Dataset]. https://catalog.data.gov/dataset/stroke-mortality-data-among-us-adults-35-by-state-territory-and-county-2016-2018-3aa59
    Explore at:
    Dataset updated
    Feb 13, 2025
    Dataset provided by
    Centers for Disease Control and Preventionhttp://www.cdc.gov/
    Description

    2016 to 2018, 3-year average. Rates are age-standardized. County rates are spatially smoothed. The data can be viewed by sex and race/ethnicity. Data source: National Vital Statistics System. Additional data, maps, and methodology can be viewed on the Interactive Atlas of Heart Disease and Stroke https://www.cdc.gov/heart-disease-stroke-atlas/about/index.html

  20. HRV-ACC: a dataset with R-R intervals and accelerometer data for the...

    • zenodo.org
    • data.niaid.nih.gov
    csv, txt, zip
    Updated Aug 9, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kamil Książek; Kamil Książek; Wilhelm Masarczyk; Wilhelm Masarczyk; Przemysław Głomb; Przemysław Głomb; Michał Romaszewski; Michał Romaszewski; Iga Stokłosa; Iga Stokłosa; Piotr Ścisło; Piotr Ścisło; Paweł Dębski; Paweł Dębski; Robert Pudlo; Robert Pudlo; Piotr Gorczyca; Piotr Gorczyca; Magdalena Piegza; Magdalena Piegza (2023). HRV-ACC: a dataset with R-R intervals and accelerometer data for the diagnosis of psychotic disorders using a Polar H10 wearable sensor [Dataset]. http://doi.org/10.5281/zenodo.8171266
    Explore at:
    txt, zip, csvAvailable download formats
    Dataset updated
    Aug 9, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Kamil Książek; Kamil Książek; Wilhelm Masarczyk; Wilhelm Masarczyk; Przemysław Głomb; Przemysław Głomb; Michał Romaszewski; Michał Romaszewski; Iga Stokłosa; Iga Stokłosa; Piotr Ścisło; Piotr Ścisło; Paweł Dębski; Paweł Dębski; Robert Pudlo; Robert Pudlo; Piotr Gorczyca; Piotr Gorczyca; Magdalena Piegza; Magdalena Piegza
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    ABSTRACT

    The issue of diagnosing psychotic diseases, including schizophrenia and bipolar disorder, in particular, the objectification of symptom severity assessment, is still a problem requiring the attention of researchers. Two measures that can be helpful in patient diagnosis are heart rate variability calculated based on electrocardiographic signal and accelerometer mobility data. The following dataset contains data from 30 psychiatric ward patients having schizophrenia or bipolar disorder and 30 healthy persons. The duration of the measurements for individuals was usually between 1.5 and 2 hours. R-R intervals necessary for heart rate variability calculation were collected simultaneously with accelerometer data using a wearable Polar H10 device. The Positive and Negative Syndrome Scale (PANSS) test was performed for each patient participating in the experiment, and its results were attached to the dataset. Furthermore, the code for loading and preprocessing data, as well as for statistical analysis, was included on the corresponding GitHub repository.

    BACKGROUND

    Heart rate variability (HRV), calculated based on electrocardiographic (ECG) recordings of R-R intervals stemming from the heart's electrical activity, may be used as a biomarker of mental illnesses, including schizophrenia and bipolar disorder (BD) [Benjamin et al]. The variations of R-R interval values correspond to the heart's autonomic regulation changes [Berntson et al, Stogios et al]. Moreover, the HRV measure reflects the activity of the sympathetic and parasympathetic parts of the autonomous nervous system (ANS) [Task Force of the European Society of Cardiology the North American Society of Pacing Electrophysiology, Matusik et al]. Patients with psychotic mental disorders show a tendency for a change in the centrally regulated ANS balance in the direction of less dynamic changes in the ANS activity in response to different environmental conditions [Stogios et al]. Larger sympathetic activity relative to the parasympathetic one leads to lower HRV, while, on the other hand, higher parasympathetic activity translates to higher HRV. This loss of dynamic response may be an indicator of mental health. Additional benefits may come from measuring the daily activity of patients using accelerometry. This may be used to register periods of physical activity and inactivity or withdrawal for further correlation with HRV values recorded at the same time.

    EXPERIMENTS

    In our experiment, the participants were 30 psychiatric ward patients with schizophrenia or BD and 30 healthy people. All measurements were performed using a Polar H10 wearable device. The sensor collects ECG recordings and accelerometer data and, additionally, prepares a detection of R wave peaks. Participants of the experiment had to wear the sensor for a given time. Basically, it was between 1.5 and 2 hours, but the shortest recording was 70 minutes. During this time, evaluated persons could perform any activity a few minutes after starting the measurement. Participants were encouraged to undertake physical activity and, more specifically, to take a walk. Due to patients being in the medical ward, they received instruction to take a walk in the corridors at the beginning of the experiment. They were to repeat the walk 30 minutes and 1 hour after the first walk. The subsequent walks were to be slightly longer (about 3, 5 and 7 minutes, respectively). We did not remind or supervise the command during the experiment, both in the treatment and the control group. Seven persons from the control group did not receive this order and their measurements correspond to freely selected activities with rest periods but at least three of them performed physical activities during this time. Nevertheless, at the start of the experiment, all participants were requested to rest in a sitting position for 5 minutes. Moreover, for each patient, the disease severity was assessed using the PANSS test and its scores are attached to the dataset.

    The data from sensors were collected using Polar Sensor Logger application [Happonen]. Such extracted measurements were then preprocessed and analyzed using the code prepared by the authors of the experiment. It is publicly available on the GitHub repository [Książek et al].

    Firstly, we performed a manual artifact detection to remove abnormal heartbeats due to non-sinus beats and technical issues of the device (e.g. temporary disconnections and inappropriate electrode readings). We also performed anomaly detection using Daubechies wavelet transform. Nevertheless, the dataset includes raw data, while a full code necessary to reproduce our anomaly detection approach is available in the repository. Optionally, it is also possible to perform cubic spline data interpolation. After that step, rolling windows of a particular size and time intervals between them are created. Then, a statistical analysis is prepared, e.g. mean HRV calculation using the RMSSD (Root Mean Square of Successive Differences) approach, measuring a relationship between mean HRV and PANSS scores, mobility coefficient calculation based on accelerometer data and verification of dependencies between HRV and mobility scores.

    DATA DESCRIPTION

    The structure of the dataset is as follows. One folder, called HRV_anonymized_data contains values of R-R intervals together with timestamps for each experiment participant. The data was properly anonymized, i.e. the day of the measurement was removed to prevent person identification. Files concerned with patients have the name treatment_X.csv, where X is the number of the person, while files related to the healthy controls are named control_Y.csv, where Y is the identification number of the person. Furthermore, for visualization purposes, an image of the raw RR intervals for each participant is presented. Its name is raw_RR_{control,treatment}_N.png, where N is the number of the person from the control/treatment group. The collected data are raw, i.e. before the anomaly removal. The code enabling reproducing the anomaly detection stage and removing suspicious heartbeats is publicly available in the repository [Książek et al]. The structure of consecutive files collecting R-R intervals is following:

    Phone timestampRR-interval [ms]
    12:43:26.538000651
    12:43:27.189000632
    12:43:27.821000618
    12:43:28.439000621
    12:43:29.060000661
    ......

    The first column contains the timestamp for which the distance between two consecutive R peaks was registered. The corresponding R-R interval is presented in the second column of the file and is expressed in milliseconds.
    The second folder, called accelerometer_anonymized_data contains values of accelerometer data collected at the same time as R-R intervals. The naming convention is similar to that of the R-R interval data: treatment_X.csv and control_X.csv represent the data coming from the persons from the treatment and control group, respectively, while X is the identification number of the selected participant. The numbers are exactly the same as for R-R intervals. The structure of the files with accelerometer recordings is as follows:

    Phone timestampX [mg]Y [mg]Z [mg]
    13:00:17.196000-961-23182
    13:00:17.205000-965-21181
    13:00:17.215000-966-22187
    13:00:17.225000-967-26193
    13:00:17.235000-965-27191
    ............

    The first column contains a timestamp, while the next three columns correspond to the currently registered acceleration in three axes: X, Y and Z, in milli-g unit.

    We also attached a file with the PANSS test scores (PANSS.csv) for all patients participating in the measurement. The structure of this file is as follows:

    no_of_personPANSS_PPANSS_NPANSS_GPANSS_total
    18132243
    21171836
    314304488
    418132758
    ..............


    The first column contains the identification number of the patient, while the three following columns refer to the PANSS scores related to positive, negative and general symptoms, respectively.

    USAGE NOTES

    All the files necessary to run the HRV and/or accelerometer data analysis are available on the GitHub repository [Książek et al]. HRV data loading, preprocessing (i.e. anomaly detection and removal), as well as the

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
LIU Meiting Avril (2024). heart.csv.xls [Dataset]. https://www.kaggle.com/liumeitingavril/heart-csv-xls
Organization logo

heart.csv.xls

Explore at:
zip(3478 bytes)Available download formats
Dataset updated
Dec 20, 2024
Authors
LIU Meiting Avril
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Dataset

This dataset was created by LIU Meiting Avril

Released under Apache 2.0

Contents

Search
Clear search
Close search
Google apps
Main menu