100+ datasets found
  1. p

    MIMIC-IV

    • physionet.org
    Updated Oct 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alistair Johnson; Lucas Bulgarelli; Tom Pollard; Brian Gow; Benjamin Moody; Steven Horng; Leo Anthony Celi; Roger Mark (2024). MIMIC-IV [Dataset]. http://doi.org/10.13026/kpb9-mt58
    Explore at:
    Dataset updated
    Oct 11, 2024
    Authors
    Alistair Johnson; Lucas Bulgarelli; Tom Pollard; Brian Gow; Benjamin Moody; Steven Horng; Leo Anthony Celi; Roger Mark
    License

    https://github.com/MIT-LCP/license-and-dua/tree/master/draftshttps://github.com/MIT-LCP/license-and-dua/tree/master/drafts

    Description

    Retrospectively collected medical data has the opportunity to improve patient care through knowledge discovery and algorithm development. Broad reuse of medical data is desirable for the greatest public good, but data sharing must be done in a manner which protects patient privacy. Here we present Medical Information Mart for Intensive Care (MIMIC)-IV, a large deidentified dataset of patients admitted to the emergency department or an intensive care unit at the Beth Israel Deaconess Medical Center in Boston, MA. MIMIC-IV contains data for over 65,000 patients admitted to an ICU and over 200,000 patients admitted to the emergency department. MIMIC-IV incorporates contemporary data and adopts a modular approach to data organization, highlighting data provenance and facilitating both individual and combined use of disparate data sources. MIMIC-IV is intended to carry on the success of MIMIC-III and support a broad set of applications within healthcare.

  2. p

    MIMIC-III Clinical Database

    • physionet.org
    Updated Sep 4, 2016
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alistair Johnson; Tom Pollard; Roger Mark (2016). MIMIC-III Clinical Database [Dataset]. http://doi.org/10.13026/C2XW26
    Explore at:
    Dataset updated
    Sep 4, 2016
    Authors
    Alistair Johnson; Tom Pollard; Roger Mark
    License

    https://github.com/MIT-LCP/license-and-dua/tree/master/draftshttps://github.com/MIT-LCP/license-and-dua/tree/master/drafts

    Description

    MIMIC-III is a large, freely-available database comprising deidentified health-related data associated with over forty thousand patients who stayed in critical care units of the Beth Israel Deaconess Medical Center between 2001 and 2012. The database includes information such as demographics, vital sign measurements made at the bedside (~1 data point per hour), laboratory test results, procedures, medications, caregiver notes, imaging reports, and mortality (including post-hospital discharge).MIMIC supports a diverse range of analytic studies spanning epidemiology, clinical decision-rule improvement, and electronic tool development. It is notable for three factors: it is freely available to researchers worldwide; it encompasses a diverse and very large population of ICU patients; and it contains highly granular data, including vital signs, laboratory results, and medications.

  3. mimic-iii-clinical-database-demo-1.4

    • kaggle.com
    Updated Apr 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Montassar bellah (2025). mimic-iii-clinical-database-demo-1.4 [Dataset]. https://www.kaggle.com/datasets/montassarba/mimic-iii-clinical-database-demo-1-4
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 1, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Montassar bellah
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Abstract MIMIC-III is a large, freely-available database comprising deidentified health-related data associated with over 40,000 patients who stayed in critical care units of the Beth Israel Deaconess Medical Center between 2001 and 2012 [1]. The MIMIC-III Clinical Database is available on PhysioNet (doi: 10.13026/C2XW26). Though deidentified, MIMIC-III contains detailed information regarding the care of real patients, and as such requires credentialing before access. To allow researchers to ascertain whether the database is suitable for their work, we have manually curated a demo subset, which contains information for 100 patients also present in the MIMIC-III Clinical Database. Notably, the demo dataset does not include free-text notes.

    Background In recent years there has been a concerted move towards the adoption of digital health record systems in hospitals. Despite this advance, interoperability of digital systems remains an open issue, leading to challenges in data integration. As a result, the potential that hospital data offers in terms of understanding and improving care is yet to be fully realized.

    MIMIC-III integrates deidentified, comprehensive clinical data of patients admitted to the Beth Israel Deaconess Medical Center in Boston, Massachusetts, and makes it widely accessible to researchers internationally under a data use agreement. The open nature of the data allows clinical studies to be reproduced and improved in ways that would not otherwise be possible.

    The MIMIC-III database was populated with data that had been acquired during routine hospital care, so there was no associated burden on caregivers and no interference with their workflow. For more information on the collection of the data, see the MIMIC-III Clinical Database page.

    Methods The demo dataset contains all intensive care unit (ICU) stays for 100 patients. These patients were selected randomly from the subset of patients in the dataset who eventually die. Consequently, all patients will have a date of death (DOD). However, patients do not necessarily die during an individual hospital admission or ICU stay.

    This project was approved by the Institutional Review Boards of Beth Israel Deaconess Medical Center (Boston, MA) and the Massachusetts Institute of Technology (Cambridge, MA). Requirement for individual patient consent was waived because the project did not impact clinical care and all protected health information was deidentified.

    Data Description MIMIC-III is a relational database consisting of 26 tables. For a detailed description of the database structure, see the MIMIC-III Clinical Database page. The demo shares an identical schema, except all rows in the NOTEEVENTS table have been removed.

    The data files are distributed in comma separated value (CSV) format following the RFC 4180 standard. Notably, string fields which contain commas, newlines, and/or double quotes are encapsulated by double quotes ("). Actual double quotes in the data are escaped using an additional double quote. For example, the string she said "the patient was notified at 6pm" would be stored in the CSV as "she said ""the patient was notified at 6pm""". More detail is provided on the RFC 4180 description page: https://tools.ietf.org/html/rfc4180

    Usage Notes The MIMIC-III demo provides researchers with an opportunity to review the structure and content of MIMIC-III before deciding whether or not to carry out an analysis on the full dataset.

    CSV files can be opened natively using any text editor or spreadsheet program. However, some tables are large, and it may be preferable to navigate the data stored in a relational database. One alternative is to create an SQLite database using the CSV files. SQLite is a lightweight database format which stores all constituent tables in a single file, and SQLite databases interoperate well with a number software tools.

    DB Browser for SQLite is a high quality, visual, open source tool to create, design, and edit database files compatible with SQLite. We have found this tool to be useful for navigating SQLite files. Information regarding installation of the software and creation of the database can be found online: https://sqlitebrowser.org/

    Release Notes Release notes for the demo follow the release notes for the MIMIC-III database.

    Acknowledgements This research and development was supported by grants NIH-R01-EB017205, NIH-R01-EB001659, and NIH-R01-GM104987 from the National Institutes of Health. The authors would also like to thank Philips Healthcare and staff at the Beth Israel Deaconess Medical Center, Boston, for supporting database development, and Ken Pierce for providing ongoing support for the MIMIC research community.

    Conflicts of Interest The authors declare no competing financial interests.

    References Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Mo...

  4. o

    MIMIC-IV Clinical Database Demo

    • registry.opendata.aws
    • physionet.org
    Updated Nov 25, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    PhysioNet (2024). MIMIC-IV Clinical Database Demo [Dataset]. https://registry.opendata.aws/mimic-iv-demo/
    Explore at:
    Dataset updated
    Nov 25, 2024
    Dataset provided by
    <a href="https://physionet.org/">PhysioNet</a>
    Description

    The Medical Information Mart for Intensive Care (MIMIC)-IV database is comprised of deidentified electronic health records for patients admitted to the Beth Israel Deaconess Medical Center. Access to MIMIC-IV is limited to credentialed users. Here, we have provided an openly-available demo of MIMIC-IV containing a subset of 100 patients. The dataset includes similar content to MIMIC-IV, but excludes free-text clinical notes. The demo may be useful for running workshops and for assessing whether the MIMIC-IV is appropriate for a study before making an access request.

  5. P

    MIMIC-IV v2.2 Dataset

    • paperswithcode.com
    Updated Feb 24, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). MIMIC-IV v2.2 Dataset [Dataset]. https://paperswithcode.com/dataset/mimic-iv-v2-2
    Explore at:
    Dataset updated
    Feb 24, 2025
    Description

    Retrospectively collected medical data has the opportunity to improve patient care through knowledge discovery and algorithm development. Broad reuse of medical data is desirable for the greatest public good, but data sharing must be done in a manner which protects patient privacy. The Medical Information Mart for Intensive Care (MIMIC)-III database provided critical care data for over 40,000 patients admitted to intensive care units at the Beth Israel Deaconess Medical Center (BIDMC). Importantly, MIMIC-III was deidentified, and patient identifiers were removed according to the Health Insurance Portability and Accountability Act (HIPAA) Safe Harbor provision. MIMIC-III has been integral in driving large amounts of research in clinical informatics, epidemiology, and machine learning. Here we present MIMIC-IV, an update to MIMIC-III, which incorporates contemporary data and improves on numerous aspects of MIMIC-III. MIMIC-IV adopts a modular approach to data organization, highlighting data provenance and facilitating both individual and combined use of disparate data sources. MIMIC-IV is intended to carry on the success of MIMIC-III and support a broad set of applications within healthcare.

  6. MIMIC_III_IPI - Discharge Summaries from MIMIC-III with Indirect Personal...

    • zenodo.org
    Updated Mar 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ibrahim Baroud; Ibrahim Baroud; Lisa Raithel; Lisa Raithel; Sebastian Möller; Sebastian Möller; Roland Roller; Roland Roller (2025). MIMIC_III_IPI - Discharge Summaries from MIMIC-III with Indirect Personal Identifiers Annotations [Dataset]. http://doi.org/10.5281/zenodo.15044596
    Explore at:
    Dataset updated
    Mar 19, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Ibrahim Baroud; Ibrahim Baroud; Lisa Raithel; Lisa Raithel; Sebastian Möller; Sebastian Möller; Roland Roller; Roland Roller
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    MIMIC_III_IPI - Discharge Summaries from Medical Information Mart for Intensive Care-III with Indirect Personal Identifiers Annotations

    The discharge summaries we use for demonstrating our Indirect Personal Identifiers (IPI) schema are randomly sampled from the Medical Information Mart for Intensive Care (MIMIC-III) dataset. MIMIC-III comprises health-related data from over 40,000 patients who stayed in critical care units of the Beth Israel Deaconess Medical Center between 2001 and 2012. Among other types of data, such as patient demographics, the database also includes various types of textual data, such as diagnostic reports and discharge summaries. We chose discharge summaries for our study, since these are richer in information than other notes in MIMIC-III. Details:

    • Johnson, A., Pollard, T., & Mark, R. (2016). MIMIC-III Clinical Database (version 1.4). PhysioNet. https://doi.org/10.13026/C2XW26.
    • Johnson, A. E., Pollard, T. J., Shen, L., Lehman, L. W., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific data, 3, 160035. https://doi.org/10.1038/sdata.2016.35

    This is the Discharge Summaries from MIMIC-III with Indirect Personal Identifiers Annotations as an external source of the paper accepted at the PrivateNLP workshop at NAACL 2025, a preprint can be found in:

    • Baroud, I., Raithel, L., Möller, S., & Roller, R. (2025). Beyond De-Identification: A Structured Approach for Defining and Detecting Indirect Identifiers in Medical Texts. arXiv preprint arXiv:2502.13342.

    This repository contains the annotations in a CSV file and the annotation guidelines document. Inspecting the exact annotation texts requires access to the MIMIC-III Clinical Database, see https://physionet.org/content/mimiciii/1.4/. Each row in the CSV file has an ID together with a list of the IPI annotated spans, each in the format {"start": ,"end": ,"label": }. The ID in the ipi_annotations.csv table corresponds to the same ROW_ID in the MIMIC-III NOTEEVENTS.csv table and can be used for merging the tables to inspect the original documents and reconstruct the annotations using the offsets.

    Please note that only authenticated users can request access to review and download the annotations and guidelines. If you encounter any issues, feel free to reach out to the contact person.

  7. d

    Data from: Assessing the use of HL7 FHIR for implementing the FAIR guiding...

    • search.dataone.org
    • data.niaid.nih.gov
    • +2more
    Updated Jan 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Philip van Damme; Matthias Löbe; Nirupama Benis; Nicolette de Keizer; Ronald Cornet (2024). Assessing the use of HL7 FHIR for implementing the FAIR guiding principles: A case study of the MIMIC-IV emergency department module [Dataset]. http://doi.org/10.5061/dryad.1jwstqk10
    Explore at:
    Dataset updated
    Jan 18, 2024
    Dataset provided by
    Dryad Digital Repository
    Authors
    Philip van Damme; Matthias Löbe; Nirupama Benis; Nicolette de Keizer; Ronald Cornet
    Time period covered
    Jan 1, 2023
    Description

    Objective To assess the use of Health Level Seven Fast Healthcare Interoperability Resources (FHIR®) for implementing the Findable, Accessible, Interoperable, and Reusable guiding principles for scientific data (FAIR). Additionally, present a list of FAIR implementation choices for supporting future FAIR implementations that use FHIR. Material and Methods A case study was conducted on the Medical Information Mart for Intensive Care-IV Emergency Department dataset (MIMIC-ED), a deidentified clinical dataset converted into FHIR. The FAIRness of this dataset was assessed using a set of common FAIR assessment indicators. Results The FHIR distribution of MIMIC-ED, comprising an implementation guide and demo data, was more FAIR compared to the non-FHIR distribution. The FAIRness score increased from 60 to 82 out of 95 points, a relative improvement of 37%. The most notable improvements were observed in interoperability, with a score increase from 5 to 19 out of 19 points, and reusability, wit..., The authors of the paper collected the dataset. , Microsoft Word (.docx files) or Microsoft Excel (.csv files) (Open-source alternatives: LibreOffice, OpenOffice) The data files (.csv) can also be opened using any text editor, R, etc., # FAIR Indicator Scores and Qualitative Comments

    This dataset belongs as supplementary material to the paper entitled "Assessing the Use of HL7 FHIR for Implementing the FAIR Guiding Principles: A Case Study of the MIMIC-IV Emergency Department Module".

    Description of the data and file structure

    This dataset describes the indicator scores and qualitative comments of the FAIR data assessment of the Medical Information Mart for Intensive Care (MIMIC)-IV Emergency Department Module. Two distributions of the Emergency Department module were assessed, the PhysioNet distribution and the Fast Healthcare Interoperability Resources (FHIR) distribution. This dataset consists of two files: (1) PhysioNet.csv containing the data of the PhysioNet distribution; and (2) FHIR.csv containing the data of the FHIR distribution. Both files share the same structure and fields.

    • Indicator ID: an ID corresponding to the IDs listed in Table 1 of the paper, which refer to a Research Data Alliance FAIR ...
  8. f

    Data Sheet 1_Association between statin administration and Clostridium...

    • frontiersin.figshare.com
    docx
    Updated Feb 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Renli Wang; Rongjun Liu; Hua Wang; Zhaojun Xu (2025). Data Sheet 1_Association between statin administration and Clostridium difficile-induced enteritis: a retrospective analysis of the MIMIC-IV database.docx [Dataset]. http://doi.org/10.3389/fphar.2025.1550378.s001
    Explore at:
    docxAvailable download formats
    Dataset updated
    Feb 24, 2025
    Dataset provided by
    Frontiers
    Authors
    Renli Wang; Rongjun Liu; Hua Wang; Zhaojun Xu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundExisting research suggests that using statins may reduce the incidence of enteritis caused by C. difficile and improve the prognosis of patients. This study aimed to explore the relation between Clostridium difficile-induced enteritis (CDE) and statin use.MethodsData were collected from the Medical Information Mart for Intensive Care-IV (MIMIC-IV) database. Multivariate logistic regression analysis was employed to assess the impact of statin use on CDE incidence in patients in intensive care units (ICUs) and its effect on in-hospital mortality among them. The research findings were validated by performing propensity score matching (PSM), inverse probability of treatment weighting (IPTW), and subgroup analyses.ResultsThe study enrolled the data of 51,978 individuals to assess the effect of statin usage on the occurrence of CDE in patients admitted to the ICU. The results indicate that statins can decrease the prevalence of CDE in patients in ICU (odds ratio (OR): 0.758, 95% confidence interval (CI): 0.666–0.873, P < 0.05), which was further confirmed through PSM (OR: 0.760, 95% CI: 0.661–0.873, P < 0.05) and IPTW (OR: 0.818, 95% CI: 0.754–0.888, P < 0.05) analyses. For most subgroups, statins’ favorable effect in reducing CDE remained constant. A total of 1,208 patients were included in the study to evaluate whether statins could lower the risk of death in patients in ICU with enteritis caused by C. difficile. Statins did not reduce in-hospital mortality of patients in ICU with CDE (OR: 0.911, 95% CI: 0.667–1.235, P = 0.553). The results were validated following PSM (OR: 0.877, 95% CI: 0.599–1.282, P = 0.499) and IPTW (OR: 0.781, 95% CI: 0.632–1.062, P = 0.071) analyses, and all subgroups demonstrated consistent results.ConclusionStatin administration can reduce the incidence of CDE in patients in the ICU; however, it does not decrease the in-hospital mortality rate for individuals with CDE.

  9. P

    Clinical Admission Notes from MIMIC-III Dataset

    • paperswithcode.com
    • opendatalab.com
    Updated Feb 7, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Betty van Aken; Jens-Michalis Papaioannou; Manuel Mayrdorfer; Klemens Budde; Felix A. Gers; Alexander Löser (2021). Clinical Admission Notes from MIMIC-III Dataset [Dataset]. https://paperswithcode.com/dataset/hospital-admission-notes-from-mimic-iii
    Explore at:
    Dataset updated
    Feb 7, 2021
    Authors
    Betty van Aken; Jens-Michalis Papaioannou; Manuel Mayrdorfer; Klemens Budde; Felix A. Gers; Alexander Löser
    Description

    This dataset is created from MIMIC-III (Medical Information Mart for Intensive Care III) and contains simulated patient admission notes. The clinical notes contain information about a patient at admission time to the ICU and are labelled for four outcome prediction tasks: Diagnoses at discharge, procedures performed, in-hospital mortality and length-of-stay.

    To obtain the data one first has to gain access to the MIMIC-III dataset and then run the scripts introduced in the linked repository.

  10. m

    MIMIC Research

    • data.mendeley.com
    Updated Sep 28, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Olivia Bernas (2017). MIMIC Research [Dataset]. http://doi.org/10.17632/3jbxrzrrsv.1
    Explore at:
    Dataset updated
    Sep 28, 2017
    Authors
    Olivia Bernas
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Published studies using the MIMIC database

  11. o

    Data from: MIMIC-IV-ECG: Diagnostic Electrocardiogram Matched Subset

    • registry.opendata.aws
    • physionet.org
    Updated Dec 19, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    PhysioNet (2024). MIMIC-IV-ECG: Diagnostic Electrocardiogram Matched Subset [Dataset]. https://registry.opendata.aws/mimic-iv-ecg/
    Explore at:
    Dataset updated
    Dec 19, 2024
    Dataset provided by
    <a href="https://physionet.org/">PhysioNet</a>
    Description

    The MIMIC-IV-ECG module contains approximately 800,000 diagnostic electrocardiograms across nearly 160,000 unique patients. These diagnostic ECGs use 12 leads and are 10 seconds in length. They are sampled at 500 Hz. This subset contains all of the ECGs for patients who appear in the MIMIC-IV Clinical Database. When a cardiologist report is available for a given ECG, we provide the needed information to link the waveform to the report. The patients in MIMIC-IV-ECG have been matched against the MIMIC-IV Clinical Database, making it possible to link to information across the MIMIC-IV modules.

  12. p

    MIMIC-II Clinical Database

    • physionet.org
    Updated Apr 24, 2011
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mohammed Saeed; Mauricio Villarroel; Andrew Reisner; Gari Clifford; Li-wei Lehman; George Moody; Thomas Heldt; Tin Kyaw; Benjamin Moody; Roger Mark (2011). MIMIC-II Clinical Database [Dataset]. http://doi.org/10.13026/fxn0-mk84
    Explore at:
    Dataset updated
    Apr 24, 2011
    Authors
    Mohammed Saeed; Mauricio Villarroel; Andrew Reisner; Gari Clifford; Li-wei Lehman; George Moody; Thomas Heldt; Tin Kyaw; Benjamin Moody; Roger Mark
    License

    https://github.com/MIT-LCP/license-and-dua/tree/master/draftshttps://github.com/MIT-LCP/license-and-dua/tree/master/drafts

    Description

    MIMIC-II documents a diverse and large population of intensive care unit patient stays and contains comprehensive and detailed clinical data, including physiological waveforms and minute-by-minute trends for a subset of records. It establishes a unique public-access resource for critical care research, supporting a diverse range of analytic studies spanning epidemiology, clinical decision-rule development, and electronic tool development. The MIMIC-II Clinical Database, although de-identified, still contains detailed information regarding the clinical care of patients, and must be treated with appropriate care and respect.

  13. Atrial Fibrillation annotations of electrocardiogram from MIMIC III matched...

    • figshare.com
    xlsx
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Syed Khairul Bashar (2023). Atrial Fibrillation annotations of electrocardiogram from MIMIC III matched subset [Dataset]. http://doi.org/10.6084/m9.figshare.12149091.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    figshare
    Authors
    Syed Khairul Bashar
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We provide some annotations of the Medical Information Mart for Intensive Care (MIMIC) III waveform database matched Subset. The annotations are for the electrocardiogram recordings and denote atrial fibrillation status.More annotations will be added in future.Details about MIMIC III matched subset can be found at Physionet.https://archive.physionet.org/physiobank/database/mimic3wdb/matched/If you use the annotations, please cite the following paper:Bashar, S.K., Ding, E., Walkey, A.J., McManus, D.D. and Chon, K.H., 2019. Noise Detection in Electrocardiogram Signals for Intensive Care Unit Patients. IEEE Access, 7, pp.88357-88368

  14. Data from: Assessment of Non-Invasive Blood Pressure Prediction from PPG and...

    • zenodo.org
    • explore.openaire.eu
    bin
    Updated Oct 22, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fabian Schrumpf; Patrick Frenzel; Christoph Aust; Georg Osterhoff; Mirco Fuch; Fabian Schrumpf; Patrick Frenzel; Christoph Aust; Georg Osterhoff; Mirco Fuch (2021). Assessment of Non-Invasive Blood Pressure Prediction from PPG and rPPG Signals Using Deep Learning [Dataset]. http://doi.org/10.5281/zenodo.5590603
    Explore at:
    binAvailable download formats
    Dataset updated
    Oct 22, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Fabian Schrumpf; Patrick Frenzel; Christoph Aust; Georg Osterhoff; Mirco Fuch; Fabian Schrumpf; Patrick Frenzel; Christoph Aust; Georg Osterhoff; Mirco Fuch
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is a subset of the MIMIC-III dataset used for non-invasive blood pressure prediction. PPG and ABP data were divided into windows of 7s length (875 data points). Systolic and diastolic blood pressure values were derived from the ABP windows. Each sample of the dataset consists of a PPG signal and blood pressure values as well as a unique subject identifier. The file consists of three datasets:

    • PPG: PPG data of size 905,400 x 875
    • label: BP data of size 905,400 x 2
    • subject_idx: subject affiliation of each sample (size 905,400 x 1)

    Furthermore, this submission contains the following models:

    • AlexNet
    • ResNet50
    • LSTM
    • Architecture published by Slapnicar et al. 2019

    The architectures were trained using a non-mixed dataset derived from the MIMIC-III waveform database. Samples were divided between training, validation and test set based on their subject affiliation preventing contamination of validation and test sets with samples from subjects used for training.

  15. f

    Table_1_Machine Learning Approach to Predict Positive Screening of...

    • frontiersin.figshare.com
    • figshare.com
    docx
    Updated Jun 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yohei Hirano; Keito Shinmoto; Yohei Okada; Kazuhiro Suga; Jeffrey Bombard; Shogo Murahata; Manoj Shrestha; Patrick Ocheja; Aiko Tanaka (2023). Table_1_Machine Learning Approach to Predict Positive Screening of Methicillin-Resistant Staphylococcus aureus During Mechanical Ventilation Using Synthetic Dataset From MIMIC-IV Database.DOCX [Dataset]. http://doi.org/10.3389/fmed.2021.694520.s001
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jun 8, 2023
    Dataset provided by
    Frontiers
    Authors
    Yohei Hirano; Keito Shinmoto; Yohei Okada; Kazuhiro Suga; Jeffrey Bombard; Shogo Murahata; Manoj Shrestha; Patrick Ocheja; Aiko Tanaka
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Background: Mechanically ventilated patients are susceptible to nosocomial infections such as ventilator-associated pneumonia. To treat ventilated patients with suspected infection, clinicians select appropriate antibiotics. However, decision-making regarding the use of antibiotics for methicillin-resistant Staphylococcus aureus (MRSA) is challenging, because of the lack of evidence-supported criteria. This study aims to derive a machine learning model to predict MRSA as a possible pathogen responsible for infection in mechanically ventilated patients.Methods: Data were collected from the Medical Information Mart for Intensive Care (MIMIC)-IV database (an openly available database of patients treated at the Beth Israel Deaconess Medical Center in the period 2008–2019). Of 26,409 mechanically ventilated patients, 809 were screened for MRSA during the mechanical ventilation period and included in the study. The outcome was positivity to MRSA on screening, which was highly imbalanced in the dataset, with 93.9% positive outcomes. Therefore, after dividing the dataset into a training set (n = 566) and a test set (n = 243) for validation by stratified random sampling with a 7:3 allocation ratio, synthetic datasets with 50% positive outcomes were created by synthetic minority over-sampling for both sets individually (synthetic training set: n = 1,064; synthetic test set: n = 456). Using these synthetic datasets, we trained and validated an XGBoost machine learning model using 28 predictor variables for outcome prediction. Model performance was evaluated by area under the receiver operating characteristic (AUROC), sensitivity, specificity, and other statistical measurements. Feature importance was computed by the Gini method.Results: In validation, the XGBoost model demonstrated reliable outcome prediction with an AUROC value of 0.89 [95% confidence interval (CI): 0.83–0.95]. The model showed a high sensitivity of 0.98 [CI: 0.95–0.99], but a low specificity of 0.47 [CI: 0.41–0.54] and a positive predictive value of 0.65 [CI: 0.62–0.68]. Important predictor variables included admission from the emergency department, insertion of arterial lines, prior quinolone use, hemodialysis, and admission to a surgical intensive care unit.Conclusions: We were able to develop an effective machine learning model to predict positive MRSA screening during mechanical ventilation using synthetic datasets, thus encouraging further research to develop a clinically relevant machine learning model for antibiotics stewardship.

  16. S

    EHR data from MIMIC-III

    • scidb.cn
    Updated Aug 24, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tingyi Wanyan; Hossein Honarvar; Ariful Azad; Ying Ding; Benjamin S. Glicksberg (2021). EHR data from MIMIC-III [Dataset]. http://doi.org/10.11922/sciencedb.j00104.00094
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 24, 2021
    Dataset provided by
    Science Data Bank
    Authors
    Tingyi Wanyan; Hossein Honarvar; Ariful Azad; Ying Ding; Benjamin S. Glicksberg
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We conducted our experiments on de-identified EHR data from MIMIC-III. This data set contains various clinical data relating to patient admission to ICU, such as disease diagnoses in the form of International Classification of Diseases (ICD)-9 codes, and lab test results as detailed in Supplementary Materials. We collected data for 5,956 patients, extracting lab tests every hour from admission. There are a total of 409 unique lab tests and 3,387 unique disease diagnoses observed. The diagnoses were obtained as ICD-9 codes and they were represented using one-hot encoding where one represents patients with disease and zero indicates those without. We binned the lab test events into 6, 12, 24, and 48 hours prior to patient death or discharge from ICU. From these data, we performed mortality predictions that are 10-fold, cross validated.

  17. P

    MIMIC-IV ICD-10 Dataset

    • paperswithcode.com
    Updated Apr 20, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joakim Edin; Alexander Junge; Jakob D. Havtorn; Lasse Borgholt; Maria Maistro; Tuukka Ruotsalo; Lars Maaløe (2023). MIMIC-IV ICD-10 Dataset [Dataset]. https://paperswithcode.com/dataset/mimic-iv-icd-10
    Explore at:
    Dataset updated
    Apr 20, 2023
    Authors
    Joakim Edin; Alexander Junge; Jakob D. Havtorn; Lasse Borgholt; Maria Maistro; Tuukka Ruotsalo; Lars Maaløe
    Description

    MIMIC-IV ICD-10 contains 122,279 discharge summaries—free-text medical documents—annotated with ICD-10 diagnosis and procedure codes. It contains data for patients admitted to the Beth Israel Deaconess Medical Center emergency department or ICU between 2008-2019. All codes with fewer than ten examples have been removed, and the train-val-test split was created using multi-label stratified sampling. The dataset is described further in Automated Medical Coding on MIMIC-III and MIMIC-IV: A Critical Review and Replicability Study, and the code to use the dataset is found here.

    The dataset is intended for medical code prediction and was created using MIMIC-IV v2.2 and MIMIC-IV-NOTE v2.2. Using the two datasets requires a license obtained in Physionet; this can take a couple of days.

  18. P

    MIMIC-IV-Note Dataset

    • paperswithcode.com
    Updated Feb 24, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). MIMIC-IV-Note Dataset [Dataset]. https://paperswithcode.com/dataset/mimic-iv-note
    Explore at:
    Dataset updated
    Feb 24, 2025
    Description

    The advent of large, open access text databases has driven advances in state-of-the-art model performance in natural language processing (NLP). The relatively limited amount of clinical data available for NLP has been cited as a significant barrier to the field's progress. Here we describe MIMIC-IV-Note: a collection of deidentified free-text clinical notes for patients included in the MIMIC-IV clinical database. MIMIC-IV-Note contains 331,794 deidentified discharge summaries from 145,915 patients admitted to the hospital and emergency department at the Beth Israel Deaconess Medical Center in Boston, MA, USA. The database also contains 2,321,355 deidentified radiology reports for 237,427 patients. All notes have had protected health information removed in accordance with the Health Insurance Portability and Accountability Act (HIPAA) Safe Harbor provision. All notes are linkable to MIMIC-IV providing important context to the clinical data therein. The database is intended to stimulate research in clinical natural language processing and associated areas.

  19. d

    Data from: Prediction model of in-hospital mortality in intensive care unit...

    • search.dataone.org
    • zenodo.org
    • +1more
    Updated May 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jingmin Zhou; Fuhai Li; Yu Song; Mingqiang Fu; Xueting Han; Junbo Ge (2025). Prediction model of in-hospital mortality in intensive care unit patients with heart failure: machine learning-based, retrospective analysis of the MIMIC-III database [Dataset]. http://doi.org/10.5061/dryad.0p2ngf1zd
    Explore at:
    Dataset updated
    May 4, 2025
    Dataset provided by
    Dryad Digital Repository
    Authors
    Jingmin Zhou; Fuhai Li; Yu Song; Mingqiang Fu; Xueting Han; Junbo Ge
    Time period covered
    Jun 25, 2021
    Description

    Objective: The predictors of in-hospital mortality for intensive care units (ICU)-admitted HF patients remain poorly characterized.We aimed to develop and validate a prediction model for all-cause in-hospital mortality among ICU-admitted HF patients.

    Design: A retrospective cohort study.

    Setting and Participants: Data were extracted from the MIMIC-III database. Data on 1,177 heart failure patients were analysed.

    Methods: Patients meeting the inclusion criteria were identified from the MIMIC-III database and randomly divided into derivation and validation groups. Independent risk factors for in-hospital mortality were screened using XGBoost and LASSO regression models in the derivation sample. Multivariable logistic regression analysis was used to build prediction models. Discrimination, calibration, and clinical usefulness of the predicting model were assessed using the C-index, calibration plot, and decision curve analysis. After pairwise comparison, the best performing model ...

  20. p

    EchoNotes Structured Database derived from MIMIC-III (ECHO-NOTE2NUM)

    • physionet.org
    Updated Feb 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gloria Hyunjung Kwak; Dana Moukheiber; Mira Moukheiber; Lama Moukheiber; Sulaiman Moukheiber; Neel Butala; Leo Anthony Celi; Christina Chen (2024). EchoNotes Structured Database derived from MIMIC-III (ECHO-NOTE2NUM) [Dataset]. http://doi.org/10.13026/xhrz-ht59
    Explore at:
    Dataset updated
    Feb 23, 2024
    Authors
    Gloria Hyunjung Kwak; Dana Moukheiber; Mira Moukheiber; Lama Moukheiber; Sulaiman Moukheiber; Neel Butala; Leo Anthony Celi; Christina Chen
    License

    https://github.com/MIT-LCP/license-and-dua/tree/master/draftshttps://github.com/MIT-LCP/license-and-dua/tree/master/drafts

    Description

    The EchoNotes Structured Database derived from MIMIC-III (ECHO-NOTE2NUM) is a structured echocardiogram database derived from 43,472 observational notes obtained during echocardiogram studies conducted in the intensive care unit at the Beth Israel Deaconess Medical Center between 2001 and 2012. The database encompasses various aspects of cardiac structure and function, including cavity size, wall thickness, systolic and diastolic function, valve regurgitation and stenosis, as well as pulmonary pressures. To facilitate extensive data analysis, the clinical notes were transformed into a structured numerical format. Within each echocardiogram report sentence, specific words or phrases were identified to describe abnormal findings, and a severity staging system using numeric categories was established. This large publicly-accessible database of structured echocardiogram data holds significant potential as a tool to investigate cardiovascular disease in the intensive care unit.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Alistair Johnson; Lucas Bulgarelli; Tom Pollard; Brian Gow; Benjamin Moody; Steven Horng; Leo Anthony Celi; Roger Mark (2024). MIMIC-IV [Dataset]. http://doi.org/10.13026/kpb9-mt58

MIMIC-IV

Explore at:
Dataset updated
Oct 11, 2024
Authors
Alistair Johnson; Lucas Bulgarelli; Tom Pollard; Brian Gow; Benjamin Moody; Steven Horng; Leo Anthony Celi; Roger Mark
License

https://github.com/MIT-LCP/license-and-dua/tree/master/draftshttps://github.com/MIT-LCP/license-and-dua/tree/master/drafts

Description

Retrospectively collected medical data has the opportunity to improve patient care through knowledge discovery and algorithm development. Broad reuse of medical data is desirable for the greatest public good, but data sharing must be done in a manner which protects patient privacy. Here we present Medical Information Mart for Intensive Care (MIMIC)-IV, a large deidentified dataset of patients admitted to the emergency department or an intensive care unit at the Beth Israel Deaconess Medical Center in Boston, MA. MIMIC-IV contains data for over 65,000 patients admitted to an ICU and over 200,000 patients admitted to the emergency department. MIMIC-IV incorporates contemporary data and adopts a modular approach to data organization, highlighting data provenance and facilitating both individual and combined use of disparate data sources. MIMIC-IV is intended to carry on the success of MIMIC-III and support a broad set of applications within healthcare.

Search
Clear search
Close search
Google apps
Main menu