100+ datasets found
  1. Synthetic Healthcare Database for Research (SyH-DR)

    • catalog.data.gov
    • healthdata.gov
    • +2more
    Updated Sep 16, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Agency for Healthcare Research and Quality (2023). Synthetic Healthcare Database for Research (SyH-DR) [Dataset]. https://catalog.data.gov/dataset/synthetic-healthcare-database-for-research-syh-dr
    Explore at:
    Dataset updated
    Sep 16, 2023
    Dataset provided by
    Agency for Healthcare Research and Qualityhttp://www.ahrq.gov/
    Description

    The Agency for Healthcare Research and Quality (AHRQ) created SyH-DR from eligibility and claims files for Medicare, Medicaid, and commercial insurance plans in calendar year 2016. SyH-DR contains data from a nationally representative sample of insured individuals for the 2016 calendar year. SyH-DR uses synthetic data elements at the claim level to resemble the marginal distribution of the original data elements. SyH-DR person-level data elements are not synthetic, but identifying information is aggregated or masked.

  2. G

    Open Database of Healthcare Facilities

    • open.canada.ca
    • catalogue.arctic-sdi.org
    csv, esri rest +4
    Updated Mar 2, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statistics Canada (2022). Open Database of Healthcare Facilities [Dataset]. https://open.canada.ca/data/en/dataset/a1bcd4ee-8e57-499b-9c6f-94f6902fdf32
    Explore at:
    fgdb/gdb, esri rest, csv, html, pdf, wmsAvailable download formats
    Dataset updated
    Mar 2, 2022
    Dataset provided by
    Statistics Canada
    License

    Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
    License information was derived automatically

    Description

    The Open Database of Healthcare Facilities (ODHF) is a collection of open data containing the names, types, and locations of health facilities across Canada. It is released under the Open Government License - Canada. The ODHF compiles open, publicly available, and directly-provided data on health facilities across Canada. Data sources include regional health authorities, provincial, territorial and municipal governments, and public health and professional healthcare bodies. This database aims to provide enhanced access to a harmonized listing of health facilities across Canada by making them available as open data. This database is a component of the Linkable Open Data Environment (LODE).

  3. d

    Healthcare Professional Email List (1.2 million contacts) by Infotanks Media...

    • datarade.ai
    Updated Jun 22, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Infotanks Media (2021). Healthcare Professional Email List (1.2 million contacts) by Infotanks Media [Dataset]. https://datarade.ai/data-products/healthcare-professional-email-list-infotanks-media
    Explore at:
    Dataset updated
    Jun 22, 2021
    Dataset authored and provided by
    Infotanks Media
    Area covered
    Cabo Verde, Belgium, Brunei Darussalam, Bahrain, Seychelles, Honduras, Burundi, American Samoa, Bulgaria, Aruba
    Description

    Facilitate marketing campaigns with the healthcare email list from Infotanks Media, including doctors, healthcare professionals, NPI numbers, physician specialties, and more. Buy targeted email lists of healthcare professionals and connect with doctors, specialists, and other healthcare professionals to promote your products and services. Hyper personalize campaigns to increase engagement for better chances of conversion. Reach out to our data experts today! Access 1.2 million physician contact database with 150+ specialties, including chiropractors, cardiologists, psychiatrists, and radiologists, among others. Get ready to integrate healthcare email lists from Infotanks Media to start email marketing campaigns through CRM and ESP. Contact us right now! Ensure guaranteed lead generation with segmented email marketing strategies for specialists, departments, and more. Make the best use of target marketing to progress and move closer to your business goals with email listing services for healthcare professionals. Infotanks Media provides 100% verified healthcare email lists with the highest email deliverability guarantee of 95%. Get a custom quote today as per your requirements. Enhance your marketing campaigns with healthcare email lists from 170+ countries to build your global outreach. Request your free sample today! Personalize your business communication and interactions to maximize conversion rates with high-quality contact data. Grow your business network in your target markets from anywhere globally with a guaranteed 95% contact accuracy of the healthcare email lists from Infotanks Media. Contact data experts at Infotanks Media from the healthcare industry to get a quick sample for free. Please write to us or call today!

    Hyper target within and outside your desired markets with GDPR and CAN-SPAM compliant healthcare email lists that get integrated into your CRM and ESPs. Balance out the sales and marketing efforts by aligning goals using email lists from the healthcare industry. Build strong business relationships with potential clients through personalized campaigns. Call Infotanks Media for a free consultation. Explore new geographies and target markets with a focused approach using healthcare email lists. Align your sales teams and marketing teams through personalized email marketing campaigns to ensure they accomplish business goals together. Add value and grow revenue to take your business to the next level of success. Double up your business and revenue growth with email lists of healthcare professionals. Send segmented campaigns to monitor behaviors and understand the purchasing habits of your potential clients. Send follow-up nurturing email marketing campaigns to attract your potential clients to become converted customers. Close deals sooner with detailed information of your prospects using the healthcare email list from Infotanks Media. Reach healthcare professionals on their preferred platform of communication with the email list of healthcare professionals. Identify, capture, explore, and grow in your target markets anywhere globally with a fully verified, validated, and compliant email database of healthcare professionals. Move beyond the traditional approach and automate sales cycles with buying triggers sent through email marketing campaigns. Use the healthcare email list from Infotanks Media to engage with your targeted potential clients and get them to respond. Increase email marketing campaign response rate to convert better! Reach out to Infotanks Media to customize your healthcare email lists. Call today!

  4. mimic-iii-clinical-database-demo-1.4

    • kaggle.com
    zip
    Updated Apr 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Montassar bellah (2025). mimic-iii-clinical-database-demo-1.4 [Dataset]. https://www.kaggle.com/datasets/montassarba/mimic-iii-clinical-database-demo-1-4
    Explore at:
    zip(11100065 bytes)Available download formats
    Dataset updated
    Apr 1, 2025
    Authors
    Montassar bellah
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Abstract MIMIC-III is a large, freely-available database comprising deidentified health-related data associated with over 40,000 patients who stayed in critical care units of the Beth Israel Deaconess Medical Center between 2001 and 2012 [1]. The MIMIC-III Clinical Database is available on PhysioNet (doi: 10.13026/C2XW26). Though deidentified, MIMIC-III contains detailed information regarding the care of real patients, and as such requires credentialing before access. To allow researchers to ascertain whether the database is suitable for their work, we have manually curated a demo subset, which contains information for 100 patients also present in the MIMIC-III Clinical Database. Notably, the demo dataset does not include free-text notes.

    Background In recent years there has been a concerted move towards the adoption of digital health record systems in hospitals. Despite this advance, interoperability of digital systems remains an open issue, leading to challenges in data integration. As a result, the potential that hospital data offers in terms of understanding and improving care is yet to be fully realized.

    MIMIC-III integrates deidentified, comprehensive clinical data of patients admitted to the Beth Israel Deaconess Medical Center in Boston, Massachusetts, and makes it widely accessible to researchers internationally under a data use agreement. The open nature of the data allows clinical studies to be reproduced and improved in ways that would not otherwise be possible.

    The MIMIC-III database was populated with data that had been acquired during routine hospital care, so there was no associated burden on caregivers and no interference with their workflow. For more information on the collection of the data, see the MIMIC-III Clinical Database page.

    Methods The demo dataset contains all intensive care unit (ICU) stays for 100 patients. These patients were selected randomly from the subset of patients in the dataset who eventually die. Consequently, all patients will have a date of death (DOD). However, patients do not necessarily die during an individual hospital admission or ICU stay.

    This project was approved by the Institutional Review Boards of Beth Israel Deaconess Medical Center (Boston, MA) and the Massachusetts Institute of Technology (Cambridge, MA). Requirement for individual patient consent was waived because the project did not impact clinical care and all protected health information was deidentified.

    Data Description MIMIC-III is a relational database consisting of 26 tables. For a detailed description of the database structure, see the MIMIC-III Clinical Database page. The demo shares an identical schema, except all rows in the NOTEEVENTS table have been removed.

    The data files are distributed in comma separated value (CSV) format following the RFC 4180 standard. Notably, string fields which contain commas, newlines, and/or double quotes are encapsulated by double quotes ("). Actual double quotes in the data are escaped using an additional double quote. For example, the string she said "the patient was notified at 6pm" would be stored in the CSV as "she said ""the patient was notified at 6pm""". More detail is provided on the RFC 4180 description page: https://tools.ietf.org/html/rfc4180

    Usage Notes The MIMIC-III demo provides researchers with an opportunity to review the structure and content of MIMIC-III before deciding whether or not to carry out an analysis on the full dataset.

    CSV files can be opened natively using any text editor or spreadsheet program. However, some tables are large, and it may be preferable to navigate the data stored in a relational database. One alternative is to create an SQLite database using the CSV files. SQLite is a lightweight database format which stores all constituent tables in a single file, and SQLite databases interoperate well with a number software tools.

    DB Browser for SQLite is a high quality, visual, open source tool to create, design, and edit database files compatible with SQLite. We have found this tool to be useful for navigating SQLite files. Information regarding installation of the software and creation of the database can be found online: https://sqlitebrowser.org/

    Release Notes Release notes for the demo follow the release notes for the MIMIC-III database.

    Acknowledgements This research and development was supported by grants NIH-R01-EB017205, NIH-R01-EB001659, and NIH-R01-GM104987 from the National Institutes of Health. The authors would also like to thank Philips Healthcare and staff at the Beth Israel Deaconess Medical Center, Boston, for supporting database development, and Ken Pierce for providing ongoing support for the MIMIC research community.

    Conflicts of Interest The authors declare no competing financial interests.

    References Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Mo...

  5. HCUP Nationwide Ambulatory Surgery Sample (NASS) Database – Restricted...

    • catalog.data.gov
    • healthdata.gov
    • +1more
    Updated Jul 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Agency for Healthcare Research and Quality, Department of Health & Human Services (2025). HCUP Nationwide Ambulatory Surgery Sample (NASS) Database – Restricted Access [Dataset]. https://catalog.data.gov/dataset/hcup-nationwide-ambulatory-surgery-sample-nass-database-restricted-access
    Explore at:
    Dataset updated
    Jul 16, 2025
    Description

    The largest all-payer ambulatory surgery database in the United States, the Healthcare Cost and Utilization Project (HCUP) Nationwide Ambulatory Surgery Sample (NASS) produces national estimates of major ambulatory surgery encounters in hospital-owned facilities. Major ambulatory surgeries are defined as selected major therapeutic procedures that require the use of an operating room, penetrate or break the skin, and involve regional anesthesia, general anesthesia, or sedation to control pain (i.e., surgeries flagged as "narrow" in the HCUP Surgery Flag Software). Unweighted, the NASS contains approximately 9.0 million ambulatory surgery encounters each year and approximately 11.8 million ambulatory surgery procedures. Weighted, it estimates approximately 11.9 million ambulatory surgery encounters and 15.7 million ambulatory surgery procedures. Sampled from the HCUP State Ambulatory Surgery and Services Databases (SASD) and State Emergency Department Databases (SEDD) in order to capture both planned and emergent major ambulatory surgeries, the NASS can be used to examine selected ambulatory surgery utilization patterns. Developed through a Federal-State-Industry partnership sponsored by the Agency for Healthcare Research and Quality, HCUP data inform decision making at the national, State, and community levels. The NASS contains clinical and resource-use information that is included in a typical hospital-owned facility record, including patient characteristics, clinical diagnostic and surgical procedure codes, disposition of patients, total charges, facility characteristics, and expected source of payment, regardless of payer, including patients covered by Medicaid, private insurance, and the uninsured. The NASS excludes data elements that could directly or indirectly identify individuals, hospitals, or states. The NASS is limited to encounters with at least one in-scope major ambulatory surgery on the record, performed at hospital-owned facilities. Procedures intended primarily for diagnostic purposes are not considered in-scope. Restricted access data files are available with a data use agreement and brief online security training.

  6. Hospital Database Management System SQL Project

    • kaggle.com
    zip
    Updated May 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Andrew Dolcimascolo-Garrett (2024). Hospital Database Management System SQL Project [Dataset]. https://www.kaggle.com/datasets/andrewdolcigarrett/hospital-database-management-system-sql-project
    Explore at:
    zip(1487278 bytes)Available download formats
    Dataset updated
    May 9, 2024
    Authors
    Andrew Dolcimascolo-Garrett
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Andrew Dolcimascolo-Garrett

    Released under MIT

    Contents

  7. HCUP Nationwide Inpatient Sample

    • datacatalog.med.nyu.edu
    Updated Nov 3, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States - Agency for Healthcare Research and Quality (AHRQ) (2022). HCUP Nationwide Inpatient Sample [Dataset]. https://datacatalog.med.nyu.edu/dataset/10012
    Explore at:
    Dataset updated
    Nov 3, 2022
    Dataset provided by
    Agency for Healthcare Research and Qualityhttp://www.ahrq.gov/
    Authors
    United States - Agency for Healthcare Research and Quality (AHRQ)
    Time period covered
    Jan 1, 1988 - Present
    Area covered
    Virginia, West Virginia, Pennsylvania, Georgia, D.C., Washington, Missouri, Washington (State), South Carolina, Oklahoma, New Mexico
    Description

    The Nationwide Inpatient Sample (NIS) is part of a family of databases and software tools developed for the Healthcare Cost and Utilization Project (HCUP). The NIS is the largest all-payer inpatient health care database in the United States, yielding national estimates of hospital inpatient stays. The NIS can be used to identify, track, and analyze national trends in health care utilization, access, charges, quality, and outcomes. Data may not be available for all states across all years.

  8. HCUP Nationwide Emergency Department Sample

    • datacatalog.med.nyu.edu
    Updated Nov 3, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States - Agency for Healthcare Research and Quality (AHRQ) (2022). HCUP Nationwide Emergency Department Sample [Dataset]. https://datacatalog.med.nyu.edu/dataset/10014
    Explore at:
    Dataset updated
    Nov 3, 2022
    Dataset provided by
    Agency for Healthcare Research and Qualityhttp://www.ahrq.gov/
    Authors
    United States - Agency for Healthcare Research and Quality (AHRQ)
    Time period covered
    Jan 1, 2006 - Present
    Area covered
    Texas, North Carolina, Nevada, Nebraska, Michigan, Missouri, Washington, D.C., Hawaii, Oregon, Georgia
    Description

    The Nationwide Emergency Department Sample (NEDS) is part of a family of databases and software tools developed for the Healthcare Cost and Utilization Project (HCUP). The NEDS is the largest all-payer emergency department (ED) database in the United States, yielding national estimates of hospital-based ED visits. The NEDS enables analyses of ED utilization patterns and supports public health professionals, administrators, policymakers, and clinicians in their decisionmaking regarding this critical source of care.

  9. HCUP National Inpatient Database

    • redivis.com
    • stanford.redivis.com
    application/jsonl +7
    Updated Sep 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stanford Center for Population Health Sciences (2025). HCUP National Inpatient Database [Dataset]. http://doi.org/10.57761/gr09-hq95
    Explore at:
    application/jsonl, csv, avro, arrow, parquet, stata, sas, spssAvailable download formats
    Dataset updated
    Sep 27, 2025
    Dataset provided by
    Redivis Inc.
    Authors
    Stanford Center for Population Health Sciences
    Time period covered
    Jan 1, 2000 - Dec 31, 2022
    Description

    Abstract

    The NIS is the largest publicly available all-payer inpatient healthcare database designed to produce U.S. regional and national estimates of inpatient utilization, access, cost, quality, and outcomes. Unweighted, it contains data from around 7 million hospital stays each year. Weighted, it estimates around 35 million hospitalizations nationally. Developed through a Federal-State-Industry partnership sponsored by the Agency for Healthcare Research and Quality (AHRQ), HCUP data inform decision making at the national, State, and community levels.

    Its large sample size is ideal for developing national and regional estimates and enables analyses of rare conditions, uncommon treatments, and special populations.

    Usage

    %3Cu%3EDO NOT%3C/u%3E

    use this data without referring to the NIS Database Documentation, which includes:

    • Description of NIS Database
    • Restrictions on Use

    %3C!-- --%3E

    • Data Elements
    • Additional Resources for Data Elements
    • ICD-10-CM/PCS Data Included in the NIS Starting with 2015 (More details about this transition available here.)
    • Known Data Issues
    • NIS Supplemental Files
    • HCUP Tools: Labels and Formats
    • Obtaining HCUP Data

    %3C!-- --%3E

    Before Manuscript Submission

    %3Cu%3E%3Cstrong%3EAll manuscripts%3C/strong%3E%3C/u%3E

    (and other items you'd like to publish) %3Cu%3E%3Cstrong%3Emust be submitted to%3C/strong%3E%3C/u%3E

    %3Cu%3E%3Cstrong%3Ephsdatacore@stanford.edu%3C/strong%3E%3C/u%3E

    for approval prior to journal submission.

    We will check your cell sizes and citations.

    For more information about how to cite PHS and PHS datasets, please visit:

    https:/phsdocs.developerhub.io/need-help/citing-phs-data-core

    You must also %3Cu%3E%3Cstrong%3Emake sure that your work meets all of the AHRQ (data owner) requirements for publishing%3C/strong%3E%3C/u%3E

    with HCUP data--listed at https://hcup-us.ahrq.gov/db/nation/nis/nischecklist.jsp

    HCUP Online Tutorials

    For additional assistance, AHRQ has created the HCUP Online Tutorial Series, a series of free, interactive courses which provide training on technical methods for conducting research with HCUP data. Topics include an HCUP Overview Course and these tutorials:

    • The HCUP Sampling Design tutorial is designed to help users learn how to account for sample design in their work with HCUP national (nationwide) databases. • The Producing National HCUP Estimates tutorial is designed to help users understand how the three national (nationwide) databases – the NIS, Nationwide Emergency Department Sample (NEDS), and Kids' Inpatient Database (KID) – can be used to produce national and regional estimates. HCUP 2020 NIS (8/22/22) 14 Introduction • The Calculating Standard Errors tutorial shows how to accurately determine the precision of the estimates produced from the HCUP nationwide databases. Users will learn two methods for calculating standard errors for estimates produced from the HCUP national (nationwide) databases. • The HCUP Multi-year Analysis tutorial presents solutions that may be necessary when conducting analyses that span multiple years of HCUP data. • The HCUP Software Tools Tutorial provides instructions on how to apply the AHRQ software tools to HCUP or other administrative databases.

    New tutorials are added periodically, and existing tutorials are updated when necessary. The Online Tutorial Series is located on the HCUP-US website at https://hcup-us.ahrq.gov/tech_assist/tutorials.jsp

    Important notes about the 2015 data

    In 2015, AHRQ restructured the data as described here:

    https://hcup-us.ahrq.gov/db/nation/nis/2015HCUPNationalInpatientSample.pdf

    Some key points:

    • For the 2015 data, all diagnosis and procedure data elements, including any data elements derived from diagnoses and procedures, were moved out of the Core File and into the Diagnosis and Procedure Groups Files.
    • Prior to 2015, and for Q1-3 of 2015, the DX1-30 and PR1-15 variables (which use ICD-9 codes) variables were used, but starting in Q4 of 2015, the I10_DX1-30 and I10_PR1-I10-15 (which use ICD-10 codes) were used. The best way to identify discharges for quarter 1-3 or quarter 4 is based on the value of the diagnosis version (DXVER); For quarters 1-3, DXVER has a value of 9; while for quarter 4, DXVER has a value of 10.
    • Some other variables also transitioned in Q4 of 2015. Please refer to the link above for more details.
    • Starting in 2016, the diagnosis and procedure information returned to the Core file. Additional detai
  10. Nationwide Emergency Department Sample

    • datacatalog.library.wayne.edu
    • fedoratest.lib.wayne.edu
    Updated Apr 4, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Agency for Healthcare Research and Quality (AHRQ) (2018). Nationwide Emergency Department Sample [Dataset]. https://datacatalog.library.wayne.edu/dataset/nationwide-emergency-department-sample
    Explore at:
    Dataset updated
    Apr 4, 2018
    Dataset provided by
    Agency for Healthcare Research and Qualityhttp://www.ahrq.gov/
    Description

    The Nationwide Emergency Department Sample (NEDS) is part of a family of databases and software tools developed for the Healthcare Cost and Utilization Project (HCUP). NEDS is the largest all-payer emergency department (ED) database in the United States, yielding national estimates of hospital-based ED visits. One of the most distinctive features of the NEDS is its large sample size, which allows for analysis across hospital types and the study of relatively uncommon disorders and procedures.

  11. Database Creation Description and Data Dictionaries

    • figshare.com
    txt
    Updated Aug 11, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jordan Kempker; John David Ike (2016). Database Creation Description and Data Dictionaries [Dataset]. http://doi.org/10.6084/m9.figshare.3569067.v3
    Explore at:
    txtAvailable download formats
    Dataset updated
    Aug 11, 2016
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Jordan Kempker; John David Ike
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    There are several Microsoft Word documents here detailing data creation methods and with various dictionaries describing the included and derived variables.The Database Creation Description is meant to walk a user through some of the steps detailed in the SAS code with this project.The alphabetical list of variables is intended for users as sometimes this makes some coding steps easier to copy and paste from this list instead of retyping.The NIS Data Dictionary contains some general dataset description as well as each variable's responses.

  12. Comprehensive Medical Q&A Dataset

    • kaggle.com
    zip
    Updated Nov 24, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2023). Comprehensive Medical Q&A Dataset [Dataset]. https://www.kaggle.com/datasets/thedevastator/comprehensive-medical-q-a-dataset
    Explore at:
    zip(5126941 bytes)Available download formats
    Dataset updated
    Nov 24, 2023
    Authors
    The Devastator
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Comprehensive Medical Q&A Dataset

    Unlocking Healthcare Data with Natural Language Processing

    By Huggingface Hub [source]

    About this dataset

    The MedQuad dataset provides a comprehensive source of medical questions and answers for natural language processing. With over 43,000 patient inquiries from real-life situations categorized into 31 distinct types of questions, the dataset offers an invaluable opportunity to research correlations between treatments, chronic diseases, medical protocols and more. Answers provided in this database come not only from doctors but also other healthcare professionals such as nurses and pharmacists, providing a more complete array of responses to help researchers unlock deeper insights within the realm of healthcare. This incredible trove of knowledge is just waiting to be mined - so grab your data mining equipment and get exploring!

    More Datasets

    For more datasets, click here.

    Featured Notebooks

    • 🚨 Your notebook can be here! 🚨!

    How to use the dataset

    In order to make the most out of this dataset, start by having a look at the column names and understanding what information they offer: qtype (the type of medical question), Question (the question in itself), and Answer (the expert response). The qtype column will help you categorize the dataset according to your desired question topics. Once you have filtered down your criteria as much as possible using qtype, it is time to analyze the data. Start by asking yourself questions such as “What treatments do most patients search for?” or “Are there any correlations between chronic conditions and protocols?” Then use simple queries such as SELECT Answer FROM MedQuad WHERE qtype='Treatment' AND Question LIKE '%pain%' to get closer to answering those questions.

    Once you have obtained new insights about healthcare based on the answers provided in this dynmaic data set - now it’s time for action! Use all that newfound understanding about patient needs in order develop educational materials and implement any suggested changes necessary. If more criteria are needed for querying this data set see if MedQuad offers additional columns; sometimes extra columns may be added periodically that could further enhance analysis capabilities; look out for notifications if these happen.

    Finally once making an impact with the use case(s) - don't forget proper citation etiquette; give credit where credit is due!

    Research Ideas

    • Developing medical diagnostic tools that use natural language processing (NLP) to better identify and diagnose health conditions in patients.
    • Creating predictive models to anticipate treatment options for different medical conditions using machine learning techniques.
    • Leveraging the dataset to build chatbots and virtual assistants that are able to answer a broad range of questions about healthcare with expert-level accuracy

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. Data Source

    License

    License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

    Columns

    File: train.csv | Column name | Description | |:--------------|:------------------------------------------------------| | qtype | The type of medical question. (String) | | Question | The medical question posed by the patient. (String) | | Answer | The expert response to the medical question. (String) |

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. If you use this dataset in your research, please credit Huggingface Hub.

  13. Healthcare Management System

    • kaggle.com
    zip
    Updated Dec 23, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anouska Abhisikta (2023). Healthcare Management System [Dataset]. https://www.kaggle.com/datasets/anouskaabhisikta/healthcare-management-system
    Explore at:
    zip(74279 bytes)Available download formats
    Dataset updated
    Dec 23, 2023
    Authors
    Anouska Abhisikta
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Patients Table:

    • PatientID: Unique identifier for each patient.
    • firstname: First name of the patient.
    • lastname: Last name of the patient.
    • email: Email address of the patient.

    This table stores information about individual patients, including their names and contact details.

    Doctors Table:

    • DoctorID: Unique identifier for each doctor.
    • DoctorName: Full name of the doctor.
    • Specialization: Area of medical specialization.
    • DoctorContact: Contact details of the doctor.

    This table contains details about healthcare providers, including their names, specializations, and contact information.

    Appointments Table:

    • AppointmentID: Unique identifier for each appointment.
    • Date: Date of the appointment.
    • Time: Time of the appointment.
    • PatientID: Foreign key referencing the Patients table, indicating the patient for the appointment.
    • DoctorID: Foreign key referencing the Doctors table, indicating the doctor for the appointment.

    This table records scheduled appointments, linking patients to doctors.

    MedicalProcedure Table:

    • ProcedureID: Unique identifier for each medical procedure.
    • ProcedureName: Name or description of the medical procedure.
    • AppointmentID: Foreign key referencing the Appointments table, indicating the appointment associated with the procedure.

    This table stores details about medical procedures associated with specific appointments.

    Billing Table:

    • InvoiceID: Unique identifier for each billing transaction.
    • PatientID: Foreign key referencing the Patients table, indicating the patient for the billing transaction.
    • Items: Description of items or services billed.
    • Amount: Amount charged for the billing transaction.

    This table maintains records of billing transactions, associating them with specific patients.

    demo Table:

    • ID: Primary key, serves as a unique identifier for each record.
    • Name: Name of the entity.
    • Hint: Additional information or hint about the entity.

    This table appears to be a demonstration or testing table, possibly unrelated to the healthcare management system.

    This dataset schema is designed to capture comprehensive information about patients, doctors, appointments, medical procedures, and billing transactions in a healthcare management system. Adjustments can be made based on specific requirements, and additional attributes can be included as needed.

  14. HCUP Nationwide Emergency Department Database (NEDS) Restricted Access File

    • data.virginia.gov
    • healthdata.gov
    • +2more
    Updated Jul 26, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Agency for Healthcare Research and Quality, Department of Health & Human Services (2023). HCUP Nationwide Emergency Department Database (NEDS) Restricted Access File [Dataset]. https://data.virginia.gov/dataset/hcup-nationwide-emergency-department-database-neds-restricted-access-file
    Explore at:
    Dataset updated
    Jul 26, 2023
    Description

    The Healthcare Cost and Utilization Project (HCUP) Nationwide Emergency Department Sample (NEDS) is the largest all-payer emergency department (ED) database in the United States. yielding national estimates of hospital-owned ED visits. Unweighted, it contains data from over 30 million ED visits each year. Weighted, it estimates roughly 145 million ED visits nationally. Developed through a Federal-State-Industry partnership sponsored by the Agency for Healthcare Research and Quality, HCUP data inform decision making at the national, State, and community levels.

    Sampled from the HCUP State Inpatient Databases (SID) and State Emergency Department Databases (SEDD), the HCUP NEDS can be used to create national and regional estimates of ED care. The SID contain information on patients initially seen in the ED and subsequently admitted to the same hospital. The SEDD capture information on ED visits that do not result in an admission (i.e., treat-and-release visits and transfers to another hospital). Developed through a Federal-State-Industry partnership sponsored by the Agency for Healthcare Research and Quality, HCUP data inform decision making at the national, State, and community levels.

    The NEDS contain information about geographic characteristics, hospital characteristics, patient characteristics, and the nature of visits (e.g., common reasons for ED visits, including injuries). The NEDS contains clinical and resource use information included in a typical discharge abstract, with safeguards to protect the privacy of individual patients, physicians, and hospitals (as required by data sources). It includes ED charge information for over 85% of patients, regardless of expected payer, including but not limited to Medicare, Medicaid, private insurance, self-pay, or those billed as ‘no charge’. The NEDS excludes data elements that could directly or indirectly identify individuals, hospitals, or states.Restricted access data files are available with a data use agreement and brief online security training.

  15. HCUP Kids' Inpatient Database (KID) - Restricted Access File

    • catalog.data.gov
    • odgavaprod.ogopendata.com
    • +3more
    Updated Jul 16, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Agency for Healthcare Research and Quality, Department of Health & Human Services (2025). HCUP Kids' Inpatient Database (KID) - Restricted Access File [Dataset]. https://catalog.data.gov/dataset/hcup-kids-inpatient-database-kid-restricted-access-file
    Explore at:
    Dataset updated
    Jul 16, 2025
    Description

    The Healthcare Cost and Utilization Project (HCUP) Kids' Inpatient Database (KID) is the largest publicly available all-payer pediatric inpatient care database in the United States, containing data from two to three million hospital stays each year. Its large sample size is ideal for developing national and regional estimates and enables analyses of rare conditions, such as congenital anomalies, as well as uncommon treatments, such as organ transplantation. Developed through a Federal-State-Industry partnership sponsored by the Agency for Healthcare Research and Quality, HCUP data inform decision making at the national, State, and community levels. The KID is a sample of pediatric discharges from 4,000 U.S. hospitals in the HCUP State Inpatient Databases yielding approximately two to three million unweighted hospital discharges for newborns, children, and adolescents per year. About 10 percent of normal newborns and 80 percent of other neonatal and pediatric stays are selected from each hospital that is sampled for patients younger than 21 years of age. The KID contains clinical and resource use information included in a typical discharge abstract, with safeguards to protect the privacy of individual patients, physicians, and hospitals (as required by data sources). It includes discharge status, diagnoses, procedures, patient demographics (e.g., sex, age), expected source of primary payment (e.g., Medicare, Medicaid, private insurance, self-pay, and other insurance types), and hospital charges and cost. Restricted access data files are available with a data use agreement and brief online security training.

  16. Demographics of patients seen at the ED with football injuries by admission...

    • plos.figshare.com
    xls
    Updated Jun 4, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michael J. McGinity; Ramesh Grandhi; Joel E. Michalek; Jesse S. Rodriguez; Aron M. Trevino; Ashley C. McGinity; Ali Seifi (2023). Demographics of patients seen at the ED with football injuries by admission to hospital 2010 to 2013 ((N = 819,000). [Dataset]. http://doi.org/10.1371/journal.pone.0195827.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 4, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Michael J. McGinity; Ramesh Grandhi; Joel E. Michalek; Jesse S. Rodriguez; Aron M. Trevino; Ashley C. McGinity; Ali Seifi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Demographics of patients seen at the ED with football injuries by admission to hospital 2010 to 2013 ((N = 819,000).

  17. HCUP Nationwide Emergency Department Database (NEDS)

    • catalog.data.gov
    • s.cnmilf.com
    Updated Mar 14, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Agency for Healthcare Research and Quality (2013). HCUP Nationwide Emergency Department Database (NEDS) [Dataset]. https://catalog.data.gov/dataset/hcup-nationwide-emergency-department-database-neds
    Explore at:
    Dataset updated
    Mar 14, 2013
    Dataset provided by
    Agency for Healthcare Research and Qualityhttp://www.ahrq.gov/
    Description

    The Nationwide Emergency Department Sample (NEDS) was created to enable analyses of emergency department (ED) utilization patterns and support public health professionals, administrators, policymakers, and clinicians in their decision-making regarding this critical source of care. The NEDS can be weighted to produce national estimates. The NEDS is the largest all-payer ED database in the United States. It was constructed using records from both the HCUP State Emergency Department Databases (SEDD) and the State Inpatient Databases (SID), both also described in healthdata.gov. The SEDD capture information on ED visits that do not result in an admission (i.e., treat-and-release visits and transfers to another hospital). The SID contain information on patients initially seen in the emergency room and then admitted to the same hospital. The NEDS contains 25-30 million (unweighted) records for ED visits for over 950 hospitals and approximates a 20-percent stratified sample of U.S. hospital-based EDs. The NEDS contains information about geographic characteristics, hospital characteristics, patient characteristics, and the nature of visits (e.g., common reasons for ED visits, including injuries). The NEDS contains clinical and resource use information included in a typical discharge abstract, with safeguards to protect the privacy of individual patients, physicians, and hospitals (as required by data sources). It includes ED charge information for over 75% of patients, regardless of payer, including patients covered by Medicaid, private insurance, and the uninsured. The NEDS excludes data elements that could directly or indirectly identify individuals, hospitals, or states.

  18. The Open Database of Healthcare Facilities-Canada

    • kaggle.com
    zip
    Updated May 24, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Brandyn Sfetcu (2022). The Open Database of Healthcare Facilities-Canada [Dataset]. https://www.kaggle.com/datasets/alexandersfetcu/the-open-database-of-healthcare-facilitiescanada
    Explore at:
    zip(1191209 bytes)Available download formats
    Dataset updated
    May 24, 2022
    Authors
    Brandyn Sfetcu
    Area covered
    Canada
    Description

    Table 1 Healthcare facility type assignment criteria examples (based on keywords) Variable Condition Value Classification Facility type contains the keywords 'community health center', 'clinic' Ambulatory health care services Facility type contains the keywords 'hospital', 'cancer treatment', 'emergency', cancer centre', 'health centre' Hospitals Facility type contains the keywords 'senior active living', 'nursing home', 'long-term care' Nursing and residential care facilities

  19. d

    Data Management Plan Examples Database

    • search.dataone.org
    • borealisdata.ca
    Updated Sep 4, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Evering, Danica; Acharya, Shrey; Pratt, Isaac; Behal, Sarthak (2024). Data Management Plan Examples Database [Dataset]. http://doi.org/10.5683/SP3/SDITUG
    Explore at:
    Dataset updated
    Sep 4, 2024
    Dataset provided by
    Borealis
    Authors
    Evering, Danica; Acharya, Shrey; Pratt, Isaac; Behal, Sarthak
    Time period covered
    Jan 1, 2011 - Jan 1, 2023
    Description

    This dataset is comprised of a collection of example DMPs from a wide array of fields; obtained from a number of different sources outlined below. Data included/extracted from the examples include the discipline and field of study, author, institutional affiliation and funding information, location, date created, title, research and data-type, description of project, link to the DMP, and where possible external links to related publications or grant pages. This CSV document serves as the content for a McMaster Data Management Plan (DMP) Database as part of the Research Data Management (RDM) Services website, located at https://u.mcmaster.ca/dmps. Other universities and organizations are encouraged to link to the DMP Database or use this dataset as the content for their own DMP Database. This dataset will be updated regularly to include new additions and will be versioned as such. We are gathering submissions at https://u.mcmaster.ca/submit-a-dmp to continue to expand the collection.

  20. f

    An example of the Row Table.

    • figshare.com
    xls
    Updated Jun 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hsien-Tsung Chang; Tsai-Huei Lin (2023). An example of the Row Table. [Dataset]. http://doi.org/10.1371/journal.pone.0168935.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 4, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Hsien-Tsung Chang; Tsai-Huei Lin
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    An example of the Row Table.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Agency for Healthcare Research and Quality (2023). Synthetic Healthcare Database for Research (SyH-DR) [Dataset]. https://catalog.data.gov/dataset/synthetic-healthcare-database-for-research-syh-dr
Organization logo

Synthetic Healthcare Database for Research (SyH-DR)

Explore at:
9 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Sep 16, 2023
Dataset provided by
Agency for Healthcare Research and Qualityhttp://www.ahrq.gov/
Description

The Agency for Healthcare Research and Quality (AHRQ) created SyH-DR from eligibility and claims files for Medicare, Medicaid, and commercial insurance plans in calendar year 2016. SyH-DR contains data from a nationally representative sample of insured individuals for the 2016 calendar year. SyH-DR uses synthetic data elements at the claim level to resemble the marginal distribution of the original data elements. SyH-DR person-level data elements are not synthetic, but identifying information is aggregated or masked.

Search
Clear search
Close search
Google apps
Main menu