53 datasets found

G
Patient Demographics
gomask.ai
csv, json
Updated Nov 12, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GoMask.ai (2025). Patient Demographics [Dataset]. https://gomask.ai/marketplace/datasets/patient-demographics
Explore at:
csv(10 MB), jsonAvailable download formats
Dataset updated
Nov 12, 2025
Dataset provided by
GoMask.ai
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Time period covered
2024 - 2025
Area covered
Global
Variables measured
email, gender, last_name, first_name, patient_id, address_city, phone_number, address_state, date_of_birth, address_street, and 15 more
Description
This dataset provides comprehensive healthcare patient demographic records, including unique medical identifiers, insurance details, emergency contacts, and appointment histories. It enables efficient patient management, supports clinical workflows, and facilitates analytics for healthcare providers and administrators. The structured schema ensures data integrity and usability for operational and research applications.
f
Patient demographics and clinical data.
datasetcatalog.nlm.nih.gov
figshare.com
+1more
Updated Aug 24, 2017
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xia, Annie; Heckel, Andreas; Weiler, Markus; Schlemmer, Heinz-Peter; Bäumer, Philipp; Jäger, Dirk; Bendszus, Martin; Heiland, Sabine; Apostolidis, Leonidas; Schwarz, Daniel; Godel, Tim (2017). Patient demographics and clinical data. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001772303
Explore at:
Dataset updated
Aug 24, 2017
Authors
Xia, Annie; Heckel, Andreas; Weiler, Markus; Schlemmer, Heinz-Peter; Bäumer, Philipp; Jäger, Dirk; Bendszus, Martin; Heiland, Sabine; Apostolidis, Leonidas; Schwarz, Daniel; Godel, Tim
Description
Patient demographics and clinical data.
f
Database containing demographic data of each patient and laboratory data of...
f1000.figshare.com
bin
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Freeha Arshad; Jelle Adelmeijer; Hans Blokzijl; Aad P. van den Berg; Robert J. Porte; Ton Lisman (2023). Database containing demographic data of each patient and laboratory data of each patient and control [Dataset]. http://doi.org/10.6084/m9.figshare.1002065.v1
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.1002065.v1
Dataset updated
May 30, 2023
Dataset provided by
f1000research.com
Authors
Freeha Arshad; Jelle Adelmeijer; Hans Blokzijl; Aad P. van den Berg; Robert J. Porte; Ton Lisman
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This file contains raw data of all laboratory measurements presented in the paper. In addition, the file contains raw demographic data of the patients as summarized in the paper in Table 1.
f
Patient demographics and baseline characteristics.
datasetcatalog.nlm.nih.gov
plos.figshare.com
+1more
Updated Aug 18, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tsutsué, Saaya; Tobinai, Kensei; Crawford, Bruce; Yi, Jingbo (2020). Patient demographics and baseline characteristics. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000470772
Explore at:
Dataset updated
Aug 18, 2020
Authors
Tsutsué, Saaya; Tobinai, Kensei; Crawford, Bruce; Yi, Jingbo
Description
Patient demographics and baseline characteristics.
g
Hospice Utilization - Patient Demographics | gimi9.com
gimi9.com
Updated Mar 5, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). Hospice Utilization - Patient Demographics | gimi9.com [Dataset]. https://gimi9.com/dataset/california_hospice-utilization-patient-demographics/
Explore at:
Dataset updated
Mar 5, 2023
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The dataset contains counts of inpatient visits leading to a discharge to hospice care. Inpatient visits included in the counts consist of individuals aged 18 or over with a discharge disposition leading to home or facility hospice care. The total counts per each individual year can be viewed based on different patient characteristics, including patient age groups, individual counties of residence, primary payer type, diagnosis category, and patient sex/race/ethnicity. The disease categories include circulatory conditions, diabetes, malignant/benign neoplasms, malnutrition, neurodegenerative disease, renal failure or other kidney diagnoses, respiratory conditions and circulatory conditions. The categories represent common groupings of diagnoses seen in other studies related to hospice care and were created by grouping together relevant medical MSDRG codes in the HCAI inpatient data.
f
Patient demographics and summary findings.
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated Apr 20, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Flores, Elsa R.; Reed, Damon R.; Yoder, Sean J.; Cen, Ling; Brohl, Andrew S.; Cheng, Chia-Ho; Teer, Jamie K.; Jinesh, Goodwin G.; Arnold, Michael; Pettersson, Fredrik; Setty, Bhuvana A. (2020). Patient demographics and summary findings. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000458480
Explore at:
Dataset updated
Apr 20, 2020
Authors
Flores, Elsa R.; Reed, Damon R.; Yoder, Sean J.; Cen, Ling; Brohl, Andrew S.; Cheng, Chia-Ho; Teer, Jamie K.; Jinesh, Goodwin G.; Arnold, Michael; Pettersson, Fredrik; Setty, Bhuvana A.
Description
Patient demographics and summary findings.
f
Demographics of the patient population.
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated Mar 15, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gatta, Gianluca; Piscitelli, Valeria; Peluso, Silvio; Pezzullo, Giovanna; La Tessa, Giuseppe Maria Ernesto; D’Agostino, Vincenzo; Sarti, Giuseppe; Somma, Francesco; Fasano, Fabrizio; Caranci, Ferdinando; Negro, Alberto; Sicignano, Carmine; Villa, Alessandro; Tamburrini, Stefania; Pace, Gianvito (2022). Demographics of the patient population. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000222913
Explore at:
Dataset updated
Mar 15, 2022
Authors
Gatta, Gianluca; Piscitelli, Valeria; Peluso, Silvio; Pezzullo, Giovanna; La Tessa, Giuseppe Maria Ernesto; D’Agostino, Vincenzo; Sarti, Giuseppe; Somma, Francesco; Fasano, Fabrizio; Caranci, Ferdinando; Negro, Alberto; Sicignano, Carmine; Villa, Alessandro; Tamburrini, Stefania; Pace, Gianvito
Description
Demographics of the patient population.
Hospital Excel Dataset
kaggle.com
zip
Updated Apr 17, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Omolola Labiyi (2025). Hospital Excel Dataset [Dataset]. https://www.kaggle.com/datasets/t0ut0u/hospital-excel-dataset/code
Explore at:
zip(18209846 bytes)Available download formats
Dataset updated
Apr 17, 2025
Authors
Omolola Labiyi
License
https://cdla.io/sharing-1-0/https://cdla.io/sharing-1-0/
Description
📌 Project Overview This project analyzes hospital admissions, patient stays, and cost trends using Excel. The dataset contains information on patient demographics, hospital names, insurance providers, and treatment costs. Key insights were derived using PivotTables, charts, and formulas.

📊 Key Insights & Visualizations ✅ Top Hospitals by Admissions → Bar Chart ✅ Insurance Provider with Most Patients → Pie Chart ✅ Cost per Day Trends → Line Chart ✅ Average Length of Stay per Hospital → Bar Chart

🛠 Excel Analysis Techniques Used PivotTables for summarizing patient data

Conditional Formatting to highlight cost trends

Bar, Pie, and Line Charts for visualization

Statistical Analysis (Average length of stay, cost trends)

📂 Files Included 📌 hospital_analysis.xlsx – The full Excel analysis file 📌 hospital_summary.pdf – Summary of key findings

Healthcare #HospitalData #ExcelAnalysis #DataVisualization #PivotTables #DataCleaning #MedicalAnalytics #PatientTrends #CostAnalysis #AdmissionsAnalysis #InsuranceData #DataAnalysis #ExcelDashboards #HealthTech
US Healthcare Readmissions and Mortality
kaggle.com
zip
Updated Jan 23, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Devastator (2023). US Healthcare Readmissions and Mortality [Dataset]. https://www.kaggle.com/datasets/thedevastator/us-healthcare-readmissions-and-mortality/code
Explore at:
zip(1801458 bytes)Available download formats
Dataset updated
Jan 23, 2023
Authors
The Devastator
Area covered
United States
Description
US Healthcare Readmissions and Mortality

Evaluating Hospital Performance

By Health [source]

About this dataset

This dataset contains detailed information about 30-day readmission and mortality rates of U.S. hospitals. It is an essential tool for stakeholders aiming to identify opportunities for improving healthcare quality and performance across the country. Providers benefit by having access to comprehensive data regarding readmission, mortality rate, score, measure start/end dates, compared average to national as well as other pertinent metrics like zip codes, phone numbers and county names. Use this data set to conduct evaluations of how hospitals are meeting industry standards from a quality and outcomes perspective in order to make more informed decisions when designing patient care strategies and policies

More Datasets

For more datasets, click here.

Featured Notebooks

🚨 Your notebook can be here! 🚨!

How to use the dataset

This dataset provides data on 30-day readmission and mortality rates of U.S. hospitals, useful in understanding the quality of healthcare being provided. This data can provide insight into the effectiveness of treatments, patient care, and staff performance at different healthcare facilities throughout the country.

In order to use this dataset effectively, it is important to understand each column and how best to interpret them. The ‘Hospital Name’ column displays the name of the facility; ‘Address’ lists a street address for the hospital; ‘City’ indicates its geographic location; ‘State’ specifies a two-letter abbreviation for that state; ‘ZIP Code’ provides each facility's 5 digit zip code address; 'County Name' specifies what county that particular hospital resides in; 'Phone number' lists a phone contact for any given facility ;'Measure Name' identifies which measure is being recorded (for instance: Elective Delivery Before 39 Weeks); 'Score' value reflects an average score based on patient feedback surveys taken over time frame listed under ' Measure Start Date.' Then there are also columns tracking both lower estimates ('Lower Estimate') as well as higher estimates ('Higher Estimate'); these create variability that can be tracked by researchers seeking further answers or formulating future studies on this topic or field.; Lastly there is one more measure oissociated with this set: ' Footnote,' which may highlight any addional important details pertinent to analysis such as numbers outlying National averages etc..

This data set can be used by hospitals, research facilities and other interested parties in providing inciteful information when making decisions about patient care standards throughout America . It can help find patterns about readmitis/mortality along county lines or answer questions about preformance fluctuations between different hospital locations over an extended amount of time. So if you are ever curious about 30 days readmitted within US Hospitals don't hesitate to dive into this insightful dataset!

Research Ideas

Comparing hospitals on a regional or national basis to measure the quality of care provided for readmission and mortality rates.

Analyzing the effects of technological advancements such as telemedicine, virtual visits, and AI on readmission and mortality rates at different hospitals.

Using measures such as Lower Estimate Higher Estimate scores to identify systematic problems in readmissions or mortality rate management at hospitals and informing public health care policy

Acknowledgements

If you use this dataset in your research, please credit the original authors. Data Source

License

License: Dataset copyright by authors - You are free to: - Share - copy and redistribute the material in any medium or format for any purpose, even commercially. - Adapt - remix, transform, and build upon the material for any purpose, even commercially. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. - Keep intact - all notices that refer to this license, including copyright notices.

Columns

File: Readmissions_and_Deaths_-_Hospital.csv | Column name | Description | |:-------------------------|:---------------------------------------------------------------------------------------------------| | Hospital Name ...
h
Africa-stroke-prediction-dataset
huggingface.co
Updated Oct 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Electric Sheep (2025). Africa-stroke-prediction-dataset [Dataset]. https://huggingface.co/datasets/electricsheepafrica/Africa-stroke-prediction-dataset
Explore at:
Dataset updated
Oct 28, 2025
Dataset authored and provided by
Electric Sheep
License
https://choosealicense.com/licenses/gpl/https://choosealicense.com/licenses/gpl/
Description
This dataset contains clinical and demographic information of patients along with their stroke status. The dataset provides comprehensive medical data for stroke research and machine learning applications.

Dataset Summary

Total Samples: 5,110 Features: 12 (Patient ID, Demographic info, Clinical measurements, Stroke status) Task: Binary classification (Stroke vs No Stroke)

Features Patient Information

id: Unique identifier (integer) gender: Biological sex… See the full description on the dataset page: https://huggingface.co/datasets/electricsheepafrica/Africa-stroke-prediction-dataset.
H
BOSQUE Test set
dataverse.harvard.edu
search.dataone.org
Updated Jul 10, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Alejandra Jaramillo Arboleda; Maria Juliana Sanchez Zapata; LILI JOHANA RUEDA JAIME; Andrés Morales-Forero; Samuel Bassetto (2025). BOSQUE Test set [Dataset]. http://doi.org/10.7910/DVN/AQEPIN
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/AQEPIN
Dataset updated
Jul 10, 2025
Dataset provided by
Harvard Dataverse
Authors
Alejandra Jaramillo Arboleda; Maria Juliana Sanchez Zapata; LILI JOHANA RUEDA JAIME; Andrés Morales-Forero; Samuel Bassetto
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
BOSQUE Test Set: A Dermoscopic Image Dataset from Colombian Patients with Diverse Skin Phototypes Description: The BOSQUE Test Set is a curated dataset of 151 dermoscopic images of pigmented skin lesions, collected from dermatology consultations and outreach campaigns in Bogotá, Colombia. Each image is accompanied by expert-verified metadata including histological diagnosis, patient demographic details, anatomical site, and skin phototype. The dataset is intended to support machine learning research in dermatology with a particular focus on skin tone diversity and fairness in diagnostic algorithms. The dataset was developed under the guidance of Universidad El Bosque, whose name inspired the acronym BOSQUE. It responds to the global underrepresentation of darker skin phototypes in existing dermoscopic image collections such as HAM10000, and aims to improve diagnostic equity through inclusive data curation. Key Features 151 dermoscopic images acquired in real-world clinical settings Captured using polarized light dermatoscopes (DermLite 4 + iPhone) Inclusive population: Sex: 97 Female, 54 Male Age groups: from 0–29 to 90+, categorized into clinically relevant bins Fitzpatrick skin phototypes: ranging from II to VI Type II (fair, burns easily): 11 patients Type III (light brown, mild burns): 94 patients Type IV (moderate brown, rarely burns): 34 patients Type V (dark brown, very rarely burns): 7 patients Type VI (deeply pigmented, never burns): 5 patients Lesion characteristics: Nature: benign or malignant (histopathologically confirmed) Size: categorized as ≤5mm, 6–10mm, 11–20mm, >20mm Evolution time: grouped into <1y, 1y, 2y, 3–4y, 5–9y, and 10y+ categories Anatomical site: head/neck, trunk, limbs, or acral areas Histopathological diagnosis: 7-class ISIC-style labels (akiec, bcc, bkl, df, mel, nv, vasc) Clinical label: melanocytic vs. non-melanocytic (from clinical diagnosis) Clinical context: includes personal history of NMSC and use of photosensitizing drugs Image naming: pseudonymized file names encode diagnosis label and image ID Ethics: all data anonymized and collected under IRB-approved protocol in Colombia Included Files BOSQUE_test_set.zip: Folder containing 151 dermoscopic image files (JPG) BOSQUE_metadata.csv: Metadata for each image, including: Patient sex, age group, skin phototype Anatomical site of the lesion Lesion nature (benign/malignant) Lesion size and evolution time (binned) Histological diagnosis (7-class) Clinical label (melanocytic / non-melanocytic) Use Cases This dataset is intended for: Benchmarking AI models for dermoscopic image classification Fairness analysis across skin tones, sex, and age groups Medical education and clinical training on diverse skin phototypes Comparison against HAM10000 or ISIC datasets in research Ethical Statement All patients provided informed consent for the capture and use of clinical and dermoscopic images, the collection of relevant clinical metadata, and the performance of skin biopsies for diagnostic confirmation. The study protocol was reviewed and approved by the Institutional Ethics Committee at Subred Integrada de Servicios de Salud Norte E.S.E and Universidad El Bosque (Bogotá, Colombia). All data were anonymized in compliance with Colombian health data privacy regulations and international ethical standards (e.g., Declaration of Helsinki). No personally identifiable information is included in the metadata or image files. Access to data was restricted to authorized investigators, and patients were informed about the research and educational use of their anonymized data. Suggested Citation [Author(s)]. (2025). BOSQUE Test Set: A Dermoscopic Image Dataset from Colombian Patients with Diverse Skin Phototypes [Data set]. Harvard Dataverse. https://doi.org/xxxxx
i
Hoosier Health and Well-being By County and Demographics - Dataset - The...
hub.mph.in.gov
Updated Sep 1, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2020). Hoosier Health and Well-being By County and Demographics - Dataset - The Indiana Data Hub [Dataset]. https://hub.mph.in.gov/dataset/hoosier-health-and-well-being-by-county-and-demographics
Explore at:
Dataset updated
Sep 1, 2020
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Indiana
Description
In August of 2018, FSSA’s Office of Healthy Opportunities deployed a social risk assessment survey. The 10-question survey was made available to anyone applying online through FSSA for health coverage, the Supplemental Nutritional Assistance Program or Temporary Assistance for Needy Families. The results of this survey are aggregated and presented below and can help communities better understand the social risk factors affecting the health of those applying for our services. Please read and review the following information regarding the use of this data prior to viewing the tool. This survey was made available to those individuals who applied online ONLY and does not represent anyone who applied in-person, by telephone, by mail or any other method. In 2018, online applications accounted for 79% of those who applied for SNAP, TANF or health coverage. Survey completion is voluntary and does not impact eligibility for SNAP, TANF or health coverage. Applications are filed at a household level and may represent several individuals. The application process identifies a primary contact person for the household, and that individual’s demographics are represented on the dashboard; for example, person’s gender, race and education level. An individual who completes more than one application and survey over any given time period is represented once for each instance, and the survey answers and demographic details are based on each application’s responses. For example, an applicant’s age, education level and survey answers can change over time, and the reporting reflects any such changes. All information is presented in aggregate to ensure personally identifiable information is protected. To protect the privacy of individuals, data representing 20 or less individuals in any county will not be displayed. I.e. it will show as blank
Brain Tumor Dataset
kaggle.com
zip
Updated Aug 24, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mahmoud Shaheen (2024). Brain Tumor Dataset [Dataset]. https://www.kaggle.com/datasets/mahmoudshaheen1134/brain-tumor-dataset
Explore at:
zip(7920966 bytes)Available download formats
Dataset updated
Aug 24, 2024
Authors
Mahmoud Shaheen
License
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Description
Purpose: This dataset is designed to aid in the development of computer vision models for the early detection and classification of brain tumors. It contains medical images, typically MRI or CT scans, of human brains with and without tumors.

Key Characteristics:

Image Types: MRI or CT scans Image Format: Commonly JPG, Labels: Associated with each image, indicating the presence or absence of a brain tumor, and possibly the tumor type (e.g., glioma, meningioma, pituitary tumor). Metadata: Additional information such as patient demographics, acquisition parameters, and clinical notes. Potential Use Cases:

Training and evaluation of deep learning models for brain tumor detection and classification. Research into medical image analysis techniques for early diagnosis and treatment planning. Development of AI-powered tools to assist radiologists in identifying and assessing brain tumors. Important Considerations:

Data Quality: Ensure the images are of high quality, well-annotated, and representative of a diverse patient population. Ethical Considerations: Adhere to privacy regulations and obtain informed consent from patients. Data Preprocessing: Images may require preprocessing steps such as normalization, augmentation, and noise reduction before being used for model training.
Digital Twin: EHR, Imaging & IoT Data
kaggle.com
zip
Updated Apr 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Python Developer (2025). Digital Twin: EHR, Imaging & IoT Data [Dataset]. https://www.kaggle.com/programmer3/digital-twin-ehr-imaging-and-iot-data
Explore at:
zip(194965975 bytes)Available download formats
Dataset updated
Apr 4, 2025
Authors
Python Developer
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
This dataset integrates multiple healthcare data sources. It combines:

Electronic Health Records (EHRs): Patient demographics, medical history, diagnoses, and treatment plans.

Medical Imaging (CT & MRI): Cross-modality imaging data for enhanced diagnostic analysis.

The UCI dataset contains 20 records and 16 attributes, likely representing features related to diabetes diagnosis or risk factors. All attributes are numerical (integer values: 0 or 1), indicating the presence or absence of a particular condition.

Wearable IoT Sensor Data: Real-time physiological monitoring, including heart rate, activity levels, and mental health indicators.
h
OMOP dataset: Hospital COVID patients: severity, acuity, therapies, outcomes...
healthdatagateway.org
unknown
Updated Oct 8, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
This publication uses data from PIONEER, an ethically approved database and analytical environment (East Midlands Derby Research Ethics 20/EM/0158) (2024). OMOP dataset: Hospital COVID patients: severity, acuity, therapies, outcomes [Dataset]. https://healthdatagateway.org/dataset/139
Explore at:
unknownAvailable download formats
Dataset updated
Oct 8, 2024
Dataset authored and provided by
This publication uses data from PIONEER, an ethically approved database and analytical environment (East Midlands Derby Research Ethics 20/EM/0158)
License
https://www.pioneerdatahub.co.uk/data/data-request-process/https://www.pioneerdatahub.co.uk/data/data-request-process/
Description
OMOP dataset: Hospital COVID patients: severity, acuity, therapies, outcomes Dataset number 2.0

Coronavirus disease 2019 (COVID-19) was identified in January 2020. Currently, there have been more than 6 million cases & more than 1.5 million deaths worldwide. Some individuals experience severe manifestations of infection, including viral pneumonia, adult respiratory distress syndrome (ARDS) & death. There is a pressing need for tools to stratify patients, to identify those at greatest risk. Acuity scores are composite scores which help identify patients who are more unwell to support & prioritise clinical care. There are no validated acuity scores for COVID-19 & it is unclear whether standard tools are accurate enough to provide this support. This secondary care COVID OMOP dataset contains granular demographic, morbidity, serial acuity and outcome data to inform risk prediction tools in COVID-19.

PIONEER geography The West Midlands (WM) has a population of 5.9 million & includes a diverse ethnic & socio-economic mix. There is a higher than average percentage of minority ethnic groups. WM has a large number of elderly residents but is the youngest population in the UK. Each day >100,000 people are treated in hospital, see their GP or are cared for by the NHS. The West Midlands was one of the hardest hit regions for COVID admissions in both wave 1 & 2.

EHR. University Hospitals Birmingham NHS Foundation Trust (UHB) is one of the largest NHS Trusts in England, providing direct acute services & specialist care across four hospital sites, with 2.2 million patient episodes per year, 2750 beds & 100 ITU beds. UHB runs a fully electronic healthcare record (EHR) (PICS; Birmingham Systems), a shared primary & secondary care record (Your Care Connected) & a patient portal “My Health”. UHB has cared for >5000 COVID admissions to date. This is a subset of data in OMOP format.

Scope: All COVID swab confirmed hospitalised patients to UHB from January – August 2020. The dataset includes highly granular patient demographics & co-morbidities taken from ICD-10 & SNOMED-CT codes. Serial, structured data pertaining to care process (timings, staff grades, specialty review, wards), presenting complaint, acuity, all physiology readings (pulse, blood pressure, respiratory rate, oxygen saturations), all blood results, microbiology, all prescribed & administered treatments (fluids, antibiotics, inotropes, vasopressors, organ support), all outcomes.

Available supplementary data: Health data preceding & following admission event. Matched “non-COVID” controls; ambulance, 111, 999 data, synthetic data. Further OMOP data available as an additional service.

Available supplementary support: Analytics, Model build, validation & refinement; A.I.; Data partner support for ETL (extract, transform & load) process, Clinical expertise, Patient & end-user access, Purchaser access, Regulatory requirements, Data-driven trials, “fast screen” services.
A
Neighborhood Demographics
data.boston.gov
pdf, xlsx
Updated Dec 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Planning Department (2025). Neighborhood Demographics [Dataset]. https://data.boston.gov/dataset/neighborhood-demographics
Explore at:
xlsx(15582925), pdf(476137), pdf(508811), xlsx(156459), xlsx(158232)Available download formats
Dataset updated
Dec 1, 2025
Dataset authored and provided by
Planning Department
License
ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
Description
Please note this page provides neighborhood demographic data using 2010 Census tracts. For updated Neighborhood Demographics using 2020 Census tracts consistently across historical years, please refer to the Planning Department Research Division's Exploring Neighborhood Change Tool. The tool visualizes demographic, economic, and housing data for Boston's tracts and neighborhoods from 1950 to 2025 (with projections to 2035) using the most up-to-date 2020 Census tract-based Neighborhood boundaries.

Boston is a city defined by the unique character of its many neighborhoods. The historical tables created by the BPDA Research Division from U.S. Census Decennial data describe demographic changes in Boston’s neighborhoods from 1950 through 2010 using consistent tract-based geographies. For more analysis of these data, please see Historical Trends in Boston's Neighborhoods. The most recent available neighborhood demographic data come from the 5-year American Community Survey (ACS). The ACS tables also present demographic data for Census-tract approximations of Boston’s neighborhoods. For pdf versions of the data presented here plus earlier versions of the analysis, please see Boston in Context.
UHB 100K Genomics patient clinical data including their acute care contacts
healthdatagateway.org
unknown
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
University Hospitals Birmingham NHS Foundation Trust, UHB 100K Genomics patient clinical data including their acute care contacts [Dataset]. https://healthdatagateway.org/en/dataset/140
Explore at:
unknownAvailable download formats
Dataset authored and provided by
University Hospitals Birmingham NHS Foundation Trusthttp://www.uhb.nhs.uk/
License
https://www.pioneerdatahub.co.uk/data/data-request-process/https://www.pioneerdatahub.co.uk/data/data-request-process/
Description
1 in 17 people are born with or develop a rare disease during their lifetime. 80% of rare diseases have an identified genetic component. However, there are usually significant diagnostic delays. The 100k Genome project was established to collect clinical data, genomic sequencing and samples from people with cancer and rare diseases, to better understand disease and find novel treatments and interventions. This includes rare cardiovascular, ciliopathy, endocrine, gastroenterological, haematological, metabolic, neurological, renal, respiratory skeletal and rheumatological disorders and cancers.

The PIONEER University Hospital Birmingham (UHB) secondary care 100k genomics dataset contains granular demographic, morbidity, treatment and outcome data, supplemented with acute care contacts with serial physiology, blood biomarker data from UHB patients recruited to this programme, to better understand the acute healthcare needs of this group of patients.

PIONEER geography: The West Midlands has a population of 5.9M and includes a diverse ethnic and socio-economic mix. There is a higher than average percentage of minority ethnic groups and a higher than average proportion of patients with rare diseases. Birmingham is home to the first Centre for Rare Diseases for adults and children, treating more than 500 rare diseases and 9000 patients per year.

Electronic Health Records: University Hospitals Birmingham NHS Foundation Trust (UHB) is one of the largest NHS Trusts in England, providing direct acute services and specialist care across four hospital sites, with 2.2M patient episodes per year, 2750 beds and 100 ITU beds.

Scope: All patients recruited to the 100K genome project from UHB. This includes all routinely collected health data for all these patients, but data is uniquely supplemented with all acute care contacts through UHB. The dataset includes highly granular patient demographics and co-morbidities taken from ICD-10 and SNOMED-CT codes. Serial, structured data pertaining to acute care process (timings, staff grades, specialty review, wards), presenting complaint, acuity, all physiology readings (pulse, blood pressure, respiratory rate, oxygen saturations), all blood results, microbiology, all prescribed and administered treatments (fluids, antibiotics, inotropes, vasopressors, organ support), all outcomes.

Available supplementary data: Matched controls; ambulance, synthetic data. Available supplementary support: Analytics, Model build, validation and refinement; A.I.; Data partner support for ETL process, Clinical expertise, Patient and end-user access, Purchaser access, Regulatory requirements, Data-driven trials, “fast screen” services.
Synthetic Hospital Data
kaggle.com
zip
Updated Oct 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dimpi Mittal (2025). Synthetic Hospital Data [Dataset]. https://www.kaggle.com/datasets/dimpimittal/synthetic-hospital-data-cleaned-version
Explore at:
zip(212990 bytes)Available download formats
Dataset updated
Oct 23, 2025
Authors
Dimpi Mittal
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Summary:

This dataset is a synthetic hospital dataset designed for data science and machine learning practice. It contains detailed information about patient admissions, hospital departments, ward types, bed grades, and hospital resources, making it ideal for predictive modeling, exploratory data analysis, and feature engineering exercises.

Key highlights:

5000 patient records with 18 columns covering patient demographics, admission details, severity, and stay length.

Fully cleaned and preprocessed: duplicates removed, missing values handled, categorical features encoded, and numeric columns ready for ML.

Suitable for regression tasks (predicting length of stay or admission deposit) and classification tasks (predicting severity of illness, type of admission, or department).

Synthetic and safe: no real patient data included, perfect for learning and experimentation.

This dataset is perfect for students, beginners, and practitioners who want to practice healthcare analytics, ML model building, or data preprocessing.
f
Patient characteristics and demographics.
datasetcatalog.nlm.nih.gov
plos.figshare.com
+1more
Updated Dec 19, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ely, E. Wesley; Han, Jin H.; McNeil, J. Brennan; Hughes, Christopher G.; Ware, Lorraine B.; Chandrasekhar, Rameela; Girard, Timothy (2019). Patient characteristics and demographics. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000096273
Explore at:
Dataset updated
Dec 19, 2019
Authors
Ely, E. Wesley; Han, Jin H.; McNeil, J. Brennan; Hughes, Christopher G.; Ware, Lorraine B.; Chandrasekhar, Rameela; Girard, Timothy
Description
Patient characteristics and demographics.
Patient demographics at the time of the baseline PET/CT (total number of...
plos.figshare.com
datasetcatalog.nlm.nih.gov
xls
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lars Husmann; Martin W. Huellner; Hannes Gruenig; Nadia Eberhard; Carlos A. Mestres; Zoran Rancic; Barbara Hasse (2023). Patient demographics at the time of the baseline PET/CT (total number of PET/CT examinations n = 101). [Dataset]. http://doi.org/10.1371/journal.pone.0258702.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0258702.t001
Dataset updated
Jun 1, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Lars Husmann; Martin W. Huellner; Hannes Gruenig; Nadia Eberhard; Carlos A. Mestres; Zoran Rancic; Barbara Hasse
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Patient demographics at the time of the baseline PET/CT (total number of PET/CT examinations n = 101).

Facebook

Twitter

Click to copy link

Link copied

Cite

GoMask.ai (2025). Patient Demographics [Dataset]. https://gomask.ai/marketplace/datasets/patient-demographics

Patient Demographics

Explore at:

csv(10 MB), jsonAvailable download formats

Dataset updated

Nov 12, 2025

Dataset provided by

GoMask.ai

License

CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically

Time period covered

2024 - 2025

Area covered

Global

Variables measured

email, gender, last_name, first_name, patient_id, address_city, phone_number, address_state, date_of_birth, address_street, and 15 more

Description

This dataset provides comprehensive healthcare patient demographic records, including unique medical identifiers, insurance details, emergency contacts, and appointment histories. It enables efficient patient management, supports clinical workflows, and facilitates analytics for healthcare providers and administrators. The structured schema ensures data integrity and usability for operational and research applications.

Clear search

Close search

Google apps

Main menu

Patient Demographics

Patient demographics and clinical data.

Database containing demographic data of each patient and laboratory data of...

Patient demographics and baseline characteristics.

Hospice Utilization - Patient Demographics | gimi9.com

Patient demographics and summary findings.

Demographics of the patient population.

Hospital Excel Dataset

Healthcare #HospitalData #ExcelAnalysis #DataVisualization #PivotTables #DataCleaning #MedicalAnalytics #PatientTrends #CostAnalysis #AdmissionsAnalysis #InsuranceData #DataAnalysis #ExcelDashboards #HealthTech

US Healthcare Readmissions and Mortality

US Healthcare Readmissions and Mortality

Evaluating Hospital Performance

About this dataset

More Datasets

Featured Notebooks

How to use the dataset

Research Ideas

Acknowledgements

License

Columns

Africa-stroke-prediction-dataset

BOSQUE Test set

Hoosier Health and Well-being By County and Demographics - Dataset - The...

Brain Tumor Dataset

Digital Twin: EHR, Imaging & IoT Data

OMOP dataset: Hospital COVID patients: severity, acuity, therapies, outcomes...

Neighborhood Demographics

UHB 100K Genomics patient clinical data including their acute care contacts

Synthetic Hospital Data

Summary:

Key highlights:

Patient characteristics and demographics.

Patient demographics at the time of the baseline PET/CT (total number of...

Patient Demographics