100+ datasets found

d
Dataplex: All CMS Data Feeds | Access 1519 Reports & 26B+ Rows of Contact...
datarade.ai
.csv
Updated Aug 29, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dataplex (2024). Dataplex: All CMS Data Feeds | Access 1519 Reports & 26B+ Rows of Contact Data | Perfect for Historical Analysis & Easy Ingestion [Dataset]. https://datarade.ai/data-products/dataplex-all-cms-data-feeds-access-1519-reports-26b-row-dataplex-3b76
Explore at:
.csvAvailable download formats
Dataset updated
Aug 29, 2024
Dataset authored and provided by
Dataplex
Area covered
United States of America
Description
The All CMS Data Feeds dataset is an expansive resource offering access to 119 unique report feeds, providing in-depth insights into various aspects of the U.S. healthcare system including nursing facility owners and accountable care organization participants contact data. With over 25.8 billion rows of data meticulously collected since 2007, this dataset is invaluable for healthcare professionals, analysts, researchers, and businesses seeking to understand and analyze healthcare trends, performance metrics, and demographic shifts over time. The dataset is updated monthly, ensuring that users always have access to the most current and relevant data available.

Dataset Overview:

118 Report Feeds: - The dataset includes a wide array of report feeds, each providing unique insights into different dimensions of healthcare. These topics range from Medicare and Medicaid service metrics, patient demographics, provider information, financial data, and much more. The breadth of information ensures that users can find relevant data for nearly any healthcare-related analysis. - As CMS releases new report feeds, they are automatically added to this dataset, keeping it current and expanding its utility for users.

25.8 Billion Rows of Data:

With over 25.8 billion rows of data, this dataset provides a comprehensive view of the U.S. healthcare system. This extensive volume of data allows for granular analysis, enabling users to uncover insights that might be missed in smaller datasets. The data is also meticulously cleaned and aligned, ensuring accuracy and ease of use.

Historical Data Since 2007: - The dataset spans from 2007 to the present, offering a rich historical perspective that is essential for tracking long-term trends and changes in healthcare delivery, policy impacts, and patient outcomes. This historical data is particularly valuable for conducting longitudinal studies and evaluating the effects of various healthcare interventions over time.

Monthly Updates:

To ensure that users have access to the most current information, the dataset is updated monthly. These updates include new reports as well as revisions to existing data, making the dataset a continuously evolving resource that stays relevant and accurate.

Data Sourced from CMS:

The data in this dataset is sourced directly from the Centers for Medicare & Medicaid Services (CMS). After collection, the data is meticulously cleaned and its attributes are aligned, ensuring consistency, accuracy, and ease of use for any application. Furthermore, any new updates or releases from CMS are automatically integrated into the dataset, keeping it comprehensive and current.

Use Cases:

Market Analysis:

The dataset is ideal for market analysts who need to understand the dynamics of the healthcare industry. The extensive historical data allows for detailed segmentation and analysis, helping users identify trends, market shifts, and growth opportunities. The comprehensive nature of the data enables users to perform in-depth analyses of specific market segments, making it a valuable tool for strategic decision-making.

Healthcare Research:

Researchers will find the All CMS Data Feeds dataset to be a robust foundation for academic and commercial research. The historical data, combined with the breadth of coverage across various healthcare metrics, supports rigorous, in-depth analysis. Researchers can explore the effects of healthcare policies, study patient outcomes, analyze provider performance, and more, all within a single, comprehensive dataset.

Performance Tracking:

Healthcare providers and organizations can use the dataset to track performance metrics over time. By comparing data across different periods, organizations can identify areas for improvement, monitor the effectiveness of initiatives, and ensure compliance with regulatory standards. The dataset provides the detailed, reliable data needed to track and analyze key performance indicators.

Compliance and Regulatory Reporting:

The dataset is also an essential tool for compliance officers and those involved in regulatory reporting. With detailed data on provider performance, patient outcomes, and healthcare utilization, the dataset helps organizations meet regulatory requirements, prepare for audits, and ensure adherence to best practices. The accuracy and comprehensiveness of the data make it a trusted resource for regulatory compliance.

Data Quality and Reliability:

The All CMS Data Feeds dataset is designed with a strong emphasis on data quality and reliability. Each row of data is meticulously cleaned and aligned, ensuring that it is both accurate and consistent. This attention to detail makes the dataset a trusted resource for high-stakes applications, where data quality is critical.

Integration and Usability:

Ease of Integration:

The dataset is provided in a CSV format, which is widely compatible with most data analysis too...
C
Patient Demographics
data.chhs.ca.gov
data.ca.gov
+3more
csv, pdf, zip
Updated Aug 29, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department of State Hospitals (2024). Patient Demographics [Dataset]. https://data.chhs.ca.gov/dataset/patient-demographics
Explore at:
csv(176), zip, pdf(102502), pdf(104096), csv(896), csv(191), pdf(86902), csv(194), csv(1784), pdf(97992), pdf(93731), pdf(104586), csv(1209), pdf(107720), csv(187), csv(206), pdf(103183), pdf(106532), csv(182), csv(212), pdf(91406), csv(2016), csv(307), csv(1072), csv(553), csv(1144), csv(167), csv(862), csv(208), csv(834)Available download formats
Dataset updated
Aug 29, 2024
Dataset authored and provided by
Department of State Hospitals
Description
Department of State Hospitals Patient Population Demographic (Fiscal Effective Dates: 2010-2020)
N
Medical Lake, WA Age Group Population Dataset: A Complete Breakdown of...
neilsberg.com
csv, json
Updated Feb 22, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2025). Medical Lake, WA Age Group Population Dataset: A Complete Breakdown of Medical Lake Age Demographics from 0 to 85 Years and Over, Distributed Across 18 Age Groups // 2025 Edition [Dataset]. https://www.neilsberg.com/research/datasets/4535d22d-f122-11ef-8c1b-3860777c1fe6/
Explore at:
json, csvAvailable download formats
Dataset updated
Feb 22, 2025
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Washington, Medical Lake
Variables measured
Population Under 5 Years, Population over 85 years, Population Between 5 and 9 years, Population Between 10 and 14 years, Population Between 15 and 19 years, Population Between 20 and 24 years, Population Between 25 and 29 years, Population Between 30 and 34 years, Population Between 35 and 39 years, Population Between 40 and 44 years, and 9 more
Measurement technique
The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates. To measure the two variables, namely (a) population and (b) population as a percentage of the total population, we initially analyzed and categorized the data for each of the age groups. For age groups we divided it into roughly a 5 year bucket for ages between 0 and 85. For over 85, we aggregated data into a single group for all ages. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the Medical Lake population distribution across 18 age groups. It lists the population in each age group along with the percentage population relative of the total population for Medical Lake. The dataset can be utilized to understand the population distribution of Medical Lake by age. For example, using this dataset, we can identify the largest age group in Medical Lake.

Key observations

The largest age group in Medical Lake, WA was for the group of age 30 to 34 years years with a population of 580 (11.77%), according to the ACS 2019-2023 5-Year Estimates. At the same time, the smallest age group in Medical Lake, WA was the 85 years and over years with a population of 24 (0.49%). Source: U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates

Age groups:

Under 5 years

5 to 9 years

10 to 14 years

15 to 19 years

20 to 24 years

25 to 29 years

30 to 34 years

35 to 39 years

40 to 44 years

45 to 49 years

50 to 54 years

55 to 59 years

60 to 64 years

65 to 69 years

70 to 74 years

75 to 79 years

80 to 84 years

85 years and over

Variables / Data Columns

Age Group: This column displays the age group in consideration

Population: The population for the specific age group in the Medical Lake is shown in this column.

% of Total Population: This column displays the population of each age group as a proportion of Medical Lake total population. Please note that the sum of all percentages may not equal one due to rounding of values.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Medical Lake Population by Age. You can refer the same here
Z
A dataset of anonymised hospitalised COVID-19 patient data: outcomes,...
data.niaid.nih.gov
zenodo.org
Updated Jun 29, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lambert, Ben (2022). A dataset of anonymised hospitalised COVID-19 patient data: outcomes, demographics and biomarker measurements for two New York hospitals [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_6771833
Explore at:
Dataset updated
Jun 29, 2022
Dataset provided by
Stopard, Isaac J
Zuretti, Alejandro
Momeni-Boroujeni
Mendoza, Rachelle
Lambert, Ben
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
New York
Description
These datasets are for a cohort of n=1540 anonymised hospitalised COVID-19 patients, and the data provide information on outcomes (i.e. patient death or discharge), demographics and biomarker measurements for two New York hospitals: State University of New York (SUNY) Downstate Health Sciences University and Maimonides Medical Center.

The file "demographics_both_hospitals.csv" contains the ultimate outcomes of hospitalisation (whether a patient was discharged or died), demographic information and known comorbidities for each of the patients.

The file "dynamics_clean_both_hospitals.csv" contains cleaned dynamic biomarker measurements for the n=1233 patients where this information was available and the data passed our various checks (see https://doi.org/10.1101/2021.11.12.21266248 for information of these checks and the cleaning process). Patients can be matched to demographic data via the "id" column.

Study approval and data collection

Study approval was obtained from the State University of New York (SUNY) Downstate Health Sciences University Institutional Review Board (IRB#1595271-1) and Maimonides Medical Center Institutional Review Board/Research Committee (IRB#2020-05-07). A retrospective query was performed among the patients who were admitted to SUNY Downstate Medical Center and Maimonides Medical Center with COVID-19-related symptoms, which was subsequently confirmed by RT PCR, from the beginning of February 2020 until the end of May 2020. Stratified randomization was used to select at least 500 patients who were discharged and 500 patients who died due to the complications of COVID-19. Patient outcome was recorded as a binary choice of “discharged” versus “COVID-19 related mortality”. Patients whose outcome was unknown were excluded. Demographic, clinical history and laboratory data was extracted from the hospital’s electronic health records.
c
Healthcare Dataset
cubig.ai
Updated May 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CUBIG (2025). Healthcare Dataset [Dataset]. https://cubig.ai/store/products/176/healthcare-dataset
Explore at:
Dataset updated
May 7, 2025
Dataset authored and provided by
CUBIG
License
https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service
Measurement technique
Synthetic data generation using AI techniques for model training, Privacy-preserving data transformation via differential privacy
Description
1) Data Introduction • The Healthcare Dataset is a synthetic dataset designed to mimic real-world healthcare data for data science, machine learning, and data analysis purposes. It includes patient information, medical conditions, admission details, and healthcare services provided. This dataset is ideal for developing and testing healthcare predictive models, practicing data manipulation techniques, and creating data visualizations.

2) Data Utilization (1) Healthcare data has characteristics that: • It includes detailed patient information such as age, gender, blood type, medical condition, and admission details. This information can be used to analyze healthcare trends, patient demographics, and the effectiveness of medical treatments. (2) Healthcare data can be used to: • Predictive Modeling: Helps in developing models to predict patient outcomes, treatment success rates, and disease progression. • Healthcare Analytics: Assists in analyzing patient data to identify patterns, improve patient care, and optimize resource allocation. • Educational Purposes: Supports learning and teaching data science concepts in a healthcare context, providing realistic data for experimentation and practice.
Hospital Management Dataset
kaggle.com
Updated May 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kanak Baghel (2025). Hospital Management Dataset [Dataset]. https://www.kaggle.com/datasets/kanakbaghel/hospital-management-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 30, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Kanak Baghel
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
This is a structured, multi-table dataset designed to simulate a hospital management system. It is ideal for practicing data analysis, SQL, machine learning, and healthcare analytics.

Dataset Overview

This dataset includes five CSV files:

patients.csv – Patient demographics, contact details, registration info, and insurance data

doctors.csv – Doctor profiles with specializations, experience, and contact information

appointments.csv – Appointment dates, times, visit reasons, and statuses

treatments.csv – Treatment types, descriptions, dates, and associated costs

billing.csv – Billing amounts, payment methods, and status linked to treatments

📁 Files & Column Descriptions

** patients.csv**

Contains patient demographic and registration details.

Column Description

patient_id -> Unique ID for each patient first_name -> Patient's first name last_name -> Patient's last name gender -> Gender (M/F) date_of_birth -> Date of birth contact_number -> Phone number address -> Address of the patient registration_date -> Date of first registration at the hospital insurance_provider -> Insurance company name insurance_number -> Policy number email -> Email address

** doctors.csv**

Details about the doctors working in the hospital.

Column Description

doctor_id -> Unique ID for each doctor first_name -> Doctor's first name last_name -> Doctor's last name specialization -> Medical field of expertise phone_number -> Contact number years_experience -> Total years of experience hospital_branch -> Branch of hospital where doctor is based email -> Official email address

appointments.csv

Records of scheduled and completed patient appointments.

Column Description

appointment_id -> Unique appointment ID patient_id -> ID of the patient doctor_id -> ID of the attending doctor appointment_date -> Date of the appointment appointment_time -> Time of the appointment reason_for_visit -> Purpose of visit (e.g., checkup) status -> Status (Scheduled, Completed, Cancelled)

treatments.csv

Information about the treatments given during appointments.

Column Description

treatment_id -> Unique ID for each treatment appointment_id -> Associated appointment ID treatment_type -> Type of treatment (e.g., MRI, X-ray) description -> Notes or procedure details cost -> Cost of treatment treatment_date -> Date when treatment was given

** billing.csv**

Billing and payment details for treatments.

Column Description

bill_id -> Unique billing ID patient_id -> ID of the billed patient treatment_id -> ID of the related treatment bill_date -> Date of billing amount -> Total amount billed payment_method -> Mode of payment (Cash, Card, Insurance) payment_status -> Status of payment (Paid, Pending, Failed)

Possible Use Cases

SQL queries and relational database design

Exploratory data analysis (EDA) and dashboarding

Machine learning projects (e.g., cost prediction, no-show analysis)

Feature engineering and data cleaning practice

End-to-end healthcare analytics workflows

Recommended Tools & Resources

SQL (joins, filters, window functions)

Pandas and Matplotlib/Seaborn for EDA

Scikit-learn for ML models

Pandas Profiling for automated EDA

Plotly for interactive visualizations

Please Note that :

All data is synthetically generated for educational and project use. No real patient information is included.

If you find this dataset helpful, consider upvoting or sharing your insights by creating a Kaggle notebook.
D
Washington Health Workforce Survey Data
data.wa.gov
data.virginia.gov
+3more
application/rdfxml +5
Updated Apr 25, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
WA-DOH (2024). Washington Health Workforce Survey Data [Dataset]. https://data.wa.gov/Health/Washington-Health-Workforce-Survey-Data/cvrw-ujje
Explore at:
application/rdfxml, xml, csv, json, application/rssxml, tsvAvailable download formats
Dataset updated
Apr 25, 2024
Dataset authored and provided by
WA-DOH
Area covered
Washington
Description
The Washington State Department of Health presents this information as a service to the public. This includes information on the work status, practice characteristics, education, and demographics of healthcare providers, provided in response to the Washington Health Workforce Survey.

This is a complete set of data across all of the responding professions. The data dictionary identifies questions that are specific to an individual profession and aren't common to all surveys. The dataset is provided without identifying information for the responding providers.

More information on the Washington Health Workforce Survey can be found at www.doh.wa.gov/workforcesurvey
h
A granular assessment of the day-to-day variation in emergency presentations...
healthdatagateway.org
unknown
Updated Mar 13, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
This publication uses data from PIONEER, an ethically approved database and analytical environment (East Midlands Derby Research Ethics 20/EM/0158) (2024). A granular assessment of the day-to-day variation in emergency presentations [Dataset]. https://healthdatagateway.org/en/dataset/175
Explore at:
unknownAvailable download formats
Dataset updated
Mar 13, 2024
Dataset authored and provided by
This publication uses data from PIONEER, an ethically approved database and analytical environment (East Midlands Derby Research Ethics 20/EM/0158)
License
https://www.pioneerdatahub.co.uk/data/data-request-process/https://www.pioneerdatahub.co.uk/data/data-request-process/
Description
The acute-care pathway (from the emergency department (ED) through acute medical units or ambulatory care and on to wards) is the most visible aspect of the hospital health-care system to most patients. Acute hospital admissions are increasing yearly and overcrowded emergency departments and high bed occupancy rates are associated with a range of adverse patient outcomes. Predicted growth in demand for acute care driven by an ageing population and increasing multimorbidity is likely to exacerbate these problems in the absence of innovation to improve the processes of care.

Key targets for Emergency Medicine services are changing, moving away from previous 4-hour targets. This will likely impact the assessment of patients admitted to hospital through Emergency Departments.

This data set provides highly granular patient level information, showing the day-to-day variation in case mix and acuity. The data includes detailed demography, co-morbidity, symptoms, longitudinal acuity scores, physiology and laboratory results, all investigations, prescriptions, diagnoses and outcomes. It could be used to develop new pathways or understand the prevalence or severity of specific disease presentations.

PIONEER geography: The West Midlands (WM) has a population of 5.9 million & includes a diverse ethnic & socio-economic mix.

Electronic Health Record: University Hospital Birmingham is one of the largest NHS Trusts in England, providing direct acute services & specialist care across four hospital sites, with 2.2 million patient episodes per year, 2750 beds & an expanded 250 ITU bed capacity during COVID. UHB runs a fully electronic healthcare record (EHR) (PICS; Birmingham Systems), a shared primary & secondary care record (Your Care Connected) & a patient portal “My Health”.

Scope: All patients with a medical emergency admitted to hospital, flowing through the acute medical unit. Longitudinal & individually linked, so that the preceding & subsequent health journey can be mapped & healthcare utilisation prior to & after admission understood. The dataset includes patient demographics, co-morbidities taken from ICD-10 & SNOMED-CT codes. Serial, structured data pertaining to process of care (timings, admissions, wards and readmissions), physiology readings (NEWS2 score and clinical frailty scale), Charlson comorbidity index and time dimensions.

Available supplementary data: Matched controls; ambulance data, OMOP data, synthetic data.

Available supplementary support: Analytics, Model build, validation & refinement; A.I.; Data partner support for ETL (extract, transform & load) process, Clinical expertise, Patient & end-user access, Purchaser access, Regulatory requirements, Data-driven trials, “fast screen” services.
Race/Ethnicity Health Workforce Trends
healthdata.gov
data.ca.gov
+3more
application/rdfxml +5
Updated Apr 8, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
chhs.data.ca.gov (2025). Race/Ethnicity Health Workforce Trends [Dataset]. https://healthdata.gov/State/Race-Ethnicity-Health-Workforce-Trends/qn52-3qif
Explore at:
json, tsv, csv, xml, application/rdfxml, application/rssxmlAvailable download formats
Dataset updated
Apr 8, 2025
Dataset provided by
chhs.data.ca.gov
Description
The dataset contains estimates for the number of healthcare professionals in 15 different healthcare categories (e.g., Registered Nurse, Dentist, License Clinical Social Worker, etc.) based on completion of license renewal by Race/Ethnicity. There are two timeframes: all current licenses and recent licenses (since 2017). California population estimates are also included to provide a marker for each Race/Ethnicity. Each healthcare professional category can be compared across Race/Ethnicity groups and compared to statewide population estimates, so Race/Ethnicity shortages can be identified for each healthcare professional category. For instance, a notable difference between healthcare professional category and statewide population would indicate either underrepresentation or overrepresentation for that Race/Ethnicity, depending on the direction of the difference.
o
Demographic, Healthcare and Education services - Dataset OD Mekong Datahub
data.opendevelopmentmekong.net
Updated Aug 3, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2016). Demographic, Healthcare and Education services - Dataset OD Mekong Datahub [Dataset]. https://data.opendevelopmentmekong.net/dataset/demographic-healthcare-and-education-services
Explore at:
Dataset updated
Aug 3, 2016
Description
This dataset describes demographic information, number of healthcare and education facilities and human resources/staff in States/Regions and Townships of Myanmar.
Healthcare Payments Data (HPD) Healthcare Measures
data.chhs.ca.gov
data.ca.gov
+3more
csv, pdf, zip
Updated Jul 10, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department of Health Care Access and Information (2025). Healthcare Payments Data (HPD) Healthcare Measures [Dataset]. https://data.chhs.ca.gov/dataset/healthcare-payments-data-hpd-healthcare-measures
Explore at:
csv(28599287), pdf(195607), pdf(235155), csv(41787067), pdf(203716), csv(2088), csv(11049), zipAvailable download formats
Dataset updated
Jul 10, 2025
Dataset authored and provided by
Department of Health Care Access and Information
Description
This dataset contains data for the Healthcare Payments Data (HPD) Healthcare Measures report. The data cover three measurement categories: Health conditions, Utilization, and Demographics. The health condition measurements quantify the prevalence of long-term illnesses and major medical events prominent in California’s communities like diabetes and heart failure. Utilization measures convey rates of healthcare system use through visits to the emergency department and different categories of inpatient stays, such as maternity or surgical stays. The demographic measures describe the health coverage and other characteristics (e.g., age) of the Californians included in the data and represented in the other measures. The data include both a count or sum of each measure and a count of the base population so that data users can calculate the percentages, rates, and averages in the visualization. Measures are grouped by year, age band, sex (assigned sex at birth), payer type, Covered California Region, and county.
E
IMA-AIM data set including Permanent Sample
healthinformationportal.eu
html
Updated Mar 2, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
IMA-AIM (2022). IMA-AIM data set including Permanent Sample [Dataset]. https://www.healthinformationportal.eu/health-information-sources/ima-aim-data-set-including-permanent-sample
Explore at:
htmlAvailable download formats
Dataset updated
Mar 2, 2022
Dataset authored and provided by
IMA-AIM
License
https://aim-ima.be/Donnees-individuelles-realiser-l?lang=frhttps://aim-ima.be/Donnees-individuelles-realiser-l?lang=fr
Variables measured
sex, title, topics, country, language, data_owners, description, contact_name, geo_coverage, contact_email, and 12 more
Measurement technique
Hospital resources & Healthcare resources
Description
IMA-AIM can provide you with detailed data on the health care system in Belgium. Their data collection includes information on the reimbursed care and medicines of the 11 million citizens insured in our country. The data is collected by the 7 health insurance funds and processed, analysed and made available for research by IMA-AIM.

The seven health insurance funds in Belgium collect a lot of data about their members in order to be able to carry out their tasks. IMA-AIM brings these data together in databases for the purpose of analysis and research. The databases contain three types of data: population data (demographic and socio-economic characteristics), information about reimbursed health care and information about reimbursed medicines.

The Permanent Sample (EPS) is a longitudinal dataset containing data from the Population, Health Care and Pharmanet databases, as well as data on hospitalisations. The data are available in separate datasets per calendar year. The aim of EPS is to make the administrative data of the health insurance funds permanently available to a number of federal and regional partners. More information about the EPS: https://metadata.ima-aim.be/nl/app/bdds/Ps
d
Africa Centre for Health and Population Studies
dknet.org
scicrunch.org
+1more
Updated Jul 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Africa Centre for Health and Population Studies [Dataset]. http://identifiers.org/RRID:SCR_008964
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_008964
Dataset updated
Jul 9, 2025
Description
Longitudinal datasets of demographic, social, medical and economic information from a rural demographic in northern KwaZulu-Natal, South Africa where HIV prevalence is extremely high. The data may be filtered by demographics, years, or by individuals questionnaires. The datasets may be used by other researchers but the Africa Centre requests notification that anyone contact them when downloading their data. The datasets are provided in three formats: Stata11 .dta; tables in a MS-Access .accdb database; and worksheets in a MS-Excel .xlsx workbook. Datasets are generated approximately every six months containing information spanning the whole period of surveillance from 1/1/2000 to present.
E
Health Statistic and Research Database
healthinformationportal.eu
html
Updated Feb 23, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Estonian National Institute for Health Development (2023). Health Statistic and Research Database [Dataset]. https://www.healthinformationportal.eu/health-information-sources/health-statistic-and-research-database
Explore at:
htmlAvailable download formats
Dataset updated
Feb 23, 2023
Dataset authored and provided by
Estonian National Institute for Health Development
Variables measured
sex, title, topics, country, language, data_owners, description, contact_name, geo_coverage, contact_email, and 10 more
Measurement technique
Multiple sources
Description
The Health Statistics and Health Research Database is Estonian largest set of health-related statistics and survey results administrated by National Institute for Health Development. Use of the database is free of charge.

The database consists of eight main areas divided into sub-areas. The data tables included in the sub-areas are assigned unique codes. The data tables presented in the database can be both viewed in the Internet environment, and downloaded using different file formats (.px, .xlsx, .csv, .json). You can download the detailed database user manual here (.pdf).

The database is constantly updated with new data. Dates of updating the existing data tables and adding new data are provided in the release calendar. The date of the last update to each table is provided after the title of the table in the list of data tables.

A contact person for each sub-area is provided under the "Definitions and Methodology" link of each sub-area, so you can ask additional information about the data published in the database. Contact this person for any further questions and data requests.

Read more about publication of health statistics by National Institute for Health Development in Health Statistics Dissemination Principles.
Figure 4.1 Percentage of population with a medical card by age group, 2007...
data.gov.ie
Updated Dec 13, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
data.gov.ie (2016). Figure 4.1 Percentage of population with a medical card by age group, 2007 and 2016 - Dataset - data.gov.ie [Dataset]. https://data.gov.ie/dataset/figure-41-percentage-of-population-with-a-medical-card-by-age-group-2007-and-2016
Explore at:
Dataset updated
Dec 13, 2016
Dataset provided by
data.gov.ie
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Figure 4.1 Percentage of population with a medical card by age group, 2007 and 2016
Family PACT Client Demographics by County
data.chhs.ca.gov
data.ca.gov
+3more
csv, zip
Updated Aug 28, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department of Health Care Services (2024). Family PACT Client Demographics by County [Dataset]. https://data.chhs.ca.gov/dataset/family-pact-client-demographics-by-county
Explore at:
csv(43590), csv(15063), csv(17905), csv(13589), csv(7847), csv(72648), zipAvailable download formats
Dataset updated
Aug 28, 2024
Dataset provided by
California Department of Health Care Serviceshttp://www.dhcs.ca.gov/
Authors
Department of Health Care Services
Description
This dataset includes the following variables: client county; number, percentage, average, and age of clients served, number and percentage of adolescent client served, number and percentage of male clients served , and clients served by race and ethnicity (Latino, White, African American, Asian and Pacific Islander, Other (including Native American); and clients served by primary language (Spanish, English, Other).
h
A synthetic dataset of 15,000 "patients" with Community Acquired Pneumonia...
healthdatagateway.org
unknown
Updated Feb 13, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Data is representative of the multi-ethnicity population within the West Midlands (42% non white). Data includes all patients admitted during this timeframe, with National data Opt Outs applied, and therefore is representative of admissions to secondary care. Data focuses on in-patient stay in hospital during the acute episode but can be supplemented on request to include previous and subsequent hospital contacts (including outpatient appointments) and ambulance, 111, 999 data. (2024). A synthetic dataset of 15,000 "patients" with Community Acquired Pneumonia (CAP) [Dataset]. https://healthdatagateway.org/en/dataset/197
Explore at:
unknownAvailable download formats
Dataset updated
Feb 13, 2024
Dataset authored and provided by
Data is representative of the multi-ethnicity population within the West Midlands (42% non white). Data includes all patients admitted during this timeframe, with National data Opt Outs applied, and therefore is representative of admissions to secondary care. Data focuses on in-patient stay in hospital during the acute episode but can be supplemented on request to include previous and subsequent hospital contacts (including outpatient appointments) and ambulance, 111, 999 data.
License
https://www.pioneerdatahub.co.uk/data/data-request-process/https://www.pioneerdatahub.co.uk/data/data-request-process/
Description
Community Acquired Pneumonia (CAP) is the leading cause of infectious death and the third leading cause of death globally. Disease severity and outcomes are highly variable, dependent on host factors (such as age, smoking history, frailty and comorbidities), microbial factors (the causative organism) and what treatments are given. Clinical decision pathways are complex and despite guidelines, there is significant national variability in how guidelines are adhered to and patient outcomes.

For clinicians treating pneumonia in the hospital setting, care of these patients can be challenging. Key decisions include the type of antibiotics (oral or intravenous), the appropriate place of care (home, hospital or intensive care), and when it is appropriate to stop antibiotics. Decision support tools to help inform clinical management would be highly valuable to the clinical community.

This dataset is synthetic, formed from statistical modelling using real patient data, and represents a population with significant diversity in terms of patient demography, socio-economic status, CAP severity, treatments and outcomes. It can be used to develop code for deployment on real data, train data analysts and increase familiarity with this disease and its management.

PIONEER geography: The West Midlands (WM) has a population of 5.9 million & includes a diverse ethnic & socio-economic mix.

EHR. UHB is one of the largest NHS Trusts in England, providing direct acute services & specialist care across four hospital sites, with 2.2 million patient episodes per year, 2750 beds & an expanded 250 ITU bed capacity during COVID. UHB runs a fully electronic healthcare record (EHR) (PICS; Birmingham Systems), a shared primary & secondary care record (Your Care Connected) & a patient portal “My Health”. This synthetic dataset has been modelled to reflect data collected from this EHR.

Scope: A synthetic dataset which has been statistically modelled on all hospitalised patients admitted to UHB with Community Acquired Pneumonia. The dataset includes highly granular patient demographics & co-morbidities taken from ICD-10 & SNOMED-CT codes. Serial, structured data pertaining to process of care including timings, admissions, escalation of care to ITU, discharge outcomes, physiology readings (heart rate, blood pressure, AVPU score and others), blood results and drug prescribing and administration.

Available supplementary data: Matched synthetic controls; ambulance, OMOP data, real patient CAP data. Available supplementary support: Analytics, Model build, validation & refinement; A.I.; Data partner support for ETL (extract, transform & load) process, Clinical expertise, Patient & end-user access, Purchaser access, Regulatory requirements, Data-driven trials, “fast screen” services.
Medical_cost_dataset
kaggle.com
Updated Aug 19, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nandita Pore (2023). Medical_cost_dataset [Dataset]. https://www.kaggle.com/datasets/nanditapore/medical-cost-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 19, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Nandita Pore
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Description:

Explore the intricacies of medical costs and healthcare expenses with our meticulously curated Medical Cost Dataset. This dataset offers valuable insights into the factors influencing medical charges, enabling researchers, analysts, and healthcare professionals to gain a deeper understanding of the dynamics within the healthcare industry.

Columns: 1. ID: A unique identifier assigned to each individual record, facilitating efficient data management and analysis. 2. Age: The age of the patient, providing a crucial demographic factor that often correlates with medical expenses. 3. Sex: The gender of the patient, offering insights into potential cost variations based on biological differences. 4. BMI: The Body Mass Index (BMI) of the patient, indicating the relative weight status and its potential impact on healthcare costs. 5. Children: The number of children or dependents covered under the medical insurance, influencing family-related medical expenses. 6. Smoker: A binary indicator of whether the patient is a smoker or not, as smoking habits can significantly impact healthcare costs. 7. Region: The geographic region of the patient, helping to understand regional disparities in healthcare expenditure. 8. Charges: The medical charges incurred by the patient, serving as the target variable for analysis and predictions.

Whether you're aiming to uncover patterns in medical billing, predict future healthcare costs, or explore the relationships between different variables and charges, our Medical Cost Dataset provides a robust foundation for your research. Researchers can utilize this dataset to develop data-driven models that enhance the efficiency of healthcare resource allocation, insurers can refine pricing strategies, and policymakers can make informed decisions to improve the overall healthcare system.

Unlock the potential of healthcare data with our comprehensive Medical Cost Dataset. Gain insights, make informed decisions, and contribute to the advancement of healthcare economics and policy. Start your analysis today and pave the way for a healthier future.
l
Data from: Population Health data collection for the City of Greater Bendigo...
opal.latrobe.edu.au
researchdata.edu.au
xlsx
Updated Mar 7, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sandra Leggat; Stephen Begg; Charles Ambrose; Greg D'Arcy (2024). Population Health data collection for the City of Greater Bendigo [Dataset]. http://doi.org/10.4225/22/55BAE9DBD9670
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.4225/22/55BAE9DBD9670
Dataset updated
Mar 7, 2024
Dataset provided by
La Trobe
Authors
Sandra Leggat; Stephen Begg; Charles Ambrose; Greg D'Arcy
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Greater Bendigo City
Description
This data collection contains de-identified clinical health service utilisation data from Bendigo Health and the General Practitioners Practices associated with the Loddon Mallee Murray Medicare Local. The collection also includes associated population health data from the ABS, AIHW and the Municipal Health Plans. Health researchers have a major interest in how clinical data can be used to monitor population health and health care in rural and regional Australia through analysing a broad range of factors shown to impact the health of different populations. The Population Health data collection provides students, managers, clinicians and researchers the opportunity to use clinical data in the study of population health, including the analysis of health risk factors, disease trends and health care utilisation and outcomes.Temporal range (data time period):2004 to 2014Spatial coverage:Bendigo Latitude -36.758711200000010000, Bendigo Longitude 144.283745899999990000
P
Medical Cost Personal Dataset Dataset
paperswithcode.com
Updated Jun 12, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Medical Cost Personal Dataset Dataset [Dataset]. https://paperswithcode.com/dataset/medical-cost-personal-dataset
Explore at:
Dataset updated
Jun 12, 2025
Description
This dataset contains demographic and personal health information for individuals, along with the corresponding medical insurance charges billed to them. It is commonly used to build predictive models for insurance costs and to explore relationships between factors such as age, BMI, smoking status, and region on medical expenses.

Features: - age: Age of the primary beneficiary (integer) - sex: Gender of the individual (male, female) - bmi: Body mass index, providing a measure of body fat based on height and weight (float) - children: Number of children/dependents covered by the insurance (integer) - smoker: Smoking status of the individual (yes, no) - region: Residential area in the US (northeast, northwest, southeast, southwest) - charges: Individual medical costs billed by health insurance (float, in USD)

Applications: This dataset is frequently used in regression modeling, cost prediction, and data visualization tasks. It is ideal for learning how lifestyle and demographic factors impact healthcare expenses and serves as a foundational dataset for applied machine learning in health economics.

Facebook

Twitter

Click to copy link

Link copied

Cite

Dataplex (2024). Dataplex: All CMS Data Feeds | Access 1519 Reports & 26B+ Rows of Contact Data | Perfect for Historical Analysis & Easy Ingestion [Dataset]. https://datarade.ai/data-products/dataplex-all-cms-data-feeds-access-1519-reports-26b-row-dataplex-3b76

Dataplex: All CMS Data Feeds | Access 1519 Reports & 26B+ Rows of Contact Data | Perfect for Historical Analysis & Easy Ingestion

Explore at:

.csvAvailable download formats

Dataset updated

Aug 29, 2024

Dataset authored and provided by

Dataplex

Area covered

United States of America

Description

The All CMS Data Feeds dataset is an expansive resource offering access to 119 unique report feeds, providing in-depth insights into various aspects of the U.S. healthcare system including nursing facility owners and accountable care organization participants contact data. With over 25.8 billion rows of data meticulously collected since 2007, this dataset is invaluable for healthcare professionals, analysts, researchers, and businesses seeking to understand and analyze healthcare trends, performance metrics, and demographic shifts over time. The dataset is updated monthly, ensuring that users always have access to the most current and relevant data available.

Dataset Overview:

118 Report Feeds: - The dataset includes a wide array of report feeds, each providing unique insights into different dimensions of healthcare. These topics range from Medicare and Medicaid service metrics, patient demographics, provider information, financial data, and much more. The breadth of information ensures that users can find relevant data for nearly any healthcare-related analysis. - As CMS releases new report feeds, they are automatically added to this dataset, keeping it current and expanding its utility for users.

25.8 Billion Rows of Data:

With over 25.8 billion rows of data, this dataset provides a comprehensive view of the U.S. healthcare system. This extensive volume of data allows for granular analysis, enabling users to uncover insights that might be missed in smaller datasets. The data is also meticulously cleaned and aligned, ensuring accuracy and ease of use.

Historical Data Since 2007: - The dataset spans from 2007 to the present, offering a rich historical perspective that is essential for tracking long-term trends and changes in healthcare delivery, policy impacts, and patient outcomes. This historical data is particularly valuable for conducting longitudinal studies and evaluating the effects of various healthcare interventions over time.

Monthly Updates:

To ensure that users have access to the most current information, the dataset is updated monthly. These updates include new reports as well as revisions to existing data, making the dataset a continuously evolving resource that stays relevant and accurate.

Data Sourced from CMS:

The data in this dataset is sourced directly from the Centers for Medicare & Medicaid Services (CMS). After collection, the data is meticulously cleaned and its attributes are aligned, ensuring consistency, accuracy, and ease of use for any application. Furthermore, any new updates or releases from CMS are automatically integrated into the dataset, keeping it comprehensive and current.

Use Cases:

Market Analysis:

The dataset is ideal for market analysts who need to understand the dynamics of the healthcare industry. The extensive historical data allows for detailed segmentation and analysis, helping users identify trends, market shifts, and growth opportunities. The comprehensive nature of the data enables users to perform in-depth analyses of specific market segments, making it a valuable tool for strategic decision-making.

Healthcare Research:

Researchers will find the All CMS Data Feeds dataset to be a robust foundation for academic and commercial research. The historical data, combined with the breadth of coverage across various healthcare metrics, supports rigorous, in-depth analysis. Researchers can explore the effects of healthcare policies, study patient outcomes, analyze provider performance, and more, all within a single, comprehensive dataset.

Performance Tracking:

Healthcare providers and organizations can use the dataset to track performance metrics over time. By comparing data across different periods, organizations can identify areas for improvement, monitor the effectiveness of initiatives, and ensure compliance with regulatory standards. The dataset provides the detailed, reliable data needed to track and analyze key performance indicators.

Compliance and Regulatory Reporting:

The dataset is also an essential tool for compliance officers and those involved in regulatory reporting. With detailed data on provider performance, patient outcomes, and healthcare utilization, the dataset helps organizations meet regulatory requirements, prepare for audits, and ensure adherence to best practices. The accuracy and comprehensiveness of the data make it a trusted resource for regulatory compliance.

Data Quality and Reliability:

The All CMS Data Feeds dataset is designed with a strong emphasis on data quality and reliability. Each row of data is meticulously cleaned and aligned, ensuring that it is both accurate and consistent. This attention to detail makes the dataset a trusted resource for high-stakes applications, where data quality is critical.

Integration and Usability:

Ease of Integration:

The dataset is provided in a CSV format, which is widely compatible with most data analysis too...

Clear search

Close search

Google apps

Main menu

Dataplex: All CMS Data Feeds | Access 1519 Reports & 26B+ Rows of Contact...

Patient Demographics

Medical Lake, WA Age Group Population Dataset: A Complete Breakdown of...

About this dataset

Content

Inspiration

Recommended for further research

A dataset of anonymised hospitalised COVID-19 patient data: outcomes,...

Healthcare Dataset

Hospital Management Dataset

Washington Health Workforce Survey Data

A granular assessment of the day-to-day variation in emergency presentations...

Race/Ethnicity Health Workforce Trends

Demographic, Healthcare and Education services - Dataset OD Mekong Datahub

Healthcare Payments Data (HPD) Healthcare Measures

IMA-AIM data set including Permanent Sample

Africa Centre for Health and Population Studies

Health Statistic and Research Database

Figure 4.1 Percentage of population with a medical card by age group, 2007...

Family PACT Client Demographics by County

A synthetic dataset of 15,000 "patients" with Community Acquired Pneumonia...

Medical_cost_dataset

Description:

Data from: Population Health data collection for the City of Greater Bendigo...

Medical Cost Personal Dataset Dataset

Dataplex: All CMS Data Feeds | Access 1519 Reports & 26B+ Rows of Contact Data | Perfect for Historical Analysis & Easy IngestionSee More Versions

Dataplex: All CMS Data Feeds | Access 1519 Reports & 26B+ Rows of Contact Data | Perfect for Historical Analysis & Easy Ingestion