More details about each file are in the individual file descriptions.
This is a dataset hosted by the Centers for Medicare & Medicaid Services (CMS). The organization has an open data platform found here and they update their information according the amount of data that is brought in. Explore CMS's Data using Kaggle and all of the data sources available through the CMS organization page!
This dataset is maintained using Socrata's API and Kaggle's API. Socrata has assisted countless organizations with hosting their open data and has been an integral part of the process of bringing more data to the public.
This dataset is distributed under the following licenses: Public Domain, NA
This reference provides significant summary information about health expenditures and the Centers for Medicare & Medicaid Services' (CMS) programs. The information presented was the most current available at the time of publication. Significant time lags may occur between the end of a data year and aggregation of data for that year.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The CMS National Plan and Provider Enumeration System (NPPES) was developed as part of the Administrative Simplification provisions in the original HIPAA act. The primary purpose of NPPES was to develop a unique identifier for each physician that billed medicare and medicaid. This identifier is now known as the National Provider Identifier Standard (NPI) which is a required 10 digit number that is unique to an individual provider at the national level.
Once an NPI record is assigned to a healthcare provider, parts of the NPI record that have public relevance, including the provider’s name, speciality, and practice address are published in a searchable website as well as downloadable file of zipped data containing all of the FOIA disclosable health care provider data in NPPES and a separate PDF file of code values which documents and lists the descriptions for all of the codes found in the data file.
The dataset contains the latest NPI downloadable file in an easy to query BigQuery table, npi_raw. In addition, there is a second table, npi_optimized which harnesses the power of Big Query’s next-generation columnar storage format to provide an analytical view of the NPI data containing description fields for the codes based on the mappings in Data Dissemination Public File - Code Values documentation as well as external lookups to the healthcare provider taxonomy codes . While this generates hundreds of columns, BigQuery makes it possible to process all this data effectively and have a convenient single lookup table for all provider information.
Fork this kernel to get started.
https://console.cloud.google.com/marketplace/details/hhs/nppes?filter=category:science-research
Dataset Source: Center for Medicare and Medicaid Services. This dataset is publicly available for anyone to use under the following terms provided by the Dataset Source - http://www.data.gov/privacy-policy#data_policy — and is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.
Banner Photo by @rawpixel from Unplash.
What are the top ten most common types of physicians in Mountain View?
What are the names and phone numbers of dentists in California who studied public health?
The Medicare Physician & Other Practitioners by Provider dataset provides information on use, payments, submitted charges and beneficiary demographic and health characteristics organized by National Provider Identifier (NPI). Note: This full dataset contains more records than most spreadsheet programs can handle, which will result in an incomplete load of data. Use of a database or statistical software is required.
NCHS has linked data from various surveys with Medicare program enrollment and health care utilization and expenditure data from the Centers for Medicare & Medicaid Services (CMS). Linkage of the NCHS survey participants with the CMS Medicare data provides the opportunity to study changes in health status, health care utilization and costs, and prescription drug use among Medicare enrollees. Medicare is the federal health insurance program for people who are 65 or older, certain younger people with disabilities, and people with End-Stage Renal Disease.
This file is a point in time snapshot of enrollment level data for providers actively enrolled in Medicare.
This is a dataset hosted by the Centers for Medicare & Medicaid Services (CMS). The organization has an open data platform found here and they update their information according the amount of data that is brought in. Explore CMS's Data using Kaggle and all of the data sources available through the CMS organization page!
This dataset is maintained using Socrata's API and Kaggle's API. Socrata has assisted countless organizations with hosting their open data and has been an integral part of the process of bringing more data to the public.
Cover photo by Annie Spratt on Unsplash
Unsplash Images are distributed under a unique Unsplash License.
This dataset is distributed under NA
https://academictorrents.com/nolicensespecifiedhttps://academictorrents.com/nolicensespecified
Corrected and updated version. Contains full records/datasets available through the CMS Open Data API, including: CMS Innovation Center Programs COVID-19 Resources Medicare Current Beneficiary Survey (MCBS) Medicare Shared Saving Program Medicare Value-Based Payment Modifier Program Provider Characteristics Provider Compliance Provider Summary By Type of Service Quality of Care Summary Statistics on Beneficiary Enrollment Summary Statistics on Provider Enrollment Summary Statistics on Use and Payments About: This site gives you direct access to public data released by the Centers for Medicare & Medicaid Services (CMS). Our goal is to make our data readily available in open, accessible, and machine-readable formats. For most available data, you can: Download data in a variety of formats. View and analyze data using interactive tools. Access data through an Application Programming Interface, or API. An API lets developers connect other applications to data in real time. The Centers
DE-SynPUF is provided here as a 1,000 person (1k), 100,000 person (100k), and 2,300,000 persom (2.3m) data sets in the OMOP Common Data Model format. The DE-SynPUF was created with the goal of providing a realistic set of claims data in the public domain while providing the very highest degree of protection to the Medicare beneficiaries’ protected health information. The purposes of the DE-SynPUF are to:
This document contains 100k dimuon events selected from the Mu dataset from Run2010B. Each line corresponds to an event. The main file contains all 100k events. Files with an underscore contain 10k events each.
These data were selected from the Mu primary dataset. The selection criteria may be different from that used in CMS physics results. They are not suitable for a full physics analysis.
https://www.usa.gov/government-workshttps://www.usa.gov/government-works
The Nursing Home COVID-19 Public File from the Centers for Medicare & Medicaid Services, filtered for Connecticut. View the full dataset and detailed metadata here.
The Nursing Home COVID-19 Public File includes data reported by nursing homes to the CDC’s National Healthcare Safety Network (NHSN) system COVID-19 Long Term Care Facility Module, including Resident Impact, Facility Capacity, Staff & Personnel, and Supplies & Personal Protective Equipment, and Ventilator Capacity and Supplies Data Elements.
This dataset tracks the updates made on the dataset "Center for Medicare & Medicaid Services (CMS) , Medicare Claims data" as a repository for previous versions of the data and metadata.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
These datasets are a subset of the CMS Open data with 2021 data-taking conditions for education purposes. The files are in CSV and PKL formats (only use one of those) and contain two datasets:
- Data files, starting with output_data_CMS_Run2012B, correspond to 4429.37 /pb of data collected by the CMS Experiment. They are a subset of the dataset on reference [1].
- Simulation files, starting with output_sim_CMS_MonteCarlo2012, are a subset of the dataset referenced on [2]. The number of generated events in this case is 30458871, and the cross section is 3503.71.
All the files were processed with a modified version of the AOD2NanoAODOutreachTool [3]. The small modifications are related to the number of triggers stored, and some objects like taus were removed.
--------------------------------------------------------
[1] CMS collaboration (2017). DoubleMuParked primary dataset in AOD format from Run of 2012 (/DoubleMuParked/Run2012B-22Jan2013-v1/AOD). CERN Open Data Portal. DOI:10.7483/OPENDATA.CMS.YLIC.86ZZ
[2] Wunsch, Stefan; (2019). DYJetsToLL dataset in reduced NanoAOD format for education and outreach. CERN Open Data Portal. DOI:10.7483/OPENDATA.CMS.SRRA.2GON
[3] https://github.com/cms-opendata-analyses/AOD2NanoAODOutreachTool
The CMS Program Statistics - Medicare Physician, Non-Physician Practitioner and Supplier tables provide use and payment data for physicians, other practitioners, limited-licensed practitioners, and durable medical equipment, prosthetic, and orthotic (DMEPOS) suppliers.
For additional information on enrollment, providers, and Medicare use and payment, visit the CMS Program Statistics page.
These data do not exist in a machine-readable format, so the view data and API options are not available. Please use the download function to access the data.
Below is the list of tables:
MDCR PHYSSUPP 1. Medicare Physicians, Non-Physician Practitioners, and Suppliers: Utilization, Program Payments, Cost Sharing, and Balance Billing for Original Medicare Beneficiaries, by Type of Entitlement, Yearly Trend MDCR PHYSSUPP 2. Medicare Physicians, Non-Physician Practitioners, and Suppliers: Utilization, Program Payments, Cost Sharing, and Balance Billing for Original Medicare Beneficiaries, by Demographic Characteristics and Medicare-Medicaid Enrollment Status MDCR PHYSSUPP 3. Medicare Physicians, Non-Physician Practitioners, and Suppliers: Utilization, Program Payments, Cost Sharing, and Balance Billing for Original Medicare Beneficiaries, by Area of Residence MDCR PHYSSUPP 4. Medicare Physicians, Non-Physician Practitioners, and Suppliers: Utilization, Program Payments, and Balance Billing for Original Medicare Beneficiaries, by Type of Service MDCR PHYSSUPP 5. Medicare Physicians, Non-Physician Practitioners, and Suppliers: Utilization, Program Payments, and Balance Billing for Original Medicare Beneficiaries, by Place of Service MDCR PHYSSUPP 6. Medicare Physicians, Non-Physician Practitioners, and Suppliers: Utilization, Program Payments, and Balance Billing for Original Medicare Beneficiaries, by Physician Specialty MDCR PHYSSUPP 7. Medicare Physicians, Non-Physician Practitioners, and Suppliers: Utilization and Program Payments for Original Medicare Beneficiaries, by Berenson-Eggers Type of Service (BETOS) Classification
More details about each file are in the individual file descriptions.
This is a dataset hosted by the Centers for Medicare & Medicaid Services (CMS). The organization has an open data platform found here and they update their information according the amount of data that is brought in. Explore CMS's Data using Kaggle and all of the data sources available through the CMS organization page!
This dataset is maintained using Socrata's API and Kaggle's API. Socrata has assisted countless organizations with hosting their open data and has been an integral part of the process of bringing more data to the public.
This dataset is distributed under the following licenses: Public Domain, NA
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Centers for Medicare & Medicaid Services (CMS) Special Terms and Conditions (STC) Datasets.
The CMS Program Statistics - Medicare Part A & Part B - All Types of Service tables provide use and payment data by type of coverage and type of service. For additional information on enrollment, providers, and Medicare use and payment, visit the CMS Program Statistics page. These data do not exist in a machine-readable format, so the view data and API options are not available. Please use the download function to access the data. Below is the list of tables: MDCR SUMMARY AB 1. Medicare Part A and Part B Summary: Utilization, Program Payments, and Cost Sharing for All Original Medicare Beneficiaries, by Type of Coverage and Type of Service, Yearly Trend MDCR SUMMARY AB 2. Medicare Part A and Part B Summary: Utilization, Program Payments, and Cost Sharing for Aged Original Medicare Beneficiaries, by Type of Coverage and Type of Service, Yearly Trend MDCR SUMMARY AB 3. Medicare Part A and Part B Summary: Utilization, Program Payments, and Cost Sharing for Disabled Original Medicare Beneficiaries by Type of Coverage and Type of Service, Yearly Trend MDCR SUMMARY AB 4. Medicare Part A and Part B Summary: Utilization, Program Payments, and Cost Sharing for Original Medicare Beneficiaries, by Type of Coverage, Demographic Characteristics, and Medicare-Medicaid Enrollment Status MDCR SUMMARY AB 5. Medicare Part A and Part B Summary: Utilization, Program Payments, and Cost Sharing for Original Medicare Beneficiaries, by Type of Coverage and by Area of Residence MDCR SUMMARY AB 6. Medicare Part A and Part B Summary: Utilization and Program Payments for Original Medicare Beneficiaries, by Type of Entitlement, Amount of Program Payments, Type of Coverage, and Type of Service
The CMS Program Statistics - Medicare Providers summary tables provide data on institutional (i.e., hospitals, skilled nursing facilities, home health agencies, hospices, etc.) and non-institutional (i.e., physicians, nonphysicians, specialists, and suppliers) providers. For additional information on enrollment, providers, and Medicare use and payment, visit the CMS Program Statistics page. These data do not exist in a machine-readable format, so the view data and API options are not available. Please use the download function to access the data. Below is the list of tables: MDCR PROVIDERS 1. Medicare Providers: Number of Medicare Certified Institutional Providers, Yearly Trend MDCR PROVIDERS 2. Medicare Providers: Number of Medicare Certified Inpatient Hospital and Skilled Nursing Facility Beds and Beds Per 1,000 Enrollees, Yearly Trend MDCR PROVIDERS 3. Medicare Providers: Number of Medicare Certified Facilities, by Type of Control, Yearly Trend MDCR PROVIDERS 4. Medicare Providers: Number of Skilled Nursing Facilities and Medicare Certified Hospitals, and Number of Beds, by State, Territories, Possessions and Other Areas MDCR PROVIDERS 5. Medicare Providers: Number of Medicare Certified Providers, by Type of Provider, by State, Territories, Possessions, and Other Areas MDCR PROVIDERS 6. Medicare Providers: Number of Medicare Non-Institutional Providers by Specialty, Yearly Trend MDCR PROVIDERS 7. Medicare Providers: Number of Medicare Non-Institutional Providers, by State, Territories, Possessions, and Other Areas, Yearly Trend
The CMS Program Statistics - Medicare Part D tables provide use and Part D drug costs by type of Part D plan (stand-alone prescription drug plan and Medicare Advantage prescription drug plan).
For additional information on enrollment, providers, and Medicare use and payment, visit the CMS Program Statistics page.
These data do not exist in a machine-readable format, so the view data and API options are not available. Please use the download function to access the data.
Below is the list of tables:
MDCR UTLZN D 1. Medicare Part D Utilization: Average Annual Prescription Drug Fills by Type of Plan, Low Income Subsidy (LIS) Eligibility, and Generic Dispensing Rate, Yearly Trend MDCR UTLZN D 2. Medicare Part D Utilization: Average Annual Gross Drug Costs Per Part D Enrollee, by Type of Plan, Low Income Subsidy (LIS) Eligibility, and Brand/Generic Drug Classification, Yearly Trend MDCR UTLZN D 3. Medicare Part D Utilization: Average Annual Gross Drug Costs Per Part D Enrollee, by Type of Plan, Low Income Subsidy (LIS) Eligibility, and Brand/Generic Drug Classification, Yearly Trend MDCR UTLZN D 4. Medicare Part D Utilization: Average Annual Prescription Drug Fills and Average Annual Gross Drug Cost Per Part D Enrollee, by Type of Plan and Demographic Characteristics MDCR UTLZN D 5. Medicare Part D Utilization: Average Annual Prescription Drug Fills and Average Annual Gross Drug Cost Per Part D Utilizer, by Type of Plan and Demographic Characteristics MDCR UTLZN D 6. Medicare Part D Utilization: Average Annual Prescription Drug Fills and Average Annual Gross Drug Cost Per Part D Enrollee, by Type of Plan, by Area of Residence MDCR UTLZN D 7. Medicare Part D Utilization: Average Annual Prescription Drug Fills and Average Annual Gross Drug Cost Per Part D Utilizer, by Type of Plan, by Area of Residence MDCR UTLZN D 8. Medicare Part D Utilization: Number of Part D Utilizers and Average Annual Prescription Drug Fills by Type of Part D Plan, Low Income Subsidy (LIS) Eligibility, and Part D Coverage Phase, Yearly Trend MDCR UTLZN D 9. Medicare Part D Utilization: Number of Part D Utilizers and Drug Costs by Type of Part D Plan, Low Income Subsidy (LIS) Eligibility, and Part D Coverage Phase, Yearly Trend MDCR UTLZN D 10. Medicare Part D Utilization: Number of Part D Utilizers, Average Annual Prescription Drug Events (Fills) and Average Annual Gross Drug Cost Per Part D Utilizer, by Part D Coverage Phase and Demographic Characteristics MDCR UTLZN D 11. Medicare Part D Utilization: Number of Part D Utilizers, Average Annual Prescription Drug Fills and Average Annual Gross Drug Cost Per Part D Utilizer, by Part D Coverage Phase and Area of Residence
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
A dataset of 1,785,625 jets from the Jet Primary Dataset of the CMS 2011A Open Data reprocessed into the MOD HDF5 format. Jets are selected from the hardest two anti-kT R=0.5 jets in events passing the Jet300 High Level Trigger and are required to have \(p_T^\text{jet}>375\) GeV, where \(p_T^\text{jet}\) includes a jet energy correction factor. Particle Flow Candidates (PFCs) for each jet are provided and include information about the PFC kinematics, PDG ID, and vertex. Additionally, jets have metadata describing their kinematics and provenance in the original CMS AOD files.
For additional details about the dataset, please see the accompanying paper, Exploring the Space of Jets with CMS Open Data. There, jets were further restricted to have \(|\eta^\text{jet}|<1.9\) to ensure tracking coverage and have "medium" quality to reject fake jets.
The supported method for downloading, reading, and using this dataset is through the EnergyFlow Python package, which has additional documentation about how to read and use this and related datasets. Should any problems be encountered, please submit an issue on GitHub.
There are corresponding datasets of simulated jets organized by hard parton \(\hat p_T\) also available on Zenodo:
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Simulated QCD jets from the Simulated QCD 1400-1800 Dataset of the CMS 2011 Open Data reprocessed into the MOD HDF5 format. Jets are provided at generator (truth) level in the GEN files and after GEANT4 detector simulation in the SIM files (which also contain associated GEN jets to facilitate studies involving both types of jets). Jets are selected from the hardest two anti-kT R=0.5 jets in events passing the Jet300 High Level Trigger (only relevant for SIM) and are required to have \(p_T^\text{jet}>375\) GeV, where \(p_T^\text{jet}\) includes a jet energy correction factor (again, only relevant for SIM). GEN jets contain truth-level particles with kinematic and PDG ID information, and SIM jets contain Particle Flow Candidates (PFCs) with kinematic, PDG ID, and vertex information. Additionally, jets have metadata describing their kinematics and provenance in the original CMS AOD files.
For additional details about the dataset, please see the accompanying paper, Exploring the Space of Jets with CMS Open Data. There, jets were further restricted to have \(|\eta^\text{jet}|<1.9\) to ensure tracking coverage and (in the case of SIM) have "medium" quality to reject fake jets.
The supported method for downloading, reading, and using this dataset is through the EnergyFlow Python package, which has additional documentation about how to read and use this and related datasets. Should any problems be encountered, please submit an issue on GitHub.
For reference, the other corresponding datasets of simulated jets available on Zenodo are:
There is an associated dataset of jets recorded by the CMS detector available on Zenodo:
More details about each file are in the individual file descriptions.
This is a dataset hosted by the Centers for Medicare & Medicaid Services (CMS). The organization has an open data platform found here and they update their information according the amount of data that is brought in. Explore CMS's Data using Kaggle and all of the data sources available through the CMS organization page!
This dataset is maintained using Socrata's API and Kaggle's API. Socrata has assisted countless organizations with hosting their open data and has been an integral part of the process of bringing more data to the public.
This dataset is distributed under the following licenses: Public Domain, NA