https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The CMS National Plan and Provider Enumeration System (NPPES) was developed as part of the Administrative Simplification provisions in the original HIPAA act. The primary purpose of NPPES was to develop a unique identifier for each physician that billed medicare and medicaid. This identifier is now known as the National Provider Identifier Standard (NPI) which is a required 10 digit number that is unique to an individual provider at the national level.
Once an NPI record is assigned to a healthcare provider, parts of the NPI record that have public relevance, including the provider’s name, speciality, and practice address are published in a searchable website as well as downloadable file of zipped data containing all of the FOIA disclosable health care provider data in NPPES and a separate PDF file of code values which documents and lists the descriptions for all of the codes found in the data file.
The dataset contains the latest NPI downloadable file in an easy to query BigQuery table, npi_raw. In addition, there is a second table, npi_optimized which harnesses the power of Big Query’s next-generation columnar storage format to provide an analytical view of the NPI data containing description fields for the codes based on the mappings in Data Dissemination Public File - Code Values documentation as well as external lookups to the healthcare provider taxonomy codes . While this generates hundreds of columns, BigQuery makes it possible to process all this data effectively and have a convenient single lookup table for all provider information.
Fork this kernel to get started.
https://console.cloud.google.com/marketplace/details/hhs/nppes?filter=category:science-research
Dataset Source: Center for Medicare and Medicaid Services. This dataset is publicly available for anyone to use under the following terms provided by the Dataset Source - http://www.data.gov/privacy-policy#data_policy — and is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.
Banner Photo by @rawpixel from Unplash.
What are the top ten most common types of physicians in Mountain View?
What are the names and phone numbers of dentists in California who studied public health?
Verify the accuracy of SSNs of all individual Medicare providers, owners, managing/directing employees, authorized representatives, ambulance service medical directors, ambulance crew members, technicians, chain organization administrators, Independent Diagnostic Test Facility (IDTF), supervising/directing physicians, and IDTF interpretation service providers. Also included in this Agreement are individual health care providers who apply for a National Provider Identification Number (NPI).
The Medicare Physician & Other Practitioners by Provider and Service dataset provides information on use, payments, and submitted charges organized by National Provider Identifier (NPI), Healthcare Common Procedure Coding System (HCPCS) code, and place of service. Note: This full dataset contains more records than most spreadsheet programs can handle, which will result in an incomplete load of data. Use of a database or statistical software is required.
The Medicare Provider and Supplier Taxonomy Crosswalk dataset lists the providers and suppliers eligible to enroll in Medicare programs with the proper healthcare provider taxonomy code. This data includes the Medicare speciality codes, if available, provider/supplier type description, taxonomy code, and the taxonomy description.
Please be advised that as of Q4 2023 there is a new Provider of Service file (POS) that contains the provider and certification details for Home Health Agencies (HHAs), Hospices, and Ambulatory Surgical Centers (ASCs). Data contained in this file are extracted from the Internet Quality Improvement and Evaluation System (iQIES) environment and will be updated quarterly along with the other two POS files. The Provider of Services File - Hospital & Non-Hospital Facilities data provide critical resources for other federal regulator requirements as well as supports the ongoing quality & research efforts sponsored by CMS. In this file you will find provider certification, termination, accreditation, ownership, name, location and other characteristics organized by CMS Certification Number.
The All CMS Data Feeds dataset is an expansive resource offering access to 119 unique report feeds, providing in-depth insights into various aspects of the U.S. healthcare system including nursing facility owners and accountable care organization participants contact data. With over 25.8 billion rows of data meticulously collected since 2007, this dataset is invaluable for healthcare professionals, analysts, researchers, and businesses seeking to understand and analyze healthcare trends, performance metrics, and demographic shifts over time. The dataset is updated monthly, ensuring that users always have access to the most current and relevant data available.
Dataset Overview:
118 Report Feeds: - The dataset includes a wide array of report feeds, each providing unique insights into different dimensions of healthcare. These topics range from Medicare and Medicaid service metrics, patient demographics, provider information, financial data, and much more. The breadth of information ensures that users can find relevant data for nearly any healthcare-related analysis. - As CMS releases new report feeds, they are automatically added to this dataset, keeping it current and expanding its utility for users.
25.8 Billion Rows of Data:
Historical Data Since 2007: - The dataset spans from 2007 to the present, offering a rich historical perspective that is essential for tracking long-term trends and changes in healthcare delivery, policy impacts, and patient outcomes. This historical data is particularly valuable for conducting longitudinal studies and evaluating the effects of various healthcare interventions over time.
Monthly Updates:
Data Sourced from CMS:
Use Cases:
Market Analysis:
Healthcare Research:
Performance Tracking:
Compliance and Regulatory Reporting:
Data Quality and Reliability:
The All CMS Data Feeds dataset is designed with a strong emphasis on data quality and reliability. Each row of data is meticulously cleaned and aligned, ensuring that it is both accurate and consistent. This attention to detail makes the dataset a trusted resource for high-stakes applications, where data quality is critical.
Integration and Usability:
Ease of Integration:
According to our latest research, the global Provider Directory Software market size reached USD 1.74 billion in 2024, reflecting robust adoption across healthcare and insurance sectors. The market is experiencing strong momentum with a recorded CAGR of 10.2% from 2025 to 2033. By 2033, the market is forecasted to reach USD 4.18 billion, driven by regulatory mandates, increasing digitization in healthcare, and the need for accurate, real-time provider information. The growth is further fueled by the growing emphasis on interoperability, patient engagement, and the demand for cost-effective healthcare administration solutions.
The surge in the Provider Directory Software market can be primarily attributed to the escalating need for accurate, up-to-date, and easily accessible provider information. Healthcare organizations, insurance companies, and regulatory bodies worldwide are increasingly mandating the use of digital provider directories to reduce administrative errors, streamline provider-patient matching, and enhance member satisfaction. Inaccurate provider data has historically resulted in claim denials, compliance violations, and poor patient experiences. As a result, the adoption of comprehensive provider directory software is becoming a strategic imperative for organizations aiming to mitigate risk and improve operational efficiency. The integration of advanced technologies such as artificial intelligence, machine learning, and blockchain is further elevating the capabilities of these solutions, enabling real-time data synchronization, automated data validation, and enhanced security.
Another significant growth factor is the evolving regulatory landscape, particularly in regions like North America and Europe. Governments and healthcare authorities are implementing stringent regulations that require payers and providers to maintain accurate, accessible, and regularly updated provider directories. For instance, the Centers for Medicare & Medicaid Services (CMS) in the United States has established rigorous standards for provider data accuracy, with non-compliance resulting in substantial financial penalties. This regulatory pressure is compelling healthcare organizations to invest in robust provider directory software that ensures compliance, reduces administrative burdens, and improves transparency. Furthermore, the ongoing shift towards value-based care and patient-centric healthcare delivery models is amplifying the demand for efficient provider data management systems.
The rapid digital transformation within the healthcare ecosystem is also playing a pivotal role in market expansion. The proliferation of electronic health records (EHRs), telemedicine, and integrated care delivery models necessitates seamless interoperability between disparate systems and stakeholders. Provider directory software acts as a critical enabler by consolidating provider information from multiple sources, ensuring data consistency, and facilitating secure data exchange across networks. The increased focus on patient engagement and experience is driving healthcare organizations to leverage provider directory solutions that offer intuitive search functionalities, real-time availability, and personalized provider recommendations. As digital health initiatives continue to gain traction, the provider directory software market is poised for sustained growth over the forecast period.
From a regional perspective, North America currently holds the largest market share, underpinned by advanced healthcare IT infrastructure, high adoption rates of digital health solutions, and a favorable regulatory environment. Europe follows closely, with growing investments in healthcare digitization and interoperability initiatives. The Asia Pacific region is emerging as a lucrative market, driven by expanding healthcare access, rising investments in health IT, and increasing awareness of the importance of accurate provider data. Latin America and the Middle East & Africa are also witnessing gradual adoption, supported by government-led digital health programs and the modernization of healthcare systems. Overall, the global provider directory software market is characterized by dynamic growth, evolving regulatory requirements, and increasing technological innovation.
More details about each file are in the individual file descriptions.
This is a dataset hosted by the Centers for Medicare & Medicaid Services (CMS). The organization has an open data platform found here and they update their information according the amount of data that is brought in. Explore CMS's Data using Kaggle and all of the data sources available through the CMS organization page!
This dataset is maintained using Socrata's API and Kaggle's API. Socrata has assisted countless organizations with hosting their open data and has been an integral part of the process of bringing more data to the public.
This dataset is distributed under the following licenses: NA, Public Domain
https://www.usa.gov/government-workshttps://www.usa.gov/government-works
We are releasing data that compares the HHS Provider Relief Fund and the CMS Accelerated and Advance Payments by State and provider as of May 15, 2020. This data is already available on other websites, but this chart brings the information together into one view for comparison. You can find additional information on the Accelerated and Advance Payments at the following links:
Fact Sheet: https://www.cms.gov/files/document/Accelerated-and-Advanced-Payments-Fact-Sheet.pdf;
Zip file on providers in each state: https://www.cms.gov/files/zip/accelerated-payment-provider-details-state.zip
Medicare Accelerated and Advance Payments State-by-State information and by Provider Type: https://www.cms.gov/files/document/covid-accelerated-and-advance-payments-state.pdf.
This file was assembled by HHS via CMS, HRSA and reviewed by leadership and compares the HHS Provider Relief Fund and the CMS Accelerated and Advance Payments by State and provider as of December 4, 2020.
HHS Provider Relief Fund President Trump is providing support to healthcare providers fighting the coronavirus disease 2019 (COVID-19) pandemic through the bipartisan Coronavirus Aid, Relief, & Economic Security Act and the Paycheck Protection Program and Health Care Enhancement Act, which provide a total of $175 billion for relief funds to hospitals and other healthcare providers on the front lines of the COVID-19 response. This funding supports healthcare-related expenses or lost revenue attributable to COVID-19 and ensures uninsured Americans can get treatment for COVID-19. HHS is distributing this Provider Relief Fund money and these payments do not need to be repaid. The Department allocated $50 billion of the Provider Relief Fund for general distribution to Medicare facilities and providers impacted by COVID-19, based on eligible providers' net reimbursement. It allocated another $22 billion to providers in areas particularly impacted by the COVID-19 outbreak, rural providers, and providers who serve low-income populations and uninsured Americans. HHS will be allocating the remaining funds in the near future.
As part of the Provider Relief Fund distribution, all providers have 45 days to attest that they meet certain criteria to keep the funding they received, including public disclosure. As of May 15, 2020, there has been a total of $34 billion in attested payments. The chart only includes those providers that have attested to the payments by that date. We will continue to update this information and add the additional providers and payments once their attestation is complete.
CMS Accelerated and Advance Payments Program On March 28, 2020, to increase cash flow to providers of services and suppliers impacted by the coronavirus disease 2019 (COVID-19) pandemic, the Centers for Medicare & Medicaid Services (CMS) expanded the Accelerated and Advance Payment Program to a broader group of Medicare Part A providers and Part B suppliers. Beginning on April 26, 2020, CMS stopped accepting new applications for the Advance Payment Program, and CMS began reevaluating all pending and new applications for Accelerated Payments in light of the availability of direct payments made through HHS’s Provider Relief Fund.
Since expanding the AAP program on March 28, 2020, CMS approved over 21,000 applications totaling $59.6 billion in payments to Part A providers, which includes hospitals, through May 18, 2020. For Part B suppliers—including doctors, non-physician practitioners and durable medical equipment suppliers— during the same time period, CMS approved almost 24,000 applications advancing $40.4 billion in payments. The AAP program is not a grant, and providers and suppliers are required to repay the loan.
CMS has published AAP data, as required by the Continuing Appropriations and Other Extensions Act of 2021, on this website: https://www.cms.gov/files/document/covid-medicare-accelerated-and-advance-payments-program-covid-19-public-health-emergency-payment.pdf. Requests for additional data related to the program must be submitted through the CMS FOIA office. For more information on how to submit a FOIA request please visit our website at https://www.cms.gov/Regulations-and-Guidance/Legislation/FOIA. The PRF is administered by the Health Resources & Services Administration (HRSA). For more information on how to submit a request for unpublished program data from HRSA, please visit https://www.hrsa.gov/foia/index.html.
Provider Relief Fund Data - https://data.cdc.gov/Administrative/Provider-Relief-Fund-COVID-19-High-Impact-Payments/b58h-s9zx
The Hospital All Owners Information dataset provides information on all owners of the hospitals. This data includes ownership information such as ownership name, ownership type, ownership address and ownership effective date.
https://academictorrents.com/nolicensespecifiedhttps://academictorrents.com/nolicensespecified
Contains full records from the CMS Open Payments Data API. About openpaymentsdata.cms.gov: The mission of the program is to provide the public with a more transparent health care system. Open Payments collects and publishes information about financial relationships between drug and medical device companies (referred to as "reporting entities") and certain health care providers (referred to as "covered recipients"). These relationships may involve payments to providers for things including but not limited to research, meals, travel, gifts or speaking fees. All information available on the Open Payments database is open to personal interpretation and if there are questions about the data, patients and their advocates should speak directly to the health care provider for a better understanding.
The CMS Program Statistics - Medicare Inpatient Hospital tables provide use and payment data for all inpatient hospitals, including short-stay hospitals, critical access hospitals, long term care hospitals, inpatient psychiatric facilities, inpatient rehabilitation facilities, religious nonmedical health care institutions, children’s hospitals, and other hospitals.
For additional information on enrollment, providers, and Medicare use and payment, visit the CMS Program Statistics page.
These data do not exist in a machine-readable format, so the view data and API options are not available. Please use the download function to access the data.
Below is the list of tables:
MDCR INPT HOSP 1. All Medicare Inpatient Hospitals: Utilization, Program Payments, and Cost Sharing for Original Medicare Beneficiaries, by Type of Entitlement, Yearly Trend MDCR INPT HOSP 2. All Medicare Inpatient Hospitals: Utilization, Program Payments, and Cost Sharing for Original Medicare Beneficiaries, by Demographic Characteristics and Medicare-Medicaid Enrollment Status MDCR INPT HOSP 3. All Medicare Inpatient Hospitals: Utilization, Program Payments, and Cost Sharing for Original Medicare Beneficiaries, by Area of Residence MDCR INPT HOSP 4. All Medicare Inpatient Hospitals: Utilization, Program Payments, and Cost Sharing for Original Medicare Beneficiaries, by Type of Hospital MDCR INPT HOSP 5. Medicare IPPS Short Stay Hospitals: Utilization, Program Payments, and Cost Sharing for Original Medicare Beneficiaries, by Type of Entitlement, Yearly Trend MDCR INPT HOSP 6. Medicare IPPS Short Stay Hospitals: Utilization, Program Payments, and Cost Sharing for Original Medicare Beneficiaries, by Demographic Characteristics and Medicare-Medicaid Enrollment MDCR INPT HOSP 7. Medicare IPPS Short Stay Hospitals: Utilization, Program Payments, and Cost Sharing for Original Medicare Beneficiaries, by Area of Residence MDCR INPT HOSP 8. Medicare IPPS Short Stay Hospitals: Utilization and Program Payments for Original Medicare Beneficiaries, by Type of Entitlement and Total Days of Care MDCR INPT HOSP 9. Medicare IPPS Short Stay Hospitals: Utilization and Program Payments for Original Medicare Beneficiaries, by Location and Bedsize of Hospitals, by Medical School Affiliation, and Type of Control MDCR INPT HOSP 10. Special-Category Hospitals: Utilization, Program Payments, and Cost Sharing for Original Medicare Beneficiaries, by Type of Hospital
The table Referring durable medical equip is part of the dataset CMS 2014 Medicare Data, available at https://redivis.com/datasets/349j-7phs91amz. It contains 366242 rows across 54 variables.
The Hospital Enrollments dataset provides enrollment information of all Hospitals currently enrolled in Medicare. This data includes information on the Hospital's sub-group type, legal business name, doing business as name, organization type and address.
description:
The Physician and Other Supplier Public Use File (Physician and Other Supplier PUF) provides information on services and procedures provided to Medicare beneficiaries by physicians and other healthcare professionals. The Physician and Other Supplier PUF contains information on utilization, payment (allowed amount and Medicare payment), and submitted charges organized by National Provider Identifier (NPI), Healthcare Common Procedure Coding System (HCPCS) code, and place of service. This PUF is based on information from CMS administrative claims data for Medicare beneficiaries enrolled in the fee-for-service program. The data in the Physician and Other Supplier PUF covers calendar years 2012 through 2014 and contains 100 percent final-action physician-supplier Part B non-institutional line items for the Medicare fee-for-service population.
; abstract:The Physician and Other Supplier Public Use File (Physician and Other Supplier PUF) provides information on services and procedures provided to Medicare beneficiaries by physicians and other healthcare professionals. The Physician and Other Supplier PUF contains information on utilization, payment (allowed amount and Medicare payment), and submitted charges organized by National Provider Identifier (NPI), Healthcare Common Procedure Coding System (HCPCS) code, and place of service. This PUF is based on information from CMS administrative claims data for Medicare beneficiaries enrolled in the fee-for-service program. The data in the Physician and Other Supplier PUF covers calendar years 2012 through 2014 and contains 100 percent final-action physician-supplier Part B non-institutional line items for the Medicare fee-for-service population.
https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy
The global Provider Data Management (PDM) Software market size was valued at approximately USD 1.5 billion in 2023 and is projected to reach around USD 3.8 billion by 2032, growing at a compound annual growth rate (CAGR) of 11.5% during the forecast period. This significant growth can be attributed to the increasing need for accurate provider information, the rising adoption of digital solutions in the healthcare sector, and the growing emphasis on regulatory compliance.
The growth of the PDM Software market is driven by several key factors. Firstly, the healthcare industry's shift towards digitalization necessitates accurate and up-to-date provider data. Accurate provider data is crucial for streamlining operations, improving patient outcomes, and ensuring regulatory compliance. As healthcare organizations strive to enhance operational efficiency and patient care, the demand for PDM software continues to rise. Furthermore, regulatory requirements, such as those mandated by the Centers for Medicare & Medicaid Services (CMS) in the United States, compel healthcare organizations to maintain accurate provider directories. Non-compliance can result in significant penalties, prompting healthcare providers to invest in robust PDM solutions.
Secondly, the growing complexity of healthcare networks and the increasing number of healthcare providers necessitate efficient data management solutions. As healthcare networks expand and new providers join the system, managing provider data manually becomes impractical and error-prone. PDM software automates data management processes, reduces administrative burdens, and ensures data accuracy. This automation not only saves time and resources but also minimizes the risk of errors and inconsistencies in provider data, thereby enhancing overall operational efficiency.
Another significant growth driver is the rising adoption of cloud-based solutions. Cloud-based PDM software offers several advantages, including scalability, flexibility, and cost-effectiveness. Healthcare organizations, particularly small and medium-sized enterprises (SMEs), are increasingly opting for cloud-based solutions to avoid the high upfront costs associated with on-premises deployments. Additionally, cloud-based solutions enable easy integration with other healthcare IT systems, facilitating seamless data exchange and interoperability. This interoperability is crucial for providing comprehensive patient care and improving coordination among healthcare providers.
Master Data Management plays a pivotal role in the healthcare sector by ensuring that provider data is accurate, consistent, and up-to-date across various systems. As healthcare organizations increasingly rely on digital solutions, the need for effective Master Data Management becomes more pronounced. This approach not only enhances data quality but also supports regulatory compliance and operational efficiency. By integrating Master Data Management into their data management strategies, healthcare providers can streamline processes, reduce errors, and improve patient care outcomes. The ability to maintain a single, reliable source of truth for provider data is essential for navigating the complexities of modern healthcare networks.
Regionally, North America dominates the PDM Software market due to the presence of advanced healthcare infrastructure, stringent regulatory requirements, and high adoption of digital health solutions. The United States, in particular, has a significant market share owing to its well-established healthcare system and continuous efforts to enhance data accuracy and provider information. Europe follows closely, with countries like the United Kingdom, Germany, and France investing in healthcare IT solutions to improve operational efficiency and patient care. The Asia Pacific region is expected to witness the highest growth rate during the forecast period, driven by increasing healthcare investments, expanding healthcare networks, and rising awareness about the benefits of digital solutions.
The Provider Data Management (PDM) Software market can be segmented based on components into software and services. The software segment dominates the market, as it includes core PDM applications that healthcare organizations use to manage provider data. This segment is expected to witness substantial growth due to the increasing adoption of advanced software solutions designed to streamline
The Medicare Durable Medical Equipment, Devices & Supplies by Referring Provider dataset contains information on usage, payments, submitted charges and beneficiary demographic and health characteristics organized by National Provider Identifier (NPI).
Note: This web page provides data on health facilities only. To file a complaint against a facility, please see: https://www.cdph.ca.gov/Programs/CHCQ/LCP/Pages/FileAComplaint.aspx
The California Department of Public Health (CDPH), Center for Health Care Quality, Licensing and Certification (L&C) Program licenses and certifies more than 30 types of healthcare facilities. The Electronic Licensing Management System (ELMS) is a CDPH data system created to manage state licensing-related data and enforcement actions. This file includes California healthcare facilities that are operational and have a current license issued by the CDPH and/or a current U.S. Department of Health and Human Services’ Centers for Medicare and Medicaid Services (CMS) certification.
To link the CDPH facility IDs with those from other Departments, like HCAI, please reference the "Licensed Facility Cross-Walk" Open Data table at https://data.chhs.ca.gov/dataset/licensed-facility-crosswalk. Facility geographic variables are updated monthly, if latitude/longitude information is missing at any point in time, it should be available when the next time the Open Data facility file is refreshed.
Please note that the file contains the data from ELMS as of the 11th business day of the month. See DATA_DATE variable for the specific date of when the data was extracted.
Map of all Health Care Facilities in California: https://go.cdii.ca.gov/cdph-facilities
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The Health Insurance Marketplace Public Use Files contain data on health and dental plans offered to individuals and small businesses through the US Health Insurance Marketplace.
To help get you started, here are some data exploration ideas:
See this forum thread for more ideas, and post there if you want to add your own ideas or answer some of the open questions!
This data was originally prepared and released by the Centers for Medicare & Medicaid Services (CMS). Please read the CMS Disclaimer-User Agreement before using this data.
Here, we've processed the data to facilitate analytics. This processed version has three components:
The original versions of the 2014, 2015, 2016 data are available in the "raw" directory of the download and "../input/raw" on Kaggle Scripts. Search for "dictionaries" on this page to find the data dictionaries describing the individual raw files.
In the top level directory of the download ("../input" on Kaggle Scripts), there are six CSV files that contain the combined at across all years:
Additionally, there are two CSV files that facilitate joining data across years:
The "database.sqlite" file contains tables corresponding to each of the processed CSV files.
The code to create the processed version of this data is available on GitHub.
Centers for Medicare & Medicaid Services - Nursing HomesThis feature layer, utilizing data from the Centers for Medicare & Medicaid Services (CMS), displays the locations of nursing homes in the U.S. Nursing homes provide a type of residential care. They are a place of residence for people who require constant nursing care and have significant deficiencies with activities of daily living. Per CMS, "Nursing homes, which include Skilled Nursing Facilities (SNFs) and Nursing Facilities (NFs), are required to be in compliance with Federal requirements to receive payment under the Medicare or Medicaid programs. The Secretary of the United States Department of Health & Human Services has delegated to the CMS and the State Medicaid Agency the authority to impose enforcement remedies against a nursing home that does not meet Federal requirements." This layer includes currently active nursing homes, including number of certified beds, address, and other information.Bridgepoint Sub-Acute and Rehab Capitol HillData downloaded: August 1, 2024Data source: Provider InformationData modification: This dataset includes only those facilities with addresses that were appropriately geocoded.For more information: Nursing homes including rehab servicesFor feedback, please contact: ArcGIScomNationalMaps@esri.comCenters for Medicare & Medicaid ServicesPer USA.gov, "The Centers for Medicare and Medicaid Services (CMS) provides health coverage to more than 100 million people through Medicare, Medicaid, the Children’s Health Insurance Program, and the Health Insurance Marketplace. The CMS seeks to strengthen and modernize the Nation’s health care system, to provide access to high quality care and improved health at lower costs."
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The CMS National Plan and Provider Enumeration System (NPPES) was developed as part of the Administrative Simplification provisions in the original HIPAA act. The primary purpose of NPPES was to develop a unique identifier for each physician that billed medicare and medicaid. This identifier is now known as the National Provider Identifier Standard (NPI) which is a required 10 digit number that is unique to an individual provider at the national level.
Once an NPI record is assigned to a healthcare provider, parts of the NPI record that have public relevance, including the provider’s name, speciality, and practice address are published in a searchable website as well as downloadable file of zipped data containing all of the FOIA disclosable health care provider data in NPPES and a separate PDF file of code values which documents and lists the descriptions for all of the codes found in the data file.
The dataset contains the latest NPI downloadable file in an easy to query BigQuery table, npi_raw. In addition, there is a second table, npi_optimized which harnesses the power of Big Query’s next-generation columnar storage format to provide an analytical view of the NPI data containing description fields for the codes based on the mappings in Data Dissemination Public File - Code Values documentation as well as external lookups to the healthcare provider taxonomy codes . While this generates hundreds of columns, BigQuery makes it possible to process all this data effectively and have a convenient single lookup table for all provider information.
Fork this kernel to get started.
https://console.cloud.google.com/marketplace/details/hhs/nppes?filter=category:science-research
Dataset Source: Center for Medicare and Medicaid Services. This dataset is publicly available for anyone to use under the following terms provided by the Dataset Source - http://www.data.gov/privacy-policy#data_policy — and is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.
Banner Photo by @rawpixel from Unplash.
What are the top ten most common types of physicians in Mountain View?
What are the names and phone numbers of dentists in California who studied public health?