The Healthcare Cost and Utilization Project (HCUP) National Inpatient Sample (NIS) is the largest publicly available all-payer inpatient care database in the United States. The NIS is designed to produce U.S. regional and national estimates of inpatient utilization, access, cost, quality, and outcomes. Unweighted, it contains data from more than 7 million hospital stays each year. Weighted, it estimates more than 35 million hospitalizations nationally. Developed through a Federal-State-Industry partnership sponsored by the Agency for Healthcare Research and Quality (AHRQ), HCUP data inform decision making at the national, State, and community levels. Starting with the 2012 data year, the NIS is a sample of discharges from all hospitals participating in HCUP, covering more than 97 percent of the U.S. population. For prior years, the NIS was a sample of hospitals. The NIS allows for weighted national estimates to identify, track, and analyze national trends in health care utilization, access, charges, quality, and outcomes. The NIS's large sample size enables analyses of rare conditions, such as congenital anomalies; uncommon treatments, such as organ transplantation; and special patient populations, such as the uninsured. NIS data are available since 1988, allowing analysis of trends over time. The NIS inpatient data include clinical and resource use information typically available from discharge abstracts with safeguards to protect the privacy of individual patients, physicians, and hospitals (as required by data sources). Data elements include but are not limited to: diagnoses, procedures, discharge status, patient demographics (e.g., sex, age), total charges, length of stay, and expected payment source, including but not limited to Medicare, Medicaid, private insurance, self-pay, or those billed as ‘no charge’. The NIS excludes data elements that could directly or indirectly identify individuals. Restricted access data files are available with a data use agreement and brief online security training.
The National (Nationwide) Inpatient Sample (NIS) is part of a family of databases and software tools developed for the Healthcare Cost and Utilization Project (HCUP). The NIS is the largest publicly available all-payer inpatient healthcare database designed to produce U.S. regional and national estimates of inpatient utilization, access, cost, quality, and outcomes. Unweighted, it contains data from around 7 million hospital stays each year. Weighted, it estimates around 35 million hospitalizations nationally. Developed through a Federal-State-Industry partnership sponsored by the Agency for Healthcare Research and Quality (AHRQ), HCUP data inform decision making at the national, State, and community levels.
The NIS is the largest publicly available all-payer inpatient healthcare database designed to produce U.S. regional and national estimates of inpatient utilization, access, cost, quality, and outcomes. Unweighted, it contains data from around 7 million hospital stays each year. Weighted, it estimates around 35 million hospitalizations nationally. Developed through a Federal-State-Industry partnership sponsored by the Agency for Healthcare Research and Quality (AHRQ), HCUP data inform decision making at the national, State, and community levels.
Its large sample size is ideal for developing national and regional estimates and enables analyses of rare conditions, uncommon treatments, and special populations.
IMPORTANT NOTE: Some records are missing from the Severity Measures table for 2017 & 2018, but none are missing from any of the other 2012-2020 data. We are in the process of trying to recover the missing records, and will update this note when we have done so.
Also %3Cu%3EDO NOT%3C/u%3E
use this data without referring to the NIS Database Documentation, which includes:
%3C!-- --%3E
%3C!-- --%3E
All manuscripts (and other items you'd like to publish) must be submitted to
phsdatacore@stanford.edu for approval prior to journal submission.
We will check your cell sizes and citations.
For more information about how to cite PHS and PHS datasets, please visit:
https:/phsdocs.developerhub.io/need-help/citing-phs-data-core
For additional assistance, AHRQ has created the HCUP Online Tutorial Series, a series of free, interactive courses which provide training on technical methods for conducting research with HCUP data. Topics include an HCUP Overview Course and these tutorials:
• The HCUP Sampling Design tutorial is designed to help users learn how to account for sample design in their work with HCUP national (nationwide) databases. • The Producing National HCUP Estimates tutorial is designed to help users understand how the three national (nationwide) databases – the NIS, Nationwide Emergency Department Sample (NEDS), and Kids' Inpatient Database (KID) – can be used to produce national and regional estimates. HCUP 2020 NIS (8/22/22) 14 Introduction • The Calculating Standard Errors tutorial shows how to accurately determine the precision of the estimates produced from the HCUP nationwide databases. Users will learn two methods for calculating standard errors for estimates produced from the HCUP national (nationwide) databases. • The HCUP Multi-year Analysis tutorial presents solutions that may be necessary when conducting analyses that span multiple years of HCUP data. • The HCUP Software Tools Tutorial provides instructions on how to apply the AHRQ software tools to HCUP or other administrative databases.
New tutorials are added periodically, and existing tutorials are updated when necessary. The Online Tutorial Series is located on the HCUP-US website at www.hcupus.ahrq.gov/tech_assist/tutorials.jsp.
In 2015, AHRQ restructured the data as described here:
https://hcup-us.ahrq.gov/db/nation/nis/2015HCUPNationalInpatientSample.pdf
Some key points:
%3C!-- --%3E
Inpatient core file showing all discharges from the sample of hospitals in participating states. Unit of observation: discharge-level
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The Nationwide Inpatient Sample (NIS) is a database focused on hospital stay information. Users are able to use the NIS to identify, track, and analyze national trends in health care utilization, access, charges, quality, and outcomes. Background The Nationwide Inpatient Sample (NIS) is maintained by the Healthcare Cost and Utilization Project. The NIS is the largest all-payer inpatient care database in the United States. It contains data from approximately 8 million hospital stays each year. The 2009 NIS contains all discharge data from 1,050 hospitals located in 44 States, approximating a 20-percent stratified sample of U.S. community hospitals. The sampling frame for the 2009 NIS is a sample of hospitals that comprises approximately 95 percent of all hospital discharges in the United States. The NIS is the only national hospital database containing charge information on all patients, regardless of payer, including persons covered by Medicare, Medicaid, private insurance, and the uninsured. User functionality Users must pay to access the database. NIS releases for data years 1988-2009 are available from the HCUP Central Distributor. The 2009 NIS may be purchased for $50 for students and $350 for all others on a single DVD-ROM with accompanying documentation. . Data Notes NIS data are available from 1988 to 2009. The number of states in the NIS has grown from 8 in the first year to 44 at present. Beginning with the 2002 NIS, severity adjustment data elements including APR-DRGs, APS-DRGs, Disease Staging, and AHRQ Comorbidity Indicators, are available. Begi nning with the 2005 NIS, Diagnosis and Procedure Groups Files containing data elements from AHRQ software tools designed to facilitate the use of the ICD-9-CM diagnostic and procedure information are available. Beginning with the 2007 NIS, data elements describing hospital structural characteristics and provision of outpatient services are available in the Hospital Weights file. NIS Release 1 includes data from 8-11 States and spans the years 1988 to 1992. NIS Releases 2 and 3 contain data from 17 States for 1993 and 1994, respectively. NIS Releases 4 and 5 contain data from 19 States for 1995 and 1996. NIS Release 6 contains data from 22 States for 1997. NIS 1998 contains data from 22 States. NIS 1999 contains data from 24 States. NIS 2000 contains data from 28 States. NIS 2001 contains data from 33 States. NIS 2002 contains data from 35 States. NIS 2003 contains data from 37 States. NIS 2004 contains data from 37 States. NIS 2005 contains data from 37 States. NIS 2006 contains data from 38 States. NIS 2007 contains data from 40 States. NIS 2008 contains data from 42 States.
This dataset tracks the updates made on the dataset "HCUP National (Nationwide) Inpatient Sample (NIS) - Restricted Access File" as a repository for previous versions of the data and metadata.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
NIS 2002-2011 Within Year Merge
This dataset tracks the updates made on the dataset "Healthcare Cost and Utilization Project (HCUP) - National Inpatient Sample" as a repository for previous versions of the data and metadata.
The National (Nationwide) Kids' Inpatient Database (KID) is part of a family of databases and software tools developed for the Healthcare Cost and Utilization Project (HCUP). Only years 2003, 2006, 2009, 2012 are available on the PHS Data Portal.
The Kids' Inpatient Database (KID) is the largest publicly available all-payer pediatric inpatient care database in the United States, containing data from two to three million hospital stays. Its large sample size is ideal for developing national and regional estimates and enables analyses of rare conditions, such as congenital anomalies, as well as uncommon treatments, such as organ transplantation. KID releases for data years 1997, 2000, 2003, 2006, 2009, 2012, 2016, and 2019 are available for purchase online through the Online HCUP Central Distributor. The KID was not produced for 2015 because of the transition from ICD-9-CM to ICD-10-CM/PCS coding.
KID Database Documentation includes:
%3C!-- --%3E
Please visit the HCUP National KID page for more information.
The Agency for Healthcare Research and Quality (AHRQ, formerly the Agency for Health Care Policy and Research) maintains the Healthcare Cost and Utilization Project (HCUP). HCUP is a Federal-State-industry partnership to build a standardized, multi-State health data system. AHRQ has taken the lead in developing HCUP databases, Web-based products, and software tools and making them available for restricted access public release.
HCUP comprises a family of administrative longitudinal databases-including State-specific hospital-discharge databases and a national sample of discharges from community hospitals.
HCUP databases contain patient-level information compiled in a uniform format with privacy protections in place. * The Nationwide Inpatient Sample (NIS) includes inpatient data from a national sample (about 20% of U.S. community hospitals) including roughly 7 million discharges from about 1,000 hospitals. It is the largest all-payer inpatient database in the U.S.; data are now available from 1988-1998. The NIS is ideal for developing national estimates, for analyzing national trends, and for research that requires a large sample size. * The State Inpatient Databases (SID) cover individual data sets in community hospitals from 22 participating States that represent more than half of all U.S. hospital discharges. The data have been translated into a uniform format to facilitate cross-State comparisons. The SID are particularly well-suited for policy inquiries unique to a specific State, studies comparing two or more States, market area research, and small area variation analyses.
The project's newest restricted access public release is the Kids' Inpatient Database (KID), containing hospital inpatient stays for children 18 years of age and younger. Researchers and policymakers can use the KID to identify, track, and analyze national trends in health care utilization, access, charges, quality, and outcomes. The KID is the only all-payer inpatient care database for children in the U.S. It contains data from approximately 1.9 million hospital discharges for children. The data are drawn from 22 HCUP 1997 State Inpatient Databases and include a sample of pediatric general discharges from over 2,500 U.S. community hospitals (defined as short-term, non-Federal, general and specialty hospitals, excluding hospital units of other institutions). A key strength of the KID is that the large sample size enables analyses of both common and rare conditions; uncommon treatments, and organ transplantation. The KID also includes charge information on all patients, regardless of payer, including children covered by Medicaid, private insurance, and the uninsured.
HCUP also contains powerful, user-friendly software that can be used with both HCUP data and with other administrative databases. The AHRQ has developed three powerful software tools Quality Indicators (QIs), Clinical Classification Software (CCS) and HCUPnet. See more on the agency's webpages.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Summary Trend TablesThe HCUP Summary Trend Tables include information on hospital utilization derived from the HCUP State Inpatient Databases (SID), State Emergency Department Databases (SEDD), National Inpatient Sample (NIS), and Nationwide Emergency Department Sample (NEDS). State statistics are displayed by discharge month and national and regional statistics are displayed by discharge quarter. Information on emergency department (ED) utilization is dependent on availability of HCUP data; not all HCUP Partners participate in the SEDD.The HCUP Summary Trend Tables include downloadable Microsoft® Excel tables with information on the following topics:Overview of trends in inpatient and emergency department utilizationAll inpatient encounter typesInpatient encounter typeNormal newbornsDeliveriesNon-elective inpatient stays, admitted through the EDNon-elective inpatient stays, not admitted through the EDElective inpatient staysInpatient service lineMaternal and neonatal conditionsMental health and substance use disordersInjuriesSurgeriesOther medical conditionsED treat-and-release visitsDescription of the data source, methodology, and clinical criteria (Excel file, 43 KB)Change log (Excel file, 65 KB)For each type of inpatient stay, there is an Excel file for the number of discharges, the percent of discharges, the average length of stay, the in-hospital mortality rate per 100 discharges,1 and the population-based rate per 100,000 population.2 Each Excel file contains State-specific, region-specific, and national statistics. For most files, trends begin in January 2017. Also included in each Excel file is a description of the HCUP databases and methodology.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
There are several Microsoft Word documents here detailing data creation methods and with various dictionaries describing the included and derived variables.The Database Creation Description is meant to walk a user through some of the steps detailed in the SAS code with this project.The alphabetical list of variables is intended for users as sometimes this makes some coding steps easier to copy and paste from this list instead of retyping.The NIS Data Dictionary contains some general dataset description as well as each variable's responses.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Jordan A. Kempker, MD, MSc June 23, 2016
NIS Across-Year Concatenation
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Predictors of healthcare utilization for people with an OUD hospitalization in the U.S.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Jordan A. Kempker, MD, MSc August 4, 2016
NIS 2002-2011 Hospital-Level Variables
This Code will create the below hospital-level variables for the entire dataset and then merge these statistics back into each observation.
hosp_vol Number of Annual Hospital Discharges
hospmort Annual Hospital In-Hospital Mortality Rate
Average Annual Hospital Elixhauser Score
Of note, this code was first tested on a random 5% sample stratified by year before applying to entire dataset. Below, is the pared down functional version that resulted but does not contain any of the internal tests.
The National (Nationwide) Inpatient Sample (NIS) is part of a family of databases and software tools developed for the Healthcare Cost and Utilization Project (HCUP). The NIS is the largest publicly available all-payer inpatient healthcare database designed to produce U.S. regional and national estimates of inpatient utilization, access, cost, quality, and outcomes. Unweighted, it contains data from around 7 million hospital stays each year. Weighted, it estimates around 35 million hospitalizations nationally. Developed through a Federal-State-Industry partnership sponsored by the Agency for Healthcare Research and Quality (AHRQ), HCUP data inform decision making at the national, State, and community levels.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Users can use the online interactive tool (HCUP-net) to gather national, state and all-payer patient health care data or users can purchase the data from the HCUP databases. Background HCUP is sponsored by the Agency for Healthcare Research and Quality. It is a collection of databases that provide information on patient health care data, including information typically found on a hospital discharge form. Topics include: cost of care, access to care, and treatment outcomes. The HCUP databases include: The National Inpatient Sample, The Kids' Inpatient Database, The State Ambulatory Surgery Database, and The State Emergency Department Databases. Many of these are available for purchase. User Functionality Users can purchase any of the aforementioned databases from the HCUP Central Administrator. Directions and contact information are readily available from the HCUP website. Users can also access the HCUP interactive tool, HCUP-net. Please click the appropriate link in the keywords to access this tool. Data Notes HCUP began collecting health care data in 1988. Specific information about a particular database can be found on the website.
The National (Nationwide) Inpatient Sample (NIS) is part of a family of databases and software tools developed for the Healthcare Cost and Utilization Project (HCUP). The NIS is the largest publicly available all-payer inpatient healthcare database designed to produce U.S. regional and national estimates of inpatient utilization, access, cost, quality, and outcomes. Unweighted, it contains data from around 7 million hospital stays each year. Weighted, it estimates around 35 million hospitalizations nationally. Developed through a Federal-State-Industry partnership sponsored by the Agency for Healthcare Research and Quality (AHRQ), HCUP data inform decision making at the national, State, and community levels.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Jordan A. Kempker, MD, MSc July 12, 2016
Assignment of Procedural Classes
The HCUP creates and annually updates a categorization of the >3,900 ICD-9 Procedures into 4 class:
This is stored in pc2015.csv availabe for download from their website. This .CSV file contains a procedure code, a label and a procedure class assignment. In excell I deleted first 3 rows of description, added a first column with "if procedure[i] = ", replaced the third column with "then prclass [i] = " and added a fourth column with a semicolon. I then copy and pasted entire table into code below to create a massive IF-THEN code chunk for the two ARRAYS.
(Use 'Control + Page Down' to get to the next code chunk)
Then to create an OR Procedure Variable, I followed HCUP documentation that PRCLASS of 3 or 4 corresponds to an OR PROCEDURE. Used another ARRAY for this.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Demographics of patients seen at the ED with football injuries by admission to hospital 2010 to 2013 ((N = 819,000).
The Healthcare Cost and Utilization Project (HCUP) National Inpatient Sample (NIS) is the largest publicly available all-payer inpatient care database in the United States. The NIS is designed to produce U.S. regional and national estimates of inpatient utilization, access, cost, quality, and outcomes. Unweighted, it contains data from more than 7 million hospital stays each year. Weighted, it estimates more than 35 million hospitalizations nationally. Developed through a Federal-State-Industry partnership sponsored by the Agency for Healthcare Research and Quality (AHRQ), HCUP data inform decision making at the national, State, and community levels. Starting with the 2012 data year, the NIS is a sample of discharges from all hospitals participating in HCUP, covering more than 97 percent of the U.S. population. For prior years, the NIS was a sample of hospitals. The NIS allows for weighted national estimates to identify, track, and analyze national trends in health care utilization, access, charges, quality, and outcomes. The NIS's large sample size enables analyses of rare conditions, such as congenital anomalies; uncommon treatments, such as organ transplantation; and special patient populations, such as the uninsured. NIS data are available since 1988, allowing analysis of trends over time. The NIS inpatient data include clinical and resource use information typically available from discharge abstracts with safeguards to protect the privacy of individual patients, physicians, and hospitals (as required by data sources). Data elements include but are not limited to: diagnoses, procedures, discharge status, patient demographics (e.g., sex, age), total charges, length of stay, and expected payment source, including but not limited to Medicare, Medicaid, private insurance, self-pay, or those billed as ‘no charge’. The NIS excludes data elements that could directly or indirectly identify individuals. Restricted access data files are available with a data use agreement and brief online security training.