Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Nov 28;318(20):2011-2018.
doi: 10.1001/jama.2017.17653.

Adherence to Methodological Standards in Research Using the National Inpatient Sample

Affiliations

Adherence to Methodological Standards in Research Using the National Inpatient Sample

Rohan Khera et al. JAMA. .

Abstract

Importance: Publicly available data sets hold much potential, but their unique design may require specific analytic approaches.

Objective: To determine adherence to appropriate research practices for a frequently used large public database, the National Inpatient Sample (NIS) of the Agency for Healthcare Research and Quality (AHRQ).

Design, setting, and participants: In this observational study of the 1082 studies published using the NIS from January 2015 through December 2016, a representative sample of 120 studies was systematically evaluated for adherence to practices required by AHRQ for the design and conduct of research using the NIS.

Exposures: None.

Main outcomes and measures: All studies were evaluated on 7 required research practices based on AHRQ's recommendations and compiled under 3 domains: (1) data interpretation (interpreting data as hospitalization records rather than unique patients); (2) research design (avoiding use in performing state-, hospital-, and physician-level assessments where inappropriate; not using nonspecific administrative secondary diagnosis codes to study in-hospital events); and (3) data analysis (accounting for complex survey design of the NIS and changes in data structure over time).

Results: Of 120 published studies, 85% (n = 102) did not adhere to 1 or more required practices and 62% (n = 74) did not adhere to 2 or more required practices. An estimated 925 (95% CI, 852-998) NIS publications did not adhere to 1 or more required practices and 696 (95% CI, 596-796) NIS publications did not adhere to 2 or more required practices. A total of 79 sampled studies (68.3% [95% CI, 59.3%-77.3%]) among the 1082 NIS studies screened for eligibility did not account for the effects of sampling error, clustering, and stratification; 62 (54.4% [95% CI, 44.7%-64.0%]) extrapolated nonspecific secondary diagnoses to infer in-hospital events; 45 (40.4% [95% CI, 30.9%-50.0%]) miscategorized hospitalizations as individual patients; 10 (7.1% [95% CI, 2.1%-12.1%]) performed state-level analyses; and 3 (2.9% [95% CI, 0.0%-6.2%]) reported physician-level volume estimates. Of 27 studies (weighted; 218 studies [95% CI, 134-303]) spanning periods of major changes in the data structure of the NIS, 21 (79.7% [95% CI, 62.5%-97.0%]) did not account for the changes. Among the 24 studies published in journals with an impact factor of 10 or greater, 16 (67%) did not adhere to 1 or more practices, and 9 (38%) did not adhere to 2 or more practices.

Conclusions and relevance: In this study of 120 recent publications that used data from the NIS, the majority did not adhere to required practices. Further research is needed to identify strategies to improve the quality of research using the NIS and assess whether there are similar problems with use of other publicly available data sets.

PubMed Disclaimer

Conflict of interest statement

Disclosures: The other authors report no potential conflicts of interest.

Figures

Figure 1
Figure 1. Study Selection Flowsheet
A sample of 120 studies was evaluated from the 1082 publications using the NIS during 2015–2016.
Figure 2
Figure 2. Simulation of Data to Demonstrate Incorrect Assessment of Hospitalization-level Trends
National-level trends in the number of coronary artery bypass grafting (CABG) hospitalizations were evaluated using the National Inpatient Sample data from the years 2010 through 2013. Trends were assessed using discharge weights for 2010–2011 that accounted for changes in data structure over time (correct weights), compared to those without this adjustment (incorrect weights).

References

    1. Shah RU, Bairey Merz CN. Publicly Available Data: Crowd Sourcing to Identify and Reduce Disparities. J Am Coll Cardiol. 2015;66(18):1973–1975. - PubMed
    1. Khera R, Krumholz HM. With great power comes great responsibility: "Big data" research from the National Inpatient Sample. Circ Cardiovasc Qual Outcomes. 2017;10(7):e003846. - PMC - PubMed
    1. NIS Database Documentation Archive. Healthcare Cost and Utilization Project (HCUP) Agency for Healthcare Research and Quality; Rockville, MD: Jun, 2016. [Accessed September 25, 2017]. at www.hcup-us.ahrq.gov/db/nation/nis/nisarchive.jsp.
    1. HCUP Databases. Healthcare Cost and Utilization Project - Overview of the National (Nationwide) Inpatient Sample (NIS) Agency for Healthcare Research and Quality; Rockville, MD: Nov, 2016. [Accessed September 25, 2017]. at www.hcup-us.ahrq.gov/nisoverview.jsp.
    1. NIS Description of Data Elements. Healthcare Cost and Utilization Project (HCUP) Agency for Healthcare Research and Quality; Rockville, MD: Nov, 2016. [Accessed September 25, 2017]. at www.hcup-us.ahrq.gov/db/nation/nis/nisdde.jsp.

MeSH terms