Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 May 18;3(1):1171.
doi: 10.13063/2327-9214.1171. eCollection 2015.

Development of an Algorithm to Classify Colonoscopy Indication from Coded Health Care Data

Affiliations

Development of an Algorithm to Classify Colonoscopy Indication from Coded Health Care Data

Kenneth F Adams et al. EGEMS (Wash DC). .

Abstract

Introduction: Electronic health data are potentially valuable resources for evaluating colonoscopy screening utilization and effectiveness. The ability to distinguish screening colonoscopies from exams performed for other purposes is critical for research that examines factors related to screening uptake and adherence, and the impact of screening on patient outcomes, but distinguishing between these indications in secondary health data proves challenging. The objective of this study is to develop a new and more accurate algorithm for identification of screening colonoscopies using electronic health data.

Methods: Data from a case-control study of colorectal cancer with adjudicated colonoscopy indication was used to develop logistic regression-based algorithms. The proposed algorithms predict the probability that a colonoscopy was indicated for screening, with variables selected for inclusion in the models using the Least Absolute Shrinkage and Selection Operator (LASSO).

Results: The algorithms had excellent classification accuracy in internal validation. The primary, restricted model had AUC= 0.94, sensitivity=0.91, and specificity=0.82. The secondary, extended model had AUC=0.96, sensitivity=0.88, and specificity=0.90.

Discussion: The LASSO approach enabled estimation of parsimonious algorithms that identified screening colonoscopies with high accuracy in our study population. External validation is needed to replicate these results and to explore the performance of these algorithms in other settings.

Keywords: LASSO; ROC; classification; cohort identification; colonoscopy; data use and quality; health information technology; screening.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Legend: Area under the Receiver Operating Characteristics (ROC) Curves for the Restricted and Extended Algorithms. The primary, restricted algorithm did not include ICD-9 screening V-codes (V76.41, V76.50, V76.51) and the HCPCS preventative examination code (G0344) as candidate variables, whereas the secondary, extended algorithm included these variables.

References

    1. Siegel R, Desantis C, Jemal A. Colorectal cancer statistics, 2014. CA: A Cancer Journal for Clinicians. 2014;64(2):104–17. - PubMed
    1. Levin B, Lieberman DA, McFarland B, Andrews KS, Brooks D, Bond J, Dash C, Giardiello FM, Glick S, Johnson D, Johnson CD, Levin TR, Pickhardt PJ, Rex DK, Smith RA, Thorson A, Winawer SJ. Screening and surveillance for the early detection of colorectal cancer and adenomatous polyps, 2008: a joint guideline from the American Cancer Society, the US Multi-Society Task Force on Colorectal Cancer, and the American College of Radiology. Gastroenterology. 2008;134(5):1570–95. - PubMed
    1. Shapiro JA, Klabunde CN, Thompson TD, Nadel MR, Seeff LC, White A. Patterns of colorectal cancer test use, including CT colonography, in the 2010 National Health Interview Survey. Cancer Epidemiology, Biomarkers & Prevention. 2012;21(6):895–904. - PMC - PubMed
    1. Ross TR, NG D, Brown JS, Pardee R, Hornbrook MC, Hart G, Steiner JF. The HMO Research Network Virtual Data Warehouse: a public health model to support collaboration. eGEMS (Generating Evidence & Methods to improve patient outcomes) 2014;2(1) Article 2. - PMC - PubMed
    1. Schenck AP, Klabunde CN, Warren JL, Peacock S, Davis WW, Hawley ST, Pignone M, Ransohoff DF. Data sources for measuring colorectal endoscopy use among Medicare enrollees. Cancer Epidemiology, Biomarkers & Prevention. 2007;16(10):2118–27. - PubMed

LinkOut - more resources