Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Apr 13;6(1):4.
doi: 10.5334/egems.198.

A Framework for Leveraging "Big Data" to Advance Epidemiology and Improve Quality: Design of the VA Colonoscopy Collaborative

Affiliations

A Framework for Leveraging "Big Data" to Advance Epidemiology and Improve Quality: Design of the VA Colonoscopy Collaborative

Samir Gupta et al. EGEMS (Wash DC). .

Abstract

Objective: To describe a framework for leveraging big data for research and quality improvement purposes and demonstrate implementation of the framework for design of the Department of Veterans Affairs (VA) Colonoscopy Collaborative.

Methods: We propose that research utilizing large-scale electronic health records (EHRs) can be approached in a 4 step framework: 1) Identify data sources required to answer research question; 2) Determine whether variables are available as structured or free-text data; 3) Utilize a rigorous approach to refine variables and assess data quality; 4) Create the analytic dataset and perform analyses. We describe implementation of the framework as part of the VA Colonoscopy Collaborative, which aims to leverage big data to 1) prospectively measure and report colonoscopy quality and 2) develop and validate a risk prediction model for colorectal cancer (CRC) and high-risk polyps.

Results: Examples of implementation of the 4 step framework are provided. To date, we have identified 2,337,171 Veterans who have undergone colonoscopy between 1999 and 2014. Median age was 62 years, and 4.6 percent (n = 106,860) were female. We estimated that 2.6 percent (n = 60,517) had CRC diagnosed at baseline. An additional 1 percent (n = 24,483) had a new ICD-9 code-based diagnosis of CRC on follow up.

Conclusion: We hope our framework may contribute to the dialogue on best practices to ensure high quality epidemiologic and quality improvement work. As a result of implementation of the framework, the VA Colonoscopy Collaborative holds great promise for 1) quantifying and providing novel understandings of colonoscopy outcomes, and 2) building a robust approach for nationwide VA colonoscopy quality reporting.

Keywords: big data; electronic health records; epidemiology; quality improvement; veterans.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Framework for leveraging “big data” for research and quality improvement purposes. Examples of implementation for each step proposed for the VA Colonoscopy Collaborative are provided. VA, Department of Veterans Affairs; NDI, National Death Index.
Figure 2
Figure 2
Risk Factors for CRC and High-Risk Polyps after Baseline Adenoma Removal. The VA Merit Review-supported cohort study will consider the potential influence of established and suspected risk factors for CRC and high-risk polyps. CRC, colorectal cancer.
Figure 3
Figure 3
Data Sources Available within VINCI. Data sources potentially accessible through VINCI include data created and maintained as usual healthcare, as well as external data sources that can be uploaded to VINCI. VA, Department of Veterans Affairs, VINCI, VA Informatics and Computing Infrastructure.
Figure 4
Figure 4
Workflow for NLP Algorithm Development and Validation.

References

    1. Hersh, WR, Weiner, MG, Embi, PJ, et al. Caveats for the use of operational electronic health record data in comparative effectiveness research. Med Care. 2013; 51(8 Suppl 3): S30–7. DOI: 10.1097/MLR.0b013e31829b1dbd - DOI - PMC - PubMed
    1. American Cancer Society. Cancer Facts & Figures 2016 Atlanta, GA: American Cancer Society; 2016.
    1. Sabatino, SA, White, MC, Thompson, TD, et al. Cancer screening test use – United States, 2013. MMWR Morb Mortal Wkly Rep. 2015; 64(17): 464–8. - PMC - PubMed
    1. Corley, DA, Jensen, CD, Marks, AR, et al. Adenoma detection rate and risk of colorectal cancer and death. N Engl J Med. 2014; 370(14): 1298–306. DOI: 10.1056/NEJMoa1309086 - DOI - PMC - PubMed
    1. Kaminski, MF, Regula, J, Kraszewska, E, et al. Quality Indicators for Colonoscopy and the Risk of Interval Cancer. N Engl J Med. 2010; 362(19): 1795–803. DOI: 10.1056/NEJMoa0907667 - DOI - PubMed