Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Dec:76:143-149.
doi: 10.1016/j.annepidem.2022.07.007. Epub 2022 Jul 23.

An algorithm to predict data completeness in oncology electronic medical records for comparative effectiveness research

Affiliations

An algorithm to predict data completeness in oncology electronic medical records for comparative effectiveness research

David Merola et al. Ann Epidemiol. 2022 Dec.

Abstract

Introduction: Electronic health record (EHR) discontinuity (missing out-of-network encounters) can lead to information bias. We sought to construct an algorithm that identifies high EHR-continuity among oncology patients.

Methods: Using a linked Medicare-EHR database and regression, we sought to 1) measure how often Medicare claims for outpatient encounters were substantiated by visits recorded in the EHR, and 2) predict continuity ratio, defined as the yearly proportion of outpatient encounters reported to Medicare that were captured by EHR data. The prediction model...s performance was evaluated with the coefficient of determination and Spearman...s correlation. We quantified variable misclassification by decile of continuity ratio using standardized difference and sensitivity.

Results: A total of 79,678 subjects met all eligibility criteria. Predicted and observed continuity was highly correlated (σSpearman=0.86). On average across all variables measured, MSD was reduced by a factor of 1/7th and sensitivity was improved 35-fold comparing subjects in the highest vs. lowest decile of CR.

Conclusion: In the oncology population, restricting EHR-based study cohorts to subjects with high continuity may reduce misclassification without greatly impacting representativeness. Further work is needed to elucidate the best manner of implementing continuity prediction rules in cohort studies.

Keywords: Comparative effectiveness research; Continuity; Electronic medical records; Information bias.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig. 1.
Fig. 1.
A schematic of the study design and cohort selection
Fig. 2.
Fig. 2.
Mean standardized difference of select comorbidities and medications by predicted continuity ratio. Note: Certain medication categories were excluded from mean standardized difference calculation to avoid redundancy with the respective drugs that comprise them (i.e., Antiplatelets/Anticoagulans, Antihypertensives, Antihyperlipidemics, Antidiabetics, Psychiatric, Gastroprotective Agents)
Fig. 3.
Fig. 3.
Mean Sensitivity of Select Comorbidities and Medications by Predicted Continuity Ratio. Note: Certain medication categories were excluded from mean sensitivity calculation to avoid redundancy with the respective drugs that comprise them (i.e., Antiplatelets/Anticoagulants, Antihypertensives, Antihyperlipidemics, Antidiabetics, Psychiatric, Gastroprotective Agents)

References

    1. Cartwright TH, Clayton M, Garey JS, Boehm KA. Use of an electronic health record (iKnowMed), in conjunction with evidence-based pathways, for data capture and outcome measurement in colorectal cancer patients treated with first-line therapy in the US Oncology Network. J Clin Oncol 2010;28(15_suppl):3626.
    1. Parikh RB, Galsky MD, Gyawali B, et al. Trends in checkpoint inhibitor therapy for advanced urothelial cell carcinoma at the end of life: insights from real–world practice. Oncologist 2019;24(6):e397–9. - PMC - PubMed
    1. Ruley M, Walker V, Studeny J, Coustasse A. The nationwide health information network: the case of the expansion of health information exchanges in the united states. Health Care Manag (Frederick) 2018;37(4):333–8. - PubMed
    1. Lin KJ, Glynn RJ, Singer DE, Murphy SN, Lii J, Schneeweiss S. Out-of-system care and recording of patient characteristics critical for comparative effectiveness research. Epidemiology 2018;29(3):356–63. - PMC - PubMed
    1. Lin KJ, Singer DE, Glynn RJ, Murphy SN, Lii J, Schneeweiss S. Identifying Patients With High Data Completeness to Improve Validity of Comparative Effectiveness Research in Electronic Health Records Data. Clin Pharmacol Ther 2018;103(5):899–905. - PMC - PubMed

Publication types