Deep learning on time series laboratory test results from electronic health records for early detection of pancreatic cancer
- PMID: 35598881
- PMCID: PMC10286873
- DOI: 10.1016/j.jbi.2022.104095
Deep learning on time series laboratory test results from electronic health records for early detection of pancreatic cancer
Abstract
The multi-modal and unstructured nature of observational data in Electronic Health Records (EHR) is currently a significant obstacle for the application of machine learning towards risk stratification. In this study, we develop a deep learning framework for incorporating longitudinal clinical data from EHR to infer risk for pancreatic cancer (PC). This framework includes a novel training protocol, which enforces an emphasis on early detection by applying an independent Poisson-random mask on proximal-time measurements for each variable. Data fusion for irregular multivariate time-series features is enabled by a "grouped" neural network (GrpNN) architecture, which uses representation learning to generate a dimensionally reduced vector for each measurement set before making a final prediction. These models were evaluated using EHR data from Columbia University Irving Medical Center-New York Presbyterian Hospital. Our framework demonstrated better performance on early detection (AUROC 0.671, CI 95% 0.667 - 0.675, p < 0.001) at 12 months prior to diagnosis compared to a logistic regression, xgboost, and a feedforward neural network baseline. We demonstrate that our masking strategy results greater improvements at distal times prior to diagnosis, and that our GrpNN model improves generalizability by reducing overfitting relative to the feedforward baseline. The results were consistent across reported race. Our proposed algorithm is potentially generalizable to other diseases including but not limited to cancer where early detection can improve survival.
Keywords: Early detection of cancer; Electronic Health Records; Machine learning; Pancreatic cancer.
Copyright © 2022 Elsevier Inc. All rights reserved.
Conflict of interest statement
Declaration of Competing Interest
The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: Chin Hur reports financial support was provided by National Cancer Institute of the National Institutes of Health.
Figures
References
-
- Rahib L, Smith BD, Aizenberg R, Rosenzweig AB, Fleshman JM, Matrisian LM, Projecting cancer incidence and deaths to 2030: the unexpected burden of thyroid, liver, and pancreas cancers in the United States, Cancer Res. 74 (11) (2014) 2913–2921. - PubMed
-
- Wagner M, Redaelli C, Lietz M, Seiler CA, Friess H, Büchler MW, Curative resection is the single most important factor determining outcome in patients with pancreatic adenocarcinoma, Br. J. Surg 91 (5) (2004) 586–594. - PubMed
-
- Goggins M, Overbeek KA, Brand R, Syngal S, Del Chiaro M, Bartsch DK, Bassi C, Carrato A, Farrell J, Fishman EK, Fockens P, Gress TM, van Hooft JE, Hruban RH, Kastrinos F, Klein A, Lennon AM, Lucas A, Park W, Rustgi A, Simeone D, Stoffel E, Vasen HFA, Cahen DL, Canto MI, Bruno M, Management of patients with increased risk for familial pancreatic cancer: updated recommendations from the International Cancer of the Pancreas Screening (CAPS) Consortium, Gut 69 (1) (2020) 7–17. - PMC - PubMed
-
- Daly MB, Pal T, Berry MP, Buys SS, Dickson P, Domchek SM, Elkhanany A, Friedman S, Goggins M, Hutton ML, Karlan BY, Khan S, Klein C, Kohlmann W, Kurian AW, Laronga C, Litton JK, Mak JS, Menendez CS, Merajver SD, Norquist BS, Offit K, Pederson HJ, Reiser G, Senter-Jamieson L, Shannon KM, Shatsky R, Visvanathan K, Weitzel JN, Wick MJ, Wisinski KB, Yurgelun MB, Darlow SD, Dwyer MA, Genetic/Familial High-Risk Assessment: Breast, Ovarian, and Pancreatic, Version 2.2021, NCCN Clinical Practice Guidelines in Oncology, J. Natl. Compr. Canc. Netw 19 (1) (2021) 77–102. - PubMed
-
- Stoffel EM, McKernin SE, Brand R, Canto M, Goggins M, Moravek C, Nagarajan A, Petersen GM, Simeone DM, Yurgelun M, Khorana AA, Evaluating Susceptibility to Pancreatic Cancer: ASCO Provisional Clinical Opinion, J. Clin. Oncol 37 (2) (2019) 153–164. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
