Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jun 2;16(6):e0251876.
doi: 10.1371/journal.pone.0251876. eCollection 2021.

Can we screen for pancreatic cancer? Identifying a sub-population of patients at high risk of subsequent diagnosis using machine learning techniques applied to primary care data

Affiliations

Can we screen for pancreatic cancer? Identifying a sub-population of patients at high risk of subsequent diagnosis using machine learning techniques applied to primary care data

Ananya Malhotra et al. PLoS One. .

Abstract

Background: Pancreatic cancer (PC) represents a substantial public health burden. Pancreatic cancer patients have very low survival due to the difficulty of identifying cancers early when the tumour is localised to the site of origin and treatable. Recent progress has been made in identifying biomarkers for PC in the blood and urine, but these cannot be used for population-based screening as this would be prohibitively expensive and potentially harmful.

Methods: We conducted a case-control study using prospectively-collected electronic health records from primary care individually-linked to cancer registrations. Our cases were comprised of 1,139 patients, aged 15-99 years, diagnosed with pancreatic cancer between January 1, 2005 and June 30, 2009. Each case was age-, sex- and diagnosis time-matched to four non-pancreatic (cancer patient) controls. Disease and prescription codes for the 24 months prior to diagnosis were used to identify 57 individual symptoms. Using a machine learning approach, we trained a logistic regression model on 75% of the data to predict patients who later developed PC and tested the model's performance on the remaining 25%.

Results: We were able to identify 41.3% of patients < = 60 years at 'high risk' of developing pancreatic cancer up to 20 months prior to diagnosis with 72.5% sensitivity, 59% specificity and, 66% AUC. 43.2% of patients >60 years were similarly identified at 17 months, with 65% sensitivity, 57% specificity and, 61% AUC. We estimate that combining our algorithm with currently available biomarker tests could result in 30 older and 400 younger patients per cancer being identified as 'potential patients', and the earlier diagnosis of around 60% of tumours.

Conclusion: After further work this approach could be applied in the primary care setting and has the potential to be used alongside a non-invasive biomarker test to increase earlier diagnosis. This would result in a greater number of patients surviving this devastating disease.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig 1
Fig 1. Schematic of multi-stage screening process starting with machine-learning derived algorithm.
Fig 2
Fig 2. Model parameters obtained in each time window (0–24,1–24,…,20–24 months) prior to diagnosis.
a) AUC (%) b) Sensitivity (%) c) Specificity (%).
Fig 3
Fig 3. AUC, sensitivity and specificity of the models with % of the population recommended for biomarker (Stage 2) testing.

Similar articles

Cited by

References

    1. Pereira SP, Oldfield L, Ney A, Hart PA, Keane MG, Pandol SJ, et al.. Early detection of pancreatic cancer. Lancet Gastroenterol Hepatol. 2020;5(7):698–710. 10.1016/S2468-1253(19)30416-9 - DOI - PMC - PubMed
    1. Mitry E, Rachet B, Quinn MJ, Cooper N, Coleman MP. Survival from cancer of the pancreas in England and Wales up to 2001. Br J Cancer. 2008;99 S21–23. 10.1038/sj.bjc.6604576 - DOI - PMC - PubMed
    1. Carrato A, Falcone A, Ducreux M, Valle JW, Parnaby A, Djazouli K, et al.. A Systematic Review of the Burden of Pancreatic Cancer in Europe: Real-World Impact on Survival, Quality of Life and Costs. J Gastrointest Cancer. 2015;46(3):201–211. 10.1007/s12029-015-9724-1 - DOI - PMC - PubMed
    1. Oldfield LE, Connor AA, Gallinger S. Molecular Events in the Natural History of Pancreatic Cancer. Trends Cancer. 2017;3(5):336–346. 10.1016/j.trecan.2017.04.005 - DOI - PubMed
    1. Exarchakou A, Rachet B, Belot A, Maringe C, Coleman MP. Impact of national cancer policies on cancer survival trends and socioeconomic inequalities in England, 1996–2013: population based study. BMJ. 2018;360:k764. 10.1136/bmj.k764 - DOI - PMC - PubMed

Publication types

MeSH terms