Development and external validation of prediction algorithms to improve early diagnosis of cancer
- PMID: 40335498
- PMCID: PMC12059126
- DOI: 10.1038/s41467-025-57990-5
Development and external validation of prediction algorithms to improve early diagnosis of cancer
Abstract
Cancer prediction algorithms are used in the UK to identify individuals at high probability of having a current, as yet undiagnosed cancer with the intention of improving early diagnosis and treatment. Here we develop and externally validate two diagnostic prediction algorithms to estimate the probability of having cancer for 15 cancer types. The first incorporates multiple predictors including age, sex, deprivation, smoking, alcohol, family history, medical diagnoses and symptoms (both general and cancer-specific symptoms). The second additionally includes commonly used blood tests (full blood count and liver function tests). We use multinomial logistic regression to develop separate equations in men and women to predict the absolute probability of 15 cancer types using a population of 7.46 million adults aged 18 to 84 years in England. We evaluate performance in two separate validation cohorts (total 2.64 million patients in England and 2.74 million from Scotland, Wales and Northern Ireland). The models have improved performance compared with existing models with improved discrimination, calibration, sensitivity and net benefit. These algorithms provide superior prediction estimates in the UK compared with existing scores and could lead to better clinical decision-making and potentially earlier diagnosis of cancer.
© 2025. The Author(s).
Conflict of interest statement
Competing interests: All authors have completed the ICMJE uniform disclosure form at www.icmje.org/coi_disclosure.pdf and declare: J.H.C. reports grants from National Institute for Health Research, John Fell Oxford University Press Research Fund, Cancer Research U.K. (C5255/A18085), and other research councils, during the conduct of the study. J.H.C. is an unpaid director of QResearch, a not-for-profit organisation which is a partnership between the University of Oxford and EMIS Health who supply the QResearch database used for this work. Until 9 Aug 2023, J.H.C. had a 50% shareholding in ClinRisk Ltd, co-owning it with her husband, who was an executive director. On 9 August 2023, 100% of the share capital was donated to Endeavour Health Care Charitable Trust and the company was renamed to Endeavour Predict Ltd. J.H.C. is a consultant to Endeavour Predict Ltd. and her husband is a non-executive director to cover the transition. The company licences software both to the private sector and to NHS bodies or bodies that provide services to the NHS (through GP electronic health record providers, pharmacies, hospital providers and other NHS providers). This software implements algorithms (including QRISK3) developed from access to the QResearch database during her time at the University of Nottingham. C.C. reports receiving personal fees from ClinRisk Ltd., outside this work.
Figures
References
-
- Cancer Research UK. Cancer in the UK (Cancer Research UK, 2024).
-
- Department of Health. Improving Outcomes: A Strategy for Cancer (Department of Health, 2011).
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
