Development and external validation of prediction algorithms to improve early diagnosis of cancer
- PMID: 40335498
- PMCID: PMC12059126
- DOI: 10.1038/s41467-025-57990-5
Development and external validation of prediction algorithms to improve early diagnosis of cancer
Abstract
Cancer prediction algorithms are used in the UK to identify individuals at high probability of having a current, as yet undiagnosed cancer with the intention of improving early diagnosis and treatment. Here we develop and externally validate two diagnostic prediction algorithms to estimate the probability of having cancer for 15 cancer types. The first incorporates multiple predictors including age, sex, deprivation, smoking, alcohol, family history, medical diagnoses and symptoms (both general and cancer-specific symptoms). The second additionally includes commonly used blood tests (full blood count and liver function tests). We use multinomial logistic regression to develop separate equations in men and women to predict the absolute probability of 15 cancer types using a population of 7.46 million adults aged 18 to 84 years in England. We evaluate performance in two separate validation cohorts (total 2.64 million patients in England and 2.74 million from Scotland, Wales and Northern Ireland). The models have improved performance compared with existing models with improved discrimination, calibration, sensitivity and net benefit. These algorithms provide superior prediction estimates in the UK compared with existing scores and could lead to better clinical decision-making and potentially earlier diagnosis of cancer.
© 2025. The Author(s).
Conflict of interest statement
Competing interests: All authors have completed the ICMJE uniform disclosure form at www.icmje.org/coi_disclosure.pdf and declare: J.H.C. reports grants from National Institute for Health Research, John Fell Oxford University Press Research Fund, Cancer Research U.K. (C5255/A18085), and other research councils, during the conduct of the study. J.H.C. is an unpaid director of QResearch, a not-for-profit organisation which is a partnership between the University of Oxford and EMIS Health who supply the QResearch database used for this work. Until 9 Aug 2023, J.H.C. had a 50% shareholding in ClinRisk Ltd, co-owning it with her husband, who was an executive director. On 9 August 2023, 100% of the share capital was donated to Endeavour Health Care Charitable Trust and the company was renamed to Endeavour Predict Ltd. J.H.C. is a consultant to Endeavour Predict Ltd. and her husband is a non-executive director to cover the transition. The company licences software both to the private sector and to NHS bodies or bodies that provide services to the NHS (through GP electronic health record providers, pharmacies, hospital providers and other NHS providers). This software implements algorithms (including QRISK3) developed from access to the QResearch database during her time at the University of Nottingham. C.C. reports receiving personal fees from ClinRisk Ltd., outside this work.
Figures





Similar articles
-
Development and validation of risk prediction equations to estimate survival in patients with colorectal cancer: cohort study.BMJ. 2017 Jun 15;357:j2497. doi: 10.1136/bmj.j2497. BMJ. 2017. PMID: 28620089 Free PMC article.
-
Symptoms and risk factors to identify women with suspected cancer in primary care: derivation and validation of an algorithm.Br J Gen Pract. 2013 Jan;63(606):e11-21. doi: 10.3399/bjgp13X660733. Br J Gen Pract. 2013. PMID: 23336450 Free PMC article.
-
Symptoms and risk factors to identify men with suspected cancer in primary care: derivation and validation of an algorithm.Br J Gen Pract. 2013 Jan;63(606):e1-10. doi: 10.3399/bjgp13X660724. Br J Gen Pract. 2013. PMID: 23336443 Free PMC article.
-
Cancer diagnostic tools to aid decision-making in primary care: mixed-methods systematic reviews and cost-effectiveness analysis.Health Technol Assess. 2020 Nov;24(66):1-332. doi: 10.3310/hta24660. Health Technol Assess. 2020. PMID: 33252328 Free PMC article.
-
Prognostic models for newly-diagnosed chronic lymphocytic leukaemia in adults: a systematic review and meta-analysis.Cochrane Database Syst Rev. 2020 Jul 31;7(7):CD012022. doi: 10.1002/14651858.CD012022.pub2. Cochrane Database Syst Rev. 2020. PMID: 32735048 Free PMC article.
Cited by
-
Constructing multicancer risk cohorts using national data from medical helplines and secondary care.NPJ Digit Med. 2025 Aug 27;8(1):551. doi: 10.1038/s41746-025-01855-0. NPJ Digit Med. 2025. PMID: 40866501 Free PMC article.
References
-
- Cancer Research UK. Cancer in the UK (Cancer Research UK, 2024).
-
- Department of Health. Improving Outcomes: A Strategy for Cancer (Department of Health, 2011).
-
- Rubin, G. et al. The expanding role of primary care in cancer control. Lancet Oncol.16, 1231–1272 (2015). - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical