Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Observational Study
. 2025 Nov;133(8):1137-1144.
doi: 10.1038/s41416-025-03136-9. Epub 2025 Aug 9.

Why NHS hospital co-morbidity research may be wrong: how clinical coding fails to identify the impact of diabetes mellitus on cancer survival

Affiliations
Observational Study

Why NHS hospital co-morbidity research may be wrong: how clinical coding fails to identify the impact of diabetes mellitus on cancer survival

K Zucker et al. Br J Cancer. 2025 Nov.

Abstract

Background: Significant volumes of research rely on secondary care diagnostic coding to identify comorbidities however little is known about its accuracy at a population level or if this influences subsequent analysis.

Methods: Retrospective observational study utilising real world data for all cancers, prostate cancer and breast cancer patients diagnosed at Leeds Cancer Centre from 2005 and 2018. Three different data definitions were used to identify patients with diabetes in each cohort: (1) clinical coding alone, (2) HbA1c blood test alone (3) either clinical coding or abnormal HbA1c. Cohort characteristics, diagnosis dates and Cox derived survival was compared across diabetes definitions.

Results: 123,841 cancer patients were identified including 13,964 with diabetes. Clinical coding failed to identify 14.6% of diabetic cancer patients with a temporal misclassification rate of 17.5%. Sole reliance on clinical coding overestimated the negative effect of DM on median survival across all cancers and 3.17 years in breast cancer.

Discussion: Clinical coding provides inaccurate diabetes diagnosis date and detection resulting in meaningful differences in analytic outcomes. This supports the use of more detailed comorbidity data definitions. Results casts doubt over research reliant on hospital clinical coding alone and the generalisability of some comorbidity and frailty scoring systems.

PubMed Disclaimer

Conflict of interest statement

Competing interests: The authors declare no competing interests. Ethics approval and consent to participate: Study methods used comply with all relevant guidelines and regulations. The study received ethical approval following review from the UK Health Regulatory Authority and Care Research Wales under IRAS 277122 REC Reference 20/HRA/1165. As the study used deidentified data only direct informed consent was not required for data registered opt outs were respected.

Figures

Fig. 1
Fig. 1. Graphical representation of the diabetic subgroups.
Note that the shape area of each subgroup is not scaled to the true numbers in the dataset.
Fig. 2
Fig. 2. Overlap of identification by HbA1c and clinical coding.
Proportion of the diabetic cancer population that are identified by Clinical coding but not HbA1c (Uniquely Identified by Clinical Coding), HbA1c and not clinical coding (Uniquely Identified by HbA1c), or identified by both HbA1c and Clinical Coding (Universally Identified). The results are shown for all cancer patients and the population of cancer patients in the LTHT blood catchment area.
Fig. 3
Fig. 3. Difference in first diabetic diagnosis flag from clinical records comparing clinical coding and abnormal HbA1c.
formula image = patients defined by both an abnormal HbA1c and diabetic clinical coding. formula image = patients defined by clinical coding as post-cancer diabetics but with abnormal HbA1c pre-cancer. Left of formula image = abnormal HbA1c earlier than clinical coding. Right of formula image = clinical coding earlier than abnormal HbA1c.
Fig. 4
Fig. 4. Survival trajectories for each diabetic data definition and the overall survival for all patients in the all cancer cohort.
Survival curves have been plotted for cox models generated using only all patients, coded diabetic patients, abnormal HbA1c patients and diabetic patients identified by the hybrid method. In all cases estimates were adjusted for potential age, sex and deprivation confounding.
Fig. 5
Fig. 5. Cox derived hazard ratios for the impact of diabetes.
Comparison between the estimated hazard across all three cohorts using each of the diabetes definitions. Horizontal dashed line indicates a hazard ratio of 1.0, which is the threshold at which the estimated hazard is equal with and without diabetes

References

    1. Albertsen PC, Fryback DG, Storer BE, Kolon TF, Fine J. The impact of co-morbidity on life expectancy among men with localized prostate cancer. J Urol. 1996;156:127–32. 10.1016/S0022-5347(01)65964-0. - DOI - PubMed
    1. Alibhai SMH, Leach M, Tomlinson GA, Krahn MD, Fleshner NE, Naglie G. Is there an optimal comorbidity index for prostate cancer? Cancer. 2008;112:1043–1050. 10.1002/cncr.23269. - DOI - PubMed
    1. Armitage JN, Van Der Meulen JH. Identifying co-morbidity in surgical patients using administrative data with the Royal College of Surgeons Charlson Score. Br J Surg. 2010;97:772–81. 10.1002/bjs.6930. - DOI - PubMed
    1. Barnett K, Mercer SW, Norbury M, Watt G, Wyke S, Guthrie B. Epidemiology of multimorbidity and implications for health care, research, and medical education: a cross-sectional study. Lancet. 2012;380:37–43. 10.1016/S0140-6736(12)60240-2. - DOI - PubMed
    1. Bates T, Evans T, Lagord C, Monypenny I, Kearins O, Lawrence G. A population based study of variations in operation rates for breast cancer, of comorbidity and prognosis at diagnosis: failure to operate for early breast cancer in older women. Eur J Surg Oncol. 2014;40:1230–36. 10.1016/j.ejso.2014.06.001. - DOI - PubMed

Publication types