Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Mar;47(3):102087.
doi: 10.1016/j.clinre.2023.102087. Epub 2023 Jan 18.

Development and validation of a multivariable risk factor questionnaire to detect oesophageal cancer in 2-week wait patients

Collaborators, Affiliations

Development and validation of a multivariable risk factor questionnaire to detect oesophageal cancer in 2-week wait patients

Kai Man Alexander Ho et al. Clin Res Hepatol Gastroenterol. 2023 Mar.

Abstract

Introduction: Oesophageal cancer is associated with poor health outcomes. Upper GI (UGI) endoscopy is the gold standard for diagnosis but is associated with patient discomfort and low yield for cancer. We used a machine learning approach to create a model which predicted oesophageal cancer based on questionnaire responses.

Methods: We used data from 2 separate prospective cross-sectional studies: the Saliva to Predict rIsk of disease using Transcriptomics and epigenetics (SPIT) study and predicting RIsk of diSease using detailed Questionnaires (RISQ) study. We recruited patients from National Health Service (NHS) suspected cancer pathways as well as patients with known cancer. We identified patient characteristics and questionnaire responses which were most associated with the development of oesophageal cancer. Using the SPIT dataset, we trained seven different machine learning models, selecting the best area under the receiver operator curve (AUC) to create our final model. We further applied a cost function to maximise cancer detection. We then independently validated the model using the RISQ dataset.

Results: 807 patients were included in model training and testing, split in a 70:30 ratio. 294 patients were included in model validation. The best model during training was regularised logistic regression using 17 features (median AUC: 0.81, interquartile range (IQR): 0.69-0.85). For testing and validation datasets, the model achieved an AUC of 0.71 (95% CI: 0.61-0.81) and 0.92 (95% CI: 0.88-0.96) respectively. At a set cut off, our model achieved a sensitivity of 97.6% and specificity of 59.1%. We additionally piloted the model in 12 patients with gastric cancer; 9/12 (75%) of patients were correctly classified.

Conclusions: We have developed and validated a risk stratification tool using a questionnaire approach. This could aid prioritising patients at high risk of having oesophageal cancer for endoscopy. Our tool could help address endoscopic backlogs caused by the COVID-19 pandemic.

Keywords: XX.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig 1
Fig. 1
Process flow for model training, testing and validation.
Fig 2
Fig. 2
Boxplot of each model and their spread of area under the receiver operating characteristic curve (ROC) for each model. Table demonstrates median ROC, sensitivity and specificity and their respective inter-quartile range (IQR) during model development using 10-fold cross-validation.
Fig 3
Fig. 3
ROC curve for training, testing and validation datasets for regularised logistic regression for predicting oesophageal and GOJ cancer.

References

    1. Arnold M., et al. Global burden of 5 major types of gastrointestinal cancer. Gastroenterology. 2020;159:335–349. e15. - PMC - PubMed
    1. Arnold M., Soerjomataram I., Ferlay J., Forman D. Global incidence of oesophageal cancer by histological subtype in 2012. Gut. 2015;64:381–387. - PubMed
    1. Abnet C.C., Arnold M., Wei W.Q. Epidemiology of esophageal squamous cell carcinoma. Gastroenterology. 2018;154:360–373. - PMC - PubMed
    1. Coleman H.G., Xie S.H., Lagergren J. The epidemiology of esophageal adenocarcinoma. Gastroenterology. 2018;154:390–405. - PubMed
    1. Cancer Research UK . Cancer research uk. 2018. Cancer statistics for the UK.http://www.cancerresearchuk.org/health-professional/cancer-statistics Available at. Accessed 28th July 2022.