Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul;34(7):3012-3020.
doi: 10.1007/s00586-025-08894-x. Epub 2025 May 17.

How reliable are the Core Outcome Measures Index and Global Treatment Outcome as measures of treatment success after surgery for central lumbar spinal canal stenosis?

Affiliations

How reliable are the Core Outcome Measures Index and Global Treatment Outcome as measures of treatment success after surgery for central lumbar spinal canal stenosis?

Selina Nauer et al. Eur Spine J. 2025 Jul.

Abstract

Introduction: The reliability of patient-reported outcome measures is typically evaluated in patients with chronic or stable symptoms, pre-treatment. After treatment, symptoms may be less extreme and less variable within a group, which may influence indices of reliability. Further, few studies have examined the reliability of retrospective, single item global assessment scores after treatment, despite the latter often being used as the external criterion when determining an instrument's minimal clinically important change (MCIC) score. This study examined the stability of Core Outcome Measures Index (COMI) and Global Treatment Outcome (GTO) scores after spine surgery.

Methods: Data were extracted for patients with lumbar spinal stenosis who had completed the COMI and GTO twice at follow-up, for both an in-house outcomes database (ROUTINE) and as part of a separate prospective study (LSOS). To be included, the questionnaires had to have been completed within 3 months of each other for 1-year follow-up (FU), within 4 months for 2-year FU and within 5 months for 5-year FU. Repeated measures ANOVA, intraclass correlation coefficients (ICC; 2-way mixed, absolute agreement), and weighted Kappa values were calculated.

Results: 64 patients (72.9 ± 6.9 y; 48% female) had a COMI and GTO available from both ROUTINE and LSOS, completed on average 9.8 ± 6.2 weeks apart. There were no significant differences between test and retest scores for any of the COMI domains or for the COMI summary score (all p > 0.05), and ICCs/weighted Kappas were moderate to good (0.73-0.87). In the ROUTINE and LSOS datasets, 83% and 81% patients, respectively, reported a "good global outcome" on the GTO (i.e. treatment helped/helped a lot); the corresponding Kappa for agreement between the ratings on an individual basis was 0.74 ("good agreement").

Conclusion: Despite the more stringent nature of the evaluation given by this "real-life" analysis of the stability of outcome scores, the FU COMI scores showed moderate to good reliability comparable to that reported in previous studies with reliability assessed 1-2 weeks apart. The GTO was also confirmed as a reliable variable yielding stable values at mid to longer term follow-up.

Keywords: Core Outcome Measures Index (COMI); Global Treatment Outcome (GTO); Patient-reported outcomes; Reliability; Spinal canal stenosis.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests.

Similar articles

References

    1. Davidson M, Keating J (2014) Patient-reported outcome measures (PROMs): how should I interpret reports of measurement properties? A practical guide for clinicians and researchers who are not biostatisticians. Br J Sports Med 48:792–796. https://doi.org/10.1136/bjsports-2012-091704 - DOI - PubMed
    1. Terwee CB, Bot SD, de Boer MR, van der Windt DA, Knol DL, Dekker J, Bouter LM, de Vet HC (2007) Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol 60:34–42 - PubMed
    1. Abdeldaiem A, Saweeres ESB, Shehab-Eldien AA, Mannion AF, Rehan Youssef A (2020) Cross-cultural adaptation and validation of the Arabic version of the core outcome measures index for the back (COMI-back) in patients with non-specific low back pain. Eur Spine J 29:2413–2430. https://doi.org/10.1007/s00586-020-06530-4 - DOI - PubMed
    1. Streiner DL, Norman GR (2003) Health measurement scales: A practical guide to their development and use. Oxford University Press
    1. Ostelo RW, Deyo RA, Stratford P, Waddell G, Croft P, Von Korff M, Bouter LM, de Vet HC (2008) Interpreting change scores for pain and functional status in low back pain: towards international consensus regarding minimal important change. Spine (Phila Pa 1976) 33:90–94. https://doi.org/10.1097/BRS.0b013e31815e3a10 - DOI - PubMed

LinkOut - more resources