Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004 Jan;31(1):125-32.

Radiological scoring methods in ankylosing spondylitis. Reliability and change over 1 and 2 years

Affiliations
  • PMID: 14705231

Radiological scoring methods in ankylosing spondylitis. Reliability and change over 1 and 2 years

Anneke Spoorenberg et al. J Rheumatol. 2004 Jan.

Abstract

Objective: To compare reliability and change over time of radiological scoring methods in ankylosing spondylitis (AS).

Methods: Two trained observers scored 217 sets of radiographs from baseline and from one and 2 years' followup. Sacroiliac (SI) joints were grade 0-4 by the New York method and Stoke Ankylosing Spondylitis Spine Score (SASSS). Hips and cervical and lumbar spine were graded 0-4 by Bath Ankylosing Spondylitis Radiology Index (BASRI). BASRI spinal scores and New York SI are combined into BASRI-spine (score 2-12) and with the addition of BASRI-hips into BASRI-total (2-16). Cervical and lumbar spine were also scored in detail (SASSS, 0-36 each) and were combined into SASSS-total or "modified" SASSS (both range 0-72). To assess change a smallest detectable difference (SDD) was estimated for data on a quasi-interval scale.

Results: The SI scoring methods showed intra and interobserver kappa between 0.36 and 0.70. The BASRI-hip reached kappa between 0.59 and 0.84. Combined SASSS scores were most reliable, with intra and interobserver intraclass correlation coefficients (ICC) between 0.90 and 0.96. The ICC of the combined BASRI scores were also very good, ranging from 0.85 to 0.95. For SI New York, SI SASSS, and BASRI-hip, 0.3-1.2% of patients deteriorated 1 grade; 7.5% deteriorated 1 grade (6.3% of maximum score) in BASRI-spine and BASRI-total, and observers agreed in up to 48% of the cases that no change occurred. The SDD was lowest (7.5; 10% of maximum score) for "modified" SASSS. Only 0.8% of patients deteriorated more than the SDD and observers agreed in up to 92% of the cases that no change occurred.

Conclusion: Radiological scoring methods for AS are moderately to excellently reliable. Under the selected scoring conditions (concealed time order, average of 2 observers, SDD based on interobserver data, unselected patient population) there was too little change over 2 years to be detected reliably by the scoring methods.

PubMed Disclaimer

LinkOut - more resources