Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2002 Dec;83(12):1745-52.
doi: 10.1053/apmr.2002.35657.

Impact of quality scales on levels of evidence inferred from a systematic review of exercise therapy and low back pain

Affiliations

Impact of quality scales on levels of evidence inferred from a systematic review of exercise therapy and low back pain

Florence Colle et al. Arch Phys Med Rehabil. 2002 Dec.

Abstract

Objective: To assess whether the scale used affects levels of evidence inferred from a systematic review of studies on exercise therapy and chronic low back pain (LBP).

Design: Twenty trials previously analyzed in a systematic review were assessed by 2 readers using 16 different scales.

Setting: Tertiary care teaching hospital in France.

Participants: Chronic LBP patients.

Interventions: Not applicable.

Main outcome measure: For the scales allowing classification into high- and low-quality trials, a rating system with 4 levels of evidence was used to summarize conclusions drawn. The Spearman rank correlation coefficient was used to assess correlations between the scores obtained with the different scales. Interrater reliability of the scales was assessed with the intraclass correlation coefficient and the Bland and Altman method, and the degree of agreement between the readers was calculated using the kappa coefficient.

Results: Two of the 3 main results of the systematic review (conflicting evidence on the effectiveness of exercise therapy compared with inactive treatments; strong evidence that exercise therapy is more effective than usual care by a general practitioner) were influenced by the scale used. The range of the Spearman rank correlation coefficients between the different scales was wide (range,.49-.94), the interreader reliability of the scales was heterogeneous, and the interreader agreement was often low (kappa<or=.60 for 7/10 tests).

Conclusions: The use of summary scores to identify physical therapy trials of high quality is questionable. Different quality assessment scales should probably be used to assess pharmacologic interventions and physical therapies. Development and validation of quality scales specific to physical treatments are needed.

PubMed Disclaimer

Publication types

LinkOut - more resources