Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Randomized Controlled Trial
. 2022 Aug 12;22(1):224.
doi: 10.1186/s12874-022-01680-z.

Evaluating the impact of calibration of patient-reported outcomes measures on results from randomized clinical trials: a simulation study based on Rasch measurement theory

Affiliations
Randomized Controlled Trial

Evaluating the impact of calibration of patient-reported outcomes measures on results from randomized clinical trials: a simulation study based on Rasch measurement theory

Angély Loubert et al. BMC Med Res Methodol. .

Abstract

Background: Meaningfully interpreting patient-reported outcomes (PRO) results from randomized clinical trials requires that the PRO scores obtained in the trial have the same meaning across patients and previous applications of the PRO instrument. Calibration of PRO instruments warrants this property. In the Rasch measurement theory (RMT) framework, calibration is performed by fixing the item parameter estimates when measuring the targeted concept for each individual of the trial. The item parameter estimates used for this purpose are typically obtained from a previous "calibration" study. But imposing this constraint on item parameters, instead of freely estimating them directly in the specific sample of the trial, may hamper the ability to detect a treatment effect. The objective of this simulation study was to explore the potential negative impact of calibration of PRO instruments that were developed using RMT on the comparison of results between treatment groups, using different analysis methods.

Methods: PRO results were simulated following a polytomous Rasch model, for a calibration and a trial sample. Scenarios included varying sample sizes, with instrument of varying number of items and modalities, and varying item parameters distributions. Different treatment effect sizes and distributions of the two patient samples were also explored. Cross-sectional comparison of treatment groups was performed using different methods based on a random effect Rasch model. Calibrated and non-calibrated approaches were compared based on type-I error, power, bias, and variance of the estimates for the difference between groups.

Results: There was no impact of the calibration approach on type-I error, power, bias, and dispersion of the estimates. Among other findings, mistargeting between the PRO instrument and patients from the trial sample (regarding the level of measured concept) resulted in a lower power and higher position bias than appropriate targeting.

Conclusions: Calibration does not compromise the ability to accurately assess a treatment effect using a PRO instrument developed within the RMT paradigm in randomized clinical trials. Thus, given its essential role in producing interpretable results, calibration should always be performed when using a PRO instrument developed using RMT as an endpoint in a randomized clinical trial.

Keywords: Calibration; Clinical trials; Patient-reported outcomes; Rasch measurement theory.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interest.

Figures

Fig. 1
Fig. 1
Illustration of the archetypes of items distribution, for different scenarios. Legend: Vertical dashed lines represent the item response category thresholds (δjl, with each color corresponding to a different item) in different scenarios, and the probability density function curve represents the distribution of the latent trait in the calibration sample (case with a variance = 1). The left part of the figure includes cases where the item locations δj have a low dispersion (range = 0.5) and the δjl have a high dispersion (SD = 2.5). The right part of the figure includes cases where the item locations δj have a high dispersion (range = 2) and the δjl have a low dispersion (SD = 1.5). Each line corresponds to different scenarios regarding the number of item and modalities: A) J = 4 items, M = 3 modalities. B) J = 4 items, M = 5 modalities. C) J = 10 items, M = 5 modalities. Full values for the response category thresholds δjl are provided in supplementary materials (Additional file 1)
Fig. 2
Fig. 2
Power using calibrated and non-calibrated approaches, depending on mistargeting of the trial sample μ. Legend: power is presented for instruments with varying number of items J and modalities M. Presented results are for comparison of treatment groups based on γ^, for scenarios with the distribution of the item parameters = second archetype with SD of 1.5 and range of 2, γ = 0.2, Ntrial = 500, Ncalibration = 250, variance of the calibration sample = 1

Similar articles

Cited by

References

    1. Health, U.D.o. and H. Services, Guidance for industry-Patient-reported outcome measures: Use in medical product development to support labeling claims. https://www.fda.gov/media/77832/download, 2009. - PMC - PubMed
    1. Mercieca-Bebber R, et al. The importance of patient-reported outcomes in clinical trials and strategies for future optimization. Patient related outcome measures. 2018;9:353. doi: 10.2147/PROM.S156279. - DOI - PMC - PubMed
    1. Fayers P, et al. EORTC QLQ-C30 scoring manual (ed 3). Brussels, Belgium. 2001. EORTC publications.
    1. Phillips SD, et al. A careful consideration of the calibration concept. J Res National Instit Standards Tech. 2001;106(2):371. doi: 10.6028/jres.106.014. - DOI - PMC - PubMed
    1. Fisher Jr WPJM. Invariance and traceability for measures of human, social, and natural capital. Theory Appl. 2009;42(9):1278–1287.

Publication types

LinkOut - more resources