Rasch fit statistics as a test of the invariance of item parameter estimates
- PMID: 12748407
Rasch fit statistics as a test of the invariance of item parameter estimates
Abstract
The invariance of the estimated parameters across variation in the incidental parameters of a sample is one of the most important properties of Rasch measurement models. This is the property that allows the equating of test forms and the use of computer adaptive testing. It necessarily follows that in Rasch models if the data fit the model, than the estimation of the parameter of interest must be invariant across sub-samples of the items or persons. This study investigates the degree to which the INFIT and OUTFIT item fit statistics in WINSTEPS detect violations of the invariance property of Rasch measurement models. The test in this study is a 80 item multiple-choice test used to assess mathematics competency. The WINSTEPS analysis of the dichotomous results, based on a sample of 2000 from a very large number of students who took the exam, indicated that only 7 of the 80 items misfit using the 1.3 mean square criteria advocated by Linacre and Wright. Subsequent calibration of separate samples of 1,000 students from the upper and lower third of the person raw score distribution, followed by a t-test comparison of the item calibrations, indicated that the item difficulties for 60 of the 80 items were more than 2 standard errors apart. The separate calibration t-values ranged from +21.00 to -7.00 with the t-test value of 41 of the 80 comparisons either larger than +5 or smaller than -5. Clearly these data do not exhibit the invariance of the item parameters expected if the data fit the model. Yet the INFIT and OUTFIT mean squares are completely insensitive to the lack of invariance in the item parameters. If the OUTFIT ZSTD from WINSTEPS was used with a critical value of | t | > 2.0, then 56 of the 60 items identified by the separate calibration t-test would be identified as misfitting. A fourth measure of misfit, the between ability-group item fit statistic identified 69 items as misfitting when a critical value of t > 2.0 was used. Clearly relying solely on the INFIT and OUTFIT mean squares in WINSETPS to assess the fit of the data to the model would cause one to miss one of the most important threats to the usefulness of the measurement model.
Similar articles
-
Computing confidence intervals of item fit statistics in the family of Rasch models using the bootstrap method.J Appl Meas. 2007;8(2):190-203. J Appl Meas. 2007. PMID: 17440261
-
The family approach to assessing fit in Rasch measurement.J Appl Meas. 2009;10(4):424-37. J Appl Meas. 2009. PMID: 19934529
-
A critique of Rasch residual fit statistics.J Appl Meas. 2000;1(2):152-76. J Appl Meas. 2000. PMID: 12029176
-
Item banking: a generational change in patient-reported outcome measurement.Optom Vis Sci. 2010 Apr;87(4):285-93. doi: 10.1097/OPX.0b013e3181d408d7. Optom Vis Sci. 2010. PMID: 20142792 Review.
-
Why measurement matters for measuring patient vision outcomes.Optom Vis Sci. 2007 Aug;84(8):675-82. doi: 10.1097/OPX.0b013e3181339f44. Optom Vis Sci. 2007. PMID: 17700332 Review.
Cited by
-
Looking for patterns of change amid pandemic period in students' evaluation of academic teaching.Qual Quant. 2022 Nov 18:1-19. doi: 10.1007/s11135-022-01567-7. Online ahead of print. Qual Quant. 2022. PMID: 36439684 Free PMC article.
-
Measuring social difficulties in routine patient-centred assessment: a Rasch analysis of the social difficulties inventory.Qual Life Res. 2007 Jun;16(5):823-31. doi: 10.1007/s11136-007-9181-9. Epub 2007 Apr 3. Qual Life Res. 2007. PMID: 17404900
-
Patient-reported outcomes measurement information system depression psychometrically underperforms compared to legacy measures and is poorly associated with postoperative functional outcomes in shoulder arthroplasty patients.Shoulder Elbow. 2023 Dec;15(6):626-633. doi: 10.1177/17585732221137555. Epub 2022 Nov 10. Shoulder Elbow. 2023. PMID: 37981972 Free PMC article.
-
A Rasch and factor analysis of the Functional Assessment of Cancer Therapy-General (FACT-G).Health Qual Life Outcomes. 2007 Apr 20;5:19. doi: 10.1186/1477-7525-5-19. Health Qual Life Outcomes. 2007. PMID: 17448239 Free PMC article.
-
Application of Sampling Variance of Item Response Theory Parameter Estimates in Detecting Outliers in Common Item Equating.Appl Psychol Meas. 2022 Sep;46(6):529-547. doi: 10.1177/01466216221108122. Epub 2022 Jun 15. Appl Psychol Meas. 2022. PMID: 35991825 Free PMC article.