Detecting DIF with the Multi-Unidimensional Pairwise Preference Model: Lord's Chi-square and IPR-NCDIF Methods

doi:10.1177/01466216251351949

. 2025 Jul 1:01466216251351949.

doi: 10.1177/01466216251351949. Online ahead of print.

Detecting DIF with the Multi-Unidimensional Pairwise Preference Model: Lord's Chi-square and IPR-NCDIF Methods

Lavanya S Kumar¹, Naidan Tu², Sean Joo³, Stephen Stark¹

Affiliations

¹ Department of Psychology, University of South Florida, Tampa, FL, USA.
² Department of Psychological Sciences, Kansas State University, Manhattan, KS, USA.
³ Department of Educational Psychology, University of Kansas, Lawrence, KS, USA.

PMID: 40612447
PMCID: PMC12213542
DOI: 10.1177/01466216251351949

Detecting DIF with the Multi-Unidimensional Pairwise Preference Model: Lord's Chi-square and IPR-NCDIF Methods

Lavanya S Kumar et al. Appl Psychol Meas. 2025.

. 2025 Jul 1:01466216251351949.

doi: 10.1177/01466216251351949. Online ahead of print.

Authors

Lavanya S Kumar¹, Naidan Tu², Sean Joo³, Stephen Stark¹

Affiliations

¹ Department of Psychology, University of South Florida, Tampa, FL, USA.
² Department of Psychological Sciences, Kansas State University, Manhattan, KS, USA.
³ Department of Educational Psychology, University of Kansas, Lawrence, KS, USA.

PMID: 40612447
PMCID: PMC12213542
DOI: 10.1177/01466216251351949

Abstract

Multidimensional forced choice (MFC) measures are gaining prominence in noncognitive assessment. Yet there has been little research on detecting differential item functioning (DIF) with models for forced choice measures. This research extended two well-known DIF detection methods to MFC measures. Specifically, the performance of Lord's chi-square and item parameter replication (IPR) methods with MFC tests based on the Multi-Unidimensional Pairwise Preference (MUPP) model was investigated. The Type I error rate and power of the DIF detection methods were examined in a Monte Carlo simulation that manipulated sample size, impact, DIF source, and DIF magnitude. Both methods showed consistent power and were found to control Type I error well across study conditions, indicating that established approaches to DIF detection work well with the MUPP model. Lord's chi-square outperformed the IPR method when DIF source was statement discrimination while the opposite was true when DIF source was statement threshold. Also, both methods performed similarly and showed better power when DIF source was statement location, in line with previous research. Study implications and practical recommendations for DIF detection with MFC tests, as well as limitations, are discussed.

Keywords: differential item functioning; item response theory; linking; measurement invariance; multi-unidimensional pairwise preference model; multidimensional forced choice.

PubMed Disclaimer

Conflict of interest statement

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Figures

**Figure 1.**
Comparison of Reference and Focal MUPP IRFs of a Simulated Item in 1.20 𝛼 DIF Condition. (a) Reference group IRF (𝛼_𝑠= 0.98, δ_𝑠= 0.79, τ_𝑠= –0.70, 𝛼_𝑡= 1.21, δ_𝑡= 1.10, τ_𝑡= –0.83). (b) Focal group IRF (𝛼_𝑠= 2.18). (c) Squared differences between reference and focal group IRFs.

**Figure 2.**
Comparison of Reference and Focal MUPP IRFs of a Simulated Item in 1.20 δ DIF Condition. (a) Focal group IRF (δ_𝑠= 1.99; Reference group IRF & item parameters in Figure 1(a)). (b) Squared differences between reference and focal group IRFs.

**Figure 3.**
Comparison of Reference and Focal MUPP IRFs of a Simulated Item in 1.20 τ DIF Condition. (a) Focal group IRF (τ_𝑠= –1.90; Reference group IRF & item parameters in Figure 1(a)). (b) Squared differences between reference and focal group IRFs.

See this image and copyright information in PMC

References

1. Barrick M. R., Mount M. K. (1991). The Big Five personality dimensions and job performance: A meta-analysis. Personnel Psychology, 44(1), 1–26. 10.1111/j.1744-6570.1991.tb00688.x - DOI
1. Birnbaum A. (1968). Some latent trait models and their use in inferring an examinee’s ability. In Lord F. M., Novick M. R. (Eds.), Statistical theories of mental test scores (pp. 397–472). Addison-Wesley.
1. Brooks S. P., Gelman A. (1998). General methods for monitoring convergence of iterative simulations. Journal of Computational and Graphical Statistics, 7(4), 434–455. 10.1080/10618600.1998.10474787 - DOI
1. Brown A., Maydeu-Olivares A. (2011). Item response modeling of forced choice questionnaires. Educational and Psychological Measurement, 71(3), 460–502. 10.1177/0013164410375112 - DOI
1. Candell G. L., Drasgow F. (1988). An iterative procedure for linking metrics and assessing item bias in item response theory. Applied Psychological Measurement, 12(3), 253–260. 10.1177/014662168801200304 - DOI

LinkOut - more resources

Full Text Sources
- PubMed Central

[1] Barrick M. R., Mount M. K. (1991). The Big Five personality dimensions and job performance: A meta-analysis. Personnel Psychology, 44(1), 1–26. 10.1111/j.1744-6570.1991.tb00688.x - DOI

[2] Barrick M. R., Mount M. K. (1991). The Big Five personality dimensions and job performance: A meta-analysis. Personnel Psychology, 44(1), 1–26. 10.1111/j.1744-6570.1991.tb00688.x - DOI

[3] Birnbaum A. (1968). Some latent trait models and their use in inferring an examinee’s ability. In Lord F. M., Novick M. R. (Eds.), Statistical theories of mental test scores (pp. 397–472). Addison-Wesley.

[4] Birnbaum A. (1968). Some latent trait models and their use in inferring an examinee’s ability. In Lord F. M., Novick M. R. (Eds.), Statistical theories of mental test scores (pp. 397–472). Addison-Wesley.

[5] Brooks S. P., Gelman A. (1998). General methods for monitoring convergence of iterative simulations. Journal of Computational and Graphical Statistics, 7(4), 434–455. 10.1080/10618600.1998.10474787 - DOI

[6] Brooks S. P., Gelman A. (1998). General methods for monitoring convergence of iterative simulations. Journal of Computational and Graphical Statistics, 7(4), 434–455. 10.1080/10618600.1998.10474787 - DOI

[7] Brown A., Maydeu-Olivares A. (2011). Item response modeling of forced choice questionnaires. Educational and Psychological Measurement, 71(3), 460–502. 10.1177/0013164410375112 - DOI

[8] Brown A., Maydeu-Olivares A. (2011). Item response modeling of forced choice questionnaires. Educational and Psychological Measurement, 71(3), 460–502. 10.1177/0013164410375112 - DOI

[9] Candell G. L., Drasgow F. (1988). An iterative procedure for linking metrics and assessing item bias in item response theory. Applied Psychological Measurement, 12(3), 253–260. 10.1177/014662168801200304 - DOI

[10] Candell G. L., Drasgow F. (1988). An iterative procedure for linking metrics and assessing item bias in item response theory. Applied Psychological Measurement, 12(3), 253–260. 10.1177/014662168801200304 - DOI

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Detecting DIF with the Multi-Unidimensional Pairwise Preference Model: Lord's Chi-square and IPR-NCDIF Methods

Affiliations

Detecting DIF with the Multi-Unidimensional Pairwise Preference Model: Lord's Chi-square and IPR-NCDIF Methods

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

References

LinkOut - more resources

Full Text Sources