Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Dec;88(4):1097-1122.
doi: 10.1007/s11336-023-09930-9. Epub 2023 Aug 7.

DIF Statistical Inference Without Knowing Anchoring Items

Affiliations

DIF Statistical Inference Without Knowing Anchoring Items

Yunxiao Chen et al. Psychometrika. 2023 Dec.

Abstract

Establishing the invariance property of an instrument (e.g., a questionnaire or test) is a key step for establishing its measurement validity. Measurement invariance is typically assessed by differential item functioning (DIF) analysis, i.e., detecting DIF items whose response distribution depends not only on the latent trait measured by the instrument but also on the group membership. DIF analysis is confounded by the group difference in the latent trait distributions. Many DIF analyses require knowing several anchor items that are DIF-free in order to draw inferences on whether each of the rest is a DIF item, where the anchor items are used to identify the latent trait distributions. When no prior information on anchor items is available, or some anchor items are misspecified, item purification methods and regularized estimation methods can be used. The former iteratively purifies the anchor set by a stepwise model selection procedure, and the latter selects the DIF-free items by a LASSO-type regularization approach. Unfortunately, unlike the methods based on a correctly specified anchor set, these methods are not guaranteed to provide valid statistical inference (e.g., confidence intervals and p-values). In this paper, we propose a new method for DIF analysis under a multiple indicators and multiple causes (MIMIC) model for DIF. This method adopts a minimal [Formula: see text] norm condition for identifying the latent trait distributions. Without requiring prior knowledge about an anchor set, it can accurately estimate the DIF effects of individual items and further draw valid statistical inferences for quantifying the uncertainty. Specifically, the inference results allow us to control the type-I error for DIF detection, which may not be possible with item purification and regularized estimation methods. We conduct simulation studies to evaluate the performance of the proposed method and compare it with the anchor-set-based likelihood ratio test approach and the LASSO approach. The proposed method is applied to analysing the three personality scales of the Eysenck personality questionnaire-revised (EPQ-R).

Keywords: confidence interval; differential item functioning; item response theory; least absolute deviations; measurement invariance.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
The path diagram of a MIMIC model for DIF analysis. The subscript i is omitted for simplicity. The dashed lines from x to Yj indicate the DIF effects.
Fig. 2
Fig. 2
Function h(c)=j=1J|γj-ajc|, where J=10, aj=1 for all j, γj=0 and 1 for j=1,,8 and j=9,10, respectively. The minimal value of h(c) is achieved when c=0.
Algorithm 1
Algorithm 1
.
Algorithm 2
Algorithm 2
.
Fig. 3
Fig. 3
Scatter plots of the coverage rates of the 95% confidence intervals for γj’s. x-axes and y-axes are labelled with item numbers and coverage rates, respectively. Panels ad correspond to our proposed method, and panels eh correspond to the Wald intervals constructed with five anchor items. Blue solid circle corresponds to small dj with high proportion DIF items. Purple solid triangle corresponds to small dj with medium proportion DIF items. Red solid square corresponds to small dj with low proportion DIF items. Blue square cross corresponds to large dj with high proportion DIF items. Purple diamond plus corresponds to large dj with medium proportion DIF items. Red circle plus corresponds to large dj with low proportion DIF items.
Fig. 4
Fig. 4
Plots of 95% confidence intervals for the DIF parameters γjs on scale P, N, and E data sets. The red horizontal lines denote γ=0. Items are arranged according to the increasing P-values.

References

    1. Asparouhov T, Muthén B. Multiple-group factor analysis alignment. Structural Equation Modeling: A Multidisciplinary Journal. 2014;21(4):495–508.
    1. Barnett V, Lewis T. Outliers in statistical data. Hoboken: Wiley; 1994.
    1. Bauer DJ, Belzak WC, Cole VT. Simplifying the assessment of measurement invariance over multiple background variables: Using regularized moderated nonlinear factor analysis to detect differential item functioning. Structural Equation Modeling: A Multidisciplinary Journal. 2020;27(1):43–55. - PMC - PubMed
    1. Bechger TM, Maris G. A statistical test for differential item pair functioning. Psychometrika. 2015;80(2):317–340. - PubMed
    1. Belzak W, Bauer DJ. Improving the assessment of measurement invariance: Using regularization to select anchor items and identify differential item functioning. Psychological Methods. 2020;25(6):673–690. - PMC - PubMed

Publication types

LinkOut - more resources