A Comparison of Label Switching Algorithms in the Context of Growth Mixture Models
- PMID: 34267396
- PMCID: PMC8243206
- DOI: 10.1177/0013164420970614
A Comparison of Label Switching Algorithms in the Context of Growth Mixture Models
Abstract
Simulation studies involving mixture models inevitably aggregate parameter estimates and other output across numerous replications. A primary issue that arises in these methodological investigations is label switching. The current study compares several label switching corrections that are commonly used when dealing with mixture models. A growth mixture model is used in this simulation study, and the design crosses three manipulated variables-number of latent classes, latent class probabilities, and class separation, yielding a total of 18 conditions. Within each of these conditions, the accuracy of a priori identifiability constraints, a priori training of the algorithm, and four post hoc algorithms developed by Tueller et al.; Cho; Stephens; and Rodriguez and Walker are tested to determine their classification accuracy. Findings reveal that, of all a priori methods, training of the algorithm leads to the most accurate classification under all conditions. In a case where an a priori algorithm is not selected, Rodriguez and Walker's algorithm is an excellent choice if interested specifically in aggregating class output without consideration as to whether the classes are accurately ordered. Using any of the post hoc algorithms tested yields improvement over baseline accuracy and is most effective under two-class models when class separation is high. This study found that if the class constraint algorithm was used a priori, it should be combined with a post hoc algorithm for accurate classification.
Keywords: growth mixture model; label switching; mixture modeling; simulation; training data set.
© The Author(s) 2020.
Conflict of interest statement
Declaration of Conflicting Interests: The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Figures
References
-
- Casella G., George E. I. (1992). Explaining the Gibbs sampler. The American Statistician, 46(3), 167-174. 10.1080/00031305.1992.10475878 - DOI
-
- Cho Y. (2013). The mixture distribution polytomous Rasch model used to account for response styles on rating scales: A simulation study of parameter recovery and classification accuracy. [Unpublished doctoral dissertation]. University of Maryland.
-
- Dempster A. P., Laird N. M., Rubin D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, Series B, 39(1), 1-22. 10.1111/j.2517-6161.1977.tb01600.x - DOI
-
- Diebolt J., Robert C. P. (1994). Estimation of finite mixture distributions through Bayesian sampling. Journal of the Royal Statistical Society, Series B, 56, 363-375. 10.1111/j.2517-6161.1994.tb01985.x - DOI
LinkOut - more resources
Full Text Sources