Penalized optimal scaling for ordinal variables with an application to international classification of functioning core sets
- PMID: 36627229
- DOI: 10.1111/bmsp.12297
Penalized optimal scaling for ordinal variables with an application to international classification of functioning core sets
Abstract
Ordinal data occur frequently in the social sciences. When applying principal component analysis (PCA), however, those data are often treated as numeric, implying linear relationships between the variables at hand; alternatively, non-linear PCA is applied where the obtained quantifications are sometimes hard to interpret. Non-linear PCA for categorical data, also called optimal scoring/scaling, constructs new variables by assigning numerical values to categories such that the proportion of variance in those new variables that is explained by a predefined number of principal components (PCs) is maximized. We propose a penalized version of non-linear PCA for ordinal variables that is a smoothed intermediate between standard PCA on category labels and non-linear PCA as used so far. The new approach is by no means limited to monotonic effects and offers both better interpretability of the non-linear transformation of the category labels and better performance on validation data than unpenalized non-linear PCA and/or standard linear PCA. In particular, an application of penalized optimal scaling to ordinal data as given with the International Classification of Functioning, Disability and Health (ICF) is provided.
Keywords: Likert scale; categorical data; chronic widespread pain; non-linear principal component analysis; optimal scoring; smoothing.
© 2023 The Authors. British Journal of Mathematical and Statistical Psychology published by John Wiley & Sons Ltd on behalf of British Psychological Society.
References
REFERENCES
-
- Bürkner, P.-C., & Charpentier, E. (2020). Modelling monotonic effects of ordinal predictors in bayesian regression models. British Journal of Mathematical and Statistical Psychology, 73(3), 420-451.
-
- Cattell, R. B. (1966). The scree test for the number of factors. Multivariate Behavioral Research, 1(2), 245-276.
-
- Cieza, A., Stucki, G., Weigl, M., Kullmann, L., Stoll, T., Kamen, L., Kostanjsek, N., & Walsh, N. (2004). ICF Core sets for chronic widespread pain. Journal of Rehabilitation Medicine, 36, 63-68.
-
- Eilers, P. H. C., & Marx, B. D. (1996). Flexible smoothing with B-splines and penalties. Statistical Science, 11, 89-121.
-
- Fahrmeir, L., & Tutz, G. (2001). Multivariate statistical modelling based on generalized linear models (2nd ed.). Springer.
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
