Distributional regression modeling via generalized additive models for location, scale, and shape: An overview through a data set from learning analytics

Affiliations

¹ Centre for Change and Complexity in Learning University of South Australia Adelaide Australia.
² Instituto de Estadística Universidad de Valparaíso Valparaíso Chile.
³ Department of Statistical Modelling Institute of Computer Science of the Czech Academy of Sciences Prague Czech Republic.
⁴ Czech Institute of Informatics Robotics and Cybernetics, CTU Prague Czech Republic.
⁵ Computer Science Education/Computer Science and Society Research Group Humboldt University of Berlin Berlin Germany.
⁶ Departamento de Estadística Pontificia Universidad Católica de Chile Santiago de Chile Chile.
⁷ Campus Institute Data Science (CIDAS) and Chair of Statistics Georg-August-Universität Göttingen Göttingen Germany.
⁸ Seminar for Statistics, ETH Zürich Zürich Switzerland.
⁹ Epidemiology, Biostatistics, and Prevention Institute University of Zurich Zurich Switzerland.
¹⁰ Institute of Data Analysis and Process Design Zurich University of Applied Sciences Winterthur Switzerland.
¹¹ Department of Statistics TU Dortmund University Dortmund Germany.
¹² Department of Statistics, CASTLab Federal University of Pernambuco Recife Brazil.

PMID: 37502671
PMCID: PMC10369920
DOI: 10.1002/widm.1479

Review

Distributional regression modeling via generalized additive models for location, scale, and shape: An overview through a data set from learning analytics

Fernando Marmolejo-Ramos et al. Wiley Interdiscip Rev Data Min Knowl Discov. 2023 Jan-Feb.

. 2023 Jan-Feb;13(1):e1479.

doi: 10.1002/widm.1479. Epub 2022 Oct 21.

Authors

Affiliations

¹ Centre for Change and Complexity in Learning University of South Australia Adelaide Australia.
² Instituto de Estadística Universidad de Valparaíso Valparaíso Chile.
³ Department of Statistical Modelling Institute of Computer Science of the Czech Academy of Sciences Prague Czech Republic.
⁴ Czech Institute of Informatics Robotics and Cybernetics, CTU Prague Czech Republic.
⁵ Computer Science Education/Computer Science and Society Research Group Humboldt University of Berlin Berlin Germany.
⁶ Departamento de Estadística Pontificia Universidad Católica de Chile Santiago de Chile Chile.
⁷ Campus Institute Data Science (CIDAS) and Chair of Statistics Georg-August-Universität Göttingen Göttingen Germany.
⁸ Seminar for Statistics, ETH Zürich Zürich Switzerland.
⁹ Epidemiology, Biostatistics, and Prevention Institute University of Zurich Zurich Switzerland.
¹⁰ Institute of Data Analysis and Process Design Zurich University of Applied Sciences Winterthur Switzerland.
¹¹ Department of Statistics TU Dortmund University Dortmund Germany.
¹² Department of Statistics, CASTLab Federal University of Pernambuco Recife Brazil.

PMID: 37502671
PMCID: PMC10369920
DOI: 10.1002/widm.1479

Abstract

The advent of technological developments is allowing to gather large amounts of data in several research fields. Learning analytics (LA)/educational data mining has access to big observational unstructured data captured from educational settings and relies mostly on unsupervised machine learning (ML) algorithms to make sense of such type of data. Generalized additive models for location, scale, and shape (GAMLSS) are a supervised statistical learning framework that allows modeling all the parameters of the distribution of the response variable with respect to the explanatory variables. This article overviews the power and flexibility of GAMLSS in relation to some ML techniques. Also, GAMLSS' capability to be tailored toward causality via causal regularization is briefly commented. This overview is illustrated via a data set from the field of LA. This article is categorized under:Application Areas > Education and LearningAlgorithmic Development > StatisticsTechnologies > Machine Learning.

Keywords: causal regularization; causality; educational data mining; generalized additive models for location, scale, and shape; learning analytics; machine learning; statistical learning; statistical modeling; supervised learning.

PubMed Disclaimer

Conflict of interest statement

The authors have declared no conflicts of interest for this article.

Figures

**FIGURE 1**
Course schedule (timeline occurs in weeks).

**FIGURE 2**
FAS′ kernel density estimates superimposed on histogram (a) and FAS′ empirical and theoretical CDFs (b). The vertical dotted line in the left plot indicates the variable's mean. The black line in the right plot shows the FAS′ ECDF and the colored lines represent five theoretical CDFs (ranked from best GB1 to worst fit RG). CDF, cumulative distribution functions; ECDF, empirical CDF; GB1, generalized beta type 1; RG, reverse Gumbel.

**FIGURE 3**
FAS′ kernel density estimates conditioned on the covariates gender (with two levels; F = females and M = males), disability (with two levels; first row = disability, second row = no disability) and highest education (with five levels). The graph also indicates the data are imbalanced in that not all combinations of levels of the covariates have values. That is, while there are FAS values for people with nondisability at all education levels, there are FAS values for people with disabilities at three education levels only.

**FIGURE 4**
Diagnostic worm plots for assessing the fitness of models using the generalized beta type 1 (GB1) distribution (a), Skew t‐distribution type 2 (ST2) (b), Beta (BE) distribution (c), and Normal (NO) distribution (d) to the FAS variable. A good fit is represented by ≈ 95% of values lying between the two green dotted elliptic lines and close to the deviation value of 0.0. In this example, the GB1 and ST2 distributions fit well most of the data but they struggle to fit the values in the tails of the FAS variable (although the ST2 distribution models better the right tail of the data than the GB1 distribution). However, compared to the GB1 and ST2 models, BE and NO exhibit a poor fit overall.

**FIGURE 5**
Termplot for the μ submodel when it includes a smooth term (P‐splines) on the covariate “number of clicks” (a). Plot (b) shows the diagnostic worm plot for assessing the fitness of the GB1 model.

**FIGURE 6**
Worm plot for the GB1 model when the μ, σ, ν, and τ parameters were modeled.

**FIGURE 7**
Violinplots of the cross‐validation results. The mean and its 95% confidence interval (CI) are represented by the red dots and error bars. The overlaid dot plots, on each violin plot, represent the result of each of the 10‐fold cross‐validation.

See this image and copyright information in PMC

References

1. Akaike, H. (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19(6), 716–723.
1. Akantziliotou, K. , Rigby, R. , & Stasinopoulos, D. (2002). The R implementation of generalized additive models for location, scale and shape. In Statistical modelling in society: Proceedings of the 17th International Workshop on statistical modelling, Statistical Modelling Society, Chania, Crete, July 8‐12, 2002 (pp. 75–83).
1. Alshabandar, R. , Hussain, A. , Keight, R. , Laws, A. , & Baker, T. (2018). The application of Gaussian mixture models for the identification of at‐risk learners in massive open online courses. In IEEE congress on evolutionary computation. IEEE Publishing. http://researchonline.ljmu.ac.uk/id/eprint/8486/
1. Angelov, P. P. , Soares, E. A. , Jiang, R. , Arnold, N. I. , & Atkinson, P. M. (2021). Explainable artificial intelligence: An analytical review. WIREs Data Mining and Knowledge Discovery, 11(5), e1424. 10.1002/widm.1424 - DOI
1. Arachchige, C. N. P. G. , Prendergast, L. A. , & Staudte, R. G. (2022). Robust analogs to the coefficient of variation. Journal of Applied Statistics, 49(2), 268–290. 10.1080/02664763.2020.1808599 - DOI - PMC - PubMed

Publication types

Actions

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Distributional regression modeling via generalized additive models for location, scale, and shape: An overview through a data set from learning analytics

Affiliations

Distributional regression modeling via generalized additive models for location, scale, and shape: An overview through a data set from learning analytics

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

LinkOut - more resources

Full Text Sources