. 2014 Jan 9;9(1):e84601.

doi: 10.1371/journal.pone.0084601. eCollection 2014.

Generalized linear mixed models for binary data: are matching results from penalized quasi-likelihood and numerical integration less biased?

Andrea Benedetti¹, Robert Platt², Juli Atherton³

Affiliations

¹ Department of Medicine, McGill University, Montreal, Canada ; Department of Epidemiology, Biostatistics & Occupational Health, McGill University, Montreal, Canada ; Respiratory Epidemiology and Clinical Research Unit, Montreal Chest Institute, Montreal, Canada.
² Department of Medicine, McGill University, Montreal, Canada ; Department of Pediatrics, McGill University, Montreal, Canada.
³ Département de Mathématiques, Université de Québec à Montréal, Montreal, Canada.

PMID: 24416249
PMCID: PMC3886992
DOI: 10.1371/journal.pone.0084601

Generalized linear mixed models for binary data: are matching results from penalized quasi-likelihood and numerical integration less biased?

Andrea Benedetti et al. PLoS One. 2014.

. 2014 Jan 9;9(1):e84601.

doi: 10.1371/journal.pone.0084601. eCollection 2014.

Authors

Andrea Benedetti¹, Robert Platt², Juli Atherton³

Affiliations

¹ Department of Medicine, McGill University, Montreal, Canada ; Department of Epidemiology, Biostatistics & Occupational Health, McGill University, Montreal, Canada ; Respiratory Epidemiology and Clinical Research Unit, Montreal Chest Institute, Montreal, Canada.
² Department of Medicine, McGill University, Montreal, Canada ; Department of Pediatrics, McGill University, Montreal, Canada.
³ Département de Mathématiques, Université de Québec à Montréal, Montreal, Canada.

PMID: 24416249
PMCID: PMC3886992
DOI: 10.1371/journal.pone.0084601

Abstract

Background: Over time, adaptive Gaussian Hermite quadrature (QUAD) has become the preferred method for estimating generalized linear mixed models with binary outcomes. However, penalized quasi-likelihood (PQL) is still used frequently. In this work, we systematically evaluated whether matching results from PQL and QUAD indicate less bias in estimated regression coefficients and variance parameters via simulation.

Methods: We performed a simulation study in which we varied the size of the data set, probability of the outcome, variance of the random effect, number of clusters and number of subjects per cluster, etc. We estimated bias in the regression coefficients, odds ratios and variance parameters as estimated via PQL and QUAD. We ascertained if similarity of estimated regression coefficients, odds ratios and variance parameters predicted less bias.

Results: Overall, we found that the absolute percent bias of the odds ratio estimated via PQL or QUAD increased as the PQL- and QUAD-estimated odds ratios became more discrepant, though results varied markedly depending on the characteristics of the dataset.

Conclusions: Given how markedly results varied depending on data set characteristics, specifying a rule above which indicated biased results proved impossible. This work suggests that comparing results from generalized linear mixed models estimated via PQL and QUAD is a worthwhile exercise for regression coefficients and variance components obtained via QUAD, in situations where PQL is known to give reasonable results.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1. Boxplot depicting the slopes from separate simple linear regressions for the effect of the absolute percent difference in OR_PQL and OR_QUAD on the absolute percent bias in OR_QUAD or OR_PQL, respectively, overall and by data generation parameters.
Median (interquartile range) of the estimated slope is the center of the box, box edges are the 25^th and 75^th percentile respectively, ends of the dashed lines are the 10^th and 90^th percentile, respectively.

Figure 2. Barplot depicting the proportion of scenarios in which the effect of the absolute percent difference in OR_PQL and OR_QUAD was a statistically significant predictor of the absolute percent bias in OR_QUAD or OR_PQL, respectively from separate simple linear regressions, overall and by data generation parameters.

Figure 3. Boxplot depicting the R² from separate simple linear regressions for the effect of the absolute percent difference in OR_PQL and OR_QUAD on the absolute percent bias in OR_QUAD or OR_PQL, respectively, overall and by data generation parameters.
Median (interquartile range) of the R² is the center of the box, box edges are the 25^th and 75^th percentile respectively, ends of the dashed lines are the 10^th and 90^th percentile, respectively.

Figure 4. Boxplot depicting the slopes from separate simple linear regressions for the effect of the absolute percent difference in σ_PQL and σ_QUAD on the absolute percent bias in σ_QUAD or σ_PQL, respectively, overall and by data generation parameters.
Median (interquartile range) of the estimated slope is the center of the box, box edges are the 25^th and 75^th percentile respectively, ends of the dashed lines are the 10^th and 90^th percentile, respectively.

Figure 5. Barplot depicting the proportion of scenarios in which the effect of the absolute percent difference in σ_PQL and σ_QUAD was a statistically significant predictor on the absolute percent bias in σ_QUAD or σ_PQL, respectively from separate simple linear regressions, overall and by data generation parameters.

Figure 6. Boxplot depicting the R² from separate simple linear regressions for the effect of the absolute percent difference in σ_PQL and σ_QUAD on the absolute percent bias in σ_QUAD or σ_PQL, respectively, overall and by data generation parameters.
Median (interquartile range) of the R² is the center of the box, box edges are the 25^th and 75^th percentile respectively, ends of the dashed lines are the 10^th and 90^th percentile, respectively.

See this image and copyright information in PMC

Cited by

Association between dietary inflammatory index and mental disorders using multilevel modeling with GLIMMIX.
Beiranvand R, Mansournia MA, Vahid F, Nejatisafa AA, Nedjat S. Beiranvand R, et al. Front Nutr. 2024 Jan 12;11:1288793. doi: 10.3389/fnut.2024.1288793. eCollection 2024. Front Nutr. 2024. PMID: 38282958 Free PMC article.
Intimate Partner Violence During COVID-19 Restrictions: A Study of 30 Countries From the I-SHARE Consortium.
Campbell L, Tan RKJ, Uhlich M, Francis JM, Mark K, Miall N, Eleuteri S, Gabster A, Shamu S, Plášilová L, Kemigisha E, Olumide A, Kosana P, Hurtado-Murillo F, Larsson EC, Cleeve A, Calvo González S, Perrotta G, Fernández Albamonte V, Blanco L, Schröder J, Adebayo A, Hendriks J, Saltis H, Marks M, Wu D, Morroni C, Esho T, Briken P, Hlatshwako TG, Ryan R, Farid NDN, Gomez Bravo R, Van de Velde S, Tucker JD; the ISHARE research consortium. Campbell L, et al. J Interpers Violence. 2023 Jun;38(11-12):7115-7142. doi: 10.1177/08862605221141865. Epub 2023 Jan 26. J Interpers Violence. 2023. PMID: 36703528 Free PMC article.
Evaluation of the Efficacy of Rotational Corrections for Standard-Fractionation Head and Neck Image-Guided Radiotherapy.
Kung JS, Tran WT, Poon I, Atenafu EG, Courneyea L, Higgins K, Enepekides D, Sahgal A, Chin L, Karam I. Kung JS, et al. Technol Cancer Res Treat. 2018 Jan-Dec;18:1533033819853824. doi: 10.1177/1533033819853824. Technol Cancer Res Treat. 2018. PMID: 31122178 Free PMC article. Clinical Trial.
A comparison of analytic approaches for individual patient data meta-analyses with binary outcomes.
Thomas D, Platt R, Benedetti A. Thomas D, et al. BMC Med Res Methodol. 2017 Feb 16;17(1):28. doi: 10.1186/s12874-017-0307-7. BMC Med Res Methodol. 2017. PMID: 28202011 Free PMC article.
Laplace approximation, penalized quasi-likelihood, and adaptive Gauss-Hermite quadrature for generalized linear mixed models: towards meta-analysis of binary outcome with sparse data.
Ju K, Lin L, Chu H, Cheng LL, Xu C. Ju K, et al. BMC Med Res Methodol. 2020 Jun 11;20(1):152. doi: 10.1186/s12874-020-01035-6. BMC Med Res Methodol. 2020. PMID: 32539721 Free PMC article.

See all "Cited by" articles

References

1. Molenberghs G, Verbeke G (2005) Models for Discrete Longitudinal Data. New York: Springer.
1. Diggle P, Heagerty P, Liang K-Y, Zeger SL (2002) Analysis of Longitudinal Data. Oxford: Oxford University Press.
1. Jang JY, Kang SK, Chung HK (1993) Biological exposure indices of organic solvents for Korean workers. International Archives of Occupational & Environmental Health 65: S219–S222 15. - PubMed
1. Neuhaus JM, Kalbfleisch JD, Hauck WW (1991) A comparison of cluster-specific and population average approaches for analyzing correlated binary data. International Statistical Review 59: 25–35.
1. Breslow N, Clayton D (1993) Approximate inference in generalized linear mixed models. J Am Stat Assoc 88: 9–25.

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Generalized linear mixed models for binary data: are matching results from penalized quasi-likelihood and numerical integration less biased?

Affiliations

Generalized linear mixed models for binary data: are matching results from penalized quasi-likelihood and numerical integration less biased?

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical