Penalized linear mixed models for structured genetic data
- PMID: 33998038
- DOI: 10.1002/gepi.22384
Penalized linear mixed models for structured genetic data
Abstract
Many genetic studies that aim to identify genetic variants associated with complex phenotypes are subject to unobserved confounding factors arising from environmental heterogeneity. This poses a challenge to detecting associations of interest and is known to induce spurious associations when left unaccounted for. Penalized linear mixed models (LMMs) are an attractive method to correct for unobserved confounding. These methods correct for varying levels of relatedness and population structure by modeling it as a random effect with a covariance structure estimated from observed genetic data. Despite an extensive literature on penalized regression and LMMs separately, the two are rarely discussed together. The aim of this review is to do so while examining the statistical properties of penalized LMMs in the genetic association setting. Specifically, the ability of penalized LMMs to accurately estimate genetic effects in the presence of environmental confounding has not been well studied. To clarify the important yet subtle distinction between population structure and environmental heterogeneity, we present a detailed review of relevant concepts and methods. In addition, we evaluate the performance of penalized LMMs and competing methods in terms of estimation and selection accuracy in the presence of a number of confounding structures.
Keywords: confounding; lasso; linear mixed model; penalized regression; population stratification.
© 2021 Wiley Periodicals LLC.
References
REFERENCES
-
- Amin, N. , Van Duijn, C. M. , & Aulchenko, Y. S. (2007). A genomic background based method for association analysis in related individuals. PLOS One, 2, e1274.
-
- Astle, W. , & Balding, D. J. (2009). Population structure and cryptic relatedness in genetic association studies. Statistical Science, 24, 451-471.
-
- Bacanu, S.-A. , Devlin, B. , & Roeder, K. (2000). The power of genomic control. The American Journal of Human Genetics, 66, 1933-1944.
-
- Barton, N. , Hermisson, J. , & Nordborg, M. (2019). Population genetics: Why structure matters. Elife, 8, e45380.
-
- Bhatnagar, S. R. , Yang, Y. , Lu, T. , Schurr, E. , Loredo-Osti, J. , Forest, M. , Oualkacha, K. , & Greenwood, C. M. (2020). Simultaneous SNP selection and adjustment for population structure in high dimensional prediction models. PLOS Genetics, 16, e1008766.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
