Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 Jul;45(5):427-444.
doi: 10.1002/gepi.22384. Epub 2021 May 16.

Penalized linear mixed models for structured genetic data

Affiliations
Review

Penalized linear mixed models for structured genetic data

Anna C Reisetter et al. Genet Epidemiol. 2021 Jul.

Abstract

Many genetic studies that aim to identify genetic variants associated with complex phenotypes are subject to unobserved confounding factors arising from environmental heterogeneity. This poses a challenge to detecting associations of interest and is known to induce spurious associations when left unaccounted for. Penalized linear mixed models (LMMs) are an attractive method to correct for unobserved confounding. These methods correct for varying levels of relatedness and population structure by modeling it as a random effect with a covariance structure estimated from observed genetic data. Despite an extensive literature on penalized regression and LMMs separately, the two are rarely discussed together. The aim of this review is to do so while examining the statistical properties of penalized LMMs in the genetic association setting. Specifically, the ability of penalized LMMs to accurately estimate genetic effects in the presence of environmental confounding has not been well studied. To clarify the important yet subtle distinction between population structure and environmental heterogeneity, we present a detailed review of relevant concepts and methods. In addition, we evaluate the performance of penalized LMMs and competing methods in terms of estimation and selection accuracy in the presence of a number of confounding structures.

Keywords: confounding; lasso; linear mixed model; penalized regression; population stratification.

PubMed Disclaimer

References

REFERENCES

    1. Amin, N. , Van Duijn, C. M. , & Aulchenko, Y. S. (2007). A genomic background based method for association analysis in related individuals. PLOS One, 2, e1274.
    1. Astle, W. , & Balding, D. J. (2009). Population structure and cryptic relatedness in genetic association studies. Statistical Science, 24, 451-471.
    1. Bacanu, S.-A. , Devlin, B. , & Roeder, K. (2000). The power of genomic control. The American Journal of Human Genetics, 66, 1933-1944.
    1. Barton, N. , Hermisson, J. , & Nordborg, M. (2019). Population genetics: Why structure matters. Elife, 8, e45380.
    1. Bhatnagar, S. R. , Yang, Y. , Lu, T. , Schurr, E. , Loredo-Osti, J. , Forest, M. , Oualkacha, K. , & Greenwood, C. M. (2020). Simultaneous SNP selection and adjustment for population structure in high dimensional prediction models. PLOS Genetics, 16, e1008766.

LinkOut - more resources