Mixed-model association for biobank-scale datasets
- PMID: 29892013
- PMCID: PMC6309610
- DOI: 10.1038/s41588-018-0144-6
Mixed-model association for biobank-scale datasets
Abstract
Biobank-based genome-wide association studies are enabling exciting insights in complex trait genetics, but much uncertainty remains over best practices for optimizing statistical power and computational efficiency in GWAS while controlling confounders. Here, we introduce a much faster version of our BOLT-LMM Bayesian mixed model association method—capable of running analyses of the full UK Biobank cohort in a few days on a single compute node—and show that it produces highly powered, robust test statistics when run on all 459K European samples (retaining related individuals). When used to conduct a GWAS for height in UK Biobank, BOLT-LMM achieved power equivalent to linear regression on 650K samples—a 93% increase in effective sample size versus the common practice of analyzing unrelated British samples using linear regression (UK Biobank documentation; Bycroft et al. bioRxiv). Across a broader set of 23 highly heritable traits, the total number of independent GWAS loci detected increased from 5,839 to 10,759, an 84% increase. We recommend the use of BOLT-LMM (retaining related individuals) for biobank-scale analyses, and we have publicly released BOLT-LMM summary association statistics for the 23 traits analyzed as a resource for all researchers.
Figures

Similar articles
-
Thousands of missing variants in the UK Biobank are recoverable by genome realignment.Ann Hum Genet. 2020 May;84(3):214-220. doi: 10.1111/ahg.12383. Epub 2020 Mar 31. Ann Hum Genet. 2020. PMID: 32232836 Free PMC article.
-
Reproducibility in the UK biobank of genome-wide significant signals discovered in earlier genome-wide association studies.Sci Rep. 2021 Sep 20;11(1):18625. doi: 10.1038/s41598-021-97896-y. Sci Rep. 2021. PMID: 34545148 Free PMC article.
-
A powerful subset-based method identifies gene set associations and improves interpretation in UK Biobank.Am J Hum Genet. 2021 Apr 1;108(4):669-681. doi: 10.1016/j.ajhg.2021.02.016. Epub 2021 Mar 16. Am J Hum Genet. 2021. PMID: 33730541 Free PMC article.
-
Fast kernel-based association testing of non-linear genetic effects for biobank-scale data.Nat Commun. 2023 Aug 15;14(1):4936. doi: 10.1038/s41467-023-40346-2. Nat Commun. 2023. PMID: 37582955 Free PMC article.
-
Identity informative SNP associations in the UK Biobank.Forensic Sci Int Genet. 2019 Sep;42:45-48. doi: 10.1016/j.fsigen.2019.06.007. Epub 2019 Jun 14. Forensic Sci Int Genet. 2019. PMID: 31226582
Cited by
-
An Activity-Mediated Transition in Transcription in Early Postnatal Neurons.Neuron. 2020 Sep 9;107(5):874-890.e8. doi: 10.1016/j.neuron.2020.06.008. Epub 2020 Jun 25. Neuron. 2020. PMID: 32589877 Free PMC article.
-
Human genetics and epigenetics of alcohol use disorder.J Clin Invest. 2024 Aug 15;134(16):e172885. doi: 10.1172/JCI172885. J Clin Invest. 2024. PMID: 39145449 Free PMC article. Review.
-
Extreme Polygenicity of Complex Traits Is Explained by Negative Selection.Am J Hum Genet. 2019 Sep 5;105(3):456-476. doi: 10.1016/j.ajhg.2019.07.003. Epub 2019 Aug 8. Am J Hum Genet. 2019. PMID: 31402091 Free PMC article.
-
Dissecting Complex Traits Using Omics Data: A Review on the Linear Mixed Models and Their Application in GWAS.Plants (Basel). 2022 Nov 28;11(23):3277. doi: 10.3390/plants11233277. Plants (Basel). 2022. PMID: 36501317 Free PMC article. Review.
-
Bayesian analysis of longitudinal traits in the Korea Association Resource (KARE) cohort.Genomics Inform. 2022 Jun;20(2):e16. doi: 10.5808/gi.22022. Epub 2022 Jun 30. Genomics Inform. 2022. PMID: 35794696 Free PMC article.
References
-
- Yu J et al. A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nature Genetics 38, 203–208 (2006). - PubMed
-
- Bycroft C et al. Genome-wide genetic data on ~500,000 UK Biobank participants. bioRxiv (2017).
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources