A scalable variational inference approach for increased mixed-model association power
- PMID: 39789286
- PMCID: PMC11821521
- DOI: 10.1038/s41588-024-02044-7
A scalable variational inference approach for increased mixed-model association power
Abstract
The rapid growth of modern biobanks is creating new opportunities for large-scale genome-wide association studies (GWASs) and the analysis of complex traits. However, performing GWASs on millions of samples often leads to trade-offs between computational efficiency and statistical power, reducing the benefits of large-scale data collection efforts. We developed Quickdraws, a method that increases association power in quantitative and binary traits without sacrificing computational efficiency, leveraging a spike-and-slab prior on variant effects, stochastic variational inference and graphics processing unit acceleration. We applied Quickdraws to 79 quantitative and 50 binary traits in 405,088 UK Biobank samples, identifying 4.97% and 3.25% more associations than REGENIE and 22.71% and 7.07% more than FastGWA. Quickdraws had costs comparable to REGENIE, FastGWA and SAIGE on the UK Biobank Research Analysis Platform service, while being substantially faster than BOLT-LMM. These results highlight the promise of leveraging machine learning techniques for scalable GWASs without sacrificing power or robustness.
© 2025. The Author(s).
Conflict of interest statement
Competing interests: The authors declare no competing interests.
Figures







Similar articles
-
A generalized linear mixed model association tool for biobank-scale data.Nat Genet. 2021 Nov;53(11):1616-1621. doi: 10.1038/s41588-021-00954-4. Epub 2021 Nov 4. Nat Genet. 2021. PMID: 34737426
-
Evaluation of GENESIS, SAIGE, REGENIE and fastGWA-GLMM for genome-wide association studies of binary traits in correlated data.Front Genet. 2022 Sep 23;13:897210. doi: 10.3389/fgene.2022.897210. eCollection 2022. Front Genet. 2022. PMID: 36212134 Free PMC article.
-
A resource-efficient tool for mixed model association analysis of large-scale data.Nat Genet. 2019 Dec;51(12):1749-1755. doi: 10.1038/s41588-019-0530-8. Epub 2019 Nov 25. Nat Genet. 2019. PMID: 31768069
-
The UK Biobank: A Shining Example of Genome-Wide Association Study Science with the Power to Detect the Murky Complications of Real-World Epidemiology.Annu Rev Genomics Hum Genet. 2022 Aug 31;23:569-589. doi: 10.1146/annurev-genom-121321-093606. Epub 2022 May 4. Annu Rev Genomics Hum Genet. 2022. PMID: 35508184 Review.
-
Review on GPU accelerated methods for genome-wide SNP-SNP interactions.Mol Genet Genomics. 2024 Dec 29;300(1):10. doi: 10.1007/s00438-024-02214-6. Mol Genet Genomics. 2024. PMID: 39738695 Review.
Cited by
-
Gene-environment interactions contribute to blood pressure variation across global populations.medRxiv [Preprint]. 2025 Jul 3:2025.07.02.25330727. doi: 10.1101/2025.07.02.25330727. medRxiv. 2025. PMID: 40630594 Free PMC article. Preprint.
References
MeSH terms
Grants and funding
- WT_/Wellcome Trust/United Kingdom
- 108861/Z/15/Z/Wellcome Trust (Wellcome)
- 850869/EC | EU Framework Programme for Research and Innovation H2020 | H2020 Priority Excellent Science | H2020 European Research Council (H2020 Excellent Science - European Research Council)
- EP/L016044/1/RCUK | Engineering and Physical Sciences Research Council (EPSRC)
LinkOut - more resources
Full Text Sources