A generalized linear mixed model association tool for biobank-scale data
- PMID: 34737426
- DOI: 10.1038/s41588-021-00954-4
A generalized linear mixed model association tool for biobank-scale data
Abstract
Compared with linear mixed model-based genome-wide association (GWA) methods, generalized linear mixed model (GLMM)-based methods have better statistical properties when applied to binary traits but are computationally much slower. In the present study, leveraging efficient sparse matrix-based algorithms, we developed a GLMM-based GWA tool, fastGWA-GLMM, that is severalfold to orders of magnitude faster than the state-of-the-art tools when applied to the UK Biobank (UKB) data and scalable to cohorts with millions of individuals. We show by simulation that the fastGWA-GLMM test statistics of both common and rare variants are well calibrated under the null, even for traits with extreme case-control ratios. We applied fastGWA-GLMM to the UKB data of 456,348 individuals, 11,842,647 variants and 2,989 binary traits (full summary statistics available at http://fastgwa.info/ukbimpbin ), and identified 259 rare variants associated with 75 traits, demonstrating the use of imputed genotype data in a large cohort to discover rare variants for binary complex traits.
© 2021. The Author(s), under exclusive licence to Springer Nature America, Inc.
Similar articles
-
A resource-efficient tool for mixed model association analysis of large-scale data.Nat Genet. 2019 Dec;51(12):1749-1755. doi: 10.1038/s41588-019-0530-8. Epub 2019 Nov 25. Nat Genet. 2019. PMID: 31768069
-
Evaluation of GENESIS, SAIGE, REGENIE and fastGWA-GLMM for genome-wide association studies of binary traits in correlated data.Front Genet. 2022 Sep 23;13:897210. doi: 10.3389/fgene.2022.897210. eCollection 2022. Front Genet. 2022. PMID: 36212134 Free PMC article.
-
A scalable variational inference approach for increased mixed-model association power.Nat Genet. 2025 Feb;57(2):461-468. doi: 10.1038/s41588-024-02044-7. Epub 2025 Jan 9. Nat Genet. 2025. PMID: 39789286 Free PMC article.
-
Dissecting the genetics of complex traits using summary association statistics.Nat Rev Genet. 2017 Feb;18(2):117-127. doi: 10.1038/nrg.2016.142. Epub 2016 Nov 14. Nat Rev Genet. 2017. PMID: 27840428 Free PMC article. Review.
-
Formalising recall by genotype as an efficient approach to detailed phenotyping and causal inference.Nat Commun. 2018 Feb 19;9(1):711. doi: 10.1038/s41467-018-03109-y. Nat Commun. 2018. PMID: 29459775 Free PMC article. Review.
Cited by
-
Plasma homocysteine levels and risk of congestive heart failure or cardiomyopathy: A Mendelian randomization study.Front Cardiovasc Med. 2023 Jan 26;10:1030257. doi: 10.3389/fcvm.2023.1030257. eCollection 2023. Front Cardiovasc Med. 2023. PMID: 36776266 Free PMC article.
-
A cross-trait study of lung cancer and its related respiratory diseases based on large-scale exome sequencing population.Transl Lung Cancer Res. 2024 Mar 29;13(3):512-525. doi: 10.21037/tlcr-24-4. Epub 2024 Mar 14. Transl Lung Cancer Res. 2024. PMID: 38601445 Free PMC article.
-
Investigating the causal association of postpartum depression with cerebrovascular diseases and cognitive impairment: a Mendelian randomization study.Front Psychiatry. 2023 Jun 22;14:1196055. doi: 10.3389/fpsyt.2023.1196055. eCollection 2023. Front Psychiatry. 2023. PMID: 37426101 Free PMC article.
-
Exploring the causal relationship of gut microbiota in nonunion: a Mendelian randomization analysis mediated by immune cell.Front Microbiol. 2024 Dec 16;15:1447877. doi: 10.3389/fmicb.2024.1447877. eCollection 2024. Front Microbiol. 2024. PMID: 39736989 Free PMC article.
-
BG2: Bayesian variable selection in generalized linear mixed models with nonlocal priors for non-Gaussian GWAS data.BMC Bioinformatics. 2023 Sep 15;24(1):343. doi: 10.1186/s12859-023-05468-w. BMC Bioinformatics. 2023. PMID: 37715138 Free PMC article.
References
-
- Tin, A. et al. Target genes, variants, tissues and transcriptional pathways influencing human serum urate levels. Nat. Genet. 51, 1459–1474 (2019).
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources