A resource-efficient tool for mixed model association analysis of large-scale data

Longda Jiang^#¹, Zhili Zheng^#^{1

2}, Ting Qi¹, Kathryn E Kemper¹, Naomi R Wray^{1

3}, Peter M Visscher¹, Jian Yang^{4

5}

Affiliations

¹ Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland, Australia.
² Institute for Advanced Research, Wenzhou Medical University, Wenzhou, Zhejiang, China.
³ Queensland Brain Institute, The University of Queensland, Brisbane, Queensland, Australia.
⁴ Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland, Australia. jian.yang.qt@gmail.com.
⁵ Institute for Advanced Research, Wenzhou Medical University, Wenzhou, Zhejiang, China. jian.yang.qt@gmail.com.

^# Contributed equally.

PMID: 31768069
DOI: 10.1038/s41588-019-0530-8

A resource-efficient tool for mixed model association analysis of large-scale data

Longda Jiang et al. Nat Genet. 2019 Dec.

. 2019 Dec;51(12):1749-1755.

doi: 10.1038/s41588-019-0530-8. Epub 2019 Nov 25.

Authors

Longda Jiang^#¹, Zhili Zheng^#^{1

2}, Ting Qi¹, Kathryn E Kemper¹, Naomi R Wray^{1

3}, Peter M Visscher¹, Jian Yang^{4

5}

Affiliations

¹ Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland, Australia.
² Institute for Advanced Research, Wenzhou Medical University, Wenzhou, Zhejiang, China.
³ Queensland Brain Institute, The University of Queensland, Brisbane, Queensland, Australia.
⁴ Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland, Australia. jian.yang.qt@gmail.com.
⁵ Institute for Advanced Research, Wenzhou Medical University, Wenzhou, Zhejiang, China. jian.yang.qt@gmail.com.

^# Contributed equally.

PMID: 31768069
DOI: 10.1038/s41588-019-0530-8

Abstract

The genome-wide association study (GWAS) has been widely used as an experimental design to detect associations between genetic variants and a phenotype. Two major confounding factors, population stratification and relatedness, could potentially lead to inflated GWAS test statistics and hence to spurious associations. Mixed linear model (MLM)-based approaches can be used to account for sample structure. However, genome-wide association (GWA) analyses in biobank samples such as the UK Biobank (UKB) often exceed the capability of most existing MLM-based tools especially if the number of traits is large. Here, we develop an MLM-based tool (fastGWA) that controls for population stratification by principal components and for relatedness by a sparse genetic relationship matrix for GWA analyses of biobank-scale data. We demonstrate by extensive simulations that fastGWA is reliable, robust and highly resource-efficient. We then apply fastGWA to 2,173 traits on array-genotyped and imputed samples from 456,422 individuals and to 2,048 traits on whole-exome-sequenced samples from 46,191 individuals in the UKB.

PubMed Disclaimer

References

1. Visscher, P. M. et al. 10 Years of GWAS discovery: biology, function, and translation. Am. J. Hum. Genet. 101, 5–22 (2017). - PubMed - PMC
1. Buniello, A. et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 47, D1005–D1012 (2019). - PubMed
1. Klein, R. J. et al. Complement factor H polymorphism in age-related macular degeneration. Science 308, 385–389 (2005). - PubMed - PMC
1. DeWan, A. et al. HTRA1 promoter polymorphism in wet age-related macular degeneration. Science 314, 989–992 (2006). - PubMed
1. Burton, P. R. et al. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447, 661–678 (2007).

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- Nature Publishing Group
Other Literature Sources
- H1 Connect - Access expert opinions and insights on biomedical research.

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A resource-efficient tool for mixed model association analysis of large-scale data

Affiliations

A resource-efficient tool for mixed model association analysis of large-scale data

Authors

Affiliations

Abstract

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources