Integrative functional linear model for genome-wide association studies with multiple traits
- PMID: 33040145
- PMCID: PMC9007435
- DOI: 10.1093/biostatistics/kxaa043
Integrative functional linear model for genome-wide association studies with multiple traits
Abstract
In recent biomedical research, genome-wide association studies (GWAS) have demonstrated great success in investigating the genetic architecture of human diseases. For many complex diseases, multiple correlated traits have been collected. However, most of the existing GWAS are still limited because they analyze each trait separately without considering their correlations and suffer from a lack of sufficient information. Moreover, the high dimensionality of single nucleotide polymorphism (SNP) data still poses tremendous challenges to statistical methods, in both theoretical and practical aspects. In this article, we innovatively propose an integrative functional linear model for GWAS with multiple traits. This study is the first to approximate SNPs as functional objects in a joint model of multiple traits with penalization techniques. It effectively accommodates the high dimensionality of SNPs and correlations among multiple traits to facilitate information borrowing. Our extensive simulation studies demonstrate the satisfactory performance of the proposed method in the identification and estimation of disease-associated genetic variants, compared to four alternatives. The analysis of type 2 diabetes data leads to biologically meaningful findings with good prediction accuracy and selection stability.
Keywords: Functional data analysis; Genome-wide association studies; Joint analysis of multiple traits; Penalization.
© The Author 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Figures




Similar articles
-
A novel association test for multiple secondary phenotypes from a case-control GWAS.Genet Epidemiol. 2017 Jul;41(5):413-426. doi: 10.1002/gepi.22045. Epub 2017 Apr 10. Genet Epidemiol. 2017. PMID: 28393390 Free PMC article. Clinical Trial.
-
Integrative functional logistic regression model for genome-wide association studies.Comput Biol Med. 2025 Mar;187:109766. doi: 10.1016/j.compbiomed.2025.109766. Epub 2025 Feb 6. Comput Biol Med. 2025. PMID: 39919666
-
Genome-wide association data classification and SNPs selection using two-stage quality-based Random Forests.BMC Genomics. 2015;16 Suppl 2(Suppl 2):S5. doi: 10.1186/1471-2164-16-S2-S5. Epub 2015 Jan 21. BMC Genomics. 2015. PMID: 25708662 Free PMC article.
-
Methods for meta-analysis of multiple traits using GWAS summary statistics.Genet Epidemiol. 2018 Mar;42(2):134-145. doi: 10.1002/gepi.22105. Epub 2017 Dec 10. Genet Epidemiol. 2018. PMID: 29226385 Free PMC article.
-
Comprehensive identification of pleiotropic loci for body fat distribution using the NHGRI-EBI Catalog of published genome-wide association studies.Obes Rev. 2019 Mar;20(3):385-406. doi: 10.1111/obr.12806. Epub 2018 Nov 22. Obes Rev. 2019. PMID: 30565845 Review.
Cited by
-
Simulation Research on the Methods of Multi-Gene Region Association Analysis Based on a Functional Linear Model.Genes (Basel). 2022 Mar 2;13(3):455. doi: 10.3390/genes13030455. Genes (Basel). 2022. PMID: 35328009 Free PMC article.
-
Gene Association Analysis of Quantitative Trait Based on Functional Linear Regression Model with Local Sparse Estimator.Genes (Basel). 2023 Mar 30;14(4):834. doi: 10.3390/genes14040834. Genes (Basel). 2023. PMID: 37107592 Free PMC article.
-
Gene Region Association Analysis of Longitudinal Quantitative Traits Based on a Function-On-Function Regression Model.Front Genet. 2022 Feb 21;13:781740. doi: 10.3389/fgene.2022.781740. eCollection 2022. Front Genet. 2022. PMID: 35265102 Free PMC article.
-
Bi-level structured functional analysis for genome-wide association studies.Biometrics. 2023 Dec;79(4):3359-3373. doi: 10.1111/biom.13871. Epub 2023 May 7. Biometrics. 2023. PMID: 37098961 Free PMC article.
-
Prior information-assisted integrative analysis of multiple datasets.Bioinformatics. 2023 Aug 1;39(8):btad452. doi: 10.1093/bioinformatics/btad452. Bioinformatics. 2023. PMID: 37490475 Free PMC article.
References
-
- Chiu, C., Zhang, B., Wang, S., Shao, J. Lakhal-Chaieb, M.L., Cook, R.J., Wilson, A.F., Bailey-Wilson J.E., Xiong, M. and Fan, R. (2019). Gene-based association analysis of survival traits via functional regression-based mixed effect Cox models for related samples. Genetic Epidemiology 43, 952–965. - PMC - PubMed
-
- Cornelis, M., Agrawal, A., Cole, J., Hansel, N.Barnes K.C., Beaty, T.H., Bennett, S.N., Bierut. L.J., Boerwinkle, E., Doheny, K.F. and others. (2010). The Gene, Environment Association Studies Consortium (Geneva): maximizing the knowledge obtained from GWAS by collaboration across studies of multiple conditions. Genetic Epidemiology 34, 364–372. - PMC - PubMed
-
- Fan, J. and Li, R. (2001). Variable selection via nonconcave penalized likelihood and its oracle properties. Journal of the American Statistical Association 96, 1348–1360.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical