. 2013 Nov;37(7):726-42.

doi: 10.1002/gepi.21757.

Functional linear models for association analysis of quantitative traits

Ruzong Fan¹, Yifan Wang, James L Mills, Alexander F Wilson, Joan E Bailey-Wilson, Momiao Xiong

Affiliations

Affiliation

¹ Biostatistics and Bioinformatics Branch, Division of Intramural Population Health Research, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Rockville, Maryland, United States of America.

PMID: 24130119
PMCID: PMC4163942
DOI: 10.1002/gepi.21757

Functional linear models for association analysis of quantitative traits

Ruzong Fan et al. Genet Epidemiol. 2013 Nov.

. 2013 Nov;37(7):726-42.

doi: 10.1002/gepi.21757.

Authors

Ruzong Fan¹, Yifan Wang, James L Mills, Alexander F Wilson, Joan E Bailey-Wilson, Momiao Xiong

Affiliation

¹ Biostatistics and Bioinformatics Branch, Division of Intramural Population Health Research, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Rockville, Maryland, United States of America.

PMID: 24130119
PMCID: PMC4163942
DOI: 10.1002/gepi.21757

Abstract

Functional linear models are developed in this paper for testing associations between quantitative traits and genetic variants, which can be rare variants or common variants or the combination of the two. By treating multiple genetic variants of an individual in a human population as a realization of a stochastic process, the genome of an individual in a chromosome region is a continuum of sequence data rather than discrete observations. The genome of an individual is viewed as a stochastic function that contains both linkage and linkage disequilibrium (LD) information of the genetic markers. By using techniques of functional data analysis, both fixed and mixed effect functional linear models are built to test the association between quantitative traits and genetic variants adjusting for covariates. After extensive simulation analysis, it is shown that the F-distributed tests of the proposed fixed effect functional linear models have higher power than that of sequence kernel association test (SKAT) and its optimal unified test (SKAT-O) for three scenarios in most cases: (1) the causal variants are all rare, (2) the causal variants are both rare and common, and (3) the causal variants are common. The superior performance of the fixed effect functional linear models is most likely due to its optimal utilization of both genetic linkage and LD information of multiple genetic variants in a genome and similarity among different individuals, while SKAT and SKAT-O only model the similarities and pairwise LD but do not model linkage and higher order LD information sufficiently. In addition, the proposed fixed effect models generate accurate type I error rates in simulation studies. We also show that the functional kernel score tests of the proposed mixed effect functional linear models are preferable in candidate gene analysis and small sample problems. The methods are applied to analyze three biochemical traits in data from the Trinity Students Study.

Keywords: association mapping; common variants; complex traits; functional data analysis; quantitative trait loci; rare variants.

PubMed Disclaimer

Figures

**Figure 1**
The empirical power of the F-test statistics of the fixed effect models (3), (4), and (6), and SKAT and SKAT-O using both rare and common variants in analysis, when causal variants were both rare and common, and all causal variants had positive effects. The simulations were based on COSI sequence data.

**Figure 2**
The empirical power of the F-test statistics of the fixed effect models (3), (4), and (6), and SKAT and SKAT-O using both rare and common variants in analysis, when causal variants were both rare and common, and 20%/80% causal variants had negative/positive effects. The simulations were based on COSI sequence data.

**Figure 3**
The empirical power of the F-test statistics of the fixed effect models (3), (4), and (6), and SKAT and SKAT-O using both rare and common variants in analysis, when causal variants were both rare and common, and 50%/50% causal variants had negative/positive effects. The simulations were based on COSI sequence data.

**Figure 4**
The empirical power of the F-test statistics of the fixed effect models (3), (4), and (6), and SKAT and SKAT-O using rare variants in analysis, when causal variants were only rare, and all causal variants had positive effects. The simulations were based on COSI sequence data.

**Figure 5**
The empirical power of the F-test statistics of the fixed effect models (3), (4), and (6), and SKAT and SKAT-O using rare variants in analysis, when causal variants were only rare, and 20%/80% causal variants had negative/positive effects. The simulations were based on COSI sequence data.

**Figure 6**
The empirical power of the F-test statistics of the fixed effect models (3), (4), and (6), and SKAT and SKAT-O using rare variants in analysis, when causal variants were only rare, and 50%/50% causal variants had negative/positive effects. The simulations were based on COSI sequence data.

See this image and copyright information in PMC

Cited by

A Multi-Marker Test for Analyzing Paired Genetic Data in Transplantation.
Arthur VL, Li Z, Cao R, Oetting WS, Israni AK, Jacobson PA, Ritchie MD, Guan W, Chen J. Arthur VL, et al. Front Genet. 2021 Oct 13;12:745773. doi: 10.3389/fgene.2021.745773. eCollection 2021. Front Genet. 2021. PMID: 34721531 Free PMC article. Review.
Meta-analysis of Complex Diseases at Gene Level with Generalized Functional Linear Models.
Fan R, Wang Y, Chiu CY, Chen W, Ren H, Li Y, Boehnke M, Amos CI, Moore JH, Xiong M. Fan R, et al. Genetics. 2016 Feb;202(2):457-70. doi: 10.1534/genetics.115.180869. Epub 2015 Dec 29. Genetics. 2016. PMID: 26715663 Free PMC article.
Generalized functional linear models for gene-based case-control association studies.
Fan R, Wang Y, Mills JL, Carter TC, Lobach I, Wilson AF, Bailey-Wilson JE, Weeks DE, Xiong M. Fan R, et al. Genet Epidemiol. 2014 Nov;38(7):622-637. doi: 10.1002/gepi.21840. Epub 2014 Sep 9. Genet Epidemiol. 2014. PMID: 25203683 Free PMC article.
A Comparison Study of Fixed and Mixed Effect Models for Gene Level Association Studies of Complex Traits.
Fan R, Chiu CY, Jung J, Weeks DE, Wilson AF, Bailey-Wilson JE, Amos CI, Chen Z, Mills JL, Xiong M. Fan R, et al. Genet Epidemiol. 2016 Dec;40(8):702-721. doi: 10.1002/gepi.21984. Epub 2016 Jul 4. Genet Epidemiol. 2016. PMID: 27374056 Free PMC article.
sumSTAAR: A flexible framework for gene-based association studies using GWAS summary statistics.
Belonogova NM, Svishcheva GR, Kirichenko AV, Zorkoltseva IV, Tsepilov YA, Axenovich TI. Belonogova NM, et al. PLoS Comput Biol. 2022 Jun 2;18(6):e1010172. doi: 10.1371/journal.pcbi.1010172. eCollection 2022 Jun. PLoS Comput Biol. 2022. PMID: 35653402 Free PMC article.

See all "Cited by" articles

References

1. Bansal V, Harismendy O, Tewhey R, Murray SS, Schork NJ, Topol EJ, Frazer KA. Accurate detection and genotyping of SNPs utilizing population sequencing data. Genome Res. 2010a;20:537–545. - PMC - PubMed
1. Bansal V, Libiger O, Torkamani A, Schork NJ. Statistical analysis strategies for association studies involving rare variants. Nat Rev Genet. 2010b;11:773–785. - PMC - PubMed
1. Barnett IJ, Lee S, Lin X. Detecting rare variant effects using extreme phenotype sampling in sequencing association studies. Genet Epidemiol. 2013;37:142–151. - PMC - PubMed
1. Clarke J, Wu HC, Jayasinghe L, Patel A, Reid S, Bayley H. Continuous base identification for single-molecule nanopore DNA sequencing. Nat Nanotechnol. 2009;4:265–270. - PubMed
1. Davies R. The distribution of a linear combination of chi-square random variables. J R Stat Soc Ser C Appl Stat. 1980;29:323–333.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Functional linear models for association analysis of quantitative traits

Affiliation

Functional linear models for association analysis of quantitative traits

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Research Materials

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Research Materials