A weighted U-statistic for genetic association analyses of sequencing data
- PMID: 25331574
- PMCID: PMC4236269
- DOI: 10.1002/gepi.21864
A weighted U-statistic for genetic association analyses of sequencing data
Abstract
With advancements in next-generation sequencing technology, a massive amount of sequencing data is generated, which offers a great opportunity to comprehensively investigate the role of rare variants in the genetic etiology of complex diseases. Nevertheless, the high-dimensional sequencing data poses a great challenge for statistical analysis. The association analyses based on traditional statistical methods suffer substantial power loss because of the low frequency of genetic variants and the extremely high dimensionality of the data. We developed a Weighted U Sequencing test, referred to as WU-SEQ, for the high-dimensional association analysis of sequencing data. Based on a nonparametric U-statistic, WU-SEQ makes no assumption of the underlying disease model and phenotype distribution, and can be applied to a variety of phenotypes. Through simulation studies and an empirical study, we showed that WU-SEQ outperformed a commonly used sequence kernel association test (SKAT) method when the underlying assumptions were violated (e.g., the phenotype followed a heavy-tailed distribution). Even when the assumptions were satisfied, WU-SEQ still attained comparable performance to SKAT. Finally, we applied WU-SEQ to sequencing data from the Dallas Heart Study (DHS), and detected an association between ANGPTL 4 and very low density lipoprotein cholesterol.
Keywords: next-generation sequencing; rare variants; weighted U-statistic.
© 2014 WILEY PERIODICALS, INC.
Figures
Similar articles
-
A fast and noise-resilient approach to detect rare-variant associations with deep sequencing data for complex disorders.Genet Epidemiol. 2012 Nov;36(7):675-85. doi: 10.1002/gepi.21662. Epub 2012 Aug 3. Genet Epidemiol. 2012. PMID: 22865616 Free PMC article.
-
A generalized genetic random field method for the genetic association analysis of sequencing data.Genet Epidemiol. 2014 Apr;38(3):242-53. doi: 10.1002/gepi.21790. Epub 2014 Jan 30. Genet Epidemiol. 2014. PMID: 24482034 Free PMC article.
-
Detecting rare variant effects using extreme phenotype sampling in sequencing association studies.Genet Epidemiol. 2013 Feb;37(2):142-51. doi: 10.1002/gepi.21699. Epub 2012 Nov 26. Genet Epidemiol. 2013. PMID: 23184518 Free PMC article.
-
A novel adaptive method for the analysis of next-generation sequencing data to detect complex trait associations with rare variants due to gene main effects and interactions.PLoS Genet. 2010 Oct 14;6(10):e1001156. doi: 10.1371/journal.pgen.1001156. PLoS Genet. 2010. PMID: 20976247 Free PMC article.
-
The Role of Next-Generation Sequencing in Pharmacogenetics and Pharmacogenomics.Cold Spring Harb Perspect Med. 2019 Feb 1;9(2):a033027. doi: 10.1101/cshperspect.a033027. Cold Spring Harb Perspect Med. 2019. PMID: 29844222 Free PMC article. Review.
Cited by
-
Reexamining Dis/Similarity-Based Tests for Rare-Variant Association with Case-Control Samples.Genetics. 2018 May;209(1):105-113. doi: 10.1534/genetics.118.300769. Epub 2018 Mar 15. Genetics. 2018. PMID: 29545466 Free PMC article.
-
A functional U-statistic method for association analysis of sequencing data.Genet Epidemiol. 2017 Nov;41(7):636-643. doi: 10.1002/gepi.22063. Epub 2017 Aug 29. Genet Epidemiol. 2017. PMID: 28850771 Free PMC article.
-
Genome-wide joint analysis of single-nucleotide variant sets and gene expression for hypertension and related phenotypes.BMC Proc. 2016 Oct 18;10(Suppl 7):125-129. doi: 10.1186/s12919-016-0017-x. eCollection 2016. BMC Proc. 2016. PMID: 27980623 Free PMC article.
-
CARD14 alterations in Tunisian patients with psoriasis and further characterization in European cohorts.Br J Dermatol. 2016 Feb;174(2):330-7. doi: 10.1111/bjd.14158. Epub 2015 Nov 17. Br J Dermatol. 2016. PMID: 26358359 Free PMC article.
-
Considering Genetic Heterogeneity in the Association Analysis Finds Genes Associated With Nicotine Dependence.Front Genet. 2019 May 17;10:448. doi: 10.3389/fgene.2019.00448. eCollection 2019. Front Genet. 2019. PMID: 31164900 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources