. 2014 Apr;38(3):242-53.

doi: 10.1002/gepi.21790. Epub 2014 Jan 30.

A generalized genetic random field method for the genetic association analysis of sequencing data

Ming Li¹, Zihuai He, Min Zhang, Xiaowei Zhan, Changshuai Wei, Robert C Elston, Qing Lu

Affiliations

PMID: 24482034
PMCID: PMC5241166
DOI: 10.1002/gepi.21790

A generalized genetic random field method for the genetic association analysis of sequencing data

Ming Li et al. Genet Epidemiol. 2014 Apr.

. 2014 Apr;38(3):242-53.

doi: 10.1002/gepi.21790. Epub 2014 Jan 30.

Authors

Ming Li¹, Zihuai He, Min Zhang, Xiaowei Zhan, Changshuai Wei, Robert C Elston, Qing Lu

Affiliation

¹ Division of Biostatistics, Department of Pediatrics, University of Arkansas for Medical Sciences, Little Rock, Arkansas, United States of America.

PMID: 24482034
PMCID: PMC5241166
DOI: 10.1002/gepi.21790

Abstract

With the advance of high-throughput sequencing technologies, it has become feasible to investigate the influence of the entire spectrum of sequencing variations on complex human diseases. Although association studies utilizing the new sequencing technologies hold great promise to unravel novel genetic variants, especially rare genetic variants that contribute to human diseases, the statistical analysis of high-dimensional sequencing data remains a challenge. Advanced analytical methods are in great need to facilitate high-dimensional sequencing data analyses. In this article, we propose a generalized genetic random field (GGRF) method for association analyses of sequencing data. Like other similarity-based methods (e.g., SIMreg and SKAT), the new method has the advantages of avoiding the need to specify thresholds for rare variants and allowing for testing multiple variants acting in different directions and magnitude of effects. The method is built on the generalized estimating equation framework and thus accommodates a variety of disease phenotypes (e.g., quantitative and binary phenotypes). Moreover, it has a nice asymptotic property, and can be applied to small-scale sequencing data without need for small-sample adjustment. Through simulations, we demonstrate that the proposed GGRF attains an improved or comparable power over a commonly used method, SKAT, under various disease scenarios, especially when rare variants play a significant role in disease etiology. We further illustrate GGRF with an application to a real dataset from the Dallas Heart Study. By using GGRF, we were able to detect the association of two candidate genes, ANGPTL3 and ANGPTL4, with serum triglyceride.

Keywords: generalized estimating equation; rare variants; small-scale sequencing studies.

PubMed Disclaimer

Figures

**Figure B1**
Type I error and Power of GGRF, SKAT, and Burden test with decreasing ratio of casual variants/noise variants. Left: Quantitative Phenotypes, Right: Binary Phenotypes; T1E: Type I Error; 1 Direction: one-direction of effect sizes; Bidirection: bidirection of effect sizes.

**Figure 1**
Distribution of the minor allele frequencies of 508 sequence variants on chromosome 22 in exome sequencing data from the 1,000 Genome Project.

**Figure 2**
Shape of four types of weight functions used in the simulations. Maximum weight at MAF of 0.07% was rescaled to be 1 for each weight function. The scaling does not change the relative contribution of variants.

**Figure 3**
Type I error and Power of GGRF and SKAT on using four SNP-specific weights under four disease models. Left: Quantitative Phenotypes, Right: Binary Phenotypes; T1E: Type I Error; S1–S4: power under various disease scenarios. S1: effect sizes of causal variants are all equal; S2: effect sizes of causal variants are proportional to BETA weights; S3: effect sizes of causal variants are proportional to WSS weights; S4: effect sizes of causal variants are proportional to LOG weights.

**Figure 4**
Type I error and Power of GGRF and SKAT with decreasing ratio of casual variants/noise variants. Left: Quantitative Phenotypes, Right: Binary Phenotypes; T1E: Type I Error; S1–S4: power under various disease scenarios. S1: effect sizes of causal variants are all equal; S2: effect sizes of causal variants are proportional to BETA weights; S3: effect sizes of causal variants are proportional to WSS weights; S4: effect sizes of causal variants are proportional to LOG weights.

**Figure 5**
Type I error and power of GGRF/SKAT with various similarity-metrics/kernel-metrics. Top left: type I error for quantitative phenotypes; Bottom left: power for quantitative phenotypes. Top right: type I error for binary phenotypes; Bottom right: power for binary phenotypes. ADJ: bootstrap adjustment for SKAT, only available with binary phenotypes, linear kernel, and BETA weight.

**Figure 6**
Distribution of minor allele frequencies in *ANGPTL*3, *ANGPTL*4, *ANGPTL*5, and *ANGPTL*6 genes in 2,658 subjects from the DHS sequencing data.

See this image and copyright information in PMC

Cited by

Random field modeling of multi-trait multi-locus association for detecting methylation quantitative trait loci.
Lyu C, Huang M, Liu N, Chen Z, Lupo PJ, Tycko B, Witte JS, Hobbs CA, Li M. Lyu C, et al. Bioinformatics. 2022 Aug 10;38(16):3853-3862. doi: 10.1093/bioinformatics/btac443. Bioinformatics. 2022. PMID: 35781319 Free PMC article.
A gene-based association test of interactions for maternal-fetal genotypes identifies genes associated with nonsyndromic congenital heart defects.
Huang M, Lyu C, Liu N, Nembhard WN, Witte JS, Hobbs CA, Li M; National Birth Defects Prevention Study. Huang M, et al. Genet Epidemiol. 2023 Oct;47(7):475-495. doi: 10.1002/gepi.22533. Epub 2023 Jun 21. Genet Epidemiol. 2023. PMID: 37341229 Free PMC article.
Detecting methylation quantitative trait loci using a methylation random field method.
Lyu C, Huang M, Liu N, Chen Z, Lupo PJ, Tycko B, Witte JS, Hobbs CA, Li M. Lyu C, et al. Brief Bioinform. 2021 Nov 5;22(6):bbab323. doi: 10.1093/bib/bbab323. Brief Bioinform. 2021. PMID: 34414410 Free PMC article.
A conditional autoregressive model for genetic association analysis accounting for genetic heterogeneity.
Shen X, Wen Y, Cui Y, Lu Q. Shen X, et al. Stat Med. 2022 Feb 10;41(3):517-542. doi: 10.1002/sim.9257. Epub 2021 Nov 22. Stat Med. 2022. PMID: 34811777 Free PMC article.
Detecting the Genomic Signature of Divergent Selection in Presence of Gene Flow.
Zeng P, Wang T. Zeng P, et al. Curr Genomics. 2015 Jun;16(3):194-202. doi: 10.2174/1389202916666150313230943. Curr Genomics. 2015. PMID: 26069459 Free PMC article.

See all "Cited by" articles

References

1. Adler RJ, Taylor JE. Random Fields and Geometry. Springer; New York: 2007.
1. Almasy L, Dyer TD, Peralta JM, Kent JW, Jr, Charlesworth JC, Curran JE, Blangero J. Genetic Analysis Workshop 17 mini-exome simulation. BMC Proc. 2011;5(Suppl 9):S2. - PMC - PubMed
1. Ansorge WJ. Next-generation DNA sequencing techniques. N Biotechnol. 2009;25(4):195–203. - PubMed
1. Besag J. Spatial interaction and statistical analysis of lattice systems. J R Stat Soc B. 1974;48:259–302.
1. Bodmer W, Bonilla C. Common and rare variants in multifactorial susceptibility to common diseases. Nat Genet. 2008;40(6):695–701. - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

K01DA033346/DA/NIDA NIH HHS/United States

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A generalized genetic random field method for the genetic association analysis of sequencing data

Affiliation

A generalized genetic random field method for the genetic association analysis of sequencing data

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Miscellaneous

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Miscellaneous