. 2012 Nov 24:13:667.

doi: 10.1186/1471-2164-13-667.

Weighted pedigree-based statistics for testing the association of rare variants

Yin Yao Shugart¹, Yun Zhu, Wei Guo, Momiao Xiong

Affiliations

PMID: 23176082
PMCID: PMC3827928
DOI: 10.1186/1471-2164-13-667

Weighted pedigree-based statistics for testing the association of rare variants

Yin Yao Shugart et al. BMC Genomics. 2012.

. 2012 Nov 24:13:667.

doi: 10.1186/1471-2164-13-667.

Authors

Yin Yao Shugart¹, Yun Zhu, Wei Guo, Momiao Xiong

Affiliation

¹ Unit of Statistical Genomics, Division of Intramural Division Program, National Institute of Mental Health, National Institute of Health, Bethesda, MD, USA.

PMID: 23176082
PMCID: PMC3827928
DOI: 10.1186/1471-2164-13-667

Abstract

Background: With the advent of next-generation sequencing (NGS) technologies, researchers are now generating a deluge of data on high dimensional genomic variations, whose analysis is likely to reveal rare variants involved in the complex etiology of disease. Standing in the way of such discoveries, however, is the fact that statistics for rare variants are currently designed for use with population-based data. In this paper, we introduce a pedigree-based statistic specifically designed to test for rare variants in family-based data. The additional power of pedigree-based statistics stems from the fact that while rare variants related to diseases or traits of interest occur only infrequently in populations, in families with multiple affected individuals, such variants are enriched. Note that while the proposed statistic can be applied with and without statistical weighting, our simulations show that its power increases when weighting (WSS and VT) are applied.

Results: Our working hypothesis was that, since rare variants are concentrated in families with multiple affected individuals, pedigree-based statistics should detect rare variants more powerfully than population-based statistics. To evaluate how well our new pedigree-based statistics perform in association studies, we develop a general framework for sequence-based association studies capable of handling data from pedigrees of various types and also from unrelated individuals. In short, we developed a procedure for transforming population-based statistics into tests for family-based associations. Furthermore, we modify two existing tests, the weighted sum-square test and the variable-threshold test, and apply both to our family-based collapsing methods. We demonstrate that the new family-based tests are more powerful than corresponding population-based test and they generate a reasonable type I error rate.To demonstrate feasibility, we apply the newly developed tests to a pedigree-based GWAS data set from the Framingham Heart Study (FHS). FHS-GWAS data contain approximately 5000 uncommon variants with frequencies less than 0.05. Potential association findings in these data demonstrate the feasibility of the software PB-STAR (note, PB-STAR is now freely available to the public).

Conclusion: Our tests show that when analyzing for rare variants, a pedigree-based design is more powerful than a population-based case-control design. We further demonstrate that a pedigree-based statistic's power to detect rare variants increases in direct relation to the proportion of affected individuals within the pedigree.

PubMed Disclaimer

Figures

**Figure 1**
**The power curves of the family-based corrected single marker χ**²test statistic as a function of the total number of individuals at the significance level α = 0.05 in the test under seven settings: unrelated individuals in cases-controls study, nuclear family groups 1 and 2, sib-pair groups 1 and 2 and three generation family groups 1 and 2, assuming a dominant model, 20% of the risk variants and a baseline penetrance of 0.01.

**Figure 2**
The power curves of the family-based collapsing test (variants with frequencies ≤0.005 were collapsed) statistic as a function of the total number of individuals at the significance level α = 0.05 in the test under seven settings: unrelated individuals in cases-controls study, nuclear family groups 1 and 2, sib-pair groups 1 and 2 and three generation family groups 1 and 2, assuming a dominant model, 20% of the risk variants and a baseline penetrance of 0.01.

**Figure 3**
The power curves of the family-based VT test statistic as a function of the total number of individuals at the significance level α = 0.05 in the test under seven settings: unrelated individuals in cases-controls study, nuclear family groups 1 and 2, sib-pair groups 1 and 2 and three generation family groups 1 and 2, assuming a dominant model, 20% of the risk variants and a baseline penetrance of 0.01.

**Figure 4**
The power curves of the family-based WSS test statistic as a function of the total number of individuals at the significance level α = 0.05 in the test under seven settings: unrelated individuals in cases-controls study, nuclear family groups 1 and 2, sib-pair groups 1 and 2 and three generation family groups 1 and 2, assuming a dominant model, 20% of the risk variants and a baseline penetrance of 0.01.

**Figure 5**
**The power curves of the family-based corrected single marker χ**²test statistic as a function of the proportion of risk variants at the significance level α = 0.05 in the test under seven settings: unrelated individuals in cases-controls study, nuclear family groups 1 and 2, sib-pair groups 1 and 2 and three generation family groups 1 and 2, assuming a dominant model, a total of 1,800 sampled individuals and a baseline penetrance of 0.01.

**Figure 6**
The power curves of the family-based collapsing test (variants with frequencies ≤0.005 were collapsed) statistic as a function of the proportion of risk variants at the significance level α = 0.05 in the test under seven settings: unrelated individuals in cases-controls study, nuclear family groups 1 and 2, sib-pair groups 1 and 2 and three generation family groups 1 and 2, assuming a dominant model, a total of 1,800 sampled individuals and a baseline penetrance of 0.01.

**Figure 7**
The power curves of the family-based VT test statistic as a function of the proportion of risk variants at the significance level α = 0.05 in the test under seven settings: unrelated individuals in cases-controls study, nuclear family groups 1 and 2, sib-pair groups 1 and 2 and three generation family groups 1 and 2, assuming a dominant model, a total of 1,800 sampled individuals and a baseline penetrance of 0.01.

**Figure 8**
The power curves of the family-based WSS test statistic as a function of the proportion of risk variants at the significance level α = 0.05 in the test under seven settings: unrelated individuals in cases-controls study, nuclear family groups 1 and 2, sib-pair groups 1 and 2 and three generation family groups 1 and 2, assuming a dominant model, a total of 1,800 sampled individuals and a baseline penetrance of 0.01.

**Figure 9**
**The power curves of the family-based corrected single marker χ**²statistic under opposite directions of association as a function of the total number of individuals at the significance level α = 0.05 in the test under seven settings: unrelated individuals in cases-controls study, nuclear family groups 1 and 2, sib-pair groups 1 and 2 and three generation family groups 1 and 2, assuming a dominant model, 20% of the risk variants and a baseline penetrance of 0.01.

**Figure 10**
The power curves of the family-based collapsing test (variants with frequencies ≤0.005 were collapsed) statistic under opposite directions of association as a function of the total number of individuals at the significance level α = 0.05 in the test under seven settings: unrelated individuals in cases-controls study, nuclear family groups 1 and 2, sib-pair groups 1 and 2 and three generation family groups 1 and 2, assuming a dominant model, 20% of the risk variants and a baseline penetrance of 0.01.

**Figure 11**
The power curves of the family-based VT statistic under opposite directions of association as a function of the total number of individuals at the significance level α = 0.05 in the test under seven settings: unrelated individuals in cases-controls study, nuclear family groups 1 and 2, sib-pair groups 1 and 2 and three generation family groups 1 and 2, assuming a dominant model, 20% of the risk variants and a baseline penetrance of 0.01.

**Figure 12**
The power curves of the family-based WSS test statistic under opposite directions of association as a function of the total number of individuals at the significance level α = 0.05 in the test under seven settings: unrelated individuals in cases-controls study, nuclear family groups 1 and 2, sib-pair groups 1 and 2 and three generation family groups 1 and 2, assuming a dominant model, 20% of the risk variants and a baseline penetrance of 0.01.

See this image and copyright information in PMC

Cited by

Pathway-based approach using hierarchical components of collapsed rare variants.
Lee S, Choi S, Kim YJ, Kim BJ; T2d-Genes Consortium; Hwang H, Park T. Lee S, et al. Bioinformatics. 2016 Sep 1;32(17):i586-i594. doi: 10.1093/bioinformatics/btw425. Bioinformatics. 2016. PMID: 27587678 Free PMC article.
The power comparison of the haplotype-based collapsing tests and the variant-based collapsing tests for detecting rare variants in pedigrees.
Guo W, Shugart YY. Guo W, et al. BMC Genomics. 2014 Jul 28;15(1):632. doi: 10.1186/1471-2164-15-632. BMC Genomics. 2014. PMID: 25070353 Free PMC article.
Amish revisited: next-generation sequencing studies of psychiatric disorders among the Plain people.
Hou L, Faraci G, Chen DT, Kassem L, Schulze TG, Shugart YY, McMahon FJ. Hou L, et al. Trends Genet. 2013 Jul;29(7):412-8. doi: 10.1016/j.tig.2013.01.007. Epub 2013 Feb 17. Trends Genet. 2013. PMID: 23422049 Free PMC article. Review.
A multistep approach to single nucleotide polymorphism-set analysis: an evaluation of power and type I error of gene-based tests of association after pathway-based association tests.
Valcarcel A, Grinde K, Cook K, Green A, Tintle N. Valcarcel A, et al. BMC Proc. 2016 Oct 18;10(Suppl 7):349-355. doi: 10.1186/s12919-016-0055-4. eCollection 2016. BMC Proc. 2016. PMID: 27980661 Free PMC article.
A complete pedigree-based graph workflow for rare candidate variant analysis.
Markello C, Huang C, Rodriguez A, Carroll A, Chang PC, Eizenga J, Markello T, Haussler D, Paten B. Markello C, et al. Genome Res. 2022 May;32(5):893-903. doi: 10.1101/gr.276387.121. Epub 2022 Apr 28. Genome Res. 2022. PMID: 35483961 Free PMC article.

See all "Cited by" articles

References

1. Ehret G. Genome-wide association studies: contribution of genomics to understanding blood pressure and essential hypertension. Curr Hypertens Rep. 2011;12:17–25. - PMC - PubMed
1. Lupski JR, Belmont JW, Boerwinkle E, Gibbs RA. Clan genomics and the complex architecture of human disease. Cell. 2011;147:32–43. doi: 10.1016/j.cell.2011.09.008. - DOI - PMC - PubMed
1. Liu DJ, Leal SM. A novel adaptive method for the analysis of next-generation sequencing data to detect complex trait associating with rare variants due to gene main effects and interactions. PLoS Genet. 2010;6:e1001156. doi: 10.1371/journal.pgen.1001156. - DOI - PMC - PubMed
1. Xiong M, Zhao J, Boerwinkle E. Generalized T2 test for genome association studies. Am J Hum Genet. 2002;70:1257–1268. doi: 10.1086/340392. - DOI - PMC - PubMed
1. Madsen BE, Browning SR. A groupwise association test for rare mutations using a weighted sum statistics. PLoS Genet. 2009;5:e1000384. doi: 10.1371/journal.pgen.1000384. - DOI - PMC - PubMed

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

1R01HL106034-01/HL/NHLBI NIH HHS/United States

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Weighted pedigree-based statistics for testing the association of rare variants

Affiliation

Weighted pedigree-based statistics for testing the association of rare variants

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Research Materials

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Research Materials