Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Nov 29;5 Suppl 9(Suppl 9):S30.
doi: 10.1186/1753-6561-5-S9-S30.

Challenges and directions: an analysis of Genetic Analysis Workshop 17 data by collapsing rare variants within family data

Affiliations

Challenges and directions: an analysis of Genetic Analysis Workshop 17 data by collapsing rare variants within family data

Peng Lin et al. BMC Proc. .

Abstract

Recent studies suggest that the traditional case-control study design does not have sufficient power to discover rare risk variants. Two different methods-collapsing and family data-are suggested as alternatives for discovering these rare variants. Compared with common variants, rare variants have unique characteristics. In this paper, we assess the distribution of rare variants in family data. We notice that a large number of rare variants exist only in one or two families and that the association result is largely shaped by those families. Therefore we explore the possibility of integrating both the collapsing method and the family data method. This combinational approach offers a potential power boost for certain causal genes, including VEGFA, VEGFC, SIRT1, SREBF1, PIK3R3, VLDLR, PLAT, and FLT4, and thus deserves further investigation.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Distribution of rare causal SNPs within families. In the GAW17 data set, 145 of 162 casual SNPs are rare variants. Of these 145 rare variants, 103 do not exist in the family data. Eighty-five percent of the existing rare variants exist in only one or two families. The number above each bar indicates the exact number of rare SNPs in this category. It partly explains why many rare variants cannot be discovered using family data.
Figure 2
Figure 2
Distribution of association signals withinfamilies. Each category indicates the number of families that report an association signal for each SNP. The number above each bar indicates the total number of rare causal SNPs in this category. The distribution of association signals matches well to the distribution of rare SNPs within families. It shows that when all families are analyzed together, the final result is largely shaped by only a few families.

Similar articles

Cited by

References

    1. Manolio TA. Genomewide association studies and assessment of the risk of disease. New Engl J Med. 2010;363:166–176. doi: 10.1056/NEJMra0905980. - DOI - PubMed
    1. Dering C, Pugh E, Ziegler A. Statistical analysis of rare sequence variants: an overview of collapsing methods. Genet Epidemiol. 2011;X(suppl X):X–X. - PMC - PubMed
    1. Li B, Leal SM. Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. Am J Hum Genet. 2008;83:311–321. doi: 10.1016/j.ajhg.2008.06.024. - DOI - PMC - PubMed
    1. Thornton T, McPeek MS. Case-control association testing with related individuals: a more powerful quasi-likelihood score test. Am J Hum Genet. 2007;81:321–337. doi: 10.1086/519497. - DOI - PMC - PubMed
    1. Yu J, Pressoir G, Briggs WH, Vroh Bi I, Yamasaki M, Doebley JF, McMullen MD, Gaut BS, Nielsen DM, Holland JB, et al.. A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat Genet. 2006;38:203–208. doi: 10.1038/ng1702. - DOI - PubMed

LinkOut - more resources