Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Apr;40(7):e53.
doi: 10.1093/nar/gkr1257. Epub 2012 Jan 12.

A comprehensive framework for prioritizing variants in exome sequencing studies of Mendelian diseases

Affiliations

A comprehensive framework for prioritizing variants in exome sequencing studies of Mendelian diseases

Miao-Xin Li et al. Nucleic Acids Res. 2012 Apr.

Abstract

Exome sequencing strategy is promising for finding novel mutations of human monogenic disorders. However, pinpointing the casual mutation in a small number of samples is still a big challenge. Here, we propose a three-level filtration and prioritization framework to identify the casual mutation(s) in exome sequencing studies. This efficient and comprehensive framework successfully narrowed down whole exome variants to very small numbers of candidate variants in the proof-of-concept examples. The proposed framework, implemented in a user-friendly software package, named KGGSeq (http://statgenpro.psychiatry.hku.hk/kggseq), will play a very useful role in exome sequencing-based discovery of human Mendelian disease genes.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
The three-level filtration and prioritization framework implemented in KGGSeq.
Figure 2.
Figure 2.
Protein–protein interaction network of MYH3 with four candidate genes. The five involved genes are in dashed circle. Each filled node denotes a gene; edges between notes indicate PPIs between protein products of the corresponding genes. Different edge colors represent the types of evidence for the association. This figure was produced by STRING (V9.0).
Figure 3.
Figure 3.
Pair-wise correlation of the five deleteriousness scores in the (a) disease-causal and (b) neutral rare variant sets. The Spearman's rank correlation method was used to calculate the pair-wise correlation coefficients.
Figure 4.
Figure 4.
Receiver operating characteristic (ROC) curves of various methods. (a) The control (neutral) variants are rare (MAF < 0.01); (b) the control (neutral) variants have MAF ≥ 0.01. The true positive rate (sensitivity) and false positive rate (1-specificity) of logistic model were obtained by 10-fold cross validation procedure. Logistic model: performance of conventional multiple logistic regression model when combining the five deleterious scores (PhyloP, SIFT, PolyPhen2, LRT and MutationTaster). The true positive rate and false positive rate of the five different prediction methods were generated by varying the threshold scores for prediction in the entire data set.

References

    1. Antonarakis SE, Beckmann JS. Mendelian disorders deserve more attention. Nat. Rev. Genet. 2006;7:277–282. - PubMed
    1. Ng SB, Buckingham KJ, Lee C, Bigham AW, Tabor HK, Dent KM, Huff CD, Shannon PT, Jabs EW, Nickerson DA, et al. Exome sequencing identifies the cause of a mendelian disorder. Nat. Genet. 2010;42:30–35. - PMC - PubMed
    1. Chen JM, Ferec C, Cooper DN. Revealing the human mutome. Clin. Genet. 2010;78:310–320. - PubMed
    1. McCarthy MI, Hattersley AT. Learning from molecular genetics: novel insights arising from the definition of genes for monogenic and type 2 diabetes. Diabetes. 2008;57:2889–2898. - PMC - PubMed
    1. McCarthy MI. Exploring the unknown: assumptions about allelic architecture and strategies for susceptibility variant discovery. Genome Med. 2009;1:66. - PMC - PubMed

Publication types