Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007;8(11):R252.
doi: 10.1186/gb-2007-8-11-r252.

Using protein complexes to predict phenotypic effects of gene mutation

Affiliations

Using protein complexes to predict phenotypic effects of gene mutation

Hunter B Fraser et al. Genome Biol. 2007.

Abstract

Background: Predicting the phenotypic effects of mutations is a central goal of genetics research; it has important applications in elucidating how genotype determines phenotype and in identifying human disease genes.

Results: Using a wide range of functional genomic data from the yeast Saccharomyces cerevisiae, we show that the best predictor of a protein's knockout phenotype is the knockout phenotype of other proteins that are present in a protein complex with it. Even the addition of multiple datasets does not improve upon the predictions made from protein complex membership. Similarly, we find that a proxy for protein complexes is a powerful predictor of disease phenotypes in humans.

Conclusion: We propose that identifying human protein complexes containing known disease genes will be an efficient method for large-scale disease gene discovery, and that yeast may prove to be an informative model system for investigating, and even predicting, the genetic basis of both Mendelian and complex disease phenotypes.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Predictors of phenotype pairs in yeast. (a) Enrichments for phenotype pairs among 20 predictors. An enrichment value of 1 reflects random performance (shown as 'all pairs', the left-most column) and greater than 1 indicates better than random predictive power. Predictors are arranged in order of increasing predictive power. Error bars indicate the hypergeometric standard deviation, which reflects the range of expected variation in the enrichment value. In the inset, red bars are the same as in the main panel a, and are in the same order. Green bars indicate enrichments in the intersection of each dataset with co-expression (r > 0.3). The seven datasets with significant improvements in predictive power are indicated by asterisks. Note that the four co-expression datasets are not counted in the multiple testing correction because they cannot possibly show any improvement when intersecting with another dataset that is a superset. (b) Green bars are the same as in Figure 1a inset. Blue bars indicate the level of enrichment that would be expected by chance, if co-expression was entirely independent of each dataset. Green bars significantly lower than the paired blue bar indicate a dataset that is not independent of co-expression. Error bars indicate the hypergeometric standard deviation, which reflects the range of expected variation in the enrichment value. In the inset, red bars are the same as in panel a and are in the same order as in both panels a and b. Green bars indicate enrichments in the intersection of each dataset with Munich Information Center for Protein Sequences (MIPS) complexes. Note that although many green bars are significantly higher than the paired red bars, no green bars are significantly higher than the MIPS complexes (rightmost) bars. This indicates that no dataset adds to the predictive power of complexes among the set of proteins in MIPS complexes. HTP, high-throughput; LTP, low-throughput; TF, transcription factor.
Figure 2
Figure 2
Predictors of disease gene pairs in human. (a) Enrichments for disease gene pairs among eight predictors. An enrichment value of 1 reflects random performance (shown as 'all pairs', the left-most column). Predictors are arranged in order of increasing predictive power. Error bars indicate the hypergeometric standard deviation, which reflects the range of expected variation in the enrichment value. In the inset red bars are the same as in the main panel a and are in the same order (note the tenfold change in scale). Green bars indicate enrichments in the intersection of each dataset with Human Protein Reference Database (HPRD) interactions. Aside from HPRD intersected with itself or with all pairs, all but one dataset (high-throughput [HTP] interactions) exhibit a significant improvement in predictive power over HPRD interactions alone when intersected with HPRD; this indicates that these datasets are at least partially independent of HPRD. (b) Green bars are the same as in panel a (inset). Blue bars indicate the level of enrichment that would be expected by chance, if HPRD interactions were entirely independent of each dataset. Green bars significantly lower than the paired blue bar indicate a dataset that is not entirely independent of HPRD interactions. The three right-most blue bars are truncated for clarity; their enrichment values are written above each bar. Error bars indicate the hypergeometric standard deviation, which reflects the range of expected variation in the enrichment value.

References

    1. McKusick VA. Mendelian Inheritance in Man and its online version, OMIM. Am J Hum Genet. 2007;80:588–604. doi: 10.1086/514346. - DOI - PMC - PubMed
    1. Steinmetz LM, Scharfe C, Deutschbauer AM, Mokranjac D, Herman ZS, Jones T, Chu AM, Giaever G, Prokisch H, Oefner PJ, et al. Systematic screen for human disease genes in yeast. Nat Genet. 2002;31:400–404. - PubMed
    1. Calvo S, Jain M, Xie X, Sheth SA, Chang B, Goldberger OA, Spinazzola A, Zeviani M, Carr SA, Mootha VK. Systematic identification of human mitochondrial disease genes through integrative genomics. Nat Genet. 2006;38:576–582. doi: 10.1038/ng1776. - DOI - PubMed
    1. Dudley AM, Janse DM, Tanay A, Shamir R, Church GM. A global view of pleiotropy and phenotypically derived gene function in yeast. Mol Syst Biol. 2005;1:2005.0001. doi: 10.1038/msb4100004. - DOI - PMC - PubMed
    1. Parsons AB, Lopez A, Givoni IE, Williams DE, Gray CA, Porter J, Chua G, Sopko R, Brost RL, Ho CH, et al. Exploring the mode-of-action of bioactive compounds by chemical-genetic profiling in yeast. Cell. 2006;126:611–625. doi: 10.1016/j.cell.2006.06.040. - DOI - PubMed

Publication types

LinkOut - more resources