Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Feb 1;25(3):372-8.
doi: 10.1093/bioinformatics/btn640. Epub 2008 Dec 17.

Precision and recall estimates for two-hybrid screens

Affiliations

Precision and recall estimates for two-hybrid screens

Hailiang Huang et al. Bioinformatics. .

Abstract

Motivation: Yeast two-hybrid screens are an important method to map pairwise protein interactions. This method can generate spurious interactions (false discoveries), and true interactions can be missed (false negatives). Previously, we reported a capture-recapture estimator for bait-specific precision and recall. Here, we present an improved method that better accounts for heterogeneity in bait-specific error rates.

Result: For yeast, worm and fly screens, we estimate the overall false discovery rates (FDRs) to be 9.9%, 13.2% and 17.0% and the false negative rates (FNRs) to be 51%, 42% and 28%. Bait-specific FDRs and the estimated protein degrees are then used to identify protein categories that yield more (or fewer) false positive interactions and more (or fewer) interaction partners. While membrane proteins have been suggested to have elevated FDRs, the current analysis suggests that intrinsic membrane proteins may actually have reduced FDRs. Hydrophobicity is positively correlated with decreased error rates and fewer interaction partners. These methods will be useful for future two-hybrid screens, which could use ultra-high-throughput sequencing for deeper sampling of interacting bait-prey pairs.

Availability: All software (C source) and datasets are available as supplemental files and at http://www.baderzone.org under the Lesser GPL v. 3 license.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
The distributions of FDRs for baits are displayed for Beta/TPL (solid line), Beta/PL (dashed line) and Mixture/TPL (black impulses) for yeast (A), worm (B) and fly (C). Posterior maximum likelihood estimates are displayed for the Beta/TPL model (open circles). Baits with a single clone do not contribute to the estimator and are not included in the histograms.

References

    1. Ashburner M, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 2000;25:25–29. - PMC - PubMed
    1. Bader JS, Chant J. Systems biology. When proteomes collide. Science. 2006;311:187–188. - PubMed
    1. Bateman A, et al. The Pfam protein families database. Nucleic Acids Res. 2004;32:D138–D141. - PMC - PubMed
    1. Cornette JL, et al. Hydrophobicity scales and computational techniques for detecting amphipathic structures in proteins. J. Mol. Biol. 1987;195:659–685. - PubMed
    1. Deane CM, et al. Protein interactions: two methods for assessment of the reliability of high throughput observations. Mol. Cell Proteomics. 2002;1:349–356. - PubMed

Publication types