Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Jan;6(1):39-46.
doi: 10.1038/nmeth.1284.

Literature-curated protein interaction datasets

Affiliations

Literature-curated protein interaction datasets

Michael E Cusick et al. Nat Methods. 2009 Jan.

Abstract

High-quality datasets are needed to understand how global and local properties of protein-protein interaction, or 'interactome', networks relate to biological mechanisms, and to guide research on individual proteins. In an evaluation of existing curation of protein interaction experiments reported in the literature, we found that curation can be error-prone and possibly of lower quality than commonly assumed.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Distribution of the number of published papers supporting each interaction: in the dataset of yeast protein interactions downloaded from the BioGRID database; in the literature-curated dataset of human protein interactions; and in the literature-curated dataset of Arabidopsis protein interactions.
Figure 2
Figure 2
Distribution of the publications in literature-curated datasets by the number of interactions reported in the publication. Distribution in the (a) yeast (b) human and (c) Arabidopsis literature-curated PPI datasets supported by a single publication.
Figure 3
Figure 3
Overlaps of reported curation for yeast PPIs (a) Overlaps of the total number of reported binary PPIs, or after removing the largest high-throughput yeast PPI reports (numbers in parentheses). (b) Overlaps of the Pubmed reports curated. (c) Overlaps after removing multiply supported interactions.
Figure 4
Figure 4
Summary of recuration results. (a) 100 interacting pairs randomly drawn from the yeast literature curated dataset supported by only a single publication. Score 0: erroneous, not reported in the associated publication; score 1: reported in the associated publication but not verified; score 2: reported and verified. (b) Recuration results of the literature curated sample for human PPIs reported in multiple publications. Proportion of correct and erroneous curation units (left panel) and a distribution of different types of curation errors (right panel). (c) Summary of curation results of randomly sampled sets from human literature curated interacting pairs reported in a single publication. Correct and erroneous curation units (left side); distribution of different types of curation errors (right side).

Comment in

  • Recurated protein interaction datasets.
    Salwinski L, Licata L, Winter A, Thorneycroft D, Khadake J, Ceol A, Aryamontri AC, Oughtred R, Livstone M, Boucher L, Botstein D, Dolinski K, Berardini T, Huala E, Tyers M, Eisenberg D, Cesareni G, Hermjakob H. Salwinski L, et al. Nat Methods. 2009 Dec;6(12):860-1. doi: 10.1038/nmeth1209-860. Nat Methods. 2009. PMID: 19935838 No abstract available.

References

    1. Cusick ME, Klitgord N, Vidal M, Hill DE. Interactome: Gateway into systems biology. Hum. Mol. Genet. 2005;14:R171–81. - PubMed
    1. Bader S, Kuhner S, Gavin AC. Interaction networks for systems biology. FEBS Lett. 2008;582:1220–4. - PubMed
    1. Vidal M. Interactome modeling. FEBS Lett. 2005;579:1834–8. - PubMed
    1. Roberts PM. Mining literature for systems biology. Brief. Bioinform. 2006;7:399–406. - PubMed
    1. Venkatesan K, et al. An empirical framework for binary interactome mapping. Nat. Methods. 2008 in press. - PMC - PubMed

Publication types