Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2001 Jan 1;29(1):75-9.
doi: 10.1093/nar/29.1.75.

YPD, PombePD and WormPD: model organism volumes of the BioKnowledge library, an integrated resource for protein information

Affiliations

YPD, PombePD and WormPD: model organism volumes of the BioKnowledge library, an integrated resource for protein information

M C Costanzo et al. Nucleic Acids Res. .

Abstract

The BioKnowledge Library is a relational database and web site (http://www.proteome.com) composed of protein-specific information collected from the scientific literature. Each Protein Report on the web site summarizes and displays published information about a single protein, including its biochemical function, role in the cell and in the whole organism, localization, mutant phenotype and genetic interactions, regulation, domains and motifs, interactions with other proteins and other relevant data. This report describes four species-specific volumes of the BioKnowledge Library, concerned with the model organisms Saccharomyces cerevisiae (YPD), Schizosaccharomyces pombe (PombePD) and Caenorhabditis elegans (WormPD), and with the fungal pathogen Candida albicans (CalPD). Protein Reports of each species are unified in format, easily searchable and extensively cross-referenced between species. The relevance of these comprehensively curated resources to analysis of proteins in other species is discussed, and is illustrated by a survey of model organism proteins that have similarity to human proteins involved in disease.

PubMed Disclaimer

Figures

Figure 1
Figure 1
(A) Proportion of proteins conserved among the model organisms. Each circle represents the set of proteins analyzed from that species and contained in the corresponding Proteome database. Complete sets of the predicted proteins (as of September 2000) of S.cerevisiae (6145 proteins) and C.elegans (18 546 proteins) and a nearly complete set of S.pombe predicted proteins (4837) were compared with each other using BLAST analysis (6,7), and matches with an E value of 1 × 10–10 or lower were counted. In the intersection of all three circles (colored in black) are proteins that find a match in all three species, represented as the percentage of total proteins in that species. At the intersection of each pair of circles is the percentage of proteins conserved between those two species but not found in the third, by these criteria. For example, 38% of S.cerevisiae proteins find a match in both S.pombe and C.elegans; 20% match S.pombe but not C.elegans; and 3% match C.elegans but not S.pombe. The proportions of proteins similar to a protein of at least one of the other species are: 61% of S.cerevisiae proteins, 71% of S.pombe proteins and 23% of C.elegans proteins find a match in at least one of the other species. (B) Comparison of predicted proteins of the model yeasts (S.cerevisiae, 6145 proteins and S.pombe, 4837 proteins), C.elegans (18 546 proteins) and mammals (10 229 from human, 5966 from mouse and 3188 from rat), as represented in the indicated volumes of the BioKnowledge Library as of September 2000. The proteins analyzed represent the complete predicted proteomes of S.cerevisiae and C.elegans, the nearly complete proteome of S.pombe and a partial set of mammalian proteins. Criteria for counting similar proteins were the same as for the comparison of (A). Proteins conserved in all three species are indicated in the intersection of all three circles, as in (A). The proportions of proteins similar to a protein of at least one of the other species are: 79% of the mammalian proteins, 43% of the C.elegans proteins and 52% of the model yeast proteins.

References

    1. Goffeau A., Barrell,B.G., Bussey,H., Davis,R.W., Dujon,B., Feldmann,H., Galibert,F., Hoheisel,J.D., Jacq,C., Johnston,M. et al. (1996) Life with 6000 genes. Science, 274, 563–567. - PubMed
    1. The C. elegans Sequencing Consortium (1998) Genome sequence of the nematode C. elegans: a platform for investigating biology. Science, 282, 2012–2018. - PubMed
    1. Adams M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D., Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F. et al. (2000) The genome sequence of Drosophila melanogaster. Science, 287, 2185–2195. - PubMed
    1. Costanzo M.C., Hogan,J.D., Cusick,M.E., Davis,B.P., Fancher,A.M., Hodges,P.E., Kondu,P., Lengieza,C., Lew-Smith,J.E., Lingner,C. et al. (2000) The Yeast Proteome Database (YPD) and Caenorhabditis elegans Proteome Database (WormPD): comprehensive resources for the organization and comparison of model organism protein information. Nucleic Acids Res., 28, 73–76. - PMC - PubMed
    1. Forsburg S.L. (1999) The best yeast? Trends Genet., 15, 340–344. - PubMed

Publication types