Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2005 Jan 1;33(Database issue):D169-73.
doi: 10.1093/nar/gki093.

SPD--a web-based secreted protein database

Affiliations

SPD--a web-based secreted protein database

Yunjia Chen et al. Nucleic Acids Res. .

Abstract

With the improved secreted protein prediction approach and comprehensive data sources, including Swiss-Prot, TrEMBL, RefSeq, Ensembl and CBI-Gene, we have constructed secretomes of human, mouse and rat, with a total of 18 152 secreted proteins. All the entries are ranked according to the prediction confidence. They were further annotated via a proteome annotation pipeline that we developed. We also set up a secreted protein classification pipeline and classified our predicted secreted proteins into different functional categories. To make the dataset more convincing and comprehensive, nine reference datasets are also integrated, such as the secreted proteins from the Gene Ontology Annotation (GOA) system at the European Bioinformatics Institute, and the vertebrate secreted proteins from Swiss-Prot. All these entries were grouped via a TribeMCL based clustering pipeline. We have constructed a web-based secreted protein database, which has been publicly available at http://spd.cbi.pku.edu.cn. Users can browse the database via a GO assignment or chromosomal-location-based interface. Moreover, text query and sequence similarity search are also provided, and the sequence and annotation data can be downloaded freely from the SPD website.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The pipeline we used to collect secreted proteins. Sequence data were downloaded by December 2003. Non-redundant: identical sequences were excluded via pairwise BLASTP. Sec-HMMER (CJ-SPHMM/TMHMM): the tool used to predict signal peptides and transmembrane region. Set1 and Set2: proteins predicted by PSORT and Sec-HMMER, respectively; Set3: proteins matched by Sec-HMMER only. Rank0: known secreted proteins in Swiss-Prot; Rank1: predicted by both PSORT and Sec-HMMER; Rank2: predicted by either PSORT or Sec-HMMER; and Rank3: predicted by Sec-HMMER only and with a signal peptide >70 amino acids.
Figure 2
Figure 2
Screen snapshots of the SPD database, only partially drawn (A and B) for clarity. (A) An example page of the chromosomal location of human protease secreted proteins. The clickable ‘+’ and ‘−’ symbols denotes the plus and minus strands as indicated in the browser at the SPD website. (B) An example of an SPD core dataset entry (TAFA 3.2) showing four divisions of the entry format and various fields with links to detail information and original data source. (C) An example of an SPD reference dataset entry (AY359017) showing two divisions with general information.

References

    1. Damas J.K., Gullestad,L. and Aukrust,P. (2001) Cytokines as new treatment targets in chronic heart failure. Curr. Control Trials Cardiovasc. Med., 2, 271–277. - PMC - PubMed
    1. Blobel G. (2000) Protein targeting (Nobel lecture). Chembiochem, 1, 86–102. - PubMed
    1. Walter P., Gilmore,R. and Blobel,G. (1984) Protein translocation across the endoplasmic reticulum. Cell, 38, 5–8. - PubMed
    1. Sakaguchi M. (1997) Eukaryotic protein secretion. Curr. Opin. Biotechnol., 8, 595–601. - PubMed
    1. Clark H.F., Gurney,A.L., Abaya,E., Baker,K., Baldwin,D., Brush,J., Chen,J., Chow,B., Chui,C., Crowley,C. et al. (2003) The secreted protein discovery initiative (SPDI), a large-scale effort to identify novel human secreted and transmembrane proteins: a bioinformatics assessment. Genome Res., 13, 2265–2270. - PMC - PubMed

Publication types