. 2002 Jan 1;30(1):35-7.

doi: 10.1093/nar/30.1.35.

The Protein Information Resource: an integrated public resource of functional annotation of proteins

Cathy H Wu¹, Hongzhan Huang, Leslie Arminski, Jorge Castro-Alvear, Yongxing Chen, Zhang-Zhi Hu, Robert S Ledley, Kali C Lewis, Hans-Werner Mewes, Bruce C Orcutt, Baris E Suzek, Akira Tsugita, C R Vinayaka, Lai-Su L Yeh, Jian Zhang, Winona C Barker

Affiliations

Affiliation

¹ National Biomedical Research Foundation, Georgetown University Medical Center, 3900 Reservoir Road, NW, Washington, DC 20007, USA. pirmail@nbrf.georgetown.edu

PMID: 11752247
PMCID: PMC99125
DOI: 10.1093/nar/30.1.35

The Protein Information Resource: an integrated public resource of functional annotation of proteins

Cathy H Wu et al. Nucleic Acids Res. 2002.

. 2002 Jan 1;30(1):35-7.

doi: 10.1093/nar/30.1.35.

Authors

Affiliation

¹ National Biomedical Research Foundation, Georgetown University Medical Center, 3900 Reservoir Road, NW, Washington, DC 20007, USA. pirmail@nbrf.georgetown.edu

PMID: 11752247
PMCID: PMC99125
DOI: 10.1093/nar/30.1.35

Abstract

The Protein Information Resource (PIR) serves as an integrated public resource of functional annotation of protein data to support genomic/proteomic research and scientific discovery. The PIR, in collaboration with the Munich Information Center for Protein Sequences (MIPS) and the Japan International Protein Information Database (JIPID), produces the PIR-International Protein Sequence Database (PSD), the major annotated protein sequence database in the public domain, containing about 250 000 proteins. To improve protein annotation and the coverage of experimentally validated data, a bibliography submission system is developed for scientists to submit, categorize and retrieve literature information. Comprehensive protein information is available from iProClass, which includes family classification at the superfamily, domain and motif levels, structural and functional features of proteins, as well as cross-references to over 40 biological databases. To provide timely and comprehensive protein data with source attribution, we have introduced a non-redundant reference protein database, PIR-NREF. The database consists of about 800 000 proteins collected from PIR-PSD, SWISS-PROT, TrEMBL, GenPept, RefSeq and PDB, with composite protein names and literature data. To promote database interoperability, we provide XML data distribution and open database schema, and adopt common ontologies. The PIR web site (http://pir.georgetown.edu/) features data mining and sequence analysis tools for information retrieval and functional identification of proteins based on both sequence and annotation information. The PIR databases and other files are also available by FTP (ftp://nbrfa.georgetown.edu/pir_databases).

PubMed Disclaimer

Cited by

DAVID: Database for Annotation, Visualization, and Integrated Discovery.
Dennis G Jr, Sherman BT, Hosack DA, Yang J, Gao W, Lane HC, Lempicki RA. Dennis G Jr, et al. Genome Biol. 2003;4(5):P3. Epub 2003 Apr 3. Genome Biol. 2003. PMID: 12734009
Proteomic definition of a desmoglein linear determinant common to Pemphigus vulgaris and Pemphigus foliaceous.
Lucchese A, Mittelman A, Tessitore L, Serpico R, Sinha AA, Kanduc D. Lucchese A, et al. J Transl Med. 2006 Aug 22;4:37. doi: 10.1186/1479-5876-4-37. J Transl Med. 2006. PMID: 16925820 Free PMC article.
Exegesis: a procedure to improve gene predictions and its use to find immunoglobulin superfamily proteins in the human and mouse genomes.
de Bono B, Chothia C. de Bono B, et al. Nucleic Acids Res. 2003 Nov 1;31(21):6096-103. doi: 10.1093/nar/gkg828. Nucleic Acids Res. 2003. PMID: 14576296 Free PMC article.
SEQOPTICS: a protein sequence clustering system.
Chen Y, Reilly KD, Sprague AP, Guan Z. Chen Y, et al. BMC Bioinformatics. 2006 Dec 12;7 Suppl 4(Suppl 4):S10. doi: 10.1186/1471-2105-7-S4-S10. BMC Bioinformatics. 2006. PMID: 17217502 Free PMC article.
Bioinformatics Resources for In Silico Proteome Analysis.
Pruess M, Apweiler R. Pruess M, et al. J Biomed Biotechnol. 2003;2003(4):231-236. doi: 10.1155/S1110724303209219. J Biomed Biotechnol. 2003. PMID: 14615630 Free PMC article.

See all "Cited by" articles

References

1. Barker W.C., Pfeiffer,F. and George,D.G. (1996) Superfamily classification in PIR-International Protein Sequence Database. Methods Enzymol., 266, 59–71. - PubMed
1. Wu C.H., Xiao,C., Hou,Z., Huang,H. and Barker,W.C. (2001) iProClass: an integrated, comprehensive, and annotated protein classification database. Nucleic Acids Res., 29, 52–54. - PMC - PubMed
1. Bateman A., Birney,E., Durbin,R., Eddy,S.R., Howe,K.L. and Sonnhammer,E.L.L. (2000) The Pfam protein families database. Nucleic Acids Res., 28, 263–266. Updated article in this issue: Nucleic Acids Res. (2002), 30, 276–280. - PMC - PubMed
1. Huang H., Xiao,C. and Wu,C.H. (2000) ProClass protein family database. Nucleic Acids Res., 28, 273–276. - PMC - PubMed
1. Hofmann K., Bucher,P., Falquet,L. and Bairoch,A. (1999) The PROSITE database, its status in 1999. Nucleic Acids Res., 27, 215–219. Updated article in this issue: Nucleic Acids Res. (2002), 30, 235–238. - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

Grants and funding

P41 LM05978/LM/NLM NIH HHS/United States

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

The Protein Information Resource: an integrated public resource of functional annotation of proteins

Affiliation

The Protein Information Resource: an integrated public resource of functional annotation of proteins

Authors

Affiliation

Abstract

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources