Protein sequence annotation in the genome era: the annotation concept of SWISS-PROT+TREMBL
- PMID: 9322012
Protein sequence annotation in the genome era: the annotation concept of SWISS-PROT+TREMBL
Abstract
SWISS-PROT is a curated protein sequence database which strives to provide a high level of annotation, a minimal level of redundancy and high level of integration with other databases. Ongoing genome sequencing projects have dramatically increased the number of protein sequences to be incorporated into SWISS-PROT. Since we do not want to dilute the quality standards of SWISS-PROT by incorporating sequences without proper sequence analysis and annotation, we cannot speed up the incorporation of new incoming data indefinitely. However, as we also want to make the sequences available as fast as possible, we introduced TREMBL (TRanslation of EMBL nucleotide sequence database), a supplement to SWISS-PROT. TREMBL consists of computer-annotated entries in SWISS-PROT format derived from the translation of all coding sequences (CDS) in the EMBL nucleotide sequence database, except for CDS already included in SWISS-PROT. While TREMBL is already of immense value, its computer-generated annotation does not match the quality of SWISS-PROTs. The main difference is in the protein functional information attached to sequences. With this in mind, we are dedicating substantial effort to develop and apply computer methods to enhance the functional information attached to TREMBL entries.
Similar articles
-
The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000.Nucleic Acids Res. 2000 Jan 1;28(1):45-8. doi: 10.1093/nar/28.1.45. Nucleic Acids Res. 2000. PMID: 10592178 Free PMC article.
-
The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1999.Nucleic Acids Res. 1999 Jan 1;27(1):49-54. doi: 10.1093/nar/27.1.49. Nucleic Acids Res. 1999. PMID: 9847139 Free PMC article.
-
The SWISS-PROT protein sequence data bank and its new supplement TREMBL.Nucleic Acids Res. 1996 Jan 1;24(1):21-5. doi: 10.1093/nar/24.1.21. Nucleic Acids Res. 1996. PMID: 8594581 Free PMC article.
-
The Swiss-Prot protein knowledgebase and ExPASy: providing the plant community with high quality proteomic data and tools.Plant Physiol Biochem. 2004 Dec;42(12):1013-21. doi: 10.1016/j.plaphy.2004.10.009. Epub 2004 Dec 15. Plant Physiol Biochem. 2004. PMID: 15707838 Review.
-
The annotation of both human and mouse kinomes in UniProtKB/Swiss-Prot: one small step in manual annotation, one giant leap for full comprehension of genomes.Mol Cell Proteomics. 2008 Aug;7(8):1409-19. doi: 10.1074/mcp.R700001-MCP200. Epub 2008 Apr 24. Mol Cell Proteomics. 2008. PMID: 18436524 Free PMC article. Review.
Cited by
-
Prediction of carbohydrate-binding proteins from sequences using support vector machines.Adv Bioinformatics. 2010;2010:289301. doi: 10.1155/2010/289301. Epub 2010 Sep 27. Adv Bioinformatics. 2010. PMID: 20936154 Free PMC article.
-
SCANMOT: searching for similar sequences using a simultaneous scan of multiple sequence motifs.Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W274-6. doi: 10.1093/nar/gki493. Nucleic Acids Res. 2005. PMID: 15980468 Free PMC article.
-
Isolation of a novel multiple-heavy metal resistant Lampropedia aestuarii GYF-1 and investigation of its bioremediation potential.BMC Microbiol. 2023 Nov 7;23(1):330. doi: 10.1186/s12866-023-03093-4. BMC Microbiol. 2023. PMID: 37936059 Free PMC article.
-
Prediction of functional residues in water channels and related proteins.Protein Sci. 1998 Jun;7(6):1458-68. doi: 10.1002/pro.5560070623. Protein Sci. 1998. PMID: 9655351 Free PMC article.