TranScout: prediction of gene expression regulatory proteins from their sequences
- PMID: 12016057
- DOI: 10.1093/bioinformatics/18.4.597
TranScout: prediction of gene expression regulatory proteins from their sequences
Abstract
Motivation: The advent of genomics yields thousands of reading frames in search of function. Identification of conserved functional motifs in protein sequences can be helpful for function prediction.
Results: A database and a classification of reported DNA-binding protein motifs has been designed. A program ('TranScout') has been developed for the detection and evaluation of conserved motifs in prokaryotic and eukaryotic sequences of proteins with a gene regulatory function. The efficiency of the program is shown in a benchmark against a database obtained from SWISS-PROT without the protein sequences used to train the program. All motifs were detected with a mean average sensitivity of 0.98 and a mean average specificity of 0.92.
Availability: The program is freely available for use on the internet at http://luz.uab.es/transcout/. The user can find additional information at this site.
Similar articles
-
TOPDOM: database of domains and motifs with conservative location in transmembrane proteins.Bioinformatics. 2008 Jun 15;24(12):1469-70. doi: 10.1093/bioinformatics/btn202. Epub 2008 Apr 23. Bioinformatics. 2008. PMID: 18434342 Free PMC article.
-
Condition specific transcription factor binding site characterization in Saccharomyces cerevisiae.Bioinformatics. 2002 Oct;18(10):1289-96. doi: 10.1093/bioinformatics/18.10.1289. Bioinformatics. 2002. PMID: 12376372
-
ORFer--retrieval of protein sequences and open reading frames from GenBank and storage into relational databases or text files.BMC Bioinformatics. 2002 Dec 19;3:40. doi: 10.1186/1471-2105-3-40. Epub 2002 Dec 19. BMC Bioinformatics. 2002. PMID: 12493080 Free PMC article.
-
ACGT-a comparative genomics tool.Bioinformatics. 2003 May 22;19(8):1039-40. doi: 10.1093/bioinformatics/btg121. Bioinformatics. 2003. PMID: 12761070
-
Identification of motifs in protein sequences.Curr Protoc Cell Biol. 2001 May;Appendix 1:Appendix 1C. doi: 10.1002/0471143030.cba01cs00. Curr Protoc Cell Biol. 2001. PMID: 18228275 Review.
Cited by
-
TrSDB: a proteome database of transcription factors.Nucleic Acids Res. 2004 Jan 1;32(Database issue):D171-3. doi: 10.1093/nar/gkh101. Nucleic Acids Res. 2004. PMID: 14681387 Free PMC article.
-
Prediction of functional class of proteins and peptides irrespective of sequence homology by support vector machines.Bioinform Biol Insights. 2009 Nov 24;1:19-47. doi: 10.4137/bbi.s315. Bioinform Biol Insights. 2009. PMID: 20066123 Free PMC article.
-
Iterative reconstruction of transcriptional regulatory networks: an algorithmic approach.PLoS Comput Biol. 2006 May;2(5):e52. doi: 10.1371/journal.pcbi.0020052. Epub 2006 May 19. PLoS Comput Biol. 2006. PMID: 16710450 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources