Bioinformatic tools for DNA/protein sequence analysis, functional assignment of genes and protein classification
- PMID: 11778865
- DOI: 10.1007/s00253-001-0844-0
Bioinformatic tools for DNA/protein sequence analysis, functional assignment of genes and protein classification
Abstract
The development of efficient DNA sequencing methods has led to the achievement of the DNA sequence of entire genomes from (to date) 55 prokaryotes, 5 eukaryotic organisms and 10 eukaryotic chromosomes. Thus, an enormous amount of DNA sequence data is available and even more will be forthcoming in the near future. Analysis of this overwhelming amount of data requires bioinformatic tools in order to identify genes that encode functional proteins or RNA. This is an important task, considering that even in the well-studied Escherichia coli more than 30% of the identified open reading frames are hypothetical genes. Future challenges of genome sequence analysis will include the understanding of gene regulation and metabolic pathway reconstruction including DNA chip technology, which holds tremendous potential for biomedicine and the biotechnological production of valuable compounds. The overwhelming volume of information often confuses scientists. This review intends to provide a guide to choosing the most efficient way to analyze a new sequence or to collect information on a gene or protein of interest by applying current publicly available databases and Web services. Recently developed tools that allow functional assignment of genes, mainly based on sequence similarity of the deduced amino acid sequence, using the currently available and increasing biological databases will be discussed.
Similar articles
-
VISTA family of computational tools for comparative analysis of DNA sequences and whole genomes.Methods Mol Biol. 2006;338:69-89. doi: 10.1385/1-59745-097-9:69. Methods Mol Biol. 2006. PMID: 16888351
-
Functional and structural genomics using PEDANT.Bioinformatics. 2001 Jan;17(1):44-57. doi: 10.1093/bioinformatics/17.1.44. Bioinformatics. 2001. PMID: 11222261
-
MitoRes: a resource of nuclear-encoded mitochondrial genes and their products in Metazoa.BMC Bioinformatics. 2006 Jan 24;7:36. doi: 10.1186/1471-2105-7-36. BMC Bioinformatics. 2006. PMID: 16433928 Free PMC article.
-
Statistical significance in biological sequence analysis.Brief Bioinform. 2006 Mar;7(1):2-24. doi: 10.1093/bib/bbk001. Brief Bioinform. 2006. PMID: 16761361 Review.
-
Pairwise sequence alignment--it's all about us!Brief Bioinform. 2006 Mar;7(1):113-5. doi: 10.1093/bib/bbk008. Brief Bioinform. 2006. PMID: 16761368 Review. No abstract available.
Cited by
-
HMGB1 protein does not mediate the inflammatory response in spontaneous spinal cord regeneration: a hint for CNS regeneration.J Biol Chem. 2013 Jun 21;288(25):18204-18. doi: 10.1074/jbc.M113.463810. Epub 2013 May 6. J Biol Chem. 2013. PMID: 23649623 Free PMC article.
-
Characterization and expression of AmphiCL encoding cathepsin l proteinase from amphioxus Branchiostoma belcheri tsingtauense.Mar Biotechnol (NY). 2005 Jul-Aug;7(4):279-86. doi: 10.1007/s10126-004-4084-9. Mar Biotechnol (NY). 2005. PMID: 15776312
-
Label noise in subtype discrimination of class C G protein-coupled receptors: A systematic approach to the analysis of classification errors.BMC Bioinformatics. 2015 Sep 29;16:314. doi: 10.1186/s12859-015-0731-9. BMC Bioinformatics. 2015. PMID: 26415951 Free PMC article.
-
Mechanism of drug resistance in bacteria: efflux pump modulation for designing of new antibiotic enhancers.Folia Microbiol (Praha). 2021 Oct;66(5):727-739. doi: 10.1007/s12223-021-00910-z. Epub 2021 Aug 25. Folia Microbiol (Praha). 2021. PMID: 34431062 Review.
-
The molecular cloning of glial fibrillary acidic protein in Gekko japonicus and its expression changes after spinal cord transection.Cell Mol Biol Lett. 2010 Dec;15(4):582-99. doi: 10.2478/s11658-010-0029-x. Epub 2010 Aug 14. Cell Mol Biol Lett. 2010. PMID: 20711818 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources