Dynamite: a flexible code generating language for dynamic programming methods used in sequence comparison
- PMID: 9322016
Dynamite: a flexible code generating language for dynamic programming methods used in sequence comparison
Abstract
We have developed a code generating language, called Dynamite, specialised for the production and subsequent manipulation of complex dynamic programming methods for biological sequence comparison. From a relatively simple text definition file Dynamite will produce a variety of implementations of a dynamic programming method, including database searches and linear space alignments. The speed of the generated code is comparable to hand written code, and the additional flexibility has proved invaluable in designing and testing new algorithms. An innovation is a flexible labelling system, which can be used to annotate the original sequences with biological information. We illustrate the Dynamite syntax and flexibility by showing definitions for dynamic programming routines (i) to align two protein sequences under the assumption that they are both poly-topic transmembrane proteins, with the simultaneous assignment of transmembrane helices and (ii) to align protein information to genomic DNA, allowing for introns and sequencing error.
Similar articles
-
Using the FASTA program to search protein and DNA sequence databases.Methods Mol Biol. 1994;25:365-89. doi: 10.1385/0-89603-276-0:365. Methods Mol Biol. 1994. PMID: 8004177 No abstract available.
-
GATA: a graphic alignment tool for comparative sequence analysis.BMC Bioinformatics. 2005 Jan 17;6:9. doi: 10.1186/1471-2105-6-9. BMC Bioinformatics. 2005. PMID: 15655071 Free PMC article.
-
Improving the practical space and time efficiency of the shortest-paths approach to sum-of-pairs multiple sequence alignment.J Comput Biol. 1995 Fall;2(3):459-72. doi: 10.1089/cmb.1995.2.459. J Comput Biol. 1995. PMID: 8521275
-
Finding homologs to nucleic acid or protein sequences using the framesearch program.Curr Protoc Bioinformatics. 2002 Aug;Chapter 3:Unit 3.2. doi: 10.1002/0471250953.bi0302s00. Curr Protoc Bioinformatics. 2002. PMID: 18792937 Review.
-
Identifying distantly related protein sequences.Comput Appl Biosci. 1997 Aug;13(4):325-32. doi: 10.1093/bioinformatics/13.4.325. Comput Appl Biosci. 1997. PMID: 9283747 Review. No abstract available.
Cited by
-
Evaluating high-throughput ab initio gene finders to discover proteins encoded in eukaryotic pathogen genomes missed by laboratory techniques.PLoS One. 2012;7(11):e50609. doi: 10.1371/journal.pone.0050609. Epub 2012 Nov 30. PLoS One. 2012. PMID: 23226328 Free PMC article.
-
Comparative gene prediction in human and mouse.Genome Res. 2003 Jan;13(1):108-17. doi: 10.1101/gr.871403. Genome Res. 2003. PMID: 12529313 Free PMC article.
-
Correlated expression of retrocopies and parental genes in zebrafish.Mol Genet Genomics. 2016 Apr;291(2):723-37. doi: 10.1007/s00438-015-1140-5. Epub 2015 Nov 11. Mol Genet Genomics. 2016. PMID: 26561303
-
The human genome browser at UCSC.Genome Res. 2002 Jun;12(6):996-1006. doi: 10.1101/gr.229102. Genome Res. 2002. PMID: 12045153 Free PMC article.
-
GeneWise and Genomewise.Genome Res. 2004 May;14(5):988-95. doi: 10.1101/gr.1865504. Genome Res. 2004. PMID: 15123596 Free PMC article.