Improving the practical space and time efficiency of the shortest-paths approach to sum-of-pairs multiple sequence alignment
- PMID: 8521275
- DOI: 10.1089/cmb.1995.2.459
Improving the practical space and time efficiency of the shortest-paths approach to sum-of-pairs multiple sequence alignment
Abstract
The MSA program, written and distributed in 1989, is one of the few existing programs that attempts to find optimal alignments of multiple protein or DNA sequences. The MSA program implements a branch-and-bound technique together with a variant of Dijkstra's shortest paths algorithm to prune the basic dynamic programming graph. We have made substantial improvements in the time and space usage of MSA. The improvements make feasible a variety of problem instances that were not feasible previously. On some runs we achieve an order of magnitude reduction in space usage and a significant multiplicative factor speedup in running time. To explain how these improvements work, we give a much more detailed description of MSA than has been previously available. In practice, MSA rarely produces a provably optimal alignment and we explain why.
Similar articles
-
The practical use of the A* algorithm for exact multiple sequence alignment.J Comput Biol. 2000;7(5):655-71. doi: 10.1089/106652701446134. J Comput Biol. 2000. PMID: 11153092
-
Automated alignment of RNA sequences to pseudoknotted structures.Proc Int Conf Intell Syst Mol Biol. 1997;5:311-8. Proc Int Conf Intell Syst Mol Biol. 1997. PMID: 9322055
-
Evaluation measures of multiple sequence alignments.J Comput Biol. 2000 Feb-Apr;7(1-2):261-76. doi: 10.1089/10665270050081513. J Comput Biol. 2000. PMID: 10890401
-
Protein multiple sequence alignment.Methods Mol Biol. 2008;484:379-413. doi: 10.1007/978-1-59745-398-1_25. Methods Mol Biol. 2008. PMID: 18592193 Review.
-
Finding homologs to nucleic acid or protein sequences using the framesearch program.Curr Protoc Bioinformatics. 2002 Aug;Chapter 3:Unit 3.2. doi: 10.1002/0471250953.bi0302s00. Curr Protoc Bioinformatics. 2002. PMID: 18792937 Review.
Cited by
-
Interaction of the repressors Nrg1 and Nrg2 with the Snf1 protein kinase in Saccharomyces cerevisiae.Genetics. 2001 Jun;158(2):563-72. doi: 10.1093/genetics/158.2.563. Genetics. 2001. PMID: 11404322 Free PMC article.
-
Alignment of multiple proteins with an ensemble of hidden Markov models.Int J Data Min Bioinform. 2010;4(1):60-71. doi: 10.1504/ijdmb.2010.030967. Int J Data Min Bioinform. 2010. PMID: 20376922 Free PMC article.
-
Genome-wide analysis of positively selected genes in seasonal and non-seasonal breeding species.PLoS One. 2015 May 22;10(5):e0126736. doi: 10.1371/journal.pone.0126736. eCollection 2015. PLoS One. 2015. PMID: 26000771 Free PMC article.
-
A yeast taf17 mutant requires the Swi6 transcriptional activator for viability and shows defects in cell cycle-regulated transcription.Genetics. 2000 Apr;154(4):1561-76. doi: 10.1093/genetics/154.4.1561. Genetics. 2000. PMID: 10747053 Free PMC article.
-
Protein multiple sequence alignment by hybrid bio-inspired algorithms.Nucleic Acids Res. 2011 Mar;39(6):1980-92. doi: 10.1093/nar/gkq1052. Epub 2010 Nov 10. Nucleic Acids Res. 2011. PMID: 21071394 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources