Gap costs for multiple sequence alignment
- PMID: 2593679
- DOI: 10.1016/s0022-5193(89)80196-1
Gap costs for multiple sequence alignment
Abstract
Standard methods for aligning pairs of biological sequences charge for the most common mutations, which are substitutions, deletions and insertions. Because a single mutation may insert or delete several nucleotides, gap costs that are not directly proportional to gap length are usually the most effective. How to extend such gap costs to alignments of three or more sequences is not immediately obvious, and a variety of approaches have been taken. This paper argues that, since gap and substitution costs together specify optimal alignments, they should be defined using a common rationale. Specifically, a new definition of gap costs for multiple alignments is proposed and compared with previous ones. Since the new definition links a multiple alignment's cost to that of its pairwise projections, it allows knowledge gained about two-sequence alignments to bear on the multiple alignment problem. Also, such linkage is a key element of recent algorithms that have rendered practical the simultaneous alignment of as many as six sequences.
Similar articles
-
Post-processing long pairwise alignments.Bioinformatics. 1999 Dec;15(12):1012-9. doi: 10.1093/bioinformatics/15.12.1012. Bioinformatics. 1999. PMID: 10745991
-
Fast, optimal alignment of three sequences using linear gap costs.J Theor Biol. 2000 Dec 7;207(3):325-36. doi: 10.1006/jtbi.2000.2177. J Theor Biol. 2000. PMID: 11082303
-
Ancestral sequence alignment under optimal conditions.BMC Bioinformatics. 2005 Nov 17;6:273. doi: 10.1186/1471-2105-6-273. BMC Bioinformatics. 2005. PMID: 16293191 Free PMC article.
-
Sequence alignment and penalty choice. Review of concepts, case studies and implications.J Mol Biol. 1994 Jan 7;235(1):1-12. doi: 10.1016/s0022-2836(05)80006-3. J Mol Biol. 1994. PMID: 8289235 Review.
-
Sequence Alignment.In: Rosen KH, Shier DR, Goddard W, editors. Handbook of Discrete and Combinatorial Mathematics. 2nd edition. Boca Raton (FL): CRC Press/Taylor & Francis; 2017 Nov. 20.1. In: Rosen KH, Shier DR, Goddard W, editors. Handbook of Discrete and Combinatorial Mathematics. 2nd edition. Boca Raton (FL): CRC Press/Taylor & Francis; 2017 Nov. 20.1. PMID: 29206392 Free Books & Documents. Review.
Cited by
-
MACSE: Multiple Alignment of Coding SEquences accounting for frameshifts and stop codons.PLoS One. 2011;6(9):e22594. doi: 10.1371/journal.pone.0022594. Epub 2011 Sep 16. PLoS One. 2011. PMID: 21949676 Free PMC article.
-
SAGA: sequence alignment by genetic algorithm.Nucleic Acids Res. 1996 Apr 15;24(8):1515-24. doi: 10.1093/nar/24.8.1515. Nucleic Acids Res. 1996. PMID: 8628686 Free PMC article.
-
Multiple sequence alignment by conformational space annealing.Biophys J. 2008 Nov 15;95(10):4813-9. doi: 10.1529/biophysj.108.129684. Epub 2008 Aug 8. Biophys J. 2008. PMID: 18689453 Free PMC article.
-
Efficient methods for multiple sequence alignment with guaranteed error bounds.Bull Math Biol. 1993 Jan;55(1):141-54. doi: 10.1007/BF02460299. Bull Math Biol. 1993. PMID: 7680269
-
Two Simple and Efficient Algorithms to Compute the SP-Score Objective Function of a Multiple Sequence Alignment.PLoS One. 2016 Aug 9;11(8):e0160043. doi: 10.1371/journal.pone.0160043. eCollection 2016. PLoS One. 2016. PMID: 27505054 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous