Multiple alignment using hidden Markov models
- PMID: 7584426
Multiple alignment using hidden Markov models
Abstract
A simulated annealing method is described for training hidden Markov models and producing multiple sequence alignments from initially unaligned protein or DNA sequences. Simulated annealing in turn uses a dynamic programming algorithm for correctly sampling suboptimal multiple alignments according to their probability and a Boltzmann temperature factor. The quality of simulated annealing alignments is evaluated on structural alignments of ten different protein families, and compared to the performance of other HMM training methods and the ClustalW program. Simulated annealing is better able to find near-global optima in the multiple alignment probability landscape than the other tested HMM training methods. Neither ClustalW nor simulated annealing produce consistently better alignments compared to each other. Examination of the specific cases in which ClustalW outperforms simulated annealing, and vice versa, provides insight into the strengths and weaknesses of current hidden Markov model approaches.
Similar articles
-
Hidden Markov models in computational biology. Applications to protein modeling.J Mol Biol. 1994 Feb 4;235(5):1501-31. doi: 10.1006/jmbi.1994.1104. J Mol Biol. 1994. PMID: 8107089
-
Simultaneous sequence alignment and tree construction using hidden Markov models.Pac Symp Biocomput. 2003:180-91. Pac Symp Biocomput. 2003. PMID: 12603027
-
HMM-ModE--improved classification using profile hidden Markov models by optimising the discrimination threshold and modifying emission probabilities with negative training sequences.BMC Bioinformatics. 2007 Mar 27;8:104. doi: 10.1186/1471-2105-8-104. BMC Bioinformatics. 2007. PMID: 17389042 Free PMC article.
-
Hidden Markov models.Curr Opin Struct Biol. 1996 Jun;6(3):361-5. doi: 10.1016/s0959-440x(96)80056-x. Curr Opin Struct Biol. 1996. PMID: 8804822 Review.
-
Sequence alignment and penalty choice. Review of concepts, case studies and implications.J Mol Biol. 1994 Jan 7;235(1):1-12. doi: 10.1016/s0022-2836(05)80006-3. J Mol Biol. 1994. PMID: 8289235 Review.
Cited by
-
HDAC8 mutations in Cornelia de Lange syndrome affect the cohesin acetylation cycle.Nature. 2012 Sep 13;489(7415):313-7. doi: 10.1038/nature11316. Nature. 2012. PMID: 22885700 Free PMC article.
-
Cluster oligonucleotide signatures for rapid identification by sequencing.BMC Bioinformatics. 2018 Oct 29;19(1):395. doi: 10.1186/s12859-018-2363-3. BMC Bioinformatics. 2018. PMID: 30522439 Free PMC article.
-
Homology induction: the use of machine learning to improve sequence similarity searches.BMC Bioinformatics. 2002 Apr 23;3:11. doi: 10.1186/1471-2105-3-11. BMC Bioinformatics. 2002. PMID: 11972320 Free PMC article.
-
Helix breaking transition in the S4 of HCN channel is critical for hyperpolarization-dependent gating.Elife. 2019 Nov 27;8:e53400. doi: 10.7554/eLife.53400. Elife. 2019. PMID: 31774399 Free PMC article.
-
Protein sequence alignment with family-specific amino acid similarity matrices.BMC Res Notes. 2011 Aug 16;4:296. doi: 10.1186/1756-0500-4-296. BMC Res Notes. 2011. PMID: 21846354 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Other Literature Sources