Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Sep;10(6):402-15.
doi: 10.2174/138920209789177575.

Hidden Markov Models and their Applications in Biological Sequence Analysis

Affiliations

Hidden Markov Models and their Applications in Biological Sequence Analysis

Byung-Jun Yoon. Curr Genomics. 2009 Sep.

Abstract

Hidden Markov models (HMMs) have been extensively used in biological sequence analysis. In this paper, we give a tutorial review of HMMs and their applications in a variety of problems in molecular biology. We especially focus on three types of HMMs: the profile-HMMs, pair-HMMs, and context-sensitive HMMs. We show how these HMMs can be used to solve various sequence analysis problems, such as pairwise and multiple sequence alignments, gene annotation, classification, similarity search, and many others.

Keywords: Hidden Markov model (HMM); context-sensitive HMM (csHMM); pair-HMM; profile-HMM; profile-csHMM; sequence analysis..

PubMed Disclaimer

Figures

Fig. (1)
Fig. (1)
A simple HMM for modeling eukaryotic genes.
Fig. (2)
Fig. (2)
Profile hidden Markov model. (a) Multiple sequence alignment for constructing the profile-HMM. (b) The ungapped HMM that represents the consensus sequence of the alignment. (c) The final profile-HMM that allows insertions and deletions.
Fig. (3)
Fig. (3)
Example of a pair hidden Markov model. A pair-HMM generates an aligned pair of sequences. In this example, two DNA sequences x and z are simultaneously generated by the pair-HMM, where the underlying state sequence is y. Note that the state sequence y uniquely determines the pairwise alignment between x and z .
Fig. (4)
Fig. (4)
A context-sensitive HMM that generates only symmetric sequences, or palindromes.
Fig. (5)
Fig. (5)
Constructing a profile-csHMM from a multiple RNA sequence alignment. (a) Example of an RNA sequence alignment. The consensus RNA structure has two base-pairs. (b) An ungapped csHMM constructed from the given alignment. (c) The final profile-csHMM that can handle symbol matches, insertions, and deletions.

Similar articles

Cited by

References

    1. Rabiner L R. A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE. 1989;77:257–286.
    1. Munch K, Krogh A. Automatic generation of gene finders for eukaryotic species. BMC Bioinform. 2006;7:263. - PMC - PubMed
    1. Durbin R, Eddy S, Krogh A, Mitchison G. Biological Sequence Analysis. Cambridge, UK: Cambridge University Press; 1998.
    1. Pachter L, Alexandersson M, Cawley S. Applications of generalized pair hidden Markov models to alignment and gene finding problems. J. Comput. Biol. 2002;9:389–399. - PubMed
    1. Liang K C, Wang X, Anastassiou D. Bayesian basecalling for DNA sequence analysis using hidden Markov models. IEEE/ACM Trans. Comput. Biol. Bioinform. 2007;4:430–440. - PubMed

LinkOut - more resources