A Hidden Markov Model approach to variation among sites in rate of evolution
- PMID: 8583911
- DOI: 10.1093/oxfordjournals.molbev.a025575
A Hidden Markov Model approach to variation among sites in rate of evolution
Abstract
The method of Hidden Markov Models is used to allow for unequal and unknown evolutionary rates at different sites in molecular sequences. Rates of evolution at different sites are assumed to be drawn from a set of possible rates, with a finite number of possibilities. The overall likelihood of phylogeny is calculated as a sum of terms, each term being the probability of the data given a particular assignment of rates to sites, times the prior probability of that particular combination of rates. The probabilities of different rate combinations are specified by a stationary Markov chain that assigns rate categories to sites. While there will be a very large number of possible ways of assigning rates to sites, a simple recursive algorithm allows the contributions to the likelihood from all possible combinations of rates to be summed, in a time proportional to the number of different rates at a single site. Thus with three rates, the effort involved is no greater than three times that for a single rate. This "Hidden Markov Model" method allows for rates to differ between sites and for correlations between the rates of neighboring sites. By summing over all possibilities it does not require us to know the rates at individual sites. However, it does not allow for correlation of rates at nonadjacent sites, nor does it allow for a continuous distribution of rates over sites. It is shown how to use the Newton-Raphson method to estimate branch lengths of a phylogeny and to infer from a phylogeny what assignment of rates to sites has the largest posterior probability. An example is given using beta-hemoglobin DNA sequences in eight mammal species; the regions of high and low evolutionary rates are inferred and also the average length of patches of similar rates.
Similar articles
-
Modelling heterotachy in phylogenetic inference by reversible-jump Markov chain Monte Carlo.Philos Trans R Soc Lond B Biol Sci. 2008 Dec 27;363(1512):3955-64. doi: 10.1098/rstb.2008.0178. Philos Trans R Soc Lond B Biol Sci. 2008. PMID: 18852097 Free PMC article.
-
Taking variation of evolutionary rates between sites into account in inferring phylogenies.J Mol Evol. 2001 Oct-Nov;53(4-5):447-55. doi: 10.1007/s002390010234. J Mol Evol. 2001. PMID: 11675604
-
Inferring complex DNA substitution processes on phylogenies using uniformization and data augmentation.Syst Biol. 2006 Apr;55(2):259-69. doi: 10.1080/10635150500541599. Syst Biol. 2006. PMID: 16551582
-
Computational advances in maximum likelihood methods for molecular phylogeny.Genome Res. 1998 Mar;8(3):222-33. doi: 10.1101/gr.8.3.222. Genome Res. 1998. PMID: 9521926 Review.
-
Phylogenetic model evaluation.Methods Mol Biol. 2008;452:331-64. doi: 10.1007/978-1-60327-159-2_16. Methods Mol Biol. 2008. PMID: 18566772 Review.
Cited by
-
Non-Markovian effects on protein sequence evolution due to site dependent substitution rates.BMC Bioinformatics. 2016 Jun 24;17:258. doi: 10.1186/s12859-016-1135-1. BMC Bioinformatics. 2016. PMID: 27342318 Free PMC article.
-
Stepwise iterative maximum likelihood clustering approach.BMC Bioinformatics. 2016 Aug 24;17(1):319. doi: 10.1186/s12859-016-1184-5. BMC Bioinformatics. 2016. PMID: 27553625 Free PMC article.
-
Population genomics of the facultatively mutualistic bacteria Sinorhizobium meliloti and S. medicae.PLoS Genet. 2012;8(8):e1002868. doi: 10.1371/journal.pgen.1002868. Epub 2012 Aug 2. PLoS Genet. 2012. PMID: 22876202 Free PMC article.
-
A genome-wide association study of venous thromboembolism identifies risk variants in chromosomes 1q24.2 and 9q.J Thromb Haemost. 2012 Aug;10(8):1521-31. doi: 10.1111/j.1538-7836.2012.04810.x. J Thromb Haemost. 2012. PMID: 22672568 Free PMC article.
-
Evolutionary forces shaping genomic islands of population differentiation in humans.BMC Genomics. 2012 Mar 22;13:107. doi: 10.1186/1471-2164-13-107. BMC Genomics. 2012. PMID: 22439654 Free PMC article.
Publication types
MeSH terms
Associated data
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources