The effect of branch length variation on the selection of models of molecular evolution
- PMID: 11443347
- DOI: 10.1007/s002390010173
The effect of branch length variation on the selection of models of molecular evolution
Abstract
Models of sequence evolution play an important role in molecular evolutionary studies. The use of inappropriate models of evolution may bias the results of the analysis and lead to erroneous conclusions. Several procedures for selecting the best-fit model of evolution for the data at hand have been proposed, like the likelihood ratio test (LRT) and the Akaike (AIC) and Bayesian (BIC) information criteria. The relative performance of these model-selecting algorithms has not yet been studied under a range of different model trees. In this study, the influence of branch length variation upon model selection is characterized. This is done by simulating sequence alignments under a known model of nucleotide substitution, and recording how often this true model is recovered by different model-fitting strategies. Results of this study agree with previous simulations and suggest that model selection is reasonably accurate. However, different model selection methods showed distinct levels of accuracy. Some LRT approaches showed better performance than the AIC or BIC information criteria. Within the LRTs, model selection is affected by the complexity of the initial model selected for the comparisons, and only slightly by the order in which different parameters are added to the model. A specific hierarchy of LRTs, which starts from a simple model of evolution, performed overall better than other possible LRT hierarchies, or than the AIC or BIC.
Similar articles
-
Does choice in model selection affect maximum likelihood analysis?Syst Biol. 2008 Feb;57(1):76-85. doi: 10.1080/10635150801898920. Syst Biol. 2008. PMID: 18275003
-
Using models of nucleotide evolution to build phylogenetic trees.Dev Comp Immunol. 2005;29(3):211-27. doi: 10.1016/j.dci.2004.07.007. Dev Comp Immunol. 2005. PMID: 15572070 Review.
-
Model selection for mixtures of mutagenetic trees.Stat Appl Genet Mol Biol. 2006;5:Article17. doi: 10.2202/1544-6115.1164. Epub 2006 Jun 23. Stat Appl Genet Mol Biol. 2006. PMID: 17049028
-
The devil in the details: interactions between the branch-length prior and likelihood model affect node support and branch lengths in the phylogeny of the Psoraceae.Syst Biol. 2011 Jul;60(4):541-61. doi: 10.1093/sysbio/syr022. Epub 2011 Mar 24. Syst Biol. 2011. PMID: 21436107
-
Coalescent methods for estimating phylogenetic trees.Mol Phylogenet Evol. 2009 Oct;53(1):320-8. doi: 10.1016/j.ympev.2009.05.033. Epub 2009 Jun 6. Mol Phylogenet Evol. 2009. PMID: 19501178 Review.
Cited by
-
Consequences of Substitution Model Selection on Protein Ancestral Sequence Reconstruction.Mol Biol Evol. 2022 Jul 2;39(7):msac144. doi: 10.1093/molbev/msac144. Mol Biol Evol. 2022. PMID: 35789388 Free PMC article.
-
Population genetics of microbial pathogens estimated from multilocus sequence typing (MLST) data.Infect Genet Evol. 2006 Mar;6(2):97-112. doi: 10.1016/j.meegid.2005.02.003. Epub 2005 Mar 24. Infect Genet Evol. 2006. PMID: 16503511 Free PMC article.
-
Robustness of ancestral sequence reconstruction to phylogenetic uncertainty.Mol Biol Evol. 2010 Sep;27(9):1988-99. doi: 10.1093/molbev/msq081. Epub 2010 Apr 5. Mol Biol Evol. 2010. PMID: 20368266 Free PMC article.
-
Protein evolution along phylogenetic histories under structurally constrained substitution models.Bioinformatics. 2013 Dec 1;29(23):3020-8. doi: 10.1093/bioinformatics/btt530. Epub 2013 Sep 12. Bioinformatics. 2013. PMID: 24037213 Free PMC article.
-
Model selection may not be a mandatory step for phylogeny reconstruction.Nat Commun. 2019 Feb 25;10(1):934. doi: 10.1038/s41467-019-08822-w. Nat Commun. 2019. PMID: 30804347 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources