Approximate likelihood-ratio test for branches: A fast, accurate, and powerful alternative
- PMID: 16785212
- DOI: 10.1080/10635150600755453
Approximate likelihood-ratio test for branches: A fast, accurate, and powerful alternative
Abstract
We revisit statistical tests for branches of evolutionary trees reconstructed upon molecular data. A new, fast, approximate likelihood-ratio test (aLRT) for branches is presented here as a competitive alternative to nonparametric bootstrap and Bayesian estimation of branch support. The aLRT is based on the idea of the conventional LRT, with the null hypothesis corresponding to the assumption that the inferred branch has length 0. We show that the LRT statistic is asymptotically distributed as a maximum of three random variables drawn from the chi(0)2 + chi(1)2 distribution. The new aLRT of interior branch uses this distribution for significance testing, but the test statistic is approximated in a slightly conservative but practical way as 2(l1- l2), i.e., double the difference between the maximum log-likelihood values corresponding to the best tree and the second best topological arrangement around the branch of interest. Such a test is fast because the log-likelihood value l2 is computed by optimizing only over the branch of interest and the four adjacent branches, whereas other parameters are fixed at their optimal values corresponding to the best ML tree. The performance of the new test was studied on simulated 4-, 12-, and 100-taxon data sets with sequences of different lengths. The aLRT is shown to be accurate, powerful, and robust to certain violations of model assumptions. The aLRT is implemented within the algorithm used by the recent fast maximum likelihood tree estimation program PHYML (Guindon and Gascuel, 2003).
Similar articles
-
The devil in the details: interactions between the branch-length prior and likelihood model affect node support and branch lengths in the phylogeny of the Psoraceae.Syst Biol. 2011 Jul;60(4):541-61. doi: 10.1093/sysbio/syr022. Epub 2011 Mar 24. Syst Biol. 2011. PMID: 21436107
-
Calculating the evolutionary rates of different genes: a fast, accurate estimator with applications to maximum likelihood phylogenetic analysis.Syst Biol. 2005 Dec;54(6):900-15. doi: 10.1080/10635150500354829. Syst Biol. 2005. PMID: 16282169
-
Survey of branch support methods demonstrates accuracy, power, and robustness of fast likelihood-based approximation schemes.Syst Biol. 2011 Oct;60(5):685-99. doi: 10.1093/sysbio/syr041. Epub 2011 May 3. Syst Biol. 2011. PMID: 21540409 Free PMC article.
-
When being "most likely" is not enough: examining the performance of three uses of the parametric bootstrap in phylogenetics.J Mol Evol. 2003 Feb;56(2):198-222. doi: 10.1007/s00239-002-2394-1. J Mol Evol. 2003. PMID: 12574867
-
Statistical measures of uncertainty for branches in phylogenetic trees inferred from molecular sequences by using model-based methods.J Appl Genet. 2008;49(1):49-67. doi: 10.1007/BF03195249. J Appl Genet. 2008. PMID: 18263970 Review.
Cited by
-
Tyrosyl-DNA phosphodiesterase 2 (Tdp2) repairs DNA-protein crosslinks and protects against double strand breaks in vivo.Front Cell Dev Biol. 2024 Aug 20;12:1394531. doi: 10.3389/fcell.2024.1394531. eCollection 2024. Front Cell Dev Biol. 2024. PMID: 39228401 Free PMC article.
-
Alfalfa Leaf Curl Virus: an Aphid-Transmitted Geminivirus.J Virol. 2015 Sep;89(18):9683-8. doi: 10.1128/JVI.00453-15. Epub 2015 Jun 24. J Virol. 2015. PMID: 26109720 Free PMC article. Review.
-
Comparative genomics of rhizobia nodulating soybean suggests extensive recruitment of lineage-specific genes in adaptations.Proc Natl Acad Sci U S A. 2012 May 29;109(22):8629-34. doi: 10.1073/pnas.1120436109. Epub 2012 May 14. Proc Natl Acad Sci U S A. 2012. PMID: 22586130 Free PMC article.
-
Identification and Molecular Characterization of Two Acetylcholinesterases from the Salmon Louse, Lepeophtheirus salmonis.PLoS One. 2015 May 4;10(5):e0125362. doi: 10.1371/journal.pone.0125362. eCollection 2015. PLoS One. 2015. PMID: 25938836 Free PMC article.
-
The wtf meiotic driver gene family has unexpectedly persisted for over 100 million years.Elife. 2022 Oct 13;11:e81149. doi: 10.7554/eLife.81149. Elife. 2022. PMID: 36227631 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources