A rapid bootstrap algorithm for the RAxML Web servers
- PMID: 18853362
- DOI: 10.1080/10635150802429642
A rapid bootstrap algorithm for the RAxML Web servers
Abstract
Despite recent advances achieved by application of high-performance computing methods and novel algorithmic techniques to maximum likelihood (ML)-based inference programs, the major computational bottleneck still consists in the computation of bootstrap support values. Conducting a probably insufficient number of 100 bootstrap (BS) analyses with current ML programs on large datasets-either with respect to the number of taxa or base pairs-can easily require a month of run time. Therefore, we have developed, implemented, and thoroughly tested rapid bootstrap heuristics in RAxML (Randomized Axelerated Maximum Likelihood) that are more than an order of magnitude faster than current algorithms. These new heuristics can contribute to resolving the computational bottleneck and improve current methodology in phylogenetic analyses. Computational experiments to assess the performance and relative accuracy of these heuristics were conducted on 22 diverse DNA and AA (amino acid), single gene as well as multigene, real-world alignments containing 125 up to 7764 sequences. The standard BS (SBS) and rapid BS (RBS) values drawn on the best-scoring ML tree are highly correlated and show almost identical average support values. The weighted RF (Robinson-Foulds) distance between SBS- and RBS-based consensus trees was smaller than 6% in all cases (average 4%). More importantly, RBS inferences are between 8 and 20 times faster (average 14.73) than SBS analyses with RAxML and between 18 and 495 times faster than BS analyses with competing programs, such as PHYML or GARLI. Moreover, this performance improvement increases with alignment size. Finally, we have set up two freely accessible Web servers for this significantly improved version of RAxML that provide access to the 200-CPU cluster of the Vital-IT unit at the Swiss Institute of Bioinformatics and the 128-CPU cluster of the CIPRES project at the San Diego Supercomputer Center. These Web servers offer the possibility to conduct large-scale phylogenetic inferences to a large part of the community that does not have access to, or the expertise to use, high-performance computing resources.
Similar articles
-
RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models.Bioinformatics. 2006 Nov 1;22(21):2688-90. doi: 10.1093/bioinformatics/btl446. Epub 2006 Aug 23. Bioinformatics. 2006. PMID: 16928733
-
RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic trees.Bioinformatics. 2005 Feb 15;21(4):456-63. doi: 10.1093/bioinformatics/bti191. Epub 2004 Dec 17. Bioinformatics. 2005. PMID: 15608047
-
IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies.Mol Biol Evol. 2015 Jan;32(1):268-74. doi: 10.1093/molbev/msu300. Epub 2014 Nov 3. Mol Biol Evol. 2015. PMID: 25371430 Free PMC article.
-
RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference.Bioinformatics. 2019 Nov 1;35(21):4453-4455. doi: 10.1093/bioinformatics/btz305. Bioinformatics. 2019. PMID: 31070718 Free PMC article.
-
A RESTful API for Access to Phylogenetic Tools via the CIPRES Science Gateway.Evol Bioinform Online. 2015 Mar 16;11:43-8. doi: 10.4137/EBO.S21501. eCollection 2015. Evol Bioinform Online. 2015. PMID: 25861210 Free PMC article. Review.
Cited by
-
Comparative genomic and phylogenetic approaches to characterize the role of genetic recombination in mycobacterial evolution.PLoS One. 2012;7(11):e50070. doi: 10.1371/journal.pone.0050070. Epub 2012 Nov 26. PLoS One. 2012. PMID: 23189179 Free PMC article.
-
Comparative description of ten transcriptomes of newly sequenced invertebrates and efficiency estimation of genomic sampling in non-model taxa.Front Zool. 2012 Nov 29;9(1):33. doi: 10.1186/1742-9994-9-33. Front Zool. 2012. PMID: 23190771 Free PMC article.
-
Geodermatophilus arenarius sp. nov., a xerophilic actinomycete isolated from Saharan desert sand in Chad.Extremophiles. 2012 Nov;16(6):903-9. doi: 10.1007/s00792-012-0486-4. Epub 2012 Oct 19. Extremophiles. 2012. PMID: 23081798
-
Phylogeny and origins of hantaviruses harbored by bats, insectivores, and rodents.PLoS Pathog. 2013 Feb;9(2):e1003159. doi: 10.1371/journal.ppat.1003159. Epub 2013 Feb 7. PLoS Pathog. 2013. PMID: 23408889 Free PMC article.
-
Diversification of land plants: insights from a family-level phylogenetic analysis.BMC Evol Biol. 2011 Nov 21;11:341. doi: 10.1186/1471-2148-11-341. BMC Evol Biol. 2011. PMID: 22103931 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources