Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006 Jan 19:7:29.
doi: 10.1186/1471-2105-7-29.

Recrafting the neighbor-joining method

Affiliations

Recrafting the neighbor-joining method

Thomas Mailund et al. BMC Bioinformatics. .

Abstract

Background: The neighbor-joining method by Saitou and Nei is a widely used method for constructing phylogenetic trees. The formulation of the method gives rise to a canonical Theta(n3) algorithm upon which all existing implementations are based.

Results: In this paper we present techniques for speeding up the canonical neighbor-joining method. Our algorithms construct the same phylogenetic trees as the canonical neighbor-joining method. The best-case running time of our algorithms are O(n2) but the worst-case remains O(n3). We empirically evaluate the performance of our algoritms on distance matrices obtained from the Pfam collection of alignments. The experiments indicate that the running time of our algorithms evolve as Theta(n2) on the examined instance collection. We also compare the running time with that of the QuickTree tool, a widely used efficient implementation of the canonical neighbor-joining method.

Conclusion: The experiments show that our algorithms also yield a significant speed-up, already for medium sized instances.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Performance of our methods using the simple approximation to Q. The plots show the running time of QuickTree, and QuickJoin with the depth-first search (DFS) method and with the priority queue (p.queue) method with and without sampling (see Methods), with the first approximation of Q described in the Methods section. The input for the runs is distance matrices for the Pfam alignments with 200 to 1000 sequences. The depth-first search without sampling performs very poorly and is removed on the plot on the right to better show the performance of the remaining methods.
Figure 2
Figure 2
Performance of our methods using the linear functions approximation to Q. The plots show the running time of QuickTree, and QuickJoin with the depth-first search method and with the priority queue method with and without sampling, with the linear functions approximation of Q described in the Methods section. The input for the runs is distance matrices for the Pfam alignments with 200 to 8000 sequences. The new methods perform significantly better than the basic neighbor-joining method, as implemented in QuickTree. To better compare the new methods the QuickTree plot is removed on the right.
Figure 3
Figure 3
A quad-tree with three levels of nodes. A quad-tree with three levels of nodes, and the corresponding subdivision of a square. The root covers the entire square, its children each of the four quadrants, and the leaves a further division of these.
Figure 4
Figure 4
The lower bound linear function. A linear function that is the best lower bound of two other linear functions on the interval r' - r'/m to r'. The dashed line is the linear function that is the greatest lower bound of the two linear functions shown as solid lines.

References

    1. Saitou N, Nei M. The Neighbor-Joining Method: A New Method for Reconstructing Phylogenetic Trees. Mol Biol Evol. 1987;4(4):406–425. - PubMed
    1. Studier JA, Keppler KJ. A Note on the Neighbor-Joining Method of Saitou and Nei. Mol Biol Evol. 1988;5(6):729–731. - PubMed
    1. St John K, Warnow T, Moret B, Vawter L. Performance Study of Phylogenetic Methods: (Unweighted) Quartet Methods and Neighbor-Joining. J Algorithms. 2003;48:173–193. doi: 10.1016/S0196-6774(03)00049-X. [(Special issue on best papers from SODA'01.)]. - DOI
    1. Fuellen G, Spitzer M, Cullen P, Lorkowski S. BLASTing Proteomes, Yielding Phylogenies. Silico Biology. 2003;3 [To appear.]. - PubMed
    1. Gascuel O. A note on Sattath and Tversky's, Saitou and Nei's, and Studier and Keppler's algorithms for inferring phylogenies from evolutionary distances. Mol Biol Evol. 1994;11(6):961–963. - PubMed

Publication types

LinkOut - more resources