Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Dec 4:9:516.
doi: 10.1186/1471-2105-9-516.

A fast algorithm for the multiple genome rearrangement problem with weighted reversals and transpositions

Affiliations

A fast algorithm for the multiple genome rearrangement problem with weighted reversals and transpositions

Martin Bader et al. BMC Bioinformatics. .

Abstract

Background: Due to recent progress in genome sequencing, more and more data for phylogenetic reconstruction based on rearrangement distances between genomes become available. However, this phylogenetic reconstruction is a very challenging task. For the most simple distance measures (the breakpoint distance and the reversal distance), the problem is NP-hard even if one considers only three genomes.

Results: In this paper, we present a new heuristic algorithm that directly constructs a phylogenetic tree w.r.t. the weighted reversal and transposition distance. Experimental results on previously published datasets show that constructing phylogenetic trees in this way results in better trees than constructing the trees w.r.t. the reversal distance, and recalculating the weight of the trees with the weighted reversal and transposition distance. An implementation of the algorithm can be obtained from the authors.

Conclusion: The possibility of creating phylogenetic trees directly w.r.t. the weighted reversal and transposition distance results in biologically more realistic scenarios. Our algorithm can solve today's most challenging biological datasets in a reasonable amount of time.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Adding a node. A new node π can either be added to (a) a node v in the tree or (b) to a point s in a cloud of an edge (v, v'). In the latter case, we have to split the edge (v, v') into two edges (v, s) and (s, v'). Clouds are removed/generated accordingly.
Figure 2
Figure 2
Improving the tree topology. The subtrees T1 and T2 are reconnected by an edge from s to v2, where s Cloud(v1, v1) and v2 V2. The edge (v1, v1) must be split into two edges (v1, s) and (s, v1) before the new edge (s, v2) is added. Clouds are removed/generated accordingly.
Figure 3
Figure 3
Variance of the Campanulaceae dataset. The results of our algorithm for the Campanulaceae dataset over all runs as box plots. For each improvement strategy, also the average running time per run is provided. (a) contains the results for the weight ratio 1:2, (b) for the weight ratio 1:1.5, and (c) for the weight ratio 1:1.
Figure 4
Figure 4
Variance of the metazoan dataset. The results of our algorithm for the metazoan dataset over all runs as box plots. For each improvement strategy, also the average running time per run is provided. (a) contains the results for the weight ratio 1:2, (b) for the weight ratio 1:1.5, and (c) for the weight ratio 1:1.
Figure 5
Figure 5
Variance of the protostomes dataset. The results of our algorithm for the protostomes dataset over all runs as box plots. For each improvement strategy, also the average running time per run is provided. (a) contains the results for the weight ratio 1:2, (b) for the weight ratio 1:1.5, and (c) for the weight ratio 1:1.

References

    1. Pevzner P, Tesler G. Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes. Genome Research. 2003;13:37–45. - PMC - PubMed
    1. Bader D, Moret B, Yan M. A Linear-Time Algorithm for Computing Inversion Distance between Signed Permutations with an Experimental Study. Journal of Computational Biology. 2001;8:483–491. - PubMed
    1. Bergeron A, Mixtacki J, Stoye J. Reversal Distance without Hurdles and Fortresses. Proc 15th Annual Symposium on Combinatorial Pattern Matching, of Lecture Notes in Computer Science. 2004;3109:388–399.
    1. Bader M, Ohlebusch E. Sorting by Weighted Reversals, Transpositions, and Inverted Transpositions. Journal of Computational Biology. 2007;14:615–636. - PubMed
    1. Bader M. Diploma thesis. University of Ulm; 2005. Sorting by Weighted Transpositions and Reversals.http://www.uni-ulm.de/fileadmin/website_uni_ulm/iui.inst.190/Mitarbeiter...

Publication types

LinkOut - more resources