Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Sep 15;28(18):i382-i388.
doi: 10.1093/bioinformatics/bts374.

Evolution of gene neighborhoods within reconciled phylogenies

Affiliations

Evolution of gene neighborhoods within reconciled phylogenies

Sèverine Bérard et al. Bioinformatics. .

Abstract

Motivation: Most models of genome evolution integrating gene duplications, losses and chromosomal rearrangements are computationally intract able, even when comparing only two genomes. This prevents large-scale studies that consider different types of genome structural variations.

Results: We define an 'adjacency phylogenetic tree' that describes the evolution of an adjacency, a neighborhood relation between two genes, by speciation, duplication or loss of one or both genes, and rearrangement. We describe an algorithm that, given a species tree and a set of gene trees where the leaves are connected by adjacencies, computes an adjacency forest that minimizes the number of gains and breakages of adjacencies (caused by rearrangements) and runs in polynomial time. We use this algorithm to reconstruct contiguous regions of mammalian and plant ancestral genomes in a few minutes for a dozen species and several thousand genes. We show that this method yields reduced conflict between ancestral adjacencies. We detect duplications involving several genes and compare the different modes of evolution between phyla and among lineages.

Availability: C++ implementation using BIO++ package, available upon request to Sèverine Bérard.

Contact: Severine.Berard@cirad.fr or Eric.Tannier@inria.fr

Supplementary information: Supplementary material is available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Examples of a species tree (left), two gene trees (middle) and an adjacency tree (right). Blue dots are speciation nodes. Leaves are extant (species, genes, adjacencies), except the one labeled by a red cross (gene loss) or a red flash (breakage). Green squares are (gene or adjacency) duplication nodes. Gene labels refer to the species they belong to. Every node of the adjacency tree is labeled by a couple of nodes from gene trees
Fig. 2.
Fig. 2.
Example of the application of the algorithm on two genes trees, G1 and G2, a species tree S and an adjacency list shown on the line Input. The costs are C(Gain)= C(Break)= 1. All the costs cb(Ei,Ej) are computed for b∈0,1, EA,B,C, i,j ∈[1..8], with Ei in G1 and Ej in G2. As a result c0(C5,C8) = 2 while c1(C5,C8) = 1. Therefore, the adjacency forest on the line Output contains C5C8. The left tree has cost 0 while the right one costs C(Gain)= 1 for the gain of the adjacency B1B3
Fig. 3.
Fig. 3.
Proportion of genes having k neighbors, function of k. Red plain line is obtained with DeCo and PhylDog trees. Green dashed line is obtained with PhylDog trees and the pairwise alternative. Blue dotted line is obtained with TreeBeST trees and DeCo
Fig. 4.
Fig. 4.
Angiosperm and mammalian phylogenies, where branch lengths are proportional to the number of adjacency duplications normalized by the number of genes. The scale is indicated at the bottom left of the two figures

References

    1. Alekseyev M.A., Pevzner P.A. Breakpoint graphs and ancestral genome reconstructions. Genome Res. 2009;19:943–957. - PMC - PubMed
    1. Bertrand D., et al. Proceedings of WABI‘10, Algorithms in Bioinformatics. Berlin Heidelberg: Lecture Notes in Bioinformatics, Springer; 2010. Reconstruction of ancestral genome subject to whole genome duplication, speciation, rearrangement and loss; pp. 78–89.
    1. Boussau B., et al. 2012. Genome-scale coestimation of species and gene trees, in revision. - PMC - PubMed
    1. Chauve C., Tannier E. A methodological framework for the reconstruction of contiguous regions of ancestral genomes and its application to mammalian genomes. PLoS Comput. Biol. 2008;4:e1000234. - PMC - PubMed
    1. Chauve C., et al. Yeast ancestral genome reconstructions: the possibilities of computational methods II. J. Comput. Biol. 2010;17:1097–1112. - PubMed

Publication types