Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Mar 1;33(5):640-649.
doi: 10.1093/bioinformatics/btw686.

Xenolog classification

Affiliations

Xenolog classification

Charlotte A Darby et al. Bioinformatics. .

Abstract

Motivation: Orthology analysis is a fundamental tool in comparative genomics. Sophisticated methods have been developed to distinguish between orthologs and paralogs and to classify paralogs into subtypes depending on the duplication mechanism and timing, relative to speciation. However, no comparable framework exists for xenologs: gene pairs whose history, since their divergence, includes a horizontal transfer. Further, the diversity of gene pairs that meet this broad definition calls for classification of xenologs with similar properties into subtypes.

Results: We present a xenolog classification that uses phylogenetic reconciliation to assign each pair of genes to a class based on the event responsible for their divergence and the historical association between genes and species. Our classes distinguish between genes related through transfer alone and genes related through duplication and transfer. Further, they separate closely-related genes in distantly-related species from distantly-related genes in closely-related species. We present formal rules that assign gene pairs to specific xenolog classes, given a reconciled gene tree with an arbitrary number of duplications and transfers. These xenology classification rules have been implemented in software and tested on a collection of ∼13 000 prokaryotic gene families. In addition, we present a case study demonstrating the connection between xenolog classification and gene function prediction.

Availability and implementation: The xenolog classification rules have been implemented in N otung 2.9, a freely available phylogenetic reconciliation software package. http://www.cs.cmu.edu/~durand/Notung . Gene trees are available at http://dx.doi.org/10.7488/ds/1503 .

Contact: durand@cmu.edu.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Gene tree (thin black lines), with a duplication and a transfer from species Y to species X, embedded in a species tree (shown in gray). The cenancestor of the transfer is designated as. Species sets D, R and O are labeled below the leaves
Fig. 2.
Fig. 2.
Xenolog class hierarchy: (top) Gene tree with one transfer, shown in the context of the species tree. (bottom) The reconciled gene tree. Each leaf is annotated with its xenolog class. Nodes g1,g2,g3 and g4 are the common ancestors of g^ and, respectively, the Primary, Sibling Donor, Sibling Recipient and Outgroup xenologs in the tree, as indicated by the labels on internal nodes. The labels on the path from g^ to the root satisfy the hierarchy, PX<XSDX<XSRX<XOX, consistent with Theorem 2.1
Fig. 3.
Fig. 3.
Paraxenolog classification: The gene tree in Figure 1, which contains a duplication followed by a transfer. Each leaf is annotated with its xenolog class. Each internal node on the path from g^ to the root is labeled with the xenolog class of all genes in its right subtree (i.e. the subtree that does not contain a transfer.) The progression of labels satisfy the xenolog hierarchy, consistent with Theorem 2.1.
Fig. 4.
Fig. 4.
Xenolog classification with multiple transfers: (top) Gene tree with two comparable transfers (solid arrows) and the associated super-transfer (dashed arrow), shown in the context of the species tree. (bottom) The reconciled gene tree. Each leaf is annotated with its xenolog class. Genes gX and gU are classified with respect to t2 and obey the hierarchy: MRCA(g^,gX)<GMRCA(g^,gU) and X(g^,gX)=PX<XX(g^,gU)=SDX. All other genes are classified with respect to the super-transfer, t*. Their xenolog classes are consistent with the hierarchy (Theorem 2.1): MRCA(g^,gY)<GMRCA(g^,gZ)<GMRCA(g^,gW)<GMRCA(g^,gV) and PX<XSDX<XSRX<XOX
Fig. 5.
Fig. 5.
Xenolog classification with incomparable transfers: (top) Gene tree with two incomparable transfers shown in the context of the species tree. Species sets associated with transfers t1 and t2 are shown below the leaves. (bottom) The reconciled gene tree. Each leaf is annotated with its xenolog class in reference to t2 (top row) and t1 (bottom row). Genes g^1 and g^2 are separated by both transfers. Since g^2Δ(gd1),X(g^1,g2)=PX. In contrast, X(g^2,g1)=IX since g^1Δ(gd2). Xenolog classes for other genes are consistent with their relatedness in the gene tree (Theorem 2.1): MRCA(g^2,gY)<GMRCA(g^2,gZ)<GMRCA(g^2,gW)=MRCA(g^2,gX)<GMRCA(g^2,gV) and PX<XSDX<XSRX<XOX
Fig. 6.
Fig. 6.
(left) Proportions of orthologs, paralogs and xenologs (all classes) in the 13 194-tree bacterial dataset. (right) Proportions of xenolog classes
Fig. 7.
Fig. 7.
Summary of the BIO4 gene family event history. Dashed lines represent lineages with a putative dual-function DTBS + DAPAS enzyme; solid lines represent lineages with a putative DTBS-only function. With respect to the gene g^ in S.cerevisiae, all other fungal genes are SRX, α-proteobacterial genes are PX, and genes in Firmicutes are SDX

References

    1. Akerborg O. et al. (2009) Simultaneous Bayesian gene tree reconstruction and reconciliation analysis. Proc. Natl. Acad. Sci. U. S. A., 106, 5714–5719. - PMC - PubMed
    1. Ali R.H. et al. (2016) GenFamClust: an accurate, synteny-aware and reliable homology inference algorithm. BMC Evol. Biol., 16, 120.. - PMC - PubMed
    1. Azad R., Lawrence J. (2012) Detecting laterally transferred genes. Methods Mol. Biol., 855, 281–308. - PubMed
    1. Boto L. (2010) Horizontal gene transfer in evolution: facts and challenges. Proc. Biol. Sci., 277, 819–827. - PMC - PubMed
    1. Boto L. (2016) Accepting foreign genes. J. Mol. Evol., 82, 173–175. - PubMed

Publication types