Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Nov 4:1:429-38.
doi: 10.1093/gbe/evp044.

Lateral transfer of genes and gene fragments in prokaryotes

Affiliations

Lateral transfer of genes and gene fragments in prokaryotes

Cheong Xin Chan et al. Genome Biol Evol. .

Abstract

Lateral genetic transfer (LGT) involves the movement of genetic material from one lineage into another and its subsequent incorporation into the new host genome via genetic recombination. Studies in individual taxa have indicated lateral origins for stretches of DNA of greatly varying length, from a few nucleotides to chromosome size. Here we analyze 1,462 sets of single-copy, putatively orthologous genes from 144 fully sequenced prokaryote genomes, asking to what extent complete genes and fragments of genes have been transferred and recombined in LGT. Using a rigorous phylogenetic approach, we find evidence for LGT in at least 476 (32.6%) of these 1,462 gene sets: 286 (19.6%) clearly show one or more "observable recombination breakpoints" within the boundaries of the open reading frame, while a further 190 (13.0%) yield trees that are topologically incongruent with the reference tree but do not contain a recombination breakpoint within the open reading frame. We refer to these gene sets as observable recombination breakpoint positive (ORB(+)) and negative (ORB(-)) respectively. The latter are prima facie instances of lateral transfer of an entire gene or beyond. We observe little functional bias between ORB(+) and ORB(-) gene sets, but find that incorporation of entire genes is potentially more frequent in pathogens than in nonpathogens. As ORB(+) gene sets are about 50% more common than ORB(-) sets in our data, the transfer of gene fragments has been relatively frequent, and the frequency of LGT may have been systematically underestimated in phylogenetic studies.

Keywords: comparative genomics; genetic recombination; genome evolution; horizontal genetic transfer; lateral genetic transfer.

PubMed Disclaimer

Figures

F<sc>IG</sc>. 1.—
FIG. 1.—
Definition of ORB+ and ORB gene sets. This simplified example shows six cases of LGT, each involving an orthologous region of six genomes (the white rectangles); adjacent genes A and B lie fully within this region in each genome, with gene boundaries as shown by the vertical dashed lines. Each case 1–6 corresponds to the presence of a single genetically recombined region (colored in red); thus, for the LGT event depicted in case 4, a recombination breakpoint is detected in gene set A (but not in gene set B), and the tree inferred for gene set B is topologically incongruent with the reference tree. We therefore label gene set A as ORB+ and gene set B as ORB as shown in the table on the right. ORB+ cases are shaded in light blue.
F<sc>IG</sc>. 2.—
FIG. 2.—
Size distribution of (A) ORB+ and (B) ORB gene sets. The solid red bars indicate overrepresented size classes; the solid blue bars indicate underrepresented classes; and the gray bars indicate classes neither over- nor underrepresented in comparison with the corresponding size-class frequency over all 1,462 gene sets, at P ≤ 0.05.
F<sc>IG</sc>. 3.—
FIG. 3.—
Representation of functional categories assigned to protein sequences corresponding to the (A) ORB+ and (B) ORB gene sets (solid yellow bars). The solid blue bars show the representation of these same functional categories in the full data set (16,639 gene sets of size ≥4, 119,695 proteins). Categories are numbered (differently for A and B) as shown in the boxes. Significance of over- or underrepresentation is represented by single (P ≤ 0.05) and double asterisks (P ≤ 0.01).
F<sc>IG</sc>. 4.—
FIG. 4.—
Taxonomic origins (National Center for Biotechnology Information level-4 taxa) of genes in the (A) ORB+ and (B) ORB gene sets. Overrepresentation relative to the 16,639 gene sets is shown in red; underrepresentation is shown in blue; gray indicates that there is neither over- nor underrepresentation at P ≤ 0.05.

References

    1. Barlow M. What antimicrobial resistance has taught us about horizontal gene transfer. Methods Mol Biol. 2009;532:397–411. - PubMed
    1. Beiko RG, Chan CX, Ragan MA. A word-oriented approach to alignment validation. Bioinformatics. 2005a;21:2230–2239. - PubMed
    1. Beiko RG, Hamilton N. Phylogenetic identification of lateral genetic transfer events. BMC Evol Biol. 2006;6:15. - PMC - PubMed
    1. Beiko RG, Harlow TJ, Ragan MA. Highways of gene sharing in prokaryotes. Proc Natl Acad Sci U S A. 2005b;102:14332–14337. - PMC - PubMed
    1. Bork P, Doolittle RF. Proposed acquisition of an animal protein domain by bacteria. Proc Natl Acad Sci U S A. 1992;89:8990–8994. - PMC - PubMed