Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Apr 1;35(4):899-913.
doi: 10.1093/molbev/msy001.

Bipartite Network Analysis of Gene Sharings in the Microbial World

Affiliations

Bipartite Network Analysis of Gene Sharings in the Microbial World

Eduardo Corel et al. Mol Biol Evol. .

Erratum in

Abstract

Extensive microbial gene flows affect how we understand virology, microbiology, medical sciences, genetic modification, and evolutionary biology. Phylogenies only provide a narrow view of these gene flows: plasmids and viruses, lacking core genes, cannot be attached to cellular life on phylogenetic trees. Yet viruses and plasmids have a major impact on cellular evolution, affecting both the gene content and the dynamics of microbial communities. Using bipartite graphs that connect up to 149,000 clusters of homologous genes with 8,217 related and unrelated genomes, we can in particular show patterns of gene sharing that do not map neatly with the organismal phylogeny. Homologous genes are recycled by lateral gene transfer, and multiple copies of homologous genes are carried by otherwise completely unrelated (and possibly nested) genomes, that is, viruses, plasmids and prokaryotes. When a homologous gene is present on at least one plasmid or virus and at least one chromosome, a process of "gene externalization," affected by a postprocessed selected functional bias, takes place, especially in Bacteria. Bipartite graphs give us a view of vertical and horizontal gene flow beyond classic taxonomy on a single very large, analytically tractable, graph that goes beyond the cellular Web of Life.

PubMed Disclaimer

Figures

<sc>Fig</sc>. 1.
Fig. 1.
Construction of the bipartite graphs and identification of the twins and articulation points. (A) Construction of the genome-CHG bipartite graphs. Top nodes represent genomes of cells and mobile genetic elements. Bottom nodes represent CHG: we display the corresponding connected component of the sequence similarity network (see Materials and Methods), edge color (from yellow to red) indicates increasing % ID. (B) Bottom twins and articulation points: bottom nodes forming a twin class and their incident edges are drawn in the same shade of gray. Nodes 7 and 8 have the same neighbors (nodes 1 and 2) thus form twin class 1. Twins 2 and 4 are trivial since they contain only one node. Node 9 is an articulation point since its removal disconnects the graph.
<sc>Fig</sc>. 2
Fig. 2
Support composition of twins and CHGs. The taxonomy of the carriers of the genes in a CHG is called its support composition. For every similarity threshold (on the x-axis), we report how many CHGs (on the right) or twins of CHGs (on the left) are exclusively found in Archaea, in Bacteria, in both kinds of prokaryotes (“mixed prokaryotes”), in viruses only, in plasmids only, in both kinds of MGE (“mixed MGE”), or in both cellular and mobile elements (“externalized”).
<sc>Fig</sc>. 3
Fig. 3
ESCHG and core. The “shared pangenome” of a lineage is composed of all CHGs that are shared by at least two genomes and contain at least one member of the lineage. (A) For each taxonomic group, we report the percentage of CHGs forming the shared pangenome that are core exclusive, exclusive but noncore and not exclusive. (B) For each taxonomic group, we report the percentage of the shared pangenome that is comprised of core exclusive, and core but not exclusive CHGs. The total height of the bar represents the core itself.
<sc>Fig</sc>. 4.
Fig. 4.
Percentage of gene externalization at ≥ 30% ID for the 382 prokaryotes in our data set. The proportion of externalized genes from Bacteria (green triangles) is significantly higher (Student’s t-test, p value < 10−16) than for Archaea (yellow dots), to the notable exception of Haloarchaea (red dots). It is moreover largely uncorrelated with genome size (regression lines with shaded confidence interval at 95%).
<sc>Fig</sc>. 5.
Fig. 5.
Distribution of functional categories among externalized genes. Color bars represent the percentage of a given COG category among externalized genes above a given identity threshold (according to the color code on the right side of the figure). On the upper bar, COG categories are grouped by large functional groups (including “Poorly characterized,” which includes COG categories R and S). The “No”/0 class (on the left) refers to the genes for which no COG category was attributed). Note the very conspicuous peak for the “L” category at thresholds ≥ 90% and 95% ID.

References

    1. Ageevets V, Julia S, Irina L, Maya M, Elena I, Elena K, Vladislav B.. 2017. Genetic environment of the blaKPC-2 gene in a Klebsiella pneumoniae isolate that may have been imported to Russia from Southeast Asia. Antimicrob Agents Chemother. 612:e01856–16. - PMC - PubMed
    1. Ahmad N, Chong TM, Hashim R, Shukor S, Yin W-F, Chan K-G.. 2015. Draft genome of multidrug-resistant Klebsiella pneumoniae 223/14 carrying KPC-6, isolated from a general hospital in Malaysia. J Genomics 3:97–98. - PMC - PubMed
    1. Ahn Y-Y, Ahnert SE, Bagrow JP, Barabási A-L.. 2011. Flavor network and the principles of food pairing. Sci Rep. 11:196.. - PMC - PubMed
    1. Aleksandrzak-Piekarczyk T, Koryszewska-Bagińska A, Grynberg M, Nowak A, Cukrowska B, Kozakova H, Bardowski J.. 2016. Genomic and functional characterization of the unusual pLOCK 0919 plasmid harboring the spaCBA Pili cluster in Lactobacillus casei LOCK 0919. Genome Biol Evol. 81:202–217. - PMC - PubMed
    1. Alzahrani T, Horadam KJ.. 2016. Complex systems and networks In: Lü J, Yu X, Chen G, Yu W, editors. Understanding complex systems. Vol. 73 Berlin, Heidelberg: Springer Berlin Heidelberg.

Publication types

LinkOut - more resources