Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2015 Sep 9;7(10):a018143.
doi: 10.1101/cshperspect.a018143.

Microbial Speciation

Affiliations
Review

Microbial Speciation

B Jesse Shapiro et al. Cold Spring Harb Perspect Biol. .

Abstract

What are species? How do they arise? These questions are not easy to answer and have been particularly controversial in microbiology. Yet, for those microbiologists studying environmental questions or dealing with clinical issues, the ability to name and recognize species, widely considered the fundamental units of ecology, can be practically useful. On a more fundamental level, the speciation problem, the focus here, is more mechanistic and conceptual. What is the origin of microbial species, and what evolutionary and ecological mechanisms keep them separate once they begin to diverge? To what extent are these mechanisms universal across diverse types of microbes, and more broadly across the entire the tree of life? Here, we propose that microbial speciation must be viewed in light of gene flow, which defines units of genetic similarity, and of natural selection, which defines units of phenotype and ecological function. We discuss to what extent ecological and genetic units overlap to form cohesive populations in the wild, based on recent evolutionary modeling and population genomics studies. These studies suggest a continuous "speciation spectrum," which microbial populations traverse in different ways depending on their balance of gene flow and natural selection.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
A model for bacterial speciation under different recombination/selection balances. Five stages in the process of speciation are illustrated under the r/s ≫1 regime (top). The illustration shows an ancestral population of bacteria (purple), which diverges into two incipient species (red or green) adapted to different habitats following the acquisition of niche-specifying genes (red or green arrows). Thin gray or black arrows show recombination events within and between incipient species, respectively. In the r/s ≪1 regime (bottom, not illustrated), stages 2–4 differ, but the start and end points are the same. The r/s ≪1 regime corresponds closely to the stable ecotype model (s.e.m.) (Box 3). Selection (s) is defined here as the average fitness difference experienced by a niche-specifying (adaptive) allele in different niches and recombination (r) is the recombination rate per locus per generation. The stages represent rough, potentially overlapping, and potentially terminal steps (e.g., stage 2 need not lead to stage 3). HGT, Horizontal gene transfer. (From Shapiro et al. 2012; adapted, with permission, from the author.)
Figure 2.
Figure 2.
Islands and continents of speciation. Based on data from aligned core genomes of Vibrio cyclitrophicus (Shapiro et al. 2012) (upper panel) and Sulfolobus islandicus (Cadillo-Quiroz et al. 2012) (lower panel), the distribution of divergent single nucleotide polymorphisms (SNPs) (divSNPs) fixed between populations is plotted along the genome as black bars. Vibrio and Sulfolobus contain, respectively, 725 divSNPs distributed more than ∼1% of the genome, and 4232 divSNPs distributed more than ∼36% of the genome. Scale is approximate for divSNPs (y-axis) and genome position (x-axis). The y-axis is not to scale for SNPs rejecting differentiation between populations (regions of the genome shown in gray).
Figure 3.
Figure 3.
Ecological differentiation via a gene-specific sweep. This illustration follows the basic steps of the symsim model (Friedman et al. 2013). At the first time point (t1), a niche-specifying gene (red triangle) arrives into a homogeneously recombining population occupying a single niche. Between t1 and t2 (as between all time points), recombination (r) occurs at random from a sympatric pool (one to two events per genome, illustrated as arrows), then genomes reproduce clonally, and are culled to a carrying capacity of four genomes per niche. Because the niche-specifying gene confers a selective advantage(s) in the new niche, genomes that contain it grow exponentially until the carrying capacity is reached at t3. Other genomes are culled at random, because the rest of the gene pool is neutral to fitness. By t4, the gene-specific sweep is complete. The niche-specifying gene is in perfect association with the new niche, but all other genes are randomly distributed across niches. At this point, barriers to recombination between niches (dashed line) may or may not emerge. (Note that recombination events at t4 are not shown for purposes of clarity, but this does not mean they do not occur.)
Figure 4.
Figure 4.
Barriers to recombination emerge via a competition-dispersal tradeoff. (A) The particle-associated Vibrio population (L) attaches to nutrient-rich particles and forms biofilms, whereas the free-living Vibrio population (S) hovers near the surface and scavenges loose nutrients. (B) When a new particle becomes available, only the S population is able to rapidly disperse to the new nutrient source. (From Yawata et al. 2014; adapted, with permission, from the author.)
Figure 5.
Figure 5.
Islands of speciation are distinct from genomic islands. (A) Different colors denote different allelic variants at a chromosomal locus. (B) Divergence between species (left y-axis) is often measured as the number of fixed nucleotide substitutions between species, or a measure, such as the fixation index (FST); metagenomic coverage (right y-axis) is simply the number of metagenomic sequencing reads that align at a given position of the reference genome.
Figure 6.
Figure 6.
Evolutionary convergence as the basis for genome-wide association studies (GWAS). In this simplified example, the genome-wide core phylogeny has been inferred for a sample of mostly clonal bacterial isolates (A–L). A convergent (homoplasic) single nucleotide polymorphism (SNP) is identified in four genomes. Importantly, this corresponds to only two independent mutation events (T → A, indicated in the phylogenetic tree by the red “X” with an “A” above it), which associate perfectly with two independent transitions from the antibiotic sensitive to resistant (R) state. The significance of the association can be assessed by calculating a P-value by resampling from the genome-wide distribution of mutations and phenotypic states (resistant or sensitive) on the phylogeny. By failing to account for population structure (e.g., the phylogenetic information), four events would be counted, thereby overestimating the significance of the association. GWAS can also be performed considering entire genes, instead of individual nucleotide sites, as targets of convergent mutations (for examples, see Sheppard et al. 2013 and Farhat et al. 2013, 2014).

References

    1. Barrick JE, Yu DS, Yoon SH, Jeong H, Oh TK, Schneider D, Lenski RE, Kim JF. 2009. Genome evolution and adaptation in a long-term experiment with Escherichia coli. Nature 461: 1243–1247. - PubMed
    1. Berg OG, Kurland CG. 2002. Evolution of microbial genomes: Sequence acquisition and loss. Mol Biol Evol 19: 2265–2276. - PubMed
    1. Blount ZD, Barrick JE, Davidson CJ, Lenski RE. 2012. Genomic analysis of a key innovation in an experimental Escherichia coli population. Nature 488: 513–518. - PMC - PubMed
    1. Boucher Y, Cordero OX, Takemura A, Hunt DE, Schliep K, Bapteste E, Lopez P, Tarr CL, Polz MF. 2011. Local mobile gene pools rapidly cross species boundaries to create endemicity within global Vibrio cholerae populations. mBio 2: e00335–10. - PMC - PubMed
    1. Burke C, Steinberg P, Rusch D, Kjelleberg S, Thomas T. 2011. Bacterial community assembly based on functional genes rather than species. Proc Natl Acad Sci 108: 14288–14293. - PMC - PubMed

Publication types

LinkOut - more resources