Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jul 21;18(7):e0288926.
doi: 10.1371/journal.pone.0288926. eCollection 2023.

A macroecological perspective on genetic diversity in the human gut microbiome

Affiliations

A macroecological perspective on genetic diversity in the human gut microbiome

William R Shoemaker. PLoS One. .

Abstract

While the human gut microbiome has been intensely studied, we have yet to obtain a sufficient understanding of the genetic diversity that it harbors. Research efforts have demonstrated that a considerable fraction of within-host genetic variation in the human gut is driven by the ecological dynamics of co-occurring strains belonging to the same species, suggesting that an ecological lens may provide insight into empirical patterns of genetic diversity. Indeed, an ecological model of self-limiting growth and environmental noise known as the Stochastic Logistic Model (SLM) was recently shown to successfully predict the temporal dynamics of strains within a single human host. However, its ability to predict patterns of genetic diversity across human hosts has yet to be tested. In this manuscript I determine whether the predictions of the SLM explain patterns of genetic diversity across unrelated human hosts for 22 common microbial species. Specifically, the stationary distribution of the SLM explains the distribution of allele frequencies across hosts and predicts the fraction of hosts harboring a given allele (i.e., prevalence) for a considerable fraction of sites. The accuracy of the SLM was correlated with independent estimates of strain structure, suggesting that patterns of genetic diversity in the gut microbiome follow statistically similar forms across human hosts due to the existence of strain-level ecology.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Distributions of genetic diversity exhibit similar statistical forms across phylogenetically distant species in the human gut.
a,b) Similarity in patterns of genetic diversity was evaluated for sites obtained from the 22 most prevalent bacterial species. c) The distribution of within-host allele frequencies across all hosts as well as d) the distribution of mean allele frequencies were rescaled to determine whether they exhibited similar forms, specifically by rescaling their logarithm using the standard score (i.e., z-score). In c, statistical fits of a gamma (the distribution predicted by the SLM) and a lognormal (a point of comparison) are illustrated as black lines. To limit the effect of the bounded nature of allele frequencies on the distribution, mean frequencies containing observations of f = 1 were excluded from subplot d. e) The relationship between statistical moments of within-host allele frequencies was consistent across species, as there was a strong linear relationship between the mean frequency of an allele and its variance on a log-log scale (i.e., Taylor’s Law). To reduce the contribution of an excess number of zeros towards estimates of f¯ and σf2, alleles with non-zero values of f in <35% of hosts were excluded. f) Finally, the fraction of sites harboring alleles present in a given number of hosts decreased in a similar manner across species. All sites in this analysis are synonymous, identical analyses were performed on alleles at nonsynonymous sites (S1 Fig). Species within the same genus were assigned the same primary or secondary color with different degrees of saturation.
Fig 2
Fig 2. The SLM successfully predicts genetic patterns for prevalent alleles.
a) Predicted values of prevalence were obtained (Eq 6) and matched with their observed values, where the data generally fell on the one-to-one line (dashed black line) for sites with a prevalence ≳ 0.01. b) The pattern of the SLM performing better for higher values of prevalence was illustrated by quantifying the relative error of the prevalence predictions. The mean of the logarithm of the relative error of the SLM over all sites (log10¯ε, dashed black line) was ∼0.1. c) The contingency of the SLM’s success was illustrated by examining the relationship between the mean frequency of an allele across hosts (f¯) and its prevalence. The predictions of the SLM (not a statistical fit) succeed for high mean frequency alleles (dashed black line). All analyses here were performed on alleles at synonymous sites using the common commensal gut species B. vulgatus. The color of each datapoint is proportionate to the number of sites. Visualizations of the predictions in this plot for all species for nonsynonymous and synonymous sites can be found in S1 Text.
Fig 3
Fig 3. The SLM succeeds and fails in a consistent manner across phylogenetically distant species.
a) Distributions of logarithmic relative errors of prevalence obtained from Eqs 6 and 21 were rescaled by their mean and standard deviation to illustrate their similarity across species. b) Binning observed prevalences reveals how the predicted values tend to follow a one-to-one relationship (dashed black line) across species, with variation among species. c) By calculating the correlation between log10-transformed observed prevalence and the relative error of our predictions, one finds a negative correlation for the majority of species. A permutation test confirms that these negative correlations are significant (95% CIs as black lines) for the majority of species (16/22). All analyses in this plot were performed using synonymous sites. Identical analyses and equivalent results for nonsynonymous sites can be found in S11 Fig.
Fig 4
Fig 4. The presence of strain structure is correlated with the accuracy of the SLM.
a) The presence or absence of strain structure was inferred from the distribution of allele frequencies for each species within each host using StrainFinder, providing an estimate of the fraction of hosts that harbor strain structure. b) This per-species estimate of strain structure can be compared to the mean relative error of allelic prevalence predictions obtained using the SLM (Eq 6) to determine whether the success of the SLM is correlated with the existence of strains. By examining this relationship when sites with rare alleles (i.e., low prevalence) are included (ϱ^0.05: dashed line) and excluded (ϱ^0.15: solid line), one sees a stronger correlation ffor alleles with a high prevalence threshold. c) This trend can be systematically evaluated by calculating the correlation across a range of prevalence values (black dots). A permutation test establishes 95% confidence intervals of the null (grey window).

Similar articles

Cited by

References

    1. Zhao S, Lieberman TD, Poyet M, Kauffman KM, Gibbons SM, Groussin M, et al.. Adaptive Evolution within Gut Microbiomes of Healthy People. Cell Host & Microbe. 2019;25(5):656–667.e8. doi: 10.1016/j.chom.2019.03.007 - DOI - PMC - PubMed
    1. Ghosh OM, Good BH. Emergent evolutionary forces in spatial models of luminal growth in the human gut microbiota; 2021. Available from: https://www.biorxiv.org/content/10.1101/2021.07.15.452569v1. - DOI - PMC - PubMed
    1. Garud NR, Good BH, Hallatschek O, Pollard KS. Evolutionary dynamics of bacteria in the gut microbiome within and across hosts. PLOS Biology. 2019;17(1):e3000102. doi: 10.1371/journal.pbio.3000102 - DOI - PMC - PubMed
    1. Yaffe E, Relman DA. Tracking microbial evolution in the human gut using Hi-C reveals extensive horizontal gene transfer, persistence and adaptation. Nature Microbiology. 2020;5(2):343–353. doi: 10.1038/s41564-019-0625-0 - DOI - PMC - PubMed
    1. Roodgar M, Good BH, Garud NR, Martis S, Avula M, Zhou W, et al.. Longitudinal linked-read sequencing reveals ecological and evolutionary responses of a human gut microbiome during antibiotic treatment. Genome Research. 2021;31(8):1433–1446. doi: 10.1101/gr.265058.120 - DOI - PMC - PubMed