Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jan;637(8045):422-429.
doi: 10.1038/s41586-024-08242-x. Epub 2024 Nov 27.

Gut microbiota strain richness is species specific and affects engraftment

Affiliations

Gut microbiota strain richness is species specific and affects engraftment

Alice Chen-Liaw et al. Nature. 2025 Jan.

Erratum in

  • Author Correction: Gut microbiota strain richness is species specific and affects engraftment.
    Chen-Liaw A, Aggarwala V, Mogno I, Haifer C, Li Z, Eggers J, Helmus D, Hart A, Wehkamp J, Lamousé-Smith ESN, Kerby RL, Rey FE, Colombel JF, Kamm MA, Olle B, Norman JM, Menon R, Watson AR, Crossette E, Terveer EM, Keller JJ, Borody TJ, Grinspan A, Paramsothy S, Kaakoush NO, Dubinsky MC, Faith JJ. Chen-Liaw A, et al. Nature. 2025 Feb;638(8050):E4. doi: 10.1038/s41586-024-08566-8. Nature. 2025. PMID: 39875608 No abstract available.

Abstract

Despite the fundamental role of bacterial strain variation in gut microbiota function1-6, the number of unique strains of a species that can stably colonize the human intestine is still unknown for almost all species. Here we determine the strain richness (SR) of common gut species using thousands of sequenced bacterial isolates with paired metagenomes. We show that SR varies across species, is transferable by faecal microbiota transplantation, and is uniquely low in the gut compared with soil and lake environments. Active therapeutic administration of supraphysiologic numbers of strains per species increases recipient SR, which then converges back to the population average after dosing is ceased. Stratifying engraftment outcomes by high or low SR shows that SR predicts microbial addition or replacement in faecal transplants. Together, these results indicate that properties of the gut ecosystem govern the number of strains of each species colonizing the gut and thereby influence strain addition and replacement in faecal microbiota transplantation and defined live biotherapeutic products.

PubMed Disclaimer

Conflict of interest statement

Competing interests: J.J.F. is a scientific advisory board member and consultant to Vedanta Biosciences, Inc. A.H., J.W., E.S.N.L.-S. are employees of Janssen Research & Development. B.O., J.M.N., R.M., A.R.W. and E.C. are employees of Vedanta Biosciences. J.K. and E.T. received research grants from Vedanta Biosciences. The remaining authors declare no competing interests.

Figures

Extended Data Fig. 1 |
Extended Data Fig. 1 |. Determination of SRj strain threshold and validation of SRj with deeper sampling of a subset of cultured gut bacterial species.
A, K-mer overlap was calculated for all pairwise combinations of isolates from a species cultured from two unrelated individuals (purple) with no direct microbial transfer between them or an individual’s own microbes from a single timepoint (green). Dotted line shows the threshold of 0.96 k-mer similarity. B, fastANI versus pairwise k-mer overlap for isolates of the same species. Dotted line represents k-mer overlap threshold of 0.96 (k-mer distance of <0.04). C, Comparison of B. fragilis SRij for the individuals in this study and in the study by Zhao et al. D, Preliminary calculation of SRj using genomes isolated from the Broad pipeline (standard pipeline used to create libraries of cultured gut bacteria). For deeper sampling estimates of SRj, we isolated additional genomes from the same species from E, the original human stool sample and F, mouse stool samples from gnotobiotic mice colonized with the same human stool (N = 2–3 mice per microbiota/diet combination) and given unique diets for strain enrichment. Error bars in DF represent SEM. Comparison of SRj calculated with genomes from the broad pipeline or the broad pipeline plus additional genomes from G, human stool samples or H, human and mouse stool samples. IL, Rarefaction curves for validation species. Dotted line shows the mean isolates/species for each species (overall mean isolates/species across the dataset was 4.7 isolates as demonstrated in Fig. 1b). M,N, Rarefaction curves for a soil species (M) and lake species (N). **p < 0.01, paired Wilcoxon test, each point represents the average SRj measured from microbiomes of a healthy or disease state. ns: not significant by paired Wilcoxon test. Grey regions on rarefaction curves indicate 95% confidence intervals.
Extended Data Fig. 2 |
Extended Data Fig. 2 |. The influence of core genome size and disease state on strain richness.
A, Spearman rank correlation for SRj versus core genome fraction for several genera. B, SRj was compared for 59 species present in both healthy and CD microbiomes. C, SRj was compared for 51 species present in both healthy and UC microbiomes. Each point represents the average SRj of a species as calculated using isolates cultured from healthy microbiomes or isolates cultured from disease microbiomes. Lines in B,C connect the SRj for each shared species between a healthy subject and a subject with IBD. ns: not significant by paired Wilcoxon test.
Extended Data Fig. 3 |
Extended Data Fig. 3 |. Metagenomics-quantified SRj correlates with the cultured SRj measured across our cohort.
A, Spearman correlation between donor SRj as measured by metagenomics and SRj from our original cultured cohort (first panel) and differences in SRj between the two groups (second panel). B, Spearman correlation between recipient SRj as measured by metagenomics at week 8 post-FMT and SRj from our original cohort (first panel) and differences in SRj between the two groups (second panel). A,B, Each point represents the average SRj for species. C, Heatmap representing the remaining six donors who donated stool to six different recipients. D, Correlation of SRj across all recipients and all donors in an independent FMT validation cohort (Leiden).
Extended Data Fig. 4 |
Extended Data Fig. 4 |. An ecological framework for strain persistence, replacement, or addition based on strain richness.
A, Schematic for FMT experimental design with multi-donor stool batches administered to each recipient. B, Within a species, bacterial strains vary in their engraftment frequency when administered in the context of a multi-donor FMT product. C, Spearman rank correlation between the metagenomics SRj of the individual FOCUS donors with the overall cultured SRj measured across our cohort. D, Expected outcomes for donor strain engraftment based on species SRj.
Fig. 1 |
Fig. 1 |. SRj of 92 human gut species.
a, SRj varies by species (Kruskal–Wallis, p <2.2 × 10−16 Data are represented as mean value ± s.e.m. b, SRj using the isolate genome set in this study (n = 4,773) is highly correlated with SRj estimated from an independent set of bacterial genomes (n = 1,947) isolated from 11 humans. Spearman rank correlation was applied. Grey area, 95% confidence interval. c, The SRj of human gut species is lower than SRj of species isolated from lake and soil microbiomes (Kruskal–Wallis test). Blue points, average SRj for a species found in each of the environments; black points, mean of the environment; error bars, s.e.m.
Fig. 2 |
Fig. 2 |. Factors influencing SRj.
a, Spearman rank correlation of SRj versus species frequency in our cohort. No corrections were used. Grey area, 95% confidence interval. b, Pairwise k-mer distances between unique conspecific strains for species with SRj < 1.1 and SRj > 1.2. c, Functional categories over- and under-represented in genes that vary between species with SRj < 1.1 versus SRj > 1.2.
Fig. 3 |
Fig. 3 |. FMT durably transmits healthy donor SRj to rCDI patients.
a, Schematic of FMT experimental design with 1:1 donor–recipient pairs. b, Representative heatmap showing the transmission of SRij from donor D283 to seven different recipients (R282, R285 and so on). The numbers in the cells of the heatmap indicate the number of strains of the given species detected in the donor or recipient across five time points for donor D283 and up to three time points for the seven recipients. c, Spearman rank correlation between recipient SRj at week 8 post-FMT and donor SRj. d, Spearman rank correlation between donor SRj at 5 years and time 0 (pre-FMT stool sample). e, Spearman rank correlation between recipient SRj at 5 years post-FMT and 8 weeks post-FMT. ce, Each datapoint represents the average SRj for a tracked species in b; grey area, 95% confidence interval, and Spearman rank correlations are two-tailed. Credit: a, © Jeremiah Faith/123rf.com.
Fig. 4 |
Fig. 4 |. Supraphysiologic manipulation of SRj in FMT recipients converges to the population baseline observed in untransplanted people.
a, Heatmap showing the SRij of single donors, donor batches and recipients at post-FMT time points both during and after FMT drug administration. b, SRj for representative species across the donor batch, recipient post-FMT time points and cultured cohort from Fig. 1a (individual donors). Data are presented as the mean value ± s.e.m. with each point representing the SRj for a species at each time point or for the cultured cohort. c, Spearman rank correlation (two-tailed) between recipient SRj at drug week 8 and donor batch SRj. Each point represents the SRj of a species as measured in the pooled donor batch versus as measured in the recipient post-FMT week 8. d, SRj across donors, batches, recipient time points and culture (previously measured in Fig. 1a and Extended Data Table 1). Blue points, tracked SRj of a species as measured in each donor group (individual donors versus pooled batch) or in each recipient post-FMT time point; black points, mean SRj across the overall time point or group. Two-sided Wilcoxon tests were used to compare groups. e, Proportional occurrence of addition, persistence and replacement events across species with low SRj and high SRj. ***P < 10−3, Wilcoxon test; ****P < 10−4, Wilcoxon test; NS, not significant by Wilcoxon test. Exact P values: individual donors versus recipients 5 years (P = 5.1 × 10−5), donor batch versus recipients week 4 (P = 6 × 10−8), recipients week 4 versus week 8 (P = 8.5 × 10−4), recipients week 8 versus week 16 (P = 1.9 × 10−5), recipients week 16 versus recipients 5 years (P = 1.2 × 10−5), recipients 5 years versus cultured SRj (P = 0.071).

References

    1. Yang C et al. Immunoglobulin A antibody composition is sculpted to bind the self gut microbiome. Sci. Immunol. 7, eabg3208 (2022). - PMC - PubMed
    1. Parida S et al. A procarcinogenic colon microbe promotes breast tumorigenesis and metastatic progression and concomitantly activates notch and β-catenin axes. Cancer Discov. 11, 1138–1157 (2021). - PubMed
    1. Arthur JC et al. Intestinal inflammation targets cancer-inducing activity of the microbiota. Science 338, 120–123 (2012). - PMC - PubMed
    1. Britton GJ et al. Defined microbiota transplant restores Th17/RORγt+ regulatory T cell balance in mice colonized with inflammatory bowel disease microbiotas. Proc Natl Acad. Sci. USA 117, 21536–21545 (2020). - PMC - PubMed
    1. Yang C et al. Fecal IgA levels are determined by strain-level differences in bacteroides ovatus and are modifiable by gut microbiota manipulation. Cell Host Microbe 27, 467–475.e6 (2020). - PMC - PubMed