Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2025 May 20;10(5):e0136424.
doi: 10.1128/msystems.01364-24. Epub 2025 Apr 8.

Improving gut virome comparisons using predicted phage host information

Affiliations
Comparative Study

Improving gut virome comparisons using predicted phage host information

Michael Shamash et al. mSystems. .

Abstract

The human gut virome is predominantly made up of bacteriophages (phages), viruses that infect bacteria. Metagenomic studies have revealed that phages in the gut are highly individual specific and dynamic. These features make it challenging to perform meaningful cross-study comparisons. While several taxonomy frameworks exist to group phages and improve these comparisons, these strategies provide little insight into the potential effects phages have on their bacterial hosts. Here, we propose the use of predicted phage host families (PHFs) as a functionally relevant, qualitative unit of phage classification to improve these cross-study analyses. We first show that bioinformatic predictions of phage hosts are accurate at the host family level by measuring their concordance to Hi-C sequencing-based predictions in human and mouse fecal samples. Next, using phage host family predictions, we determined that PHFs reduce intra- and interindividual ecological distances compared to viral contigs in a previously published cohort of 10 healthy individuals, while simultaneously improving longitudinal virome stability. Lastly, by reanalyzing a previously published metagenomics data set with >1,000 samples, we determined that PHFs are prevalent across individuals and can aid in the detection of inflammatory bowel disease-specific virome signatures. Overall, our analyses support the use of predicted phage hosts in reducing between-sample distances and providing a biologically relevant framework for making between-sample virome comparisons.

Importance: The human gut virome consists mainly of bacteriophages (phages), which infect bacteria and show high individual specificity and variability, complicating cross-study comparisons. Furthermore, existing taxonomic frameworks offer limited insight into their interactions with bacterial hosts. In this study, we propose using predicted phage host families (PHFs) as a higher-level classification unit to enhance functional cross-study comparisons. We demonstrate that bioinformatic predictions of phage hosts align with Hi-C sequencing results at the host family level in human and mouse fecal samples. We further show that PHFs reduce ecological distances and improve virome stability over time. Additionally, reanalysis of a large metagenomics data set revealed that PHFs are widespread and can help identify disease-specific virome patterns, such as those linked to inflammatory bowel disease.

Keywords: bacteriophages; bioinformatics; gut microbiome; microbial interactions; virome.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest

Figures

Fig 1
Fig 1
Computationally predicted bacterial hosts for vOTUs are concordant with in situ associations to the bacterial family level. Agreement between iPHoP predicted host range and Hi-C assigned host range at various taxonomic ranks for 1,243 vOTUs. Additional comparisons were made when iPHoP predicted multiple hosts for a vOTU (see main text for details on the three comparisons).
Fig 2
Fig 2
PHFs reduce interindividual variation and increase intraindividual virome stability in a cohort of 10 healthy individuals. Data were analyzed from a previously published study of 10 healthy individuals (1). (A) Taxonomic bar plots of virome composition at the PHF level for each individual over time. Facet labels above the bar plots correspond to the subject IDs from the original study. (B) Ecological distances between samples with Bray-Curtis at the contig level, Bray-Curtis at the PHF level, and Weighted UniFrac at the PHF level. Interindividual and intraindividual comparisons are both shown. Significance was assessed using the Friedman test with the post-hoc Wilcoxon signed-rank test, using Bonferroni correction for multiple comparisons (***P < 0.001, ****P < 0.0001). (C) Virome stability, defined here as (1—ecological distance from the previous sample), was calculated for each individual using the Bray-Curtis distance metrics at the contig level and at the PHF level. Significance was assessed using the Wilcoxon signed-rank test (*P < 0.05, **P < 0.01, ***P < 0.001).
Fig 3
Fig 3
PHFs are prevalent and reduce intra- and interindividuality in a large human IBD cohort. Data were analyzed from the previously published HMP2 data set (13). Samples with low viral read counts (<1,500) were removed from analyses. In total, bulk metagenomes from 1,093 samples from 115 individuals (57 CD, 31 UC, 27 non-IBD controls) were included for downstream analyses. (A, B) Rank prevalence distributions of vOTUs (A) and PHFs (B) across individuals. In total, there were 3,870 distinct vOTUs and 74 distinct PHFs. The dotted red line indicates the rank at which features are more, or less than, 50% prevalent. (C) Mean relative abundance of features (PHFs vs vOTUs) that were present in more than 50% of individuals in the data set. (D) Bray-Curtis distance between samples according to interindividual or intraindividual comparisons. Significance was assessed using the Wilcoxon signed-rank test (****P ≤ 0.0001).
Fig 4
Fig 4
PHFs reveal disease-specific signatures of IBD. Data were analyzed from the previously published HMP2 data set (13). Samples with low viral read counts (< 1,500) were removed from analyses. (A) PCoA plots generated from Bray-Curtis distance matrices using vOTUs (left) and PHFs (right). Samples are color-coded according to the dysbiotic status identified in (13). (B) Differentially abundant PHFs based on dysbiosis status. Only individuals which had both a dysbiotic and non-dysbiotic sample were included. Only PHFs that were more than 50% prevalent across individuals were considered for these analyses. PHFs with an adjusted P value ≤ 0.05 and with a log2 fold-change ≥1 or with a log2 fold-change ≤ −1 were considered differentially abundant.

Similar articles

References

    1. Shkoporov AN, Clooney AG, Sutton TDS, Ryan FJ, Daly KM, Nolan JA, McDonnell SA, Khokhlova EV, Draper LA, Forde A, Guerin E, Velayudhan V, Ross RP, Hill C. 2019. The human gut virome is highly diverse, stable, and individual specific. Cell Host Microbe 26:527–541. doi:10.1016/j.chom.2019.09.009 - DOI - PubMed
    1. Clooney AG, Sutton TDS, Shkoporov AN, Holohan RK, Daly KM, O’Regan O, Ryan FJ, Draper LA, Plevy SE, Ross RP, Hill C. 2019. Whole-virome analysis sheds light on viral dark matter in inflammatory bowel disease. Cell Host Microbe 26:764–778. doi:10.1016/j.chom.2019.10.009 - DOI - PubMed
    1. Nishijima S, Nagata N, Kiguchi Y, Kojima Y, Miyoshi-Akiyama T, Kimura M, Ohsugi M, Ueki K, Oka S, Mizokami M, Itoi T, Kawai T, Uemura N, Hattori M. 2022. Extensive gut virome variation and its associations with host and environmental factors in a population-level cohort. Nat Commun 13:5252. doi:10.1038/s41467-022-32832-w - DOI - PMC - PubMed
    1. Beller L, Deboutte W, Vieira-Silva S, Falony G, Tito RY, Rymenans L, Yinda CK, Vanmechelen B, Van Espen L, Jansen D, Shi C, Zeller M, Maes P, Faust K, Van Ranst M, Raes J, Matthijnssens J. 2022. The virota and its transkingdom interactions in the healthy infant gut. Proc Natl Acad Sci USA 119:e2114619119. doi:10.1073/pnas.2114619119 - DOI - PMC - PubMed
    1. Zuo T, Sun Y, Wan Y, Yeoh YK, Zhang F, Cheung CP, Chen N, Luo J, Wang W, Sung JJY, Chan PKS, Wang K, Chan FKL, Miao Y, Ng SC. 2020. Human-gut-DNA virome variations across geography, ethnicity, and urbanization. Cell Host Microbe 28:741–751. doi:10.1016/j.chom.2020.08.005 - DOI - PubMed

Publication types

LinkOut - more resources