Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Meta-Analysis
. 2025 Jun 17;23(1):676.
doi: 10.1186/s12967-025-06676-z.

Cross-cohort analysis identifies shared gut microbial signatures and validates microbial risk scores for colorectal cancer

Affiliations
Meta-Analysis

Cross-cohort analysis identifies shared gut microbial signatures and validates microbial risk scores for colorectal cancer

Yuhan Zhang et al. J Transl Med. .

Abstract

Background: Microbiome-wide association studies showed links between colorectal cancer (CRC) and gut microbiota. However, the clinical application of gut microbiota in CRC prevention has been hindered by the diversity of study populations and technical variations. We aimed to determine CRC-related gut microbial signatures based on cross-regional, cross-population, and cross-cohort metagenomic datasets, and elucidate its application value in CRC risk assessment.

Methods: We used the MMUPHin tool to perform a meta-analysis of our own cohort and seven publicly available metagenomics datasets to identify gut microbial species associated with CRC across different cohorts, comprising of 570 CRC cases and 557 controls. Based on differential species sets, we constructed the microbial risk score (MRS) using α-diversity of the sub-community (MRSα), weighted/unweighted summation methods and machine learning algorithms. Cohort-to-cohort training and validation were performed to demonstrate the transferability.

Results: We found that MRSα of core species was better validated and more interpretable than those constructed with summation methods or machine learning algorithms. Six species, including Parvimonas micra, Clostridium symbiosum, Peptostreptococcus stomatis, Bacteroides fragilis, Gemella morbillorum, and Fusobacterium nucleatum, were included in MRSα constructed by half or more of the cohorts. The AUC of MRSα, calculated based on the sub-community of six species, varied between 0.619 and 0.824 across the eight cohorts.

Conclusion: We identified six CRC-related species across regions, populations, and cohorts. The constructed MRSα could contribute to the risk prediction of CRC in different populations.

Keywords: Colorectal cancer; Cross-cohort; Disease prediction; Gut microbiota; Microbial risk score.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: Data use from the Target-C study was approved by the Ethics Committee of the National Cancer Center/Cancer Hospital, Chinese Academy of Medical Sciences, and Peking Union Medical College (18-013/1615). All the participants provided written informed consent. Competing interests: The authors declare no conflict of interest.

Figures

Fig. 1
Fig. 1
The workflow of the microbial risk score (MRS) establishment
Fig. 2
Fig. 2
Geographic distribution of participants in 8 colorectal cancer-related studies
Fig. 3
Fig. 3
Differential species across cohorts identified via MMUPHin analysis of 8 colorectal cancer study populations. Bar plots of significant differential species (adjusted P < 0.05, Benjamini–Hochberg method) were presented with red for enriched species in colorectal cancer group and blue for enriched species in control group
Fig. 4
Fig. 4
Cohort-to-cohort validation of microbial risk score (MRS) constructed with α-diversity of the sub-community. The heatmap shows the AUC values of MRS built with α-diversity of the sub-community of differential species in each cohort. The average value refers to the AUC values in the other validation cohorts
Fig. 5
Fig. 5
Cohort-to-cohort validation of microbial risk score (MRS) constructed with LightGBMXT. The heatmap shows the AUC values of MRS built with LightGBMXT in each cohort. The average value refers to the AUC values in the other validation cohorts. The gray squares indicate an AUC less than 0.5

Similar articles

References

    1. Bray F, Laversanne M, Sung H, Ferlay J, Siegel RL, Soerjomataram I, et al. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2024;74:229–63. - PubMed
    1. Siegel RL, Wagle NS, Cercek A, Smith RA, Jemal A. Colorectal cancer statistics, 2023. CA Cancer J Clin. 2023;73:233–54. - PubMed
    1. Ladabaum U, Dominitz JA, Kahi C, Schoen RE. Strategies for colorectal cancer screening. Gastroenterology. 2020;158:418–32. - PubMed
    1. Kaminski MF, Robertson DJ, Senore C, Rex DK. Optimizing the quality of colorectal cancer screening worldwide. Gastroenterology. 2020;158:404–17. - PubMed
    1. Kastrinos F, Kupfer SS, Gupta S. Colorectal cancer risk assessment and precision approaches to screening: brave new world or worlds apart? Gastroenterology. 2023;164:812–27. - PMC - PubMed

Publication types

LinkOut - more resources