. 2019 Apr;25(4):667-678.

doi: 10.1038/s41591-019-0405-7. Epub 2019 Apr 1.

Metagenomic analysis of colorectal cancer datasets identifies cross-cohort microbial diagnostic signatures and a link with choline degradation

Andrew Maltez Thomas^#^{1

2

3}, Paolo Manghi^#¹, Francesco Asnicar¹, Edoardo Pasolli¹, Federica Armanini¹, Moreno Zolfo¹, Francesco Beghini¹, Serena Manara¹, Nicolai Karcher¹, Chiara Pozzi⁴, Sara Gandini⁴, Davide Serrano⁴, Sonia Tarallo⁵, Antonio Francavilla⁵, Gaetano Gallo^{6

7}, Mario Trompetto⁷, Giulio Ferrero⁸, Sayaka Mizutani^{9

10}, Hirotsugu Shiroma⁹, Satoshi Shiba¹¹, Tatsuhiro Shibata^{11

12}, Shinichi Yachida^{11

13}, Takuji Yamada^{9

14}, Jakob Wirbel¹⁵, Petra Schrotz-King¹⁶, Cornelia M Ulrich¹⁷, Hermann Brenner^{16

18

19}, Manimozhiyan Arumugam^{20

21}, Peer Bork^{15

22

23

24}, Georg Zeller¹⁵, Francesca Cordero⁸, Emmanuel Dias-Neto^{3

25}, João Carlos Setubal^{2

26}, Adrian Tett¹, Barbara Pardini^{5

27}, Maria Rescigno²⁸, Levi Waldron^{29

30}, Alessio Naccarati^{5

31}, Nicola Segata³²

Affiliations

¹ Department CIBIO, University of Trento, Trento, Italy.
² Biochemistry Department, Chemistry Institute, University of São Paulo, São Paulo, Brazil.
³ Medical Genomics Laboratory, CIPE/A.C. Camargo Cancer Center, São Paulo, Brazil.
⁴ IEO, European Institute of Oncology IRCCS, Milan, Italy.
⁵ Italian Institute for Genomic Medicine, Turin, Italy.
⁶ Department of Surgical and Medical Sciences, University of Catanzaro, Catanzaro, Italy.
⁷ Department of Colorectal Surgery, Clinica S. Rita, Vercelli, Italy.
⁸ Department of Computer Science, University of Turin, Turin, Italy.
⁹ School of Life Science and Technology, Tokyo Institute of Technology, Tokyo, Japan.
¹⁰ Research Fellow of Japan Society for the Promotion of Science, Tokyo, Japan.
¹¹ Division of Cancer Genomics, National Cancer Center Research Institute, Tokyo, Japan.
¹² Human Genome Center, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan.
¹³ Department of Cancer Genome Informatics, Osaka University, Osaka, Japan.
¹⁴ PRESTO, Japan Science and Technology Agency, Saitama, Japan.
¹⁵ Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany.
¹⁶ Division of Preventive Oncology, National Center for Tumor Diseases and German Cancer Research Center, Heidelberg, Germany.
¹⁷ Huntsman Cancer Institute and Department of Population Health Sciences, University of Utah, Salt Lake City, UT, USA.
¹⁸ Division of Clinical Epidemiology and Aging Research, German Cancer Research Center, Heidelberg, Germany.
¹⁹ German Cancer Consortium, German Cancer Research Center, Heidelberg, Germany.
²⁰ Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark.
²¹ Faculty of Healthy Sciences, University of Southern Denmark, Odense, Denmark.
²² Molecular Medicine Partnership Unit, Heidelberg, Germany.
²³ Max Delbrück Centre for Molecular Medicine, Berlin, Germany.
²⁴ Department of Bioinformatics, Biocenter, University of Würzburg, Würzburg, Germany.
²⁵ Laboratory of Neurosciences, Institute of Psychiatry, University of São Paulo, São Paulo, Brazil.
²⁶ Biocomplexity Institute of Virginia Tech, Blacksburg, VA, USA.
²⁷ Department of Medical Sciences, University of Turin, Turin, Italy.
²⁸ Mucosal Immunology and Microbiota Unit, Humanitas Research Hospital, Milan, Italy.
²⁹ Graduate School of Public Health and Health Policy, City University of New York, New York, NY, USA.
³⁰ Institute for Implementation Science in Population Health, City University of New York, New York, NY, USA.
³¹ Department of Molecular Biology of Cancer, Institute of Experimental Medicine, Prague, Czech Republic.
³² Department CIBIO, University of Trento, Trento, Italy. nicola.segata@unitn.it.

^# Contributed equally.

PMID: 30936548
PMCID: PMC9533319
DOI: 10.1038/s41591-019-0405-7

Metagenomic analysis of colorectal cancer datasets identifies cross-cohort microbial diagnostic signatures and a link with choline degradation

Andrew Maltez Thomas et al. Nat Med. 2019 Apr.

. 2019 Apr;25(4):667-678.

doi: 10.1038/s41591-019-0405-7. Epub 2019 Apr 1.

Authors

Affiliations

¹ Department CIBIO, University of Trento, Trento, Italy.
² Biochemistry Department, Chemistry Institute, University of São Paulo, São Paulo, Brazil.
³ Medical Genomics Laboratory, CIPE/A.C. Camargo Cancer Center, São Paulo, Brazil.
⁴ IEO, European Institute of Oncology IRCCS, Milan, Italy.
⁵ Italian Institute for Genomic Medicine, Turin, Italy.
⁶ Department of Surgical and Medical Sciences, University of Catanzaro, Catanzaro, Italy.
⁷ Department of Colorectal Surgery, Clinica S. Rita, Vercelli, Italy.
⁸ Department of Computer Science, University of Turin, Turin, Italy.
⁹ School of Life Science and Technology, Tokyo Institute of Technology, Tokyo, Japan.
¹⁰ Research Fellow of Japan Society for the Promotion of Science, Tokyo, Japan.
¹¹ Division of Cancer Genomics, National Cancer Center Research Institute, Tokyo, Japan.
¹² Human Genome Center, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan.
¹³ Department of Cancer Genome Informatics, Osaka University, Osaka, Japan.
¹⁴ PRESTO, Japan Science and Technology Agency, Saitama, Japan.
¹⁵ Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany.
¹⁶ Division of Preventive Oncology, National Center for Tumor Diseases and German Cancer Research Center, Heidelberg, Germany.
¹⁷ Huntsman Cancer Institute and Department of Population Health Sciences, University of Utah, Salt Lake City, UT, USA.
¹⁸ Division of Clinical Epidemiology and Aging Research, German Cancer Research Center, Heidelberg, Germany.
¹⁹ German Cancer Consortium, German Cancer Research Center, Heidelberg, Germany.
²⁰ Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark.
²¹ Faculty of Healthy Sciences, University of Southern Denmark, Odense, Denmark.
²² Molecular Medicine Partnership Unit, Heidelberg, Germany.
²³ Max Delbrück Centre for Molecular Medicine, Berlin, Germany.
²⁴ Department of Bioinformatics, Biocenter, University of Würzburg, Würzburg, Germany.
²⁵ Laboratory of Neurosciences, Institute of Psychiatry, University of São Paulo, São Paulo, Brazil.
²⁶ Biocomplexity Institute of Virginia Tech, Blacksburg, VA, USA.
²⁷ Department of Medical Sciences, University of Turin, Turin, Italy.
²⁸ Mucosal Immunology and Microbiota Unit, Humanitas Research Hospital, Milan, Italy.
²⁹ Graduate School of Public Health and Health Policy, City University of New York, New York, NY, USA.
³⁰ Institute for Implementation Science in Population Health, City University of New York, New York, NY, USA.
³¹ Department of Molecular Biology of Cancer, Institute of Experimental Medicine, Prague, Czech Republic.
³² Department CIBIO, University of Trento, Trento, Italy. nicola.segata@unitn.it.

^# Contributed equally.

PMID: 30936548
PMCID: PMC9533319
DOI: 10.1038/s41591-019-0405-7

Erratum in

Author Correction: Metagenomic analysis of colorectal cancer datasets identifies cross-cohort microbial diagnostic signatures and a link with choline degradation.
Thomas AM, Manghi P, Asnicar F, Pasolli E, Armanini F, Zolfo M, Beghini F, Manara S, Karcher N, Pozzi C, Gandini S, Serrano D, Tarallo S, Francavilla A, Gallo G, Trompetto M, Ferrero G, Mizutani S, Shiroma H, Shiba S, Shibata T, Yachida S, Yamada T, Wirbel J, Schrotz-King P, Ulrich CM, Brenner H, Arumugam M, Bork P, Zeller G, Cordero F, Dias-Neto E, Setubal JC, Tett A, Pardini B, Rescigno M, Waldron L, Naccarati A, Segata N. Thomas AM, et al. Nat Med. 2019 Dec;25(12):1948. doi: 10.1038/s41591-019-0663-4. Nat Med. 2019. PMID: 31664237

Abstract

Several studies have investigated links between the gut microbiome and colorectal cancer (CRC), but questions remain about the replicability of biomarkers across cohorts and populations. We performed a meta-analysis of five publicly available datasets and two new cohorts and validated the findings on two additional cohorts, considering in total 969 fecal metagenomes. Unlike microbiome shifts associated with gastrointestinal syndromes, the gut microbiome in CRC showed reproducibly higher richness than controls (P < 0.01), partially due to expansions of species typically derived from the oral cavity. Meta-analysis of the microbiome functional potential identified gluconeogenesis and the putrefaction and fermentation pathways as being associated with CRC, whereas the stachyose and starch degradation pathways were associated with controls. Predictive microbiome signatures for CRC trained on multiple datasets showed consistently high accuracy in datasets not considered for model training and independent validation cohorts (average area under the curve, 0.84). Pooled analysis of raw metagenomes showed that the choline trimethylamine-lyase gene was overabundant in CRC (P = 0.001), identifying a relationship between microbiome choline metabolism and CRC. The combined analysis of heterogeneous CRC cohorts thus identified reproducible microbiome biomarkers and accurate disease-predictive models that can form the basis for clinical prognostic tests and hypothesis-driven mechanistic studies.

PubMed Disclaimer

Figures

**Fig. 1. Sequencing depths and species richness across CRC datasets**
**(A)** Boxplots reporting the total number of reads in each dataset. P-values between the carcinoma and control groups were calculated by two-tailed Wilcoxon rank-sum tests. **(B)** Boxplots showing the total number of microbial species per dataset. P-values were calculated by two-tailed Wilcoxon rank-sum tests. **(C)** Boxplots showing the total number of microbial species per dataset calculated on metagenomes subsampled in each dataset to the number of reads of the 10th percentile. P-values were calculated by two-tailed Wilcoxon rank-sum tests. **(D)** Multivariate analysis of species richness using crude and age, sex and BMI-adjusted coefficients obtained from linear models. **(E)** Meta-analysis of crude and adjusted multivariate richness coefficients using a random effects model. Bold lines represent the 95% confidence interval for the random effects model estimate.

**Fig. 2. Meta-analysis of species diversity andoral species richness in CRC datasets**
A) Boxplots reporting the Shannon species diversity in each dataset. P-values between the carcinoma and control groups were calculated by two-tailed Wilcoxon rank-sum tests. **(B)** Boxplots reporting the Shannon species diversity calculated on metagenomes subsampled in each dataset to the number of reads of the 10th percentile. P-values were calculated by two-tailed Wilcoxon rank-sum tests. **(C)** Multivariate analysis of species diversity using crude and age, sex and BMI-adjusted coefficients obtained from linear models. **(D)** Meta-analysis of crude and adjusted multivariate Shannon diversity coefficients using a random effects model. Bold lines represent the 95% confidence interval for the random effects model estimate. **(E)** Boxplots reporting the total number of oral microbial species per dataset. P-values were calculated by two-tailed Wilcoxon rank-sum tests comparing values between controls and carcinomas for each dataset. **(F)** Multivariate analysis of putative oral species richness using crude and age, sex and BMI-adjusted coefficients obtained from linear models. **(G)** Meta-analysis of crude and adjusted multivariate putative oral species richness coefficients using a random effects model. Bold lines represent the 95% confidence interval for the random effects model estimate.

**Fig. 3. Two novel metagenomic cohorts identify clear but only partially overlapping microbiome signatures associated with CRC**
**(A)** Relative abundances (log scale) and effect sizes (estimated using the LDA score in LEfSe) for the significantly different microbial species in CRC samples compared to control samples for Cohort1 (significance assessed by the non-parametric test in LEfSe) and **(B)** for Cohort2. **(C)** Alpha diversities measured as the total number of species and the total number of UniProt90 gene families in each sample for the two cohorts. **(D)** Beta-diversities estimated with the Bray-Curtis dissimilarity metric for intra- and inter-condition comparisons in the two cohorts.

**Fig. 4. Analysis of F. *nucleatum* markers, and taxonomic meta-analysis of CRC datasets.**
**(A)** Percentages of F. *nucleatum* clade-specific markers (200 in total) in each per dataset. P-values were obtained by two-tailed Wilcoxon rank-sum tests comparing values between controls and carcinomas for each dataset. **(B)** Meta-analysis of CRC datasets using species-level MetaPhlAn2 profiles. Bold lines represent the 95% confidence interval for the random effects model estimate. **(C)** Multivariate analysis of meta-analysis species-level abundance biomarkers. Crude and age, sex and BMI adjusted coefficients for species associated with disease status in the meta-analysis of standardized mean differences.

**Fig. 5. Analysis of putative oral species’ abundances in CRC datasets and gene families richness across CRC datasets.**
**(A)** Effect sizes of the abundances of significant putative oral species identified using a meta-analysis of standardized mean differences and a random effects model. Bold lines represent the 95% confidence interval for the random effects model estimate. **(B)** Total abundance of putative oral species in each gut metagenomic dataset. P-values were obtained by two-tailed Wilcoxon rank-sum tests comparing values between controls and carcinomas for each dataset. **(C)** The total number of reads in each sample of each dataset correlates with the total number of gene families identified using HUMANn2. Ellipses represent the 95% confidence level assuming a multivariate t-distribution. **(D)** Distribution of the total number of gene families identified in the samples of each dataset. P-values were obtained by two-tailed Wilcoxon rank-sum tests comparing values between controls and carcinomas for each dataset. **(E)** Distribution of the percentages of unmapped reads across datasets for UniProt90 gene families.

**Fig. 6. Cross-validation, cross-cohort, and LODO predictions using pathway abundances, species abundances, and species-specific markers.**
**(A)** Prediction matrix reporting prediction performances as AUC values obtained using a Random Forest (RF) model on pathway relative abundances. Values on the diagonal refer to 20 times repeated 10-fold stratified cross validations. Off-diagonal values refer to the AUC values obtained by training the classifier on the dataset of the corresponding row and applying it on the dataset of the corresponding column. The Leave-One-Dataset-Out (LODO) row refers to the performances obtained by training the model on pathway abundances using all but the dataset of the corresponding column and applying it on the dataset of the corresponding column. **(B)** Prediction matrix as in (A) but using MetaPhlAn2 marker presence and absence information. **(C)** Prediction of samples-to-cohort assignments using species-level relative abundances. Only control samples from each dataset are considered. **(D)** Principal coordinate analysis of Bray-Curtis distances computed on MetaPhlAn2 species-level abundances across datasets. Ellipses represent the 95% confidence level assuming a multivariate t-distribution. **(E)** Cross prediction matrix for the performances of RF models in predicting adenomas versus CRC conditions. **(F)** Cross prediction matrix as described in (E) but on the distinction of adenomas versus controls.

**Fig. 7. Prediction performances at increasing numbers of external datasets considered in the training model**
**(A)** Prediction performances computed based on MetaPhlAn2 species abundances. The dark yellow line interpolates the median AUC at each number of training datasets considered. (B) Prediction performances computed based on HUMANn2 gene-family abundances.

**Fig. 8. Identification of a minimal number of microbial gene-families for CRC-detection.**
Prediction performances in the LODO-settings at increasing number of gene-families. Each ranking is obtained excluding the testing dataset to avoid overfitting.

**Fig. 9. Metagenomic analysis of genes involved in the TMA-synthesis pathway**
**(A)** ShortBRED analysis of *yeaW* and *caiT* gene abundances. Points represent the log of reads per kilobase per million mapped reads (RPKM) for each sample and crosses represent mean values per group/dataset. **(B)** ShortBRED analysis of *cutD* gene abundances. Boxplots reports the RKPM abundances obtained using ShortBRED for the gene of the activating TMA-lyase enzyme *cutD*. P-values were calculated by two-tailed Wilcoxon rank-sum tests comparing values between controls and carcinomas for each dataset. **(C)** Forest plot showing effect sizes calculated using a meta-analysis of standardized mean differences and a random effects model on *cutD* RPKM abundances between carcinomas and controls. **(D)** Breadth of coverage of *cutC* gene sequence clusters across CRC datasets. **(E)** Depth of coverage of *cutC* gene sequence clusters across CRC datasets..

**Fig. 10. Cluster analysis of samples’ representative cutC sequence variants.**
**(A)** Prediction strengths at differing number of clusters showing optimum numbers at 2 and 4 clusters. **(B)** Tables showing the number of samples for carcinomas, adenomas and controls with breadth of coverage >80% at two different cluster thresholds. P-values were calculated using a Fisher T-test and taxonomy was assigned by BLASTn and the *cutC* sequence database (criteria of 80% coverage, >97% identity and minimum 2000nt alignment length).

**Figure 1.. Reproducible taxonomic and functional microbial biomarkers across datasets when contrasting carcinoma against healthy controls (no adenoma samples considered).**
**(A)** UpSet plot showing the number of taxonomic biomarkers identified using LEfSE on MetaPhlAn2 species profiles shared by combinations of datasets (see Suppl. Table 3 for all single significant associations). **(B)** Pooled effect sizes for the 20 significant features with the largest effect size calculated using a meta-analysis of standardized mean differences and a random effects model on MetaPhlAn2 species abundances and on **(C)** HUMANn2 pathway abundances. Bold lines represent the 95% confidence interval for the random effects model coefficient estimate. **(D)** Scatter plot of crude and age-, sex-, and BMI-adjusted coefficients obtained from linear models using MetaPhlAn2 species abundances. **(E)** Scatter plot of crude and age-, sex-, and BMI-adjusted coefficients obtained from linear models using HUMANn2 pathway abundances.

**Figure 2.. Assessment of prediction performances of the gut microbiome for CRC detection within and across cohorts.**
**(A)** Cross prediction matrix reporting prediction performances as AUC values obtained using a Random Forest (RF) model on species-level relative abundances (see Methods). Values on the diagonal refer to 20 times repeated 10-fold stratified cross validations. Off-diagonal values refer to the AUC values obtained by training the classifier on the dataset of the corresponding row and applying it on the dataset of the corresponding column. The Leave-One-Dataset-Out (LODO) row refers to the performances obtained by training the model on the species-level abundances and MetaPhlAn2 markers using all but the dataset of the corresponding column and applying it on the dataset of the corresponding column. See Extended Data 6 for the marker cross-study validation matrix. **(B)** Cross prediction matrix of AUC values obtained using HUMANn2 UniRef90 gene-family abundances and HUMANn2 pathway relative abundances. See Extended Data 6 for the pathway cross-study validation matrix. **(C)** Prediction performances for the two Italian cohorts at increasing numbers of external datasets considered for training the model. The dark yellow line interpolates the median AUC at each number of training datasets considered. See Extended Data 7 for the plots referred to the other testing datasets. **(D)** Prediction performances at increasing number of datasets in the training, using HUMANn2 UniProt90 gene-family abundances. See Extended Data 7 for the plots referred to the other testing datasets.

**Figure 3.. Ranking relevance of each species in the predictive models for each dataset and identification of a minimal microbial signature for CRC detection.**
**(A)** The importance of each species for the cross-validation prediction performance in each dataset estimated using the internal RF scores. Only species appearing in the five top ranking features in at least one dataset are reported. Prediction performances at increasing number of microbial species obtained by re-training the RF classifier on the N top ranked features identified with a first RF model training in a cross-validation **(B)** and LODO-setting **(C)**. The rankings are obtained excluding the testing dataset to avoid overfitting.

**Figure 4.. Choline TMA-lyase gene *cutC* and its genetic variants are strong biomarkers for CRC-associated stool samples.**
**(A)** Distribution of reads per kilobase million (RKPM) abundances obtained using ShortBRED for the choline TMA-lyase enzyme gene *cutC*. P-values were computed by two-tailed Wilcoxon Signed-Rank tests comparing values between controls and carcinomas for each dataset. **(B)** Forest plot reporting effect sizes calculated using a meta-analysis of standardized mean differences and a random effects model on *cutC* RPKM abundances between carcinomas and controls. **(C)** Phylogenetic tree of sample-specific *cutC* sequence variants identified four main sequence variants. Tips with no circles represent *cutC* sequence variants from genomes absent from the datasets. Taxonomy was assigned based on mapping against existing *cutC* sequences (criteria of 80% coverage, >97% identity and minimum 2,000nt alignment length). **(D)** qPCR validation of *cutC* gene abundance and **(E)** *cutC* transcript abundance (normalized by total 16S rRNA gene/transcript abundance) on a subset of DNA samples from Cohort1. qPCR validation P-values are obtained by 1-tail Wilcoxon Signed-Rank test.

**Figure 5 -. Clinical potential and validation of the predictive biomarkers.**
**(A)** Prediction performance of the taxonomic models trained on the 7 datasets of Table 1 and applied on the new validation cohorts confirmed the strong reproducibility of metagenomic models for CRC across cohorts when sufficiently large training cohorts are available. Feature ranking of the 16-species model are obtained the testing cohort to avoid overfitting. **(B)** Species richness, rarefied oral species richness, and *cutC* gene abundance (RPKM) are confirmed to be strong biomarkers of CRC in the validation datasets . P-values underlying the panels refer to one-tailed Wilcoxon Signed Rank test; P-values overlying the panels refer to the one-sided permutation-based Wilcoxon-Mann-Whitney tests, blocked for cohort. **(C)** Prediction performances as AUC values on the validation cohorts when adding external set of case and controls samples from metagenomic cohorts of diseases other than CRC (Crohn’s disease, ulcerative colitis, type-2 diabetes). **(D)** Assessment of the potential of microbiome-based prediction models in comparison and in combination with current non-invasive clinical screening tests. Models integrating our LODO machine learning approach with the FOBT or the Wif-1 Methylation tests are termed OR and AND, depending on whether only one or both need to be positive for the combined test to be positive.

See this image and copyright information in PMC

Comment in

Microbial signatures of colorectal cancer.
Koch L. Koch L. Nat Rev Genet. 2019 Jun;20(6):318-319. doi: 10.1038/s41576-019-0126-2. Nat Rev Genet. 2019. PMID: 30971807 No abstract available.

References

1. Ferlay J. et al. Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012. Int. J. Cancer 136, E359–86 (2015). - PubMed
1. Siegel R, Desantis C. & Jemal A. Colorectal cancer statistics, 2014. CA Cancer J. Clin 64, 104–117 (2014). - PubMed
1. Frank C, Sundquist J, Yu H, Hemminki A. & Hemminki K. Concordant and discordant familial cancer: Familial risks, proportions and population impact. Int. J. Cancer 140, 1510–1516 (2017). - PubMed
1. Foulkes WD Inherited susceptibility to common cancers. N. Engl. J. Med 359, 2143–2153 (2008). - PubMed
1. Johnson CM et al. Meta-analyses of colorectal cancer risk factors. Cancer Causes Control 24, 1207–1222 (2013). - PMC - PubMed

Methods-only References

1. Langmead B. & Salzberg SL Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357 (2012). - PMC - PubMed
1. Truong DT et al. MetaPhlAn2 for enhanced metagenomic taxonomic profiling. Nat. Methods 12, 902–903 (2015). - PubMed
1. Abubucker S. et al. Metabolic reconstruction for metagenomic data and its application to the human microbiome. PLoS Comput. Biol 8, e1002358 (2012). - PMC - PubMed
1. Breiman L. Random Forests. Mach. Learn 45, 5–32 (2001).
1. Pedregosa F. et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res 12, 2825–2830 (2011).

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- H1 Connect - Access expert opinions and insights on biomedical research.
- The Lens - Patent Citations Database
Medical
- ClinicalTrials.gov
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Metagenomic analysis of colorectal cancer datasets identifies cross-cohort microbial diagnostic signatures and a link with choline degradation

Affiliations

Metagenomic analysis of colorectal cancer datasets identifies cross-cohort microbial diagnostic signatures and a link with choline degradation

Authors

Affiliations

Erratum in

Abstract

Figures

Comment in

References

Methods-only References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical