Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Oct 31;14(5):e0160723.
doi: 10.1128/mbio.01607-23. Epub 2023 Oct 9.

Major data analysis errors invalidate cancer microbiome findings

Affiliations

Major data analysis errors invalidate cancer microbiome findings

Abraham Gihawi et al. mBio. .

Abstract

Recent reports showing that human cancers have a distinctive microbiome have led to a flurry of papers describing microbial signatures of different cancer types. Many of these reports are based on flawed data that, upon re-analysis, completely overturns the original findings. The re-analysis conducted here shows that most of the microbes originally reported as associated with cancer were not present at all in the samples. The original report of a cancer microbiome and more than a dozen follow-up studies are, therefore, likely to be invalid.

Keywords: bioinformatics; cancer; computational biology; metagenomics; microbiome.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Fig 1
Fig 1
Average number of reads per sample in bladder cancer (BLCA) in the top 20 most-abundant genera reported in Poore et al. (left), averaged across 156 whole-genome sequencing samples. On the right are the counts for the same samples and the same genera, in the same order, as computed in our re-analysis. Note that the y-axis scales are different by a factor of 2,000. The x-axis shows genus names.
Fig 2
Fig 2
Distribution of normalized counts for Hepandensovirus for adrenocortical carcinoma (blue) vs all other samples (orange). The inset shows a zoomed-in view of the distribution for the small values. All raw values were zero.
Fig 3
Fig 3
Distribution of normalized counts for Thiorhodospira reads in KICH cancer (blue) and normal (orange) samples. Nearly all raw values were zero except for seven samples with a raw count of 1.
Fig 4
Fig 4
Distribution of normalized read counts in the APCR data set for Nitrospira reads found in lung squamous cell carcinoma (blue) and all other cancer types (orange). For clarity, the y-axis is truncated at 500, but the peak of the distribution for other cancers is at 1,389.
Fig 5
Fig 5
Distribution of normalized counts for Mulikevirus reads in head and neck squamous cell cancer (orange) and normal (blue) samples. All raw values were zero.
Fig 6
Fig 6
Accuracies for one-vs-all tumor classification models obtained from a selection of samples and genera with zero classified reads prior to normalization. Each row shows the accuracies of a classifier that distinguished one cancer type from all other cancer types in the table. AUC: maximum measured area under the sensitivity-specificity curve. PPV: positive predictive value. NPV: negative predictive value.

Update of

References

    1. Bosch FX, Lorincz A, Muñoz N, Meijer C, Shah KV. 2002. The causal relation between human papillomavirus and cervical cancer. J Clin Pathol 55:244–265. doi:10.1136/jcp.55.4.244 - DOI - PMC - PubMed
    1. Warren JR, Marshall B. 1983. Unidentified curved bacilli on gastric epithelium in active chronic gastritis. Lancet 1:1273–1275. doi:10.1016/S0140-6736(83)92719-8 - DOI - PubMed
    1. Castellarin M, Warren RL, Freeman JD, Dreolini L, Krzywinski M, Strauss J, Barnes R, Watson P, Allen-Vercoe E, Moore RA, Holt RA. 2012. Fusobacterium nucleatum infection is prevalent in human colorectal carcinoma. Genome Res 22:299–306. doi:10.1101/gr.126516.111 - DOI - PMC - PubMed
    1. Poore GD, Kopylova E, Zhu Q, Carpenter C, Fraraccio S, Wandro S, Kosciolek T, Janssen S, Metcalf J, Song SJ, Kanbar J, Miller-Montgomery S, Heaton R, Mckay R, Patel SP, Swafford AD, Knight R. 2020. Microbiome analyses of blood and tissues suggest cancer diagnostic approach. Nat 579:567–574. doi:10.1038/s41586-020-2095-1 - DOI - PMC - PubMed
    1. Breitwieser FP, Pertea M, Zimin AV, Salzberg SL. 2019. Human contamination in bacterial genomes has created thousands of spurious proteins. Genome Res 29:954–960. doi:10.1101/gr.245373.118 - DOI - PMC - PubMed