This is a preprint.
Major data analysis errors invalidate cancer microbiome findings
- PMID: 37577699
- PMCID: PMC10418105
- DOI: 10.1101/2023.07.28.550993
Major data analysis errors invalidate cancer microbiome findings
Update in
-
Major data analysis errors invalidate cancer microbiome findings.mBio. 2023 Oct 31;14(5):e0160723. doi: 10.1128/mbio.01607-23. Epub 2023 Oct 9. mBio. 2023. PMID: 37811944 Free PMC article.
Abstract
We re-analyzed the data from a recent large-scale study that reported strong correlations between microbial organisms and 33 different cancer types, and that created machine learning predictors with near-perfect accuracy at distinguishing among cancers. We found at least two fundamental flaws in the reported data and in the methods: (1) errors in the genome database and the associated computational methods led to millions of false positive findings of bacterial reads across all samples, largely because most of the sequences identified as bacteria were instead human; and (2) errors in transformation of the raw data created an artificial signature, even for microbes with no reads detected, tagging each tumor type with a distinct signal that the machine learning programs then used to create an apparently accurate classifier. Each of these problems invalidates the results, leading to the conclusion that the microbiome-based classifiers for identifying cancer presented in the study are entirely wrong. These flaws have subsequently affected more than a dozen additional published studies that used the same data and whose results are likely invalid as well.
Conflict of interest statement
Conflicts of interest. CSC, DSB and AG are coinventors on a patent application (UK Patent Application No. 2200682.9) from the University of East Anglia/UEA Enterprises Limited regarding the application of biomarker bacterial genera in prostate cancer. All other authors declare no conflicts.
Figures
References
-
- Warren JR, Marshall B. 1983. Unidentified curved bacilli on gastric epithelium in active chronic gastritis. Lancet 1:1273–5. - PubMed
-
- Poore GD, Kopylova E, Zhu Q, Carpenter C, Fraraccio S, Wandro S, Kosciolek T, Janssen S, Metcalf J, Song SJ, Kanbar J, Miller-Montgomery S, Heaton R, McKay R, Patel SP, Swafford AD, Knight R. 2020. Microbiome analyses of blood and tissues suggest cancer diagnostic approach. Nature 579:567–574. - PMC - PubMed
Publication types
Grants and funding
LinkOut - more resources
Full Text Sources