Consistent and correctable bias in metagenomic sequencing experiments
- PMID: 31502536
- PMCID: PMC6739870
- DOI: 10.7554/eLife.46923
Consistent and correctable bias in metagenomic sequencing experiments
Abstract
Marker-gene and metagenomic sequencing have profoundly expanded our ability to measure biological communities. But the measurements they provide differ from the truth, often dramatically, because these experiments are biased toward detecting some taxa over others. This experimental bias makes the taxon or gene abundances measured by different protocols quantitatively incomparable and can lead to spurious biological conclusions. We propose a mathematical model for how bias distorts community measurements based on the properties of real experiments. We validate this model with 16S rRNA gene and shotgun metagenomics data from defined bacterial communities. Our model better fits the experimental data despite being simpler than previous models. We illustrate how our model can be used to evaluate protocols, to understand the effect of bias on downstream statistical analyses, and to measure and correct bias given suitable calibration controls. These results illuminate new avenues toward truly quantitative and reproducible metagenomics measurements.
Keywords: 16S rRNA gene; bias; calibration; computational biology; infectious disease; metagenomics; microbiology; microbiome; reproducibility; systems biology.
© 2019, McLaren et al.
Conflict of interest statement
MM, AW, BC No competing interests declared
Figures













Similar articles
-
The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies.BMC Microbiol. 2015 Mar 21;15:66. doi: 10.1186/s12866-015-0351-6. BMC Microbiol. 2015. PMID: 25880246 Free PMC article.
-
Simple statistical identification and removal of contaminant sequences in marker-gene and metagenomics data.Microbiome. 2018 Dec 17;6(1):226. doi: 10.1186/s40168-018-0605-2. Microbiome. 2018. PMID: 30558668 Free PMC article.
-
Full-length 16S rRNA gene classification of Atlantic salmon bacteria and effects of using different 16S variable regions on community structure analysis.Microbiologyopen. 2019 Oct;8(10):e898. doi: 10.1002/mbo3.898. Epub 2019 Jul 4. Microbiologyopen. 2019. PMID: 31271529 Free PMC article.
-
Current challenges and best-practice protocols for microbiome analysis.Brief Bioinform. 2021 Jan 18;22(1):178-193. doi: 10.1093/bib/bbz155. Brief Bioinform. 2021. PMID: 31848574 Free PMC article. Review.
-
Field and laboratory guidelines for reliable bioinformatic and statistical analysis of bacterial shotgun metagenomic data.Crit Rev Biotechnol. 2024 Sep;44(6):1164-1182. doi: 10.1080/07388551.2023.2254933. Epub 2023 Sep 20. Crit Rev Biotechnol. 2024. PMID: 37731336 Review.
Cited by
-
Patient propagules: Do soil archives preserve the legacy of fungal and prokaryotic communities?PLoS One. 2020 Aug 11;15(8):e0237368. doi: 10.1371/journal.pone.0237368. eCollection 2020. PLoS One. 2020. PMID: 32780777 Free PMC article.
-
De novo identification of microbial contaminants in low microbial biomass microbiomes with Squeegee.Nat Commun. 2022 Nov 10;13(1):6799. doi: 10.1038/s41467-022-34409-z. Nat Commun. 2022. PMID: 36357382 Free PMC article.
-
The community ecology perspective of omics data.Microbiome. 2022 Dec 13;10(1):225. doi: 10.1186/s40168-022-01423-8. Microbiome. 2022. PMID: 36510248 Free PMC article.
-
Disentangling bias for non-destructive insect metabarcoding.PeerJ. 2022 Feb 23;10:e12981. doi: 10.7717/peerj.12981. eCollection 2022. PeerJ. 2022. PMID: 35228909 Free PMC article.
-
Ensuring That Fundamentals of Quantitative Microbiology Are Reflected in Microbial Diversity Analyses Based on Next-Generation Sequencing.Front Microbiol. 2022 Mar 1;13:728146. doi: 10.3389/fmicb.2022.728146. eCollection 2022. Front Microbiol. 2022. PMID: 35300475 Free PMC article.
References
-
- Aitchison J. The Statistical Analysis of Compositional Data. London: Chapman and Hall; 1986.
-
- Aitchison J. On criteria for measures of compositional difference. Mathematical Geology. 1992;24:365–379. doi: 10.1007/BF00891269. - DOI
-
- Aitchison J. A concise guide to compositional data analysis. 2nd Compositional Data Analysis Workshop; 2003.
-
- Barceló-Vidal C, Martín-Fernández JA, Pawlowsky-Glahn V. Mathematical foundations of compositional data analysis. Proceedings of IAMG'01—The Sixth Annual Conference of the International Association for Mathematical Geology; 2001.
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources