Simple statistical identification and removal of contaminant sequences in marker-gene and metagenomics data
- PMID: 30558668
- PMCID: PMC6298009
- DOI: 10.1186/s40168-018-0605-2
Simple statistical identification and removal of contaminant sequences in marker-gene and metagenomics data
Abstract
Background: The accuracy of microbial community surveys based on marker-gene and metagenomic sequencing (MGS) suffers from the presence of contaminants-DNA sequences not truly present in the sample. Contaminants come from various sources, including reagents. Appropriate laboratory practices can reduce contamination, but do not eliminate it. Here we introduce decontam ( https://github.com/benjjneb/decontam ), an open-source R package that implements a statistical classification procedure that identifies contaminants in MGS data based on two widely reproduced patterns: contaminants appear at higher frequencies in low-concentration samples and are often found in negative controls.
Results: Decontam classified amplicon sequence variants (ASVs) in a human oral dataset consistently with prior microscopic observations of the microbial taxa inhabiting that environment and previous reports of contaminant taxa. In metagenomics and marker-gene measurements of a dilution series, decontam substantially reduced technical variation arising from different sequencing protocols. The application of decontam to two recently published datasets corroborated and extended their conclusions that little evidence existed for an indigenous placenta microbiome and that some low-frequency taxa seemingly associated with preterm birth were contaminants.
Conclusions: Decontam improves the quality of metagenomic and marker-gene sequencing by identifying and removing contaminant DNA sequences. Decontam integrates easily with existing MGS workflows and allows researchers to generate more accurate profiles of microbial communities at little to no additional cost.
Keywords: 16S rRNA gene; DNA contamination; Marker-gene; Metagenomics; Microbiome.
Conflict of interest statement
Ethics approval and consent to participate
The study was approved by the Administrative Panels on Human Subjects Research (Stanford IRB protocol #21586) and by the Human Research Protection Program at UCSF (UCSF IRB protocol ##11-06283). All research subjects provided written informed consent prior to specimen collection.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figures






Comment in
-
Towards precision quantification of contamination in metagenomic sequencing experiments.Microbiome. 2019 Apr 16;7(1):62. doi: 10.1186/s40168-019-0678-6. Microbiome. 2019. PMID: 30992055 Free PMC article.
Similar articles
-
Does the human placenta delivered at term have a microbiota? Results of cultivation, quantitative real-time PCR, 16S rRNA gene sequencing, and metagenomics.Am J Obstet Gynecol. 2019 Mar;220(3):267.e1-267.e39. doi: 10.1016/j.ajog.2018.10.018. Am J Obstet Gynecol. 2019. PMID: 30832984 Free PMC article.
-
Controlling for Contaminants in Low-Biomass 16S rRNA Gene Sequencing Experiments.mSystems. 2019 Jun 4;4(4):e00290-19. doi: 10.1128/mSystems.00290-19. mSystems. 2019. PMID: 31164452 Free PMC article.
-
Optimizing 16S rRNA gene profile analysis from low biomass nasopharyngeal and induced sputum specimens.BMC Microbiol. 2020 May 12;20(1):113. doi: 10.1186/s12866-020-01795-7. BMC Microbiol. 2020. PMID: 32397992 Free PMC article.
-
Is there a placental microbiota? A critical review and re-analysis of published placental microbiota datasets.BMC Microbiol. 2023 Mar 18;23(1):76. doi: 10.1186/s12866-023-02764-6. BMC Microbiol. 2023. PMID: 36934229 Free PMC article. Review.
-
Identifying a healthy oral microbiome through metagenomics.Clin Microbiol Infect. 2012 Jul;18 Suppl 4:54-7. doi: 10.1111/j.1469-0691.2012.03857.x. Clin Microbiol Infect. 2012. PMID: 22647051 Review.
Cited by
-
Associations of upper respiratory mucosa microbiota with Rheumatoid arthritis, autoantibodies, and disease activity.PLoS One. 2024 Aug 6;19(8):e0308010. doi: 10.1371/journal.pone.0308010. eCollection 2024. PLoS One. 2024. PMID: 39106279 Free PMC article.
-
Multiomics Characterization of the Canada Goose Fecal Microbiome Reveals Selective Efficacy of Simulated Metagenomes.Microbiol Spectr. 2022 Dec 21;10(6):e0238422. doi: 10.1128/spectrum.02384-22. Epub 2022 Nov 1. Microbiol Spectr. 2022. PMID: 36318011 Free PMC article.
-
Impact of intrapartum antibiotic prophylaxis on the oral and fecal bacteriomes of children in the first week of life.Sci Rep. 2024 Aug 6;14(1):18163. doi: 10.1038/s41598-024-68953-z. Sci Rep. 2024. PMID: 39107353 Free PMC article.
-
Fecal virus-like particles are sufficient to reduce necrotizing enterocolitis.Gut Microbes. 2024 Jan-Dec;16(1):2392876. doi: 10.1080/19490976.2024.2392876. Epub 2024 Aug 22. Gut Microbes. 2024. PMID: 39172643 Free PMC article.
-
Next-Generation Sequencing to Elucidate the Semen Microbiome in Male Reproductive Disorders.Medicina (Kaunas). 2023 Dec 22;60(1):25. doi: 10.3390/medicina60010025. Medicina (Kaunas). 2023. PMID: 38256286 Free PMC article. Review.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical