Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Apr;65(3):709-19.
doi: 10.1007/s00248-012-0145-4. Epub 2012 Dec 12.

Effects of OTU clustering and PCR artifacts on microbial diversity estimates

Affiliations

Effects of OTU clustering and PCR artifacts on microbial diversity estimates

Nastassia V Patin et al. Microb Ecol. 2013 Apr.

Abstract

Next-generation sequencing has increased the coverage of microbial diversity surveys by orders of magnitude, but differentiating artifacts from rare environmental sequences remains a challenge. Clustering 16S rRNA sequences into operational taxonomic units (OTUs) organizes sequence data into groups of 97 % identity, helping to reduce data volumes and avoid analyzing sequencing artifacts by grouping them with real sequences. Here, we analyze sequence abundance distributions across environmental samples and show that 16S rRNA sequences of >99 % identity can represent functionally distinct microorganisms, rendering OTU clustering problematic when the goal is an accurate analysis of organism distribution. Strict postsequencing quality control (QC) filters eliminated the most prevalent artifacts without clustering. Further experiments proved that DNA polymerase errors in polymerase chain reaction (PCR) generate a significant number of substitution errors, most of which pass QC filters. Based on our findings, we recommend minimizing the number of PCR cycles in DNA library preparation and applying strict postsequencing QC filters to reduce the most prevalent artifacts while maintaining a high level of accuracy in diversity estimates. We further recommend correlating rare and abundant sequences across environmental samples, rather than clustering into OTUs, to identify remaining sequence artifacts without losing the resolution afforded by high-throughput sequencing.

PubMed Disclaimer

References

    1. Appl Environ Microbiol. 2011 Jun;77(11):3846-52 - PubMed
    1. Science. 2009 May 29;324(5931):1190-2 - PubMed
    1. Appl Environ Microbiol. 2007 Jul;73(14):4532-42 - PubMed
    1. ISME J. 2012 Jan;6(1):183-94 - PubMed
    1. Nature. 2004 Mar 4;428(6978):37-43 - PubMed

MeSH terms

LinkOut - more resources