Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Aug 4:6:771.
doi: 10.3389/fmicb.2015.00771. eCollection 2015.

Primer and platform effects on 16S rRNA tag sequencing

Affiliations

Primer and platform effects on 16S rRNA tag sequencing

Julien Tremblay et al. Front Microbiol. .

Abstract

Sequencing of 16S rRNA gene tags is a popular method for profiling and comparing microbial communities. The protocols and methods used, however, vary considerably with regard to amplification primers, sequencing primers, sequencing technologies; as well as quality filtering and clustering. How results are affected by these choices, and whether data produced with different protocols can be meaningfully compared, is often unknown. Here we compare results obtained using three different amplification primer sets (targeting V4, V6-V8, and V7-V8) and two sequencing technologies (454 pyrosequencing and Illumina MiSeq) using DNA from a mock community containing a known number of species as well as complex environmental samples whose PCR-independent profiles were estimated using shotgun sequencing. We find that paired-end MiSeq reads produce higher quality data and enabled the use of more aggressive quality control parameters over 454, resulting in a higher retention rate of high quality reads for downstream data analysis. While primer choice considerably influences quantitative abundance estimations, sequencing platform has relatively minor effects when matched primers are used. Beta diversity metrics are surprisingly robust to both primer and sequencing platform biases.

Keywords: 16S rRNA gene sequencing; amplification; community assembly; high throughput sequencing; microbial diversity; microbial population and community ecology; sequencing error.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Timeline indicating major breakthroughs in experimental and theoretical work in the field of 16S rRNA gene sequencing.
Figure 2
Figure 2
Error estimation for various sequencing configurations. (A) Insertion, deletion and substitution error frequency per 1000 P. suwonensis reads before and after lenient and stringent QC. Error frequency was calculated from triplicates for each sequencing condition. Error bars represent standard deviation. (B) Position of insertion, deletion and substitution error frequency in 16S tag amplicon sequences in which no QC filter has been applied.
Figure 3
Figure 3
Observed OTUs rarefaction estimation curves for P. suwonensis and synthetic community DNA pool. A dotted black line shows the theoretical number of expected OTUs for P. suwonensis (PS) and a red line for the synthetic community (SC). All OTU tables used to generate rarefaction curves were rarefied to 2893 reads per sample.
Figure 4
Figure 4
Taxonomy heatmaps of all 16S data from (A) the synthetic community DNA pool and (B) samples from a wetlands sampling site. Color scale is defined as log2 of percentage values of each taxon.
Figure 5
Figure 5
Procrustes rotation comparison of weighted UniFrac, unweighted UniFrac and Bray-Curtis coordinates metrics for various wetland 16S tag data types.

References

    1. Acinas S. G., Sarma-Rupavtarm R., Klepac-Ceraj V., Polz M. F. (2005). PCR-induced sequence artifacts and bias: insights from comparison of two 16S rRNA clone libraries constructed from the same sample. Appl. Environ. Microbiol. 71, 8966–8969. 10.1128/AEM.71.12.8966-8969.2005 - DOI - PMC - PubMed
    1. Baker G. C., Smith J. J., Cowan D. A. (2003). Review and re-analysis of domain-specific 16S primers. J. Microbiol. Methods 55, 541–555. 10.1016/j.mimet.2003.08.009 - DOI - PubMed
    1. Benjamini Y., Speed T. P. (2012). Summarizing and correcting the GC content bias in high-throughput sequencing. Nucleic Acids Res. 40, e72. 10.1093/nar/gks001 - DOI - PMC - PubMed
    1. Bulgarelli D., Rott M., Schlaeppi K., Ver Loren Van Themaat E., Ahmadinejad N., Assenza F., et al. . (2012). Revealing structure and assembly cues for Arabidopsis root-inhabiting bacterial microbiota. Nature 488, 91–95. 10.1038/nature11336 - DOI - PubMed
    1. Caporaso J. G., Kuczynski J., Stombaugh J., Bittinger K., Bushman F. D., Costello E. K., et al. . (2010). QIIME allows analysis of high-throughput community sequencing data. Nat. Methods 7, 335–336. 10.1038/nmeth.f.303 - DOI - PMC - PubMed