Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Mar;21(3):494-504.
doi: 10.1101/gr.112730.110. Epub 2011 Jan 6.

Chimeric 16S rRNA sequence formation and detection in Sanger and 454-pyrosequenced PCR amplicons

Affiliations

Chimeric 16S rRNA sequence formation and detection in Sanger and 454-pyrosequenced PCR amplicons

Brian J Haas et al. Genome Res. 2011 Mar.

Abstract

Bacterial diversity among environmental samples is commonly assessed with PCR-amplified 16S rRNA gene (16S) sequences. Perceived diversity, however, can be influenced by sample preparation, primer selection, and formation of chimeric 16S amplification products. Chimeras are hybrid products between multiple parent sequences that can be falsely interpreted as novel organisms, thus inflating apparent diversity. We developed a new chimera detection tool called Chimera Slayer (CS). CS detects chimeras with greater sensitivity than previous methods, performs well on short sequences such as those produced by the 454 Life Sciences (Roche) Genome Sequencer, and can scale to large data sets. By benchmarking CS performance against sequences derived from a controlled DNA mixture of known organisms and a simulated chimera set, we provide insights into the factors that affect chimera formation such as sequence abundance, the extent of similarity between 16S genes, and PCR conditions. Chimeras were found to reproducibly form among independent amplifications and contributed to false perceptions of sample diversity and the false identification of novel taxa, with less-abundant species exhibiting chimera rates exceeding 70%. Shotgun metagenomic sequences of our mock community appear to be devoid of 16S chimeras, supporting a role for shotgun metagenomics in validating novel organisms discovered in targeted sequence surveys.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Formation of chimeric sequences during PCR. An aborted extension product from an earlier cycle of PCR can function as a primer in a subsequent PCR cycle. If this aborted extension product anneals to and primes DNA synthesis from an improper template, a chimeric molecule is formed.
Figure 2.
Figure 2.
Comparison of chimera detection sensitivity among methods. (A) Chimera detection sensitivity as a function of chimera divergence; (B) chimera detection sensitivity according to the shared level of taxonomy between the proposed parental sequences. Cumulative false-positive rates were as follows: CS, 1.6%; WigeoN, 0.67%; BellerophonGG, 7.13%; KmerGenus, 0%.
Figure 3.
Figure 3.
Correlation of chimera content with sequence homology and organism abundance. (A) Percent of other organism abundance corresponding to chimeras with the indicated more abundant species (y-axis), plotted according to percent identity (x-axis) between homologous 16S genes. (B) Number of chimeric sequences corresponding to a given genus were plotted as a function of total genus-level classified reads for the even (eMC) and staggered (sMC) mock community. Total read counts were based on best BLASTN match (E ≤ 10−10) to reference sequences for nonchimeras in addition to the genus representation within the CS-predicted chimeras. (C) Percent of sequences that correspond to chimeras for each genus plotted according to genus-level sequence abundance. Error bars correspond to standard error from the mean based on four technical replicates.
Figure 4.
Figure 4.
Alignment of sequences corresponding to chimeras between Streptococcus and Staphylococcus 16S rRNA genes. Only columns from the NAST multiple alignment containing nonidentical nucleotides between the reference sequences (top and bottom) are shown. Nucleotides matching Streptococcus sequences are colored red. Sequence prefixes correspond to the four experimental replicates A–D.

References

    1. Acinas SG, Sarma-Rupavtarm R, Klepac-Ceraj V, Polz MF 2005. PCR-induced sequence artifacts and bias: Insights from comparison of two 16S rRNA clone libraries constructed from the same sample. Appl Environ Microbiol 71: 8966–8969 - PMC - PubMed
    1. Ashelford KE, Chuzhanova NA, Fry JC, Jones AJ, Weightman AJ 2005. At least 1 in 20 16S rRNA sequence records currently held in public repositories is estimated to contain substantial anomalies. Appl Environ Microbiol 71: 7724–7736 - PMC - PubMed
    1. Ashelford KE, Chuzhanova NA, Fry JC, Jones AJ, Weightman AJ 2006. New screening software shows that most recent large 16S rRNA gene clone libraries contain chimeras. Appl Environ Microbiol 72: 5734–5741 - PMC - PubMed
    1. Boucher Y, Douady CJ, Sharma AK, Kamekura M, Doolittle WF 2004. Intragenomic heterogeneity and intergenomic recombination among haloarchaeal rRNA genes. J Bacteriol 186: 3980–3990 - PMC - PubMed
    1. Caporaso JG, Kuczynski J, Stombaugh J, Bittinger K, Bushman FD, Costello EK, Fierer N, Pena AG, Goodrich JK, Gordon JI, et al. 2010. QIIME allows analysis of high-throughput community sequencing data. Nat Methods 7: 335–336 - PMC - PubMed

Publication types

LinkOut - more resources