Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Nov 27;7(12):3416-25.
doi: 10.1093/gbe/evv234.

Assessing the Unseen Bacterial Diversity in Microbial Communities

Affiliations

Assessing the Unseen Bacterial Diversity in Microbial Communities

Alejandro Caro-Quintero et al. Genome Biol Evol. .

Abstract

For both historical and technical reasons, 16S ribosomal RNA has been the most common molecular marker used to analyze the contents of microbial communities. However, its slow rate of evolution hinders the resolution of closely related bacteria--individual 16S-phylotypes, particularly when clustered at 97% sequence identity, conceal vast amounts of species- and strain-level variation. Protein-coding genes, which evolve more quickly, are useful for differentiating among more recently diverged lineages, but their application is complicated by difficulties in designing low-redundancy primers that amplify homologous regions from distantly related taxa. Given the now-common practice of multiplexing hundreds of samples, adopting new genes usually entails the synthesis of large sets of barcoded primers. To circumvent problems associated with use of protein-coding genes to survey microbial communities, we develop an approach--termed phyloTAGs--that offers an automatic solution for primer design and can be easily adapted to target different taxonomic groups and/or different protein-coding regions. We applied this method to analyze diversity within the gorilla gut microbiome and recovered hundreds of strains that went undetected after deep-sequencing of 16S rDNA amplicons. PhyloTAGs provides a powerful way to recover the fine-level diversity within microbial communities and to study stability and dynamics of bacterial populations.

Keywords: bacterial species; bacterial strain diversity; community profiling; microbiome; phylotypes; population structure.

PubMed Disclaimer

Figures

F<sc>ig</sc>. 1.—
Fig. 1.—
Correspondence between levels of sequence divergence and estimates of OTU richness for gyrB and 16S rDNA. (A) Association between the degree of sequence identity of 16S rDNA and the gyrB gene for pairs of genomes assigned to the same species. Note that 1) a 16S rDNA sequence identity value of 97%, which is conventionally used to delineate bacterial species, corresponds to 88% nucleotide sequence identity for gyrB, and 2) a 16S rDNA sequence identity value of 99%, which has been used to delineate strains within a designated bacterial species, corresponds to 96% nucleotide sequence identity for gyrB. A total of 604 genomes were examined. (B) Richness of the Lachnospiraceae family within all samples, as estimated by the Chao1 index for gyrB phyloTAGs and the 16S iTAGs at several values of OTU clustering. Estimation of parameters based on subsampling the data sets for each marker gene to the same depth with 100 bootstraps replicates. Shaded zones around rarefaction curves represent the 95% confidence intervals. Dashes show read numbers obtained after extrapolation to sample sizes larger than the actual total number of reads for the 16S data set. (C) Rarefaction analysis of sample coverage for data sets analyzed in panel (B), using identical subsampling parameters. As in (B), shaded zones around the rarefaction curves represent 95% confidence intervals, and dashed lines indicating trends after extrapolation to sample sizes larger than the actual total number of reads.
F<sc>ig</sc>. 2.—
Fig. 2.—
Community structure and OTU diversity recovered by gyrB phyloTAGs. Progressive clustering of gyrB phyloTAG sequences at decreasing levels of amino acid identity (based on conceptually translated nucleotide sequences) reconstructs the diversity and number of taxa within the Lachnospiraceae at different phylogenetic depths. Branches in cladogram are colored according to level of identity at which sequences are clustered. Four major branches (i.e., lineages) occur when clustered at 55% amino acid identity (black), 5 at 60% identity (turquoise), 12 at 65% identity (blue), and so on, with amount of diversification at lower taxonomic scales being highly variable among lineages. Black circles at the terminal end of each branch are sized according to the number of sequences affiliated to a cluster at levels >95% amino acid identity, and the five largest clusters are labeled with roman numerals (I–V). The inset shows the fine-level resolution of OTU variation in Lachnospiraceae cluster I. In this case, progressive clustering of gyrB phyloTAGs was performed on nucleotide sequences and revealed that most of the sequence variation assorts into 99% OTUs, that is, at the level of closely related strains within species.

References

    1. Acinas SG, Sarma-Rupavtarm R, Klepac-Ceraj V, Polz MF. 2005. PCR-induced sequence artifacts and bias: insights from comparison of two 16S rRNA clone libraries constructed from the same sample. Appl Environ Microbiol. 71:8966–8969. - PMC - PubMed
    1. Barret M, et al. 2015. Emergence shapes the structure of the seed-microbiota. Appl Environ Microbiol. 81:1257–1266. - PMC - PubMed
    1. Bartram AK, Lynch MDJ, Stearns JC, Moreno-Hagelsieb G, Neufeld JD. 2011. Generation of multimillion-sequence 16S rRNA gene libraries from complex microbial communities by assembling paired-end Illumina reads. Appl Environ Microbiol. 77:3846–3852. - PMC - PubMed
    1. Brettar I, Moore ER, Hofle MG. 2001. Phylogeny and abundance of novel denitrifying bacteria isolated from the water column of the central Baltic Sea. Microb Ecol 42:295–305. - PubMed
    1. Caporaso JG, et al. 2011. Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample. Proc Natl Acad Sci U S A. 108:4516–4522. - PMC - PubMed

Publication types

LinkOut - more resources