Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Jul;8(7):1428-39.
doi: 10.1038/ismej.2013.248. Epub 2014 Jan 23.

Evolutionary analysis of a streamlined lineage of surface ocean Roseobacters

Affiliations

Evolutionary analysis of a streamlined lineage of surface ocean Roseobacters

Haiwei Luo et al. ISME J. 2014 Jul.

Abstract

The vast majority of surface ocean bacteria are uncultivated. Compared with their cultured relatives, they frequently exhibit a streamlined genome, reduced G+C content and distinct gene repertoire. These genomic traits are relevant to environmental adaptation, and have generally been thought to become fixed in marine bacterial populations through selection. Using single-cell genomics, we sequenced four uncultivated cells affiliated with the ecologically relevant Roseobacter clade and used a composition-heterogeneous Bayesian phylogenomic model to resolve these single-cell genomes into a new clade. This lineage has no representatives in culture, yet accounts for ∼35% of Roseobacters in some surface ocean waters. Analyses of multiple genomic traits, including genome size, G+C content and percentage of noncoding DNA, suggest that these single cells are representative of oceanic Roseobacters but divergent from isolates. Population genetic analyses showed that substitution of physicochemically dissimilar amino acids and replacement of G+C-rich to G+C-poor codons are accelerated in the uncultivated clade, processes that are explained equally well by genetic drift as by the more frequently invoked explanation of natural selection. The relative importance of drift vs selection in this clade, and perhaps in other marine bacterial clades with streamlined G+C-poor genomes, remains unresolved until more evidence is accumulated.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Regression model for Roseobacter SAG genome size estimation based on genome statistics for 40 cultured Roseobacter strains. The x axis shows the ratio of the number of conserved single-copy genes universally present in fully sequenced Roseobacter genomes to the number of predicted protein-coding genes in a genome; the y axis is the number of nucleotides sequenced. The data were fit to a polynomial regression model (R2=0.93), and the model was used to estimate the genome sizes of the four SAGs (C03, O19, K06 and J04), which were found to be 2.64, 3.10, 3.50 and 2.65 Mb, respectively. The prediction interval (PI) is also shown.
Figure 2
Figure 2
Genomic characteristics of the three SAG-O19 clade members compared with metagenomic roseobacter sequences and isolates. Distribution of G+C content sampled from Global Ocean Survey (GOS) metagenomic data sets (a) and cultured Roseobacter genomes (b). For panel a, the GOS roseobacter reads (n=5608) were identified using the dN pipeline software, which assigns metagenomic reads to a microbial clade with high confidence (Luo et al., 2012). For panel b, the cultured roseobacter genomes were randomly sheared to generate an in silico metagenome, and then analyzed as described for panel a. The gray arrow in panel b indicates the average G+C content of the SAGs. The fraction of noncoding DNA (c) and estimated average genome size (d) in Roseobacter GOS reads, SAGs and isolates. The box-and-whiskers plots indicate the median (horizontal line), the boundary of the first (Q1) and third quartile (Q3) (box) and the range from Q1–1.5(Q3–Q1) to Q3+1.5(Q3–Q1) (whiskers); circles indicate outliers. The details for noncoding DNA fraction and genome size estimation can be found in the Materials and methods.
Figure 3
Figure 3
Bayesian phylogenomic tree of the Roseobacter clade using a composition-heterogeneous model in the P4 software package based on a concatenation of 52 single-copy orthologous protein sequences. The scale bar indicates the number of substitutions per site. The value near each internal branch is the posterior probability for that branch. The tree is rooted using four genomes from sister clades in the Alphaproteobacteria (Hyphomonadaceae, Caulobacterales, Rhizobiales); the outgroups are not shown. The SAG clade is highlighted in blue, an example control clade is highlighted in orange and the remaining members used to calculate dR/dC ratios are indicated in green. The basal lineage consisting of HTCC2255 and SAG C03 were not included in the analyses. G+C content and genome size are indicated in the two right columns, the latter estimated using a regression model for incomplete genomes (Figure 1). Plasmids, if any, are not included in genome size calculation.
Figure 4
Figure 4
The ratio of radical (dR) to conservative (dC) substitution rates based on physicochemical properties (shown here, charge) of amino acids for the SAG-O19 clade and six control clades. Bars indicate one standard deviation of the mean. For the control clades, only strain names are shown; full names can be found in Figure 3.

Similar articles

Cited by

References

    1. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–3402. - PMC - PubMed
    1. Balbi KJ, Rocha EPC, Feil EJ. The temporal dynamics of slightly deleterious mutations in Escherichia coli and Shigella spp. Mol Biol Evol. 2009;26:345–355. - PubMed
    1. Bragg JG, Hyder CL. Nitrogen versus carbon use in prokaryotic genomes and proteomes. P Roy Soc B-Biol Sci. 2004;271:S374–S377. - PMC - PubMed
    1. Buchan A, Gonzalez JM, Moran MA. Overview of the marine Roseobacter lineage. Appl Environ Microbiol. 2005;71:5665–5677. - PMC - PubMed
    1. Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25:1972–1973. - PMC - PubMed

Publication types