Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Aug 6;19(1):589.
doi: 10.1186/s12864-018-4984-3.

Modeling of the GC content of the substituted bases in bacterial core genomes

Affiliations

Modeling of the GC content of the substituted bases in bacterial core genomes

Jon Bohlin et al. BMC Genomics. .

Abstract

Background: The purpose of the present study was to examine the GC content of substituted bases (sbGC) in the core genomes of 35 bacterial species. Each species, or core genome, constituted genomes from at least 10 strains. We also wanted to explore whether sbGC for each strain was associated with the corresponding species' core genome GC content (cgGC). We present a simple mathematical model that estimates sbGC from cgGC. The model assumes only that the estimated sbGC is a function of cgGC proportional to fixed AT→GC (α) and GC → AT (β) mutation rates. Non-linear regression was used to estimate parameters α and β from the empirical data described above.

Results: We found that sbGC for each strain showed a non-linear association with the corresponding cgGC with a bias towards higher GC content for most core genomes (66.3% of the strains), assuming as a null-hypothesis that sbGC should be approximately equal to cgGC. The most GC rich core genomes (i.e. approximately %GC > 60), on the other hand, exhibited slightly less GC-biased sbGC than expected. The best fitted regression model indicates that GC → AT mutation rates β = (1.91 ± 0.13) p < 0.001 are approximately (1.91/0.79) = 2.42 times as high, on average, as AT→GC α = (- 0.79 ± 0.25) p < 0.001 mutation rates. Whether the observed sbGC GC-bias for all but the most GC-rich prokaryotic species is due to selection, compensating for the GC → AT mutation bias, and/or selective neutral processes is currently debated. Residual standard error was found to be σ = 0.076 indicating estimated errors of sbGC to be approximately within ±15.2% GC (95% confidence interval) for the strains of all species in the study.

Conclusion: Not only did our mathematical model give reasonable estimates of sbGC it also provides further support to previous observations that mutation rates in prokaryotes exhibit a universal GC → AT bias that appears to be remarkably consistent between taxa.

Keywords: Core genome; Core genome GC content; Mathematical modeling; Microbial genomics; SNP GC content; Statistical parameter estimation.

PubMed Disclaimer

Conflict of interest statement

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Figures

Fig. 1
Fig. 1
sbGC plotted against cgGC. The graph shows sbGC, for each strain, on the y-axis plotted against corresponding cgGC on the x-axis each point coloured according to phyla. The dashed line designates sbGC = cgGC while the blue points represent gcMOD fitted to the data using non-linear regression
Fig. 2
Fig. 2
Difference in sbGC- and cgGC plotted against cgGC. The figure shows the difference between sbGC and corresponding cgGC (i.e. sbGC subtracted from cgGC) plotted against cgGC (horizontal axis) as well as the estimated values from (gcMOD-cgGC) (blue points)

References

    1. McCutcheon JP, Moran NA. Extreme genome reduction in symbiotic bacteria. Nat Rev Microbiol. 2012;10(1):13–26. doi: 10.1038/nrmicro2670. - DOI - PubMed
    1. Nishida H. Evolution of genome base composition and genome size in bacteria. Front Microbiol. 2012;3:420. doi: 10.3389/fmicb.2012.00420. - DOI - PMC - PubMed
    1. Reichenberger ER, Rosen G, Hershberg U, Hershberg R. Prokaryotic nucleotide composition is shaped by both phylogeny and the environment. Genome Biol Evol. 2015;7(5):1380–1389. doi: 10.1093/gbe/evv063. - DOI - PMC - PubMed
    1. Foerstner KU, von Mering C, Hooper SD, Bork P. Environments shape the nucleotide composition of genomes. EMBO Rep. 2005;6(12):1208–1213. doi: 10.1038/sj.embor.7400538. - DOI - PMC - PubMed
    1. Hildebrand F, Meyer A, Eyre-Walker A. Evidence of selection upon genomic GC-content in bacteria. PLoS Genet. 2010;6(9):e1001107. doi: 10.1371/journal.pgen.1001107. - DOI - PMC - PubMed

LinkOut - more resources