Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Oct 24;8(1):15679.
doi: 10.1038/s41598-018-33944-4.

Case studies on potential G-quadruplex-forming sequences from the bacterial orders Deinococcales and Thermales derived from a survey of published genomes

Affiliations

Case studies on potential G-quadruplex-forming sequences from the bacterial orders Deinococcales and Thermales derived from a survey of published genomes

Yun Ding et al. Sci Rep. .

Abstract

Genomes provide a platform for storage of chemical information that must be stable under the context in which an organism thrives. The 2'-deoxyguanosine (G) nucleotide has the potential to provide additional chemical information beyond its Watson-Crick base-pairing capacity. Sequences with four or more runs of three G nucleotides each are potential G-quadruplex forming sequences (PQSs) that can adopt G-quadruplex folds. Herein, we analyzed sequenced genomes from the NCBI database to determine the PQS densities of the genome sequences. First, we found organisms with large genomes, including humans, alligators, and maize, have similar densities of PQSs (~300 PQSs/Mbp), and the genomes are significantly enriched in PQSs with more than four G tracks. Analysis of microorganism genomes found a greater diversity of PQS densities. In general, PQS densities positively tracked with the GC% of the genome. Exceptions to this observation were the genomes from thermophiles that had many more PQSs than expected by random chance. Analysis of the location of these PQSs in annotated genomes from the order Thermales showed these G-rich sequences to be randomly distributed; in contrast, in the order Deinococcales the PQSs were enriched and biased around transcription start sites of genes. Four representative PQSs, two each from the Thermales and Deinococcales, were studied by biophysical methods to establish the ability of them to fold to G-quadruplexes. The experiments found the two PQSs in the Thermales did not adopt G-quadruplex folds, while the two most common in the Deinococcales adopted stable parallel-stranded G-quadruplexes. The findings lead to a hypothesis that thermophilic organisms are enriched with PQSs as an unavoidable consequence to stabilize thermally their genomes to live at high temperature; in contrast, the genomes from stress-resistant bacteria found in the Deinococcales may utilize PQSs for gene regulatory purposes.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Characteristics of a G-quadruplex. (A) General sequence formula for a PQS. (B) Structure of a G-tetrad. (C) Cartoon representations for a parallel and antiparallel G4.
Figure 2
Figure 2
The density of PQSs in the genomes of select organisms as a function of the genome length, GC%, and percentage of PQSs with >4 G tracks.
Figure 3
Figure 3
Plots of PQS densities and GC% for the phyla of bacteria with representative genomes found in the NCBI database. There exist six phyla of bacteria (Caldiserica, Calditrichaeota, Chrysiogenetes, Dictyoglomi, Elusimicrobia, and Thermodesulfobacteria) that have fewer than four representative genomes sequenced; these data can be found in the supporting information (Fig. S2). Three panels represent superphyla of bacteria that include the FCB group (phyla = Fibrobacteres, Chlorobi, and Bacteroidetes), PVC group (phyla = Planctomycetes, Verrucomicrobia, Chlamydiae, and Lentisphaerae), and the Terrabacteria group (phyla = Cyanobacteria, Chlorflexi, and Deinococcus-Thermus).
Figure 4
Figure 4
Profiles of the PQSs found in the phylum Deinococcus-Thermus. (A) Plot of densities of PQSs vs. GC%. The red box plots represent the theoretical PQS densities determined from analysis of randomized genomes with defined GC%. (B) Distribution of PQSs around TSSs binned at 30 nts. (C) Distributions of PQSs on the coding or template strand in the orders Deinoccocales and Thermales. (D) Distributions of >4 G-track PQSs on the coding or template strand in the orders Deinoccocales and Thermales.
Figure 5
Figure 5
Hierarchical classification in the phylum Deinococcus-Thermus on the basis of the distribution of PQSs around TSSs. Unsupervised hierarchical classification was performed on the vectors of the PQS distributions around the TSSs. The bacterial names were colored coded after classification.
Figure 6
Figure 6
Representative PQSs selected for biophysical characterization to establish G4 folding potential. (A) The sequences selected for study by (B) 1H-NMR, (C) CD spectroscopy, and (D) thermal melting analysis (i.e., Tm).

Similar articles

Cited by

References

    1. O’Sullivan RJ, Karlseder J. Telomeres: protecting chromosomes against genome instability. Nat. Rev. Mol. Cell Biol. 2010;11:171–181. doi: 10.1038/nrm2848. - DOI - PMC - PubMed
    1. Morgunova E, Taipale J. Structural perspective of cooperative transcription factor binding. Curr. Opin. Struct. Biol. 2017;47:1–8. doi: 10.1016/j.sbi.2017.03.006. - DOI - PubMed
    1. Brazda V, Laister RC, Jagelska EB, Arrowsmith C. Cruciform structures are a common DNA feature important for regulating biological processes. BMC Mol. Biol. 2011;12:33. doi: 10.1186/1471-2199-12-33. - DOI - PMC - PubMed
    1. Matos J, West SC. Holliday junction resolution: regulation in space and time. DNA Repair (Amst) 2014;19:176–181. doi: 10.1016/j.dnarep.2014.03.013. - DOI - PMC - PubMed
    1. Kendrick S, Hurley LH. The role of G-quadruplex/i-motif secondary structures as cis-acting regulatory elements. Pure Appl. Chem. 2010;82:1609–1621. doi: 10.1351/PAC-CON-09-09-29. - DOI - PMC - PubMed

Publication types