Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 May 1;8(1):8.
doi: 10.1186/1479-7364-8-8.

QGRS-Conserve: a computational method for discovering evolutionarily conserved G-quadruplex motifs

Affiliations

QGRS-Conserve: a computational method for discovering evolutionarily conserved G-quadruplex motifs

Scott Frees et al. Hum Genomics. .

Abstract

Background: Nucleic acids containing guanine tracts can form quadruplex structures via non-Watson-Crick base pairing. Formation of G-quadruplexes is associated with the regulation of important biological functions such as transcription, genetic instability, DNA repair, DNA replication, epigenetic mechanisms, regulation of translation, and alternative splicing. G-quadruplexes play important roles in human diseases and are being considered as targets for a variety of therapies. Identification of functional G-quadruplexes and the study of their overall distribution in genomes and transcriptomes is an important pursuit. Traditional computational methods map sequence motifs capable of forming G-quadruplexes but have difficulty in distinguishing motifs that occur by chance from ones which fold into G-quadruplexes.

Results: We present Quadruplex forming 'G'-rich sequences (QGRS)-Conserve, a computational method for calculating motif conservation across exomes and supports filtering to provide researchers with more precise methods of studying G-quadruplex distribution patterns. Our method quantitatively evaluates conservation between quadruplexes found in homologous nucleotide sequences based on several motif structural characteristics. QGRS-Conserve also efficiently manages overlapping G-quadruplex sequences such that the resulting datasets can be analyzed effectively.

Conclusions: We have applied QGRS-Conserve to identify a large number of G-quadruplex motifs in the human exome conserved across several mammalian and non-mammalian species. We have successfully identified multiple homologs of many previously published G-quadruplexes that play post-transcriptional regulatory roles in human genes. Preliminary large-scale analysis identified many homologous G-quadruplexes in the 5'- and 3'-untranslated regions of mammalian species. An expectedly smaller set of G-quadruplex motifs was found to be conserved across larger phylogenetic distances. QGRS-Conserve provides means to build datasets that can be filtered and categorized in a variety of biological dimensions for more targeted studies in order to better understand the roles that G-quadruplexes play.

PubMed Disclaimer

Figures

Figure 1
Figure 1
G-quadruplex structure. Left: an intramolecular G-quadruplex formed by a G3AG3CUG3CG3 RNA motif. Right: a G-tetrad structure.
Figure 2
Figure 2
Overlap percentage computation. The extension of sequence length using 50% of the total sequence length as padding inflates overlap scores for proximate sequences.
Figure 3
Figure 3
Algorithm stages for SNCA mRNA. Overview of the algorithm stages for QGRS identification (independently) on each mRNA, performing a semi-global alignment on the homologs, and finally evaluating the QGRS pair for conservation.
Figure 4
Figure 4
Overlapping QGRS yielding different G-scores. A small sample of hundreds of overlapping QGRS found in the ‘GGATGAGCCTGTGGGTGGAGGGCAGTGGAGCGGGCTGGG’ sequence of human GREB1 gene.
Figure 5
Figure 5
Choosing QGRS considered for conservation calculations. The number next to each QGRS motif represents its calculated G-score. This figure demonstrates how it is possible that QGRS with highest G-score do not necessarily yield the highest conservation score.

Similar articles

Cited by

References

    1. Gellert M, Lipsett MN, Davies DR. Helix formation by guanylic acid. Proc Natl Acad Sci U S A. 1962;48:2013–2018. doi: 10.1073/pnas.48.12.2013. - DOI - PMC - PubMed
    1. Simonsson T. G-quadruplex DNA structures–variations on a theme. Biol Chem. 2001;382:621–628. - PubMed
    1. Davis JT. G-quartets 40 years later: from 5′-GMP to molecular biology and supramolecular chemistry. Angew Chem Int Ed Engl. 2004;43:668–698. doi: 10.1002/anie.200300589. - DOI - PubMed
    1. Burge S, Parkinson GN, Hazel P, Todd AK, Neidle S. Quadruplex DNA: sequence, topology and structure. Nucleic Acids Res. 2006;34:5402–5415. doi: 10.1093/nar/gkl655. - DOI - PMC - PubMed
    1. Ji X, Sun H, Zhou H, Xiang J, Tang Y, Zhao C. Research progress of RNA quadruplex. Nucleic Acid Ther. 2011;21:185–200. doi: 10.1089/nat.2010.0272. - DOI - PubMed

Publication types