Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jan;43(Database issue):D103-9.
doi: 10.1093/nar/gku977. Epub 2014 Oct 17.

GBshape: a genome browser database for DNA shape annotations

Affiliations

GBshape: a genome browser database for DNA shape annotations

Tsu-Pei Chiu et al. Nucleic Acids Res. 2015 Jan.

Abstract

Many regulatory mechanisms require a high degree of specificity in protein-DNA binding. Nucleotide sequence does not provide an answer to the question of why a protein binds only to a small subset of the many putative binding sites in the genome that share the same core motif. Whereas higher-order effects, such as chromatin accessibility, cooperativity and cofactors, have been described, DNA shape recently gained attention as another feature that fine-tunes the DNA binding specificities of some transcription factor families. Our Genome Browser for DNA shape annotations (GBshape; freely available at http://rohslab.cmb.usc.edu/GBshape/) provides minor groove width, propeller twist, roll, helix twist and hydroxyl radical cleavage predictions for the entire genomes of 94 organisms. Additional genomes can easily be added using the GBshape framework. GBshape can be used to visualize DNA shape annotations qualitatively in a genome browser track format, and to download quantitative values of DNA shape features as a function of genomic position at nucleotide resolution. As biological applications, we illustrate the periodicity of DNA shape features that are present in nucleosome-occupied sequences from human, fly and worm, and we demonstrate structural similarities between transcription start sites in the genomes of four Drosophila species.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Architecture of the GBshape database. GBshape consists of a high-throughput prediction platform, data depositories and a user interface. DNA shape annotations of entire genomes can be generated by the high-throughput prediction platform, which runs on a high-performance computing cluster (HPCC), and, together with genome sequences and UCSC Genome Browser standard annotations, stored in the data depositories. The user interface provides multiple functionalities for users to either visualize or download structural annotations.
Figure 2.
Figure 2.
Visual display of GBshape annotations in the genome browser for a specific position in the S. cerevisiae genome. (A) Genome positions and UCSC Genome Browser standard annotation tracks. (B) DNA shape annotation tracks for MGW, ProT, Roll, HelT and hydroxyl radical cleavage intensity (ORChID2). (C) Heat map views for DNA shape annotations.
Figure 3.
Figure 3.
Variation in MGW (blue) and ORChID2 (green) signals on average in nucleosome sequences from the (A) Caenorhabditis elegans, (B) Drosophila melanogaster and (C) human genomes. Numbering of the nucleotide position starts with −1 and 1 for the central two base pairs, respectively.
Figure 4.
Figure 4.
Variation in Roll (blue), HelT (green) and ProT (red) on average in nucleosome sequences from the (A) Caenorhabditis elegans, (B) Drosophila melanogaster and (C) human genomes. Numbering of the nucleotide position starts with −1 and 1 for the central two base pairs, respectively.
Figure 5.
Figure 5.
Average heat maps for four DNA shape features of TSSs and 50 bp up- and downstream in four fly species. The analysis is based on 3823 TSSs from the D. melanogaster, 6909 TSSs from the D. simulans, 7234 TSSs from the D. sechellia and 7397 TSSs from the D. pseudoobscura genomes. Column numbers in each heat map indicate the nucleotide position relative to the TSS. Black frames mark the locations of the Initiatior (Inr) element and TATA box.

References

    1. Rohs R., West S.M., Sosinsky A., Liu P., Mann R.S., Honig B. The role of DNA shape in protein-DNA recognition. Nature. 2009;461:1248–1253. - PMC - PubMed
    1. Slattery M., Zhou T., Yang L., Dantas Machado A.C., Gordân R., Rohs R. Absence of a simple code: how transcription factors read the genome. Trends Biochem. Sci. 2014;39:381–399. - PMC - PubMed
    1. Rohs R., Jin X., West S.M., Joshi R., Honig B., Mann R.S. Origins of specificity in protein-DNA recognition. Annu. Rev. Biochem. 2010;79:233–269. - PMC - PubMed
    1. Parker S.C., Tullius T.D. DNA shape, genetic codes, and evolution. Curr. Opin. Struct. Biol. 2011;21:342–347. - PMC - PubMed
    1. Slattery M., Riley T., Liu P., Abe N., Gomez-Alcala P., Dror I., Zhou T., Rohs R., Honig B., Bussemaker H.J., et al. Cofactor binding evokes latent differences in DNA binding specificity between Hox proteins. Cell. 2011;147:1270–1282. - PMC - PubMed

Publication types