Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Oct 1;32(19):3024-6.
doi: 10.1093/bioinformatics/btw338. Epub 2016 Jun 10.

RTFBSDB: an integrated framework for transcription factor binding site analysis

Affiliations

RTFBSDB: an integrated framework for transcription factor binding site analysis

Zhong Wang et al. Bioinformatics. .

Abstract

Transcription factors (TFs) regulate complex programs of gene transcription by binding to short DNA sequence motifs. Here, we introduce rtfbsdb, a unified framework that integrates a database of more than 65 000 TF binding motifs with tools to easily and efficiently scan target genome sequences. Rtfbsdb clusters motifs with similar DNA sequence specificities and integrates RNA-seq or PRO-seq data to restrict analyses to motifs recognized by TFs expressed in the cell type of interest. Our package allows common analyses to be performed rapidly in an integrated environment.

Availability and implementation: rtfbsdb available at (https://github.com/Danko-Lab/rtfbs_db).

Contact: dankoc@gmail.com

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Schematic illustrating the rtfbsdb workflow. Motifs are loaded into a CisBP.db object in R using an automated web scraper that imports data directly from the Cis-BP database. The set of motifs is reduced to those most applicable for analysis (right side) by removing TFs that are not expressed in the cell system of interest, and subsequently grouping motifs recognizing similar DNA sequences by clustering. The final set of motifs can be used to complete several common tasks in genomics (bottom row), including testing a set of DNA sequences for enriched motifs, scanning a target genome, or visualizing motifs as sequence logos

References

    1. Bodenhofer U. et al. (2011) APCluster: an R package for affinity propagation clustering. Bioinformatics, 27, 2463–2464. - PubMed
    1. Core L.J. et al. (2008) Nascent RNA sequencing reveals widespread pausing and divergent initiation at human promoters. Science, 322, 1845–1848. - PMC - PubMed
    1. Danko C.G. et al. (2015) Identification of active transcriptional regulatory elements from GRO-seq data. Nat. Methods, 12, 433–438. - PMC - PubMed
    1. Goodman R.H., Smolik S. (2000) CBP/p300 in cell growth, transformation, and development. Genes Dev., 14, 1553–1577. - PubMed
    1. Grant C.E. et al. (2011) FIMO: scanning for occurrences of a given motif. Bioinformatics, 27, 1017–1018. - PMC - PubMed

Substances