Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Aug 22;12(8):R83.
doi: 10.1186/gb-2011-12-8-r83.

Cistrome: an integrative platform for transcriptional regulation studies

Affiliations

Cistrome: an integrative platform for transcriptional regulation studies

Tao Liu et al. Genome Biol. .

Abstract

The increasing volume of ChIP-chip and ChIP-seq data being generated creates a challenge for standard, integrative and reproducible bioinformatics data analysis platforms. We developed a web-based application called Cistrome, based on the Galaxy open source framework. In addition to the standard Galaxy functions, Cistrome has 29 ChIP-chip- and ChIP-seq-specific tools in three major categories, from preliminary peak calling and correlation analyses to downstream genome feature association, gene expression analyses, and motif discovery. Cistrome is available at http://cistrome.org/ap/.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Workflow within the Cistrome analysis platform. Cistrome functions can be divided into three categories: data preprocessing, gene expression and integrative analysis. A general workflow using Cistrome is to upload datasets, preprocess them using peak calling tools to generate peak locations in BED format and signal profiles in WIGGLE format, upload gene expression data to produce specific gene lists, and then use various integrative analysis tools to generate figures and reports. The bottom figure shows the web interface of the Cistrome platform based on the Galaxy framework. The left panel shows available tools, the middle panel shows messages, tool options, or result details, and the right panel shows the datasets organized in the user's history, including datasets that have been or are being processed (in green and yellow, respectively), or waiting in the queue (in gray). CEAS,; DC, Data Collection module; GEO, Gene Expression Omnibus; NPS, Nucleosome Positioning from Sequencing; TF, transcription factor.
Figure 2
Figure 2
Correlation and association tools. (a) Correlation plots using different histone marks in C. elegans early embryos [43]. Cistrome correlation tools can generate either a heatmap with hierarchical clustering according to pair-wise correlation coefficients or a grid of scatterplots. (b) Venn diagram showing the overlap of H3K4me3 peaks (in blue) with transcription start sites (TSS) for all the genes (in red) in the C. elegans genome. (c) Meta-gene plot generated by CEAS showing the H3K4me3 signals enriched at gene promoter regions; the top expressed genes (red) have higher H3K4me3 signals than the bottom expressed genes (purple). (d) Conservation plot showing that the human androgen receptor (AR) binding sites from ChIP-chip [24] are more conserved than their flanking regions in placental mammals.
Figure 3
Figure 3
Heatmap analysis with k-means clustering. By combining H3K27me3, H3K9me3, H3K4me3, H3K4me2, H3K36me3 and MES-4 (the histone H3K36 methyltransferase) ChIP-chip signals, as in Figure 2a, the Cistrome heatmap tool separates the ± 1-kbp regions for all of the C. elegans TSSs into five clusters using k-means clustering. From top to bottom, the clusters are as follows: (1) about 3,000 TSSs related to active genes have high H3K4me3 upstream of the TSSs and high H3K36me3 downstream of the TSSs; (2) about 2,000 TTSs have slightly lower H3K4me3 levels downstream of the TSSs and no significant K36me3 enrichment; (3) about 2,000 TSSs have high H3K27me3 and H3K9me3 related to inactive genes; (4) about 2,500 TTSs with low H3K27me3, moderate H3K4me3 and high H3K36me3 enrichment around the TTS related to genes in operons; and (5) about 10,000 TTSs have no strong marks.
Figure 4
Figure 4
Cistrome SeqPos motif analysis. A screenshot of the SeqPos output. The enriched motifs at the androgen receptor binding sites without FoxA1 binding are displayed in an interactive HTML page. When the user clicks on the row of a particular motif, the motif logo and detail information are shown at the top of the page.

References

    1. Ren B, Robert F, Wyrick JJ, Aparicio O, Jennings EG, Simon I, Zeitlinger J, Schreiber J, Hannett N, Kanin E, Volkert TL, Wilson CJ, Bell SP, Young RA. Genome-wide location and function of DNA binding proteins. Science. 2000;290:2306–2309. doi: 10.1126/science.290.5500.2306. - DOI - PubMed
    1. Johnson DS, Mortazavi A, Myers RM, Wold B. Genome-wide mapping of in vivo protein-DNA interactions. Science. 2007;316:1497–1502. doi: 10.1126/science.1141319. - DOI - PubMed
    1. Ji H, Jiang H, Ma W, Johnson DS, Myers RM, Wong WH. An integrated software system for analyzing ChIP-chip and ChIP-seq data. Nat Biotechnol. 2008;26:1293–1300. doi: 10.1038/nbt.1505. - DOI - PMC - PubMed
    1. Ye T, Krebs AR, Choukrallah MA, Keime C, Plewniak F, Davidson I, Tora L. seqMINER: an integrated ChIP-seq data interpretation platform. Nucleic Acids Res. 2010;39:e35. - PMC - PubMed
    1. Goecks J, Nekrutenko A, Taylor J. Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol. 2010;11:R86. doi: 10.1186/gb-2010-11-8-r86. - DOI - PMC - PubMed

Publication types

MeSH terms

Substances