Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jan 7;50(D1):D1077-D1084.
doi: 10.1093/nar/gkab1043.

proChIPdb: a chromatin immunoprecipitation database for prokaryotic organisms

Affiliations

proChIPdb: a chromatin immunoprecipitation database for prokaryotic organisms

Katherine T Decker et al. Nucleic Acids Res. .

Erratum in

Abstract

The transcriptional regulatory network in prokaryotes controls global gene expression mostly through transcription factors (TFs), which are DNA-binding proteins. Chromatin immunoprecipitation (ChIP) with DNA sequencing methods can identify TF binding sites across the genome, providing a bottom-up, mechanistic understanding of how gene expression is regulated. ChIP provides indispensable evidence toward the goal of acquiring a comprehensive understanding of cellular adaptation and regulation, including condition-specificity. ChIP-derived data's importance and labor-intensiveness motivate its broad dissemination and reuse, which is currently an unmet need in the prokaryotic domain. To fill this gap, we present proChIPdb (prochipdb.org), an information-rich, interactive web database. This website collects public ChIP-seq/-exo data across several prokaryotes and presents them in dashboards that include curated binding sites, nucleotide-resolution genome viewers, and summary plots such as motif enrichment sequence logos. Users can search for TFs of interest or their target genes, download all data, dashboards, and visuals, and follow external links to understand regulons through biological databases and the literature. This initial release of proChIPdb covers diverse organisms, including most major TFs of Escherichia coli, and can be expanded to support regulon discovery across the prokaryotic domain.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Overview of proChIPdb's sources, pipeline, and content. Gray arrows in this figure present the flow of data. Data sources (upper left) include the European Nucleotide Archive, Gene Expression Omnibus (GEO), Pubmed, the Sequence Read Archive, and in-house data (which has been posted to GEO). This data enters the processing pipeline (lower) through the various file types indicated by the blue flow arrows. Processing through the pipeline occurs from left to right via the gray arrows, with green rectangles indicating data types and files and black text indicating processing steps and tools. Towards the right of the pipeline, data feeds through the green arrows into the proChIPdb site (upper right), which consists of a binding site table, genome viewer, and feature visualization panel.
Figure 2.
Figure 2.
TF dashboard webpage for E. coli K-12 MG1655 Fur regulator. (A) Metadata and relevant links. (B) Binding site table with curated list of binding peaks. Each tab contains the TFBSs for a unique condition (DPD versus Fe supplementation in this example). (C) Embedded Integrative Genomics Viewer (igv.js) component with annotation tracks and genome-wide peaks from raw data. (D–G) Feature visuals, which contains tabs for the various additional plots. (D) The active tab shows a histogram of binding peak widths. (E) A scatterplot of peak positions relative to their closest downstream gene. (F) The consensus binding sequence motifs.( G) Venn diagram comparing proChIPdb identified target genes versus literature regulon genes.

References

    1. Raha D., Hong M., Snyder M.. ChIP-Seq: a method for global identification of regulatory elements in the genome. Curr. Protoc. Mol. Biol. 2010; Chapter 21:Unit 21.19.1–Unit 21.19.14. - PubMed
    1. Rhee H.S., Pugh B.F.. ChIP-exo method for identifying genomic location of DNA-binding proteins with near-single-nucleotide accuracy. Curr. Protoc. Mol. Biol. 2012; Chapter 21:Unit 21.24. - PMC - PubMed
    1. Gao Y., Yurkovich J.T., Seo S.W., Kabimoldayev I., Dräger A., Chen K., Sastry A.V., Fang X., Mih N., Yang L.et al.. Systematic discovery of uncharacterized transcription factors in Escherichia coli K-12 MG1655. Nucleic Acids Res. 2018; 46:10682–10696. - PMC - PubMed
    1. Eckweiler D., Dudek C.-A., Hartlich J., Brötje D., Jahn D.. PRODORIC2: the bacterial gene regulation database in 2018. Nucleic Acids Res. 2018; 46:D320–D326. - PMC - PubMed
    1. Pachkov M., Balwierz P.J., Arnold P., Ozonov E., van Nimwegen E.. SwissRegulon, a database of genome-wide annotations of regulatory sites: recent updates. Nucleic Acids Res. 2013; 41:D214–D220. - PMC - PubMed

Publication types