Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004;5(8):R56.
doi: 10.1186/gb-2004-5-8-r56. Epub 2004 Jul 28.

Identifying combinatorial regulation of transcription factors and binding motifs

Affiliations

Identifying combinatorial regulation of transcription factors and binding motifs

Mamoru Kato et al. Genome Biol. 2004.

Abstract

Background: Combinatorial interaction of transcription factors (TFs) is important for gene regulation. Although various genomic datasets are relevant to this issue, each dataset provides relatively weak evidence on its own. Developing methods that can integrate different sequence, expression and localization data have become important.

Results: Here we use a novel method that integrates chromatin immunoprecipitation (ChIP) data with microarray expression data and with combinatorial TF-motif analysis. We systematically identify combinations of transcription factors and of motifs. The various combinations of TFs involved multiple binding mechanisms. We reconstruct a new combinatorial regulatory map of the yeast cell cycle in which cell-cycle regulation can be drawn as a chain of extended TF modules. We find that the pairwise combination of a TF for an early cell-cycle phase and a TF for a later phase is often used to control gene expression at intermediate times. Thus the number of distinct times of gene expression is greater than the number of transcription factors. We also see that some TF modules control branch points (cell-cycle entry and exit), and in the presence of appropriate signals they can allow progress along alternative pathways.

Conclusions: Combining different data sources can increase statistical power as demonstrated by detecting TF interactions and composite TF-binding motifs. The original picture of a chain of simple cell-cycle regulators can be extended to a chain of composite regulatory modules: different modules may share a common TF component in the same pathway or a TF component cross-talking to other pathways.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Overview of the method. (a) Finding over-represented single motifs from ChIP data. Target promoters of each TF are determined from ChIP data, and these promoters are searched for over-represented motifs. (b) Finding over-represented motif combinations specific to each cell-cycle phase. For all over-represented motifs found in (a), all possible first (single)-, second (double)- and third (triple)-order combinations of motifs are searched against the promoters of the G1, S, S/G2, G2/M, M/G1 or 'All' groups of cell-cycle genes. (c) When an over-represented motif combination is found, we require that the genes possessing that combination have more coherent patterns of gene expression than the patterns for the phase-specific gene group.(d) From ChIP data, we find the set of TFs (over-represented TFs, see Materials and methods) binding to the promoters with a motif combination (from (b) and (c)), and the set of TFs that can bind over-represented motifs (from (a)) constituting a motif combination. For each motif combination and its component motifs, we take the intersection of TFs from these two sets. Such intersection TFs (in bold in the (d)) are assigned to the component motifs of a combination (see Materials and methods).
Figure 2
Figure 2
Over-represented motifs from ChIP data reflect three types of binding mechanisms. A solid arrow means that the TF binds directly to the motif. A dotted arrow means that a motif (for example, M1) is over-represented in TF1 ChIP data. This can occur (a) because of direct binding, or (b, c) because of two modes of indirect binding. (a) M1 is a direct-binding motif of TF1. (b) M1 is an indirect piggy-back binding motif of TF1 via TF2, where TF1 binds TF2, and TF2 binds the motif. For example, M1, TF1, and TF2 could correspond to GTAAACA (the Fkh2 motif), Ndd1, and Fkh2, respectively, as Ndd1 binds to Fkh2, which in turn binds to GTAAACA. (c) M1 is an indirect cross-binding motif of TF1 via TF2. For example, M1, TF1 and TF2 could correspond to GTAAACA (the Fkh2 motif), Mcm1, and Fkh2, respectively, since Mcm1 binds to its own motif (M2) on the same promoter as the Fkh2 motif.
Figure 3
Figure 3
Reconstructed transcriptional regulation model of the yeast cell cycle. Segments of the cycle contain motif combinations. The TF and motif combinations in black were known previously and are also confirmed here. Those in red are new combinations (or previously poorly characterized combinations). TFs and motifs in parentheses are those we could not detect (for example, because the TF was not included in the ChIP experiments) but whose presence is indicated by some other experimental work [25,46]. Almost all TFs appear as elements of a combination. {Swi5, Ste12, Dig1} shows a joint-process combination between cell cycle and pheromone response or filamentous growth (see Discussion).

References

    1. Spellman PT, Sherlock G, Zhang MQ, Iyer VR, Anders K, Eisen MB, Brown PO, Botstein D, Futcher B. Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. Mol Biol Cell . 1998;9:3273–3297. - PMC - PubMed
    1. Tavazoie S, Hughes JD, Campbell MJ, Cho RJ, Church GM. Systematic determination of genetic network architecture. Nat Genet. 1999;22:281–285. doi: 10.1038/10343. - DOI - PubMed
    1. Zhang MQ. Large-scale gene expression data analysis: a new challenge to computational biologists. Genome Res. 1999;9:681–688. - PubMed
    1. van Helden J, Andre B, Collado-Vides J. Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies. J Mol Biol. 1998;281:827–842. doi: 10.1006/jmbi.1998.1947. - DOI - PubMed
    1. Roth FP, Hughes JD, Estep PW, Church GM. Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation. Nat Biotechnol. 1998;16:939–945. - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources