Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006 Jul 1;34(Web Server issue):W541-5.
doi: 10.1093/nar/gkl342.

Composite Module Analyst: identification of transcription factor binding site combinations using genetic algorithm

Affiliations

Composite Module Analyst: identification of transcription factor binding site combinations using genetic algorithm

T Waleev et al. Nucleic Acids Res. .

Abstract

Composite Module Analyst (CMA) is a novel software tool aiming to identify promoter-enhancer models based on the composition of transcription factor (TF) binding sites and their pairs. CMA is closely interconnected with the TRANSFAC database. In particular, CMA uses the positional weight matrix (PWM) library collected in TRANSFAC and therefore provides the possibility to search for a large variety of different TF binding sites. We model the structure of the long gene regulatory regions by a Boolean function that joins several local modules, each consisting of co-localized TF binding sites. Having as an input a set of co-regulated genes, CMA builds the promoter model and optimizes the parameters of the model automatically by applying a genetic-regression algorithm. We use a multicomponent fitness function of the algorithm which includes several statistical criteria in a weighted linear function. We show examples of successful application of CMA to a microarray data on transcription profiling of TNF-alpha stimulated primary human endothelial cells. The CMA web server is freely accessible at http://www.gene-regulation.com/pub/programs/cma/CMA.html. An advanced version of CMA is also a part of the commercial system ExPlaintrade mark (www.biobase.de) designed for causal analysis of gene expression data.

PubMed Disclaimer

Figures

Figure 1
Figure 1
A schematic representation of a match of a CM in a particular promoter. Several TF sites and pairs of TF sites are found in a sequence window of the length w, which is located in an upstream region of a given gene.
Figure 2
Figure 2
CMA web interface and results of identification of the CM discriminating promoters of up-regulated genes from promoters of down-regulated genes upon TNF-alpha stimulation of primary human endothelial cells. (a) CMA web interface; (b) representation of the identified CM that consists of three single weight matrices and three pairs of matrices. The distance limits orientation of the sites in the pairs are schematically shown. (c) Two histograms of the fuzzy promoter score (fps) in the promoters of up-regulated genes (red) and in down-regulated genes (blue); (d) representation of TF sites found in the windows that correspond to the maximal score of the match of the CM in the promoters. Marked are identified NF-AT/EGR2 site pairs (red) and IK3/IRF1 site pairs (blue).

Similar articles

Cited by

References

    1. Matys V., Kel-Margoulis O., Fricke E., Liebich I., Land S., Barre-Dirrie A., Reuter I., Chekmenev D., Krull M., Hornischer K., et al. TRANSFAC® and its module TRANSCompel®: transcriptional gene regulation in eukaryotes. Nucleic Acids Res. 2006;34:D108–D110. - PMC - PubMed
    1. Hardison R., Slightom J.L., Gumucio D.L., Goodman M., Stojanovic N., Miller W. Locus control regions of mammalian beta-globin gene clusters: combining phylogenetic analyses and experimental results to gain functional insights. Gene. 1997;205:73–94. - PubMed
    1. Wasserman W.W., Fickett J.W. Identification of regulatory regions which confer muscle-specific gene expression. J. Mol. Biol. 1998;278:167–181. - PubMed
    1. Frech K., Quandt K., Werner T. Muscle actin genes: a first step towards computational classification of tissue specific promoters. In Silico Biol. 1998;1:29–38. - PubMed
    1. Kel A.E., Kel-Margoulis O.V., Farnham P.J., Bartley S.M., Wingender E., Zhang M.Q. Computer-assisted identification of cell cycle-related genes: new targets for E2F transcription factors. J. Mol. Biol. 2001;309:99–120. - PubMed

Publication types

Substances