Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004 Jul 1;32(Web Server issue):W195-8.
doi: 10.1093/nar/gkh387.

MSCAN: identification of functional clusters of transcription factor binding sites

Affiliations

MSCAN: identification of functional clusters of transcription factor binding sites

Wynand B L Alkema et al. Nucleic Acids Res. .

Abstract

Identification of functional transcription factor binding sites in genomic sequences is notoriously difficult. The critical problem is the low specificity of predictions, which directly reflects the low target specificity of DNA binding proteins. To overcome the noise produced in predictions of individual binding sites, a new generation of algorithms achieves better predictive specificity by focusing on locally dense clusters of binding sites. MSCAN is a leading method for binding site cluster detection that determines the significance of observed sites while correcting for local compositional bias of sequences. The algorithm is highly flexible, applying any set of input binding models to the analysis of a user-specified sequence. From the user's perspective, a key feature of the system is that no reference data sets of regulatory sequences from co-regulated genes are required to train the algorithm. The output from MSCAN consists of an ordered list of sequence segments that contain potential regulatory modules. We have chosen the features in MSCAN such that sequence and matrix retrieval is highly facilitated, resulting in a web server that is intuitive to use. MSCAN is available at http://mscan.cgb.ki.se/cgi-bin/MSCAN.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Screenshot of the MSCAN data input page. The initial input form contains fields for the provision of TF binding profiles (matrices), sequences and parameters. Links to the help file provide users with information about formats and description of the parameters.
Figure 2
Figure 2
Screenshot of the output page. Shown is a page with a predicted module in the regulatory region of a gene known to be selectively expressed in a context linked to the selected TFs. Every category label on the output page is hyperlinked to the help page.

References

    1. Rebeiz M., Reeves,N.L. and Posakony,J.W. (2002) SCORE: a computational approach to the identification of cis-regulatory modules and target genes in whole-genome sequence data. Site clustering over random expectation. Proc. Natl Acad. Sci. USA, 99, 9888–9893. - PMC - PubMed
    1. Bailey T.L. and Noble,W.S. (2003) Searching for statistically significant regulatory modules. Bioinformatics, 19 (Suppl. 2), II16–II25. - PubMed
    1. Frith M.C., Li,M.C. and Weng,Z. (2003) Cluster-Buster: finding dense clusters of motifs in DNA sequences. Nucleic Acids Res., 31, 3666–3668. - PMC - PubMed
    1. Sharan R., Ovcharenko,I., Ben-Hur,A. and Karp,R.M. (2003) CREME: a framework for identifying cis-regulatory modules in human–mouse conserved segments. Bioinformatics, 19 (Suppl. 1), I283–I291. - PubMed
    1. Aerts S., Van Loo,P., Thijs,G., Moreau,Y. and De Moor,B. (2003) Computational detection of cis-regulatory modules. Bioinformatics, 19 (Suppl 2), II5–II14. - PubMed