Footer: a quantitative comparative genomics method for efficient recognition of cis-regulatory elements
- PMID: 15930494
- PMCID: PMC1142474
- DOI: 10.1101/gr.2952005
Footer: a quantitative comparative genomics method for efficient recognition of cis-regulatory elements
Abstract
The search for mammalian DNA regulatory regions poses a challenging problem in computational biology. The short length of the DNA patterns compared with the size of the promoter regions and the degeneracy of the patterns makes their identification difficult. One way to overcome this problem is to use evolutionary information to reduce the number of false-positive predictions. We developed a novel method for pattern identification that compares a pair of putative binding sites in two species (e.g., human and mouse) and assigns two probability scores based on the relative position of the sites in the promoter and their agreement with a known model of binding preferences. We tested the algorithm's ability to predict known binding sites on various promoters. Overall, it exhibited 83% sensitivity and the specificity was 72%, which is a clear improvement over existing methods. Our algorithm also successfully predicted two novel NF-kappaB binding sites in the promoter region of the mouse autotaxin gene (ATX, ENPP2), which we were able to verify by using chromatin immunoprecipitation assay coupled with quantitative real-time PCR.
Figures
References
-
- Altschul, S.F., Gish, W., Miller, W., Myers, E.W., and Lipman, D.J. 1990. Basic local alignment search tool. J. Mol. Biol. 215: 403-410. - PubMed
-
- Barash, Y., Elidan, G., Friedman, N., and Kaplan, T. 2003. Modeling dependencies in protein-DNA binding sites. In Seventh Annual International Conference on Computational Molecular Biology (RECOMB).
-
- Benos, P.V., Lapedes, A.S., Fields, D.S., and Stormo, G.D. 2001. SAMIE: Statistical algorithm for modeling interaction energies. Pac. Symp. Biocomput. 115-126. - PubMed
-
- Benos, P.V., Lapedes, A.S., and Stormo, G.D. 2002b. Probabilistic code for DNA recognition by proteins of the EGR family. J. Mol. Biol. 323: 701-727. - PubMed
WEB SITE REFERENCES
-
- http://www.idtdna.com/; IDTDNA software for designing PCR primers.
-
- http://biodev.hgen.pitt.edu/cgi-bin/Footer/Footer.cgi; Footer Web server for analysis of mammalian promoters.
-
- http://biodev.hgen.pitt.edu/cgi-bin/enologos/enologos.cgi; enoLOGOS Web server for sequence LOGOS.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources