An algorithm for finding protein-DNA binding sites with applications to chromatin-immunoprecipitation microarray experiments
- PMID: 12101404
- DOI: 10.1038/nbt717
An algorithm for finding protein-DNA binding sites with applications to chromatin-immunoprecipitation microarray experiments
Abstract
Chromatin immunoprecipitation followed by cDNA microarray hybridization (ChIP-array) has become a popular procedure for studying genome-wide protein-DNA interactions and transcription regulation. However, it can only map the probable protein-DNA interaction loci within 1-2 kilobases resolution. To pinpoint interaction sites down to the base-pair level, we introduce a computational method, Motif Discovery scan (MDscan), that examines the ChIP-array-selected sequences and searches for DNA sequence motifs representing the protein-DNA interaction sites. MDscan combines the advantages of two widely adopted motif search strategies, word enumeration and position-specific weight matrix updating, and incorporates the ChIP-array ranking information to accelerate searches and enhance their success rates. MDscan correctly identified all the experimentally verified motifs from published ChIP-array experiments in yeast (STE12, GAL4, RAP1, SCB, MCB, MCM1, SFF, and SWI5), and predicted two motif patterns for the differential binding of Rap1 protein in telomere regions. In our studies, the method was faster and more accurate than several established motif-finding algorithms. MDscan can be used to find DNA motifs not only in ChIP-array experiments but also in other experiments in which a subgroup of the sequences can be inferred to contain relatively abundant motif sites. The MDscan web server can be accessed at http://BioProspector.stanford.edu/MDscan/.
Similar articles
-
A hidden Markov model for analyzing ChIP-chip experiments on genome tiling arrays and its application to p53 binding sequences.Bioinformatics. 2005 Jun;21 Suppl 1:i274-82. doi: 10.1093/bioinformatics/bti1046. Bioinformatics. 2005. PMID: 15961467
-
Design of a combinatorial DNA microarray for protein-DNA interaction studies.BMC Bioinformatics. 2006 Oct 3;7:429. doi: 10.1186/1471-2105-7-429. BMC Bioinformatics. 2006. PMID: 17018151 Free PMC article.
-
Computer-assisted identification of cell cycle-related genes: new targets for E2F transcription factors.J Mol Biol. 2001 May 25;309(1):99-120. doi: 10.1006/jmbi.2001.4650. J Mol Biol. 2001. PMID: 11491305
-
Visualizing and characterizing in vivo DNA-binding events and direct target genes of plant transcription factors.Methods Mol Biol. 2011;754:293-305. doi: 10.1007/978-1-61779-154-3_17. Methods Mol Biol. 2011. PMID: 21720960 Review.
-
Location analysis of DNA-bound proteins at the whole-genome level: untangling transcriptional regulatory networks.Bioessays. 2001 Jun;23(6):473-6. doi: 10.1002/bies.1066. Bioessays. 2001. PMID: 11385626 Review.
Cited by
-
The cis-regulatory code of Hox function in Drosophila.EMBO J. 2012 Aug 1;31(15):3323-33. doi: 10.1038/emboj.2012.179. Epub 2012 Jul 10. EMBO J. 2012. PMID: 22781127 Free PMC article.
-
Transcriptional regulation of fatty acid biosynthesis in Lactococcus lactis.J Bacteriol. 2013 Mar;195(5):1081-9. doi: 10.1128/JB.02043-12. Epub 2012 Dec 28. J Bacteriol. 2013. PMID: 23275247 Free PMC article.
-
Quantitative statistical analysis of cis-regulatory sequences in ABA/VP1- and CBF/DREB1-regulated genes of Arabidopsis.Plant Physiol. 2005 Sep;139(1):437-47. doi: 10.1104/pp.104.058412. Epub 2005 Aug 19. Plant Physiol. 2005. PMID: 16113229 Free PMC article.
-
Identification of an OCT4 and SRY regulatory module using integrated computational and experimental genomics approaches.Genome Res. 2007 Jun;17(6):807-17. doi: 10.1101/gr.6006107. Genome Res. 2007. PMID: 17567999 Free PMC article.
-
Genomic binding profiling of the fission yeast stress-activated MAPK Sty1 and the bZIP transcriptional activator Atf1 in response to H2O2.PLoS One. 2010 Jul 16;5(7):e11620. doi: 10.1371/journal.pone.0011620. PLoS One. 2010. PMID: 20661279 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases