Localized motif discovery in gene regulatory sequences
- PMID: 20223835
- DOI: 10.1093/bioinformatics/btq106
Localized motif discovery in gene regulatory sequences
Abstract
Motivation: Discovery of nucleotide motifs that are localized with respect to a certain biological landmark is important in several appli-cations, such as in regulatory sequences flanking the transcription start site, in the neighborhood of known transcription factor binding sites, and in transcription factor binding regions discovered by massively parallel sequencing (ChIP-Seq).
Results: We report an algorithm called LocalMotif to discover such localized motifs. The algorithm is based on a novel scoring function, called spatial confinement score, which can determine the exact interval of localization of a motif. This score is combined with other existing scoring measures including over-representation and relative entropy to determine the overall prominence of the motif. The approach successfully discovers biologically relevant motifs and their intervals of localization in scenarios where the motifs cannot be discovered by general motif finding tools. It is especially useful for discovering multiple co-localized motifs in a set of regulatory sequences, such as those identified by ChIP-Seq.
Availability and implementation: The LocalMotif software is available at http://www.comp.nus.edu.sg/~bioinfo/LocalMotif.
Similar articles
-
A generic motif discovery algorithm for sequential data.Bioinformatics. 2006 Jan 1;22(1):21-8. doi: 10.1093/bioinformatics/bti745. Epub 2005 Oct 27. Bioinformatics. 2006. PMID: 16257985
-
Mining ChIP-chip data for transcription factor and cofactor binding sites.Bioinformatics. 2005 Jun;21 Suppl 1:i403-12. doi: 10.1093/bioinformatics/bti1043. Bioinformatics. 2005. PMID: 15961485
-
Computing the P-value of the information content from an alignment of multiple sequences.Bioinformatics. 2005 Jun;21 Suppl 1:i311-8. doi: 10.1093/bioinformatics/bti1044. Bioinformatics. 2005. PMID: 15961473
-
Discovering sequence motifs.Methods Mol Biol. 2008;452:231-51. doi: 10.1007/978-1-60327-159-2_12. Methods Mol Biol. 2008. PMID: 18566768 Review.
-
An algorithmic perspective of de novo cis-regulatory motif finding based on ChIP-seq data.Brief Bioinform. 2018 Sep 28;19(5):1069-1081. doi: 10.1093/bib/bbx026. Brief Bioinform. 2018. PMID: 28334268 Review.
Cited by
-
ChIP-Seq-Based Approach in Mouse Enteric Precursor Cells Reveals New Potential Genes with a Role in Enteric Nervous System Development and Hirschsprung Disease.Int J Mol Sci. 2020 Nov 28;21(23):9061. doi: 10.3390/ijms21239061. Int J Mol Sci. 2020. PMID: 33260622 Free PMC article.
-
A highly efficient and effective motif discovery method for ChIP-seq/ChIP-chip data using positional information.Nucleic Acids Res. 2012 Apr;40(7):e50. doi: 10.1093/nar/gkr1135. Epub 2012 Jan 6. Nucleic Acids Res. 2012. PMID: 22228832 Free PMC article.
-
POWRS: position-sensitive motif discovery.PLoS One. 2012;7(7):e40373. doi: 10.1371/journal.pone.0040373. Epub 2012 Jul 5. PLoS One. 2012. PMID: 22792292 Free PMC article.
-
Machine learning for epigenetics and future medical applications.Epigenetics. 2017 Jul 3;12(7):505-514. doi: 10.1080/15592294.2017.1329068. Epub 2017 May 19. Epigenetics. 2017. PMID: 28524769 Free PMC article. Review.
-
Motif discovery and transcription factor binding sites before and after the next-generation sequencing era.Brief Bioinform. 2013 Mar;14(2):225-37. doi: 10.1093/bib/bbs016. Epub 2012 Apr 19. Brief Bioinform. 2013. PMID: 22517426 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous