An algorithmic perspective of de novo cis-regulatory motif finding based on ChIP-seq data
- PMID: 28334268
- DOI: 10.1093/bib/bbx026
An algorithmic perspective of de novo cis-regulatory motif finding based on ChIP-seq data
Abstract
Transcription factors are proteins that bind to specific DNA sequences and play important roles in controlling the expression levels of their target genes. Hence, prediction of transcription factor binding sites (TFBSs) provides a solid foundation for inferring gene regulatory mechanisms and building regulatory networks for a genome. Chromatin immunoprecipitation sequencing (ChIP-seq) technology can generate large-scale experimental data for such protein-DNA interactions, providing an unprecedented opportunity to identify TFBSs (a.k.a. cis-regulatory motifs). The bottleneck, however, is the lack of robust mathematical models, as well as efficient computational methods for TFBS prediction to make effective use of massive ChIP-seq data sets in the public domain. The purpose of this study is to review existing motif-finding methods for ChIP-seq data from an algorithmic perspective and provide new computational insight into this field. The state-of-the-art methods were shown through summarizing eight representative motif-finding algorithms along with corresponding challenges, and introducing some important relative functions according to specific biological demands, including discriminative motif finding and cofactor motifs analysis. Finally, potential directions and plans for ChIP-seq-based motif-finding tools were showcased in support of future algorithm development.
Similar articles
-
De novo prediction of cis-regulatory elements and modules through integrative analysis of a large number of ChIP datasets.BMC Genomics. 2014 Dec 2;15:1047. doi: 10.1186/1471-2164-15-1047. BMC Genomics. 2014. PMID: 25442502 Free PMC article.
-
Improving analysis of transcription factor binding sites within ChIP-Seq data based on topological motif enrichment.BMC Genomics. 2014 Jun 13;15(1):472. doi: 10.1186/1471-2164-15-472. BMC Genomics. 2014. PMID: 24927817 Free PMC article.
-
FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets.DNA Res. 2019 Jun 1;26(3):231-242. doi: 10.1093/dnares/dsz004. DNA Res. 2019. PMID: 30957858 Free PMC article.
-
DNA sequence motif: a jack of all trades for ChIP-Seq data.Adv Protein Chem Struct Biol. 2013;91:135-71. doi: 10.1016/B978-0-12-411637-5.00005-6. Adv Protein Chem Struct Biol. 2013. PMID: 23790213 Review.
-
A survey of motif finding Web tools for detecting binding site motifs in ChIP-Seq data.Biol Direct. 2014 Feb 20;9:4. doi: 10.1186/1745-6150-9-4. Biol Direct. 2014. PMID: 24555784 Free PMC article. Review.
Cited by
-
CelEst: a unified gene regulatory network for estimating transcription factor activities in C. elegans.Genetics. 2025 Mar 17;229(3):iyae189. doi: 10.1093/genetics/iyae189. Genetics. 2025. PMID: 39705007 Free PMC article.
-
RECTA: Regulon Identification Based on Comparative Genomics and Transcriptomics Analysis.Genes (Basel). 2018 May 30;9(6):278. doi: 10.3390/genes9060278. Genes (Basel). 2018. PMID: 29849014 Free PMC article.
-
A single ChIP-seq dataset is sufficient for comprehensive analysis of motifs co-occurrence with MCOT package.Nucleic Acids Res. 2019 Dec 2;47(21):e139. doi: 10.1093/nar/gkz800. Nucleic Acids Res. 2019. PMID: 31750523 Free PMC article.
-
Contribution of ECT2 to Tubulointerstitial Fibrosis in the Progression of Chronic Kidney Disease.Curr Med Sci. 2024 Dec;44(6):1249-1258. doi: 10.1007/s11596-024-2948-1. Epub 2024 Oct 26. Curr Med Sci. 2024. PMID: 39460889
-
SamSelect: a sample sequence selection algorithm for quorum planted motif search on large DNA datasets.BMC Bioinformatics. 2018 Jun 18;19(1):228. doi: 10.1186/s12859-018-2242-y. BMC Bioinformatics. 2018. PMID: 29914360 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources