MOCCS: Clarifying DNA-binding motif ambiguity using ChIP-Seq data
- PMID: 26971251
- DOI: 10.1016/j.compbiolchem.2016.01.014
MOCCS: Clarifying DNA-binding motif ambiguity using ChIP-Seq data
Abstract
Background: As a key mechanism of gene regulation, transcription factors (TFs) bind to DNA by recognizing specific short sequence patterns that are called DNA-binding motifs. A single TF can accept ambiguity within its DNA-binding motifs, which comprise both canonical (typical) and non-canonical motifs. Clarification of such DNA-binding motif ambiguity is crucial for revealing gene regulatory networks and evaluating mutations in cis-regulatory elements. Although chromatin immunoprecipitation sequencing (ChIP-seq) now provides abundant data on the genomic sequences to which a given TF binds, existing motif discovery methods are unable to directly answer whether a given TF can bind to a specific DNA-binding motif.
Results: Here, we report a method for clarifying the DNA-binding motif ambiguity, MOCCS. Given ChIP-Seq data of any TF, MOCCS comprehensively analyzes and describes every k-mer to which that TF binds. Analysis of simulated datasets revealed that MOCCS is applicable to various ChIP-Seq datasets, requiring only a few minutes per dataset. Application to the ENCODE ChIP-Seq datasets proved that MOCCS directly evaluates whether a given TF binds to each DNA-binding motif, even if known position weight matrix models do not provide sufficient information on DNA-binding motif ambiguity. Furthermore, users are not required to provide numerous parameters or background genomic sequence models that are typically unavailable. MOCCS is implemented in Perl and R and is freely available via https://github.com/yuifu/moccs.
Conclusions: By complementing existing motif-discovery software, MOCCS will contribute to the basic understanding of how the genome controls diverse cellular processes via DNA-protein interactions.
Keywords: ChIP-Seq; DNA binding motifs; Transcription factors.
Copyright © 2016 Elsevier Ltd. All rights reserved.
Similar articles
-
Transcription factor-binding k-mer analysis clarifies the cell type dependency of binding specificities and cis-regulatory SNPs in humans.BMC Genomics. 2023 Oct 7;24(1):597. doi: 10.1186/s12864-023-09692-9. BMC Genomics. 2023. PMID: 37805453 Free PMC article.
-
Improved linking of motifs to their TFs using domain information.Bioinformatics. 2020 Mar 1;36(6):1655-1662. doi: 10.1093/bioinformatics/btz855. Bioinformatics. 2020. PMID: 31742324 Free PMC article.
-
MEME-ChIP: motif analysis of large DNA datasets.Bioinformatics. 2011 Jun 15;27(12):1696-7. doi: 10.1093/bioinformatics/btr189. Epub 2011 Apr 12. Bioinformatics. 2011. PMID: 21486936 Free PMC article.
-
DNA sequence motif: a jack of all trades for ChIP-Seq data.Adv Protein Chem Struct Biol. 2013;91:135-71. doi: 10.1016/B978-0-12-411637-5.00005-6. Adv Protein Chem Struct Biol. 2013. PMID: 23790213 Review.
-
An algorithmic perspective of de novo cis-regulatory motif finding based on ChIP-seq data.Brief Bioinform. 2018 Sep 28;19(5):1069-1081. doi: 10.1093/bib/bbx026. Brief Bioinform. 2018. PMID: 28334268 Review.
Cited by
-
Functional D-box sequences reset the circadian clock and drive mRNA rhythms.Commun Biol. 2019 Aug 8;2:300. doi: 10.1038/s42003-019-0522-3. eCollection 2019. Commun Biol. 2019. PMID: 31428688 Free PMC article.
-
Six6 and Six7 coordinately regulate expression of middle-wavelength opsins in zebrafish.Proc Natl Acad Sci U S A. 2019 Mar 5;116(10):4651-4660. doi: 10.1073/pnas.1812884116. Epub 2019 Feb 14. Proc Natl Acad Sci U S A. 2019. PMID: 30765521 Free PMC article.
-
Transcription factor-binding k-mer analysis clarifies the cell type dependency of binding specificities and cis-regulatory SNPs in humans.BMC Genomics. 2023 Oct 7;24(1):597. doi: 10.1186/s12864-023-09692-9. BMC Genomics. 2023. PMID: 37805453 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous