SeAMotE: a method for high-throughput motif discovery in nucleic acid sequences
- PMID: 25341390
- PMCID: PMC4223730
- DOI: 10.1186/1471-2164-15-925
SeAMotE: a method for high-throughput motif discovery in nucleic acid sequences
Abstract
Background: The large amount of data produced by high-throughput sequencing poses new computational challenges. In the last decade, several tools have been developed for the identification of transcription and splicing factor binding sites.
Results: Here, we introduce the SeAMotE (Sequence Analysis of Motifs Enrichment) algorithm for discovery of regulatory regions in nucleic acid sequences. SeAMotE provides (i) a robust analysis of high-throughput sequence sets, (ii) a motif search based on pattern occurrences and (iii) an easy-to-use web-server interface. We applied our method to recently published data including 351 chromatin immunoprecipitation (ChIP) and 13 crosslinking immunoprecipitation (CLIP) experiments and compared our results with those of other well-established motif discovery tools. SeAMotE shows an average accuracy of 80% in finding discriminative motifs and outperforms other methods available in literature.
Conclusions: SeAMotE is a fast, accurate and flexible algorithm for the identification of sequence patterns involved in protein-DNA and protein-RNA recognition. The server can be freely accessed at http://s.tartaglialab.com/new_submission/seamote.
Figures




Similar articles
-
TrawlerWeb: an online de novo motif discovery tool for next-generation sequencing datasets.BMC Genomics. 2018 Apr 5;19(1):238. doi: 10.1186/s12864-018-4630-0. BMC Genomics. 2018. PMID: 29621972 Free PMC article.
-
MEME-ChIP: motif analysis of large DNA datasets.Bioinformatics. 2011 Jun 15;27(12):1696-7. doi: 10.1093/bioinformatics/btr189. Epub 2011 Apr 12. Bioinformatics. 2011. PMID: 21486936 Free PMC article.
-
An algorithm for finding protein-DNA binding sites with applications to chromatin-immunoprecipitation microarray experiments.Nat Biotechnol. 2002 Aug;20(8):835-9. doi: 10.1038/nbt717. Epub 2002 Jul 8. Nat Biotechnol. 2002. PMID: 12101404
-
A survey of motif finding Web tools for detecting binding site motifs in ChIP-Seq data.Biol Direct. 2014 Feb 20;9:4. doi: 10.1186/1745-6150-9-4. Biol Direct. 2014. PMID: 24555784 Free PMC article. Review.
-
An algorithmic perspective of de novo cis-regulatory motif finding based on ChIP-seq data.Brief Bioinform. 2018 Sep 28;19(5):1069-1081. doi: 10.1093/bib/bbx026. Brief Bioinform. 2018. PMID: 28334268 Review.
Cited by
-
Mechanisms and consequences of subcellular RNA localization across diverse cell types.Traffic. 2020 Jun;21(6):404-418. doi: 10.1111/tra.12730. Epub 2020 Apr 29. Traffic. 2020. PMID: 32291836 Free PMC article. Review.
-
Combining phylogenetic footprinting with motif models incorporating intra-motif dependencies.BMC Bioinformatics. 2017 Mar 1;18(1):141. doi: 10.1186/s12859-017-1495-1. BMC Bioinformatics. 2017. PMID: 28249564 Free PMC article.
-
WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data.Sci Rep. 2017 Jun 12;7(1):3217. doi: 10.1038/s41598-017-03554-7. Sci Rep. 2017. PMID: 28607381 Free PMC article.
-
By the company they keep: interaction networks define the binding ability of transcription factors.Nucleic Acids Res. 2015 Oct 30;43(19):e125. doi: 10.1093/nar/gkv607. Epub 2015 Jun 18. Nucleic Acids Res. 2015. PMID: 26089389 Free PMC article.
-
Zooming in on protein-RNA interactions: a multi-level workflow to identify interaction partners.Biochem Soc Trans. 2020 Aug 28;48(4):1529-1543. doi: 10.1042/BST20191059. Biochem Soc Trans. 2020. PMID: 32820806 Free PMC article. Review.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources