Using Trawler_standalone to discover overrepresented motifs in DNA and RNA sequences derived from various experiments including chromatin immunoprecipitation
- PMID: 20134431
- DOI: 10.1038/nprot.2009.158
Using Trawler_standalone to discover overrepresented motifs in DNA and RNA sequences derived from various experiments including chromatin immunoprecipitation
Abstract
Genome-wide location analysis has become a standard technology to unravel gene regulation networks. The accurate characterization of nucleotide signatures in sequences is key to uncovering the regulatory logic but remains a computational challenge. This protocol describes how to best characterize these signatures (motifs) using the new standalone version of Trawler, which was designed and optimized to analyze chromatin immunoprecipitation (ChIP) data sets. In particular, we describe the three main steps of Trawler_standalone (motif discovery, clustering and visualization) and discuss the appropriate parameters to be used in each step depending on the data set and the biological questions addressed. Compared to five other motif discovery programs, Trawler_standalone is in most cases the fastest algorithm to accurately predict the correct motifs especially for large data sets. Its running time ranges within few seconds to several minutes, depending on the size of the data set and the parameters used. This protocol is best suited for bioinformaticians seeking to use Trawler_standalone in a high-throughput manner.
Similar articles
-
TrawlerWeb: an online de novo motif discovery tool for next-generation sequencing datasets.BMC Genomics. 2018 Apr 5;19(1):238. doi: 10.1186/s12864-018-4630-0. BMC Genomics. 2018. PMID: 29621972 Free PMC article.
-
Trawler: de novo regulatory motif discovery pipeline for chromatin immunoprecipitation.Nat Methods. 2007 Jul;4(7):563-5. doi: 10.1038/nmeth1061. Epub 2007 Jun 24. Nat Methods. 2007. PMID: 17589518
-
SeAMotE: a method for high-throughput motif discovery in nucleic acid sequences.BMC Genomics. 2014 Oct 23;15(1):925. doi: 10.1186/1471-2164-15-925. BMC Genomics. 2014. PMID: 25341390 Free PMC article.
-
Discovering sequence motifs.Methods Mol Biol. 2007;395:271-92. doi: 10.1007/978-1-59745-514-5_17. Methods Mol Biol. 2007. PMID: 17993680 Review.
-
An extension and novel solution to the (l,d)-motif challenge problem.Genome Inform. 2004;15(2):63-71. Genome Inform. 2004. PMID: 15706492 Review.
Cited by
-
An intuitionistic approach to scoring DNA sequences against transcription factor binding site motifs.BMC Bioinformatics. 2010 Nov 8;11:551. doi: 10.1186/1471-2105-11-551. BMC Bioinformatics. 2010. PMID: 21059262 Free PMC article.
-
The light responsive transcriptome of the zebrafish: function and regulation.PLoS One. 2011 Feb 15;6(2):e17080. doi: 10.1371/journal.pone.0017080. PLoS One. 2011. PMID: 21390203 Free PMC article.
-
TrawlerWeb: an online de novo motif discovery tool for next-generation sequencing datasets.BMC Genomics. 2018 Apr 5;19(1):238. doi: 10.1186/s12864-018-4630-0. BMC Genomics. 2018. PMID: 29621972 Free PMC article.
-
NKX2-5 congenital heart disease mutations show diverse loss and gain of epigenomic, biochemical and chromatin search functions underpinning pathogenicity.bioRxiv [Preprint]. 2025 Jun 20:2025.06.20.659510. doi: 10.1101/2025.06.20.659510. bioRxiv. 2025. PMID: 40568061 Free PMC article. Preprint.
-
NKX2-5 mutations causative for congenital heart disease retain functionality and are directed to hundreds of targets.Elife. 2015 Jul 6;4:e06942. doi: 10.7554/eLife.06942. Elife. 2015. PMID: 26146939 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources