Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Apr 9:11:179.
doi: 10.1186/1471-2105-11-179.

The value of position-specific priors in motif discovery using MEME

Affiliations

The value of position-specific priors in motif discovery using MEME

Timothy L Bailey et al. BMC Bioinformatics. .

Abstract

Background: Position-specific priors have been shown to be a flexible and elegant way to extend the power of Gibbs sampler-based motif discovery algorithms. Information of many types-including sequence conservation, nucleosome positioning, and negative examples-can be converted into a prior over the location of motif sites, which then guides the sequence motif discovery algorithm. This approach has been shown to confer many of the benefits of conservation-based and discriminative motif discovery approaches on Gibbs sampler-based motif discovery methods, but has not previously been studied with methods based on expectation maximization (EM).

Results: We extend the popular EM-based MEME algorithm to utilize position-specific priors and demonstrate their effectiveness for discovering transcription factor (TF) motifs in yeast and mouse DNA sequences. Utilizing a discriminative, conservation-based prior dramatically improves MEME's ability to discover motifs in 156 yeast TF ChIP-chip datasets, more than doubling the number of datasets where it finds the correct motif. On these datasets, MEME using the prior has a higher success rate than eight other conservation-based motif discovery approaches. We also show that the same type of prior improves the accuracy of motifs discovered by MEME in mouse TF ChIP-seq data, and that the motifs tend to be of slightly higher quality those found by a Gibbs sampling algorithm using the same prior.

Conclusions: We conclude that using position-specific priors can substantially increase the power of EM-based motif discovery algorithms such as MEME algorithm.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Comparison of motifs found in mouse ChIP-seq datasets. The figure shows the motifs reported by Chen et al. [11] and those found by MEME in sequences identified as bound to the given transcription factor in 13 ChIP-seq experiments. The MEME motifs were found using 100 randomly chosen bound sequences and the OOPS-formula image prior. The inter-motif distance (scaled Euclidean distance) is computed as described in Additional file 1.

References

    1. Siddharthan R, Siggia ED, van Nimwegen E. PhyloGibbs: a gibbs sampling motif finder that incorporates phylogeny. PLoS Comput Biol. 2005;1(7):e67. doi: 10.1371/journal.pcbi.0010067. - DOI - PMC - PubMed
    1. Fang F, Blanchette M. FootPrinter3: phylogenetic footprinting in partially alignable sequences. Nucleic Acids Res. 2006. pp. W617–W620. - DOI - PMC - PubMed
    1. Redhead E, Bailey T. Discriminative motif discovery in DNA and protein sequences using the DEME Algorithm. BMC Bioinformatics. 2007;8:385. doi: 10.1186/1471-2105-8-385. - DOI - PMC - PubMed
    1. Barash Y, Bejerano G, Friedman N. In: Algorithms in Bioinformatics: Proc. First International Workshop, no. 2149 in LNCS. Gascuel O, Moret BME, editor. 2001. A simple hyper-geometric approach for discovering putative transcription factor binding sites; pp. 278–293.
    1. Foat BC, Houshmandi SS, Olivas WM, Bussemaker HJ. Profiling condition-specific, genome-wide regulation of mRNA stability in yeast. Proc Natl Acad Sci USA. 2005;102(49):17675–17680. doi: 10.1073/pnas.0503803102. - DOI - PMC - PubMed

Publication types

Substances