Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015:2015:853461.
doi: 10.1155/2015/853461. Epub 2015 Aug 10.

An Affinity Propagation-Based DNA Motif Discovery Algorithm

Affiliations

An Affinity Propagation-Based DNA Motif Discovery Algorithm

Chunxiao Sun et al. Biomed Res Int. 2015.

Abstract

The planted (l, d) motif search (PMS) is one of the fundamental problems in bioinformatics, which plays an important role in locating transcription factor binding sites (TFBSs) in DNA sequences. Nowadays, identifying weak motifs and reducing the effect of local optimum are still important but challenging tasks for motif discovery. To solve the tasks, we propose a new algorithm, APMotif, which first applies the Affinity Propagation (AP) clustering in DNA sequences to produce informative and good candidate motifs and then employs Expectation Maximization (EM) refinement to obtain the optimal motifs from the candidate motifs. Experimental results both on simulated data sets and real biological data sets show that APMotif usually outperforms four other widely used algorithms in terms of high prediction accuracy.

PubMed Disclaimer

Figures

Figure 1
Figure 1
AP clustering results.
Figure 2
Figure 2
Sequence logos of the predicted motifs.
Figure 3
Figure 3
Prediction accuracy on real biological data.
Figure 4
Figure 4
Prediction accuracy on Tompa data.
Algorithm 1
Algorithm 1
APMotif.

Similar articles

Cited by

References

    1. Stormo G. D. DNA binding sites: representation and discovery. Bioinformatics. 2000;16(1):16–23. doi: 10.1093/bioinformatics/16.1.16. - DOI - PubMed
    1. Pevzner P. A., Sze S.-H. Combinatorial approaches to finding subtle signals in DNA sequences. Proceedings of the 8th International Conference on Intelligent Systems for Molecular Biology; August 2000; San Diego, Calif, USA. pp. 269–278. - PubMed
    1. Schneider T. D. Consensus sequence zen. Applied Bioinformaitics. 2002;1(3):111–119. - PMC - PubMed
    1. Yu Q., Huo H., Zhang Y., Guo H. PairMotif: a new pattern-driven algorithm for planted (l, d) DNA motif search. PLoS ONE. 2012;7(10) doi: 10.1371/journal.pone.0048442.e48442 - DOI - PMC - PubMed
    1. Chin F. Y. L., Leung H. C. M. Voting algorithms for discovering long motifs. Proceedings of the 3rd Asia-Pacific Bioinformatics Conference (APBC '05); January 2005; pp. 261–271.

Publication types

LinkOut - more resources