A Spectral Rotation Method with Triplet Periodicity Property for Planted Motif Finding Problems
- PMID: 31782356
- DOI: 10.2174/1386207322666191129112433
A Spectral Rotation Method with Triplet Periodicity Property for Planted Motif Finding Problems
Abstract
Background: Genes are known as functional patterns in the genome and are presumed to have biological significance. They can indicate binding sites for transcription factors and they encode certain proteins. Finding genes from biological sequences is a major task in computational biology for unraveling the mechanisms of gene expression.
Objective: Planted motif finding problems are a class of mathematical models abstracted from the process of detecting genes from genome, in which a specific gene with a number of mutations is planted into a randomly generated background sequence, and then gene finding algorithms can be tested to check if the planted gene can be found in feasible time.
Methods: In this work, a spectral rotation method based on triplet periodicity property is proposed to solve planted motif finding problems.
Results: The proposed method gives significant tolerance of base mutations in genes. Specifically, genes having a number of substitutions can be detected from randomly generated background sequences. Experimental results on genomic data set from Saccharomyces cerevisiae reveal that genes can be visually distinguished. It is proposed that genes with about 50% mutations can be detected from randomly generated background sequences.
Conclusion: It is found that with about 5 insertions or deletions, this method fails in finding the planted genes. For a particular case, if the deletion of bases is located at the beginning of the gene, that is, bases are not randomly deleted, then the tolerance of the method for base deletion is increased.
Keywords: Gene detection; fast algorithm; fourier spectrums; motif finding; planted motif finding problem; visualization method..
Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.net.
Similar articles
-
Visualization of the protein-coding regions with a self adaptive spectral rotation approach.Nucleic Acids Res. 2011 Jan;39(1):e3. doi: 10.1093/nar/gkq891. Epub 2010 Oct 14. Nucleic Acids Res. 2011. PMID: 20947567 Free PMC article.
-
PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny.PLoS Comput Biol. 2005 Dec;1(7):e67. doi: 10.1371/journal.pcbi.0010067. Epub 2005 Dec 9. PLoS Comput Biol. 2005. PMID: 16477324 Free PMC article.
-
Finding motifs in DNA sequences using low-dispersion sequences.J Comput Biol. 2014 Apr;21(4):320-9. doi: 10.1089/cmb.2013.0054. Epub 2014 Mar 5. J Comput Biol. 2014. PMID: 24597706 Free PMC article.
-
Voting algorithms for the motif finding problem.Comput Syst Bioinformatics Conf. 2008;7:37-47. Comput Syst Bioinformatics Conf. 2008. PMID: 19642267
-
Finding motifs using random projections.J Comput Biol. 2002;9(2):225-42. doi: 10.1089/10665270252935430. J Comput Biol. 2002. PMID: 12015879
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases