An adaptive window length strategy for eukaryotic CDS prediction
- PMID: 24384711
- DOI: 10.1109/TCBB.2013.76
An adaptive window length strategy for eukaryotic CDS prediction
Abstract
Signal processing-based algorithms for identification of coding sequences (CDS) in eukaryotes are non-data driven and exploit the presence of three-base periodicity in these regions for their detection. Three-base periodicity is commonly detected using short time Fourier transform (STFT) that uses a window function of fixed length. As the length of the protein coding and noncoding regions varies widely, the identification accuracy of STFT-based algorithms is poor. In this paper, a novel signal processing-based algorithm is developed by enabling the window length adaptation in STFT of DNA sequences for improving the identification of three-base periodicity. The length of the window function has been made adaptive in coding regions to maximize the magnitude of period-3 measure, whereas in the noncoding regions, the window length is tailored to minimize this measure. Simulation results on bench mark data sets demonstrate the advantage of this algorithm when compared with other non-data-driven methods for CDS prediction.
Similar articles
-
A three-state model for DNA protein-coding regions.IEEE Trans Biomed Eng. 2006 Nov;53(11):2148-55. doi: 10.1109/TBME.2006.879477. IEEE Trans Biomed Eng. 2006. PMID: 17073319
-
Prediction of protein coding regions by the 3-base periodicity analysis of a DNA sequence.J Theor Biol. 2007 Aug 21;247(4):687-94. doi: 10.1016/j.jtbi.2007.03.038. Epub 2007 Apr 10. J Theor Biol. 2007. PMID: 17509616
-
Discrete Ramanujan transform for distinguishing the protein coding regions from other regions.Mol Cell Probes. 2014 Oct-Dec;28(5-6):228-36. doi: 10.1016/j.mcp.2014.04.002. Epub 2014 Apr 29. Mol Cell Probes. 2014. PMID: 24787059
-
Gene prediction based on DNA spectral analysis: a literature review.J Comput Biol. 2011 Apr;18(4):639-76. doi: 10.1089/cmb.2010.0184. Epub 2011 Mar 7. J Comput Biol. 2011. PMID: 21381961 Review.
-
Eukaryotic transcription factor binding sites--modeling and integrative search methods.Bioinformatics. 2008 Jun 1;24(11):1325-31. doi: 10.1093/bioinformatics/btn198. Epub 2008 Apr 21. Bioinformatics. 2008. PMID: 18426806 Review.
Cited by
-
Cloud-based adaptive exon prediction for DNA analysis.Healthc Technol Lett. 2018 Jan 22;5(1):25-30. doi: 10.1049/htl.2017.0032. eCollection 2018 Feb. Healthc Technol Lett. 2018. PMID: 29515813 Free PMC article.
-
Optimized convolutional neural network using African vulture optimization algorithm for the detection of exons.Sci Rep. 2025 Jan 30;15(1):3810. doi: 10.1038/s41598-025-86672-x. Sci Rep. 2025. PMID: 39885276 Free PMC article.
-
Short Exon Detection via Wavelet Transform Modulus Maxima.PLoS One. 2016 Sep 16;11(9):e0163088. doi: 10.1371/journal.pone.0163088. eCollection 2016. PLoS One. 2016. PMID: 27635656 Free PMC article.
-
Exon prediction based on multiscale products of a genomic-inspired multiscale bilateral filtering.PLoS One. 2019 Mar 21;14(3):e0205050. doi: 10.1371/journal.pone.0205050. eCollection 2019. PLoS One. 2019. PMID: 30897105 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials