A novel approach to the detection of genomic approximate tandem repeats in the Levenshtein metric
- PMID: 17803368
- DOI: 10.1089/cmb.2007.0018
A novel approach to the detection of genomic approximate tandem repeats in the Levenshtein metric
Abstract
An efficient algorithm for detecting approximate tandem repeats in genomic sequences is presented. The algorithm is based on innovative statistical criteria to detect candidate regions which may include tandem repeats; these regions are subsequently verified by alignments based on dynamic programming. No prior information about the period size or pattern is needed. Also, the algorithm is virtually capable of detecting repeats with any period. An implementation of the algorithm is compared with the two state-of-the-art tandem repeats detection tools to demonstrate its effectiveness both on natural and synthetic data. The algorithm is available at www.cs.brown.edu/people/domanic/tandem/.
Similar articles
-
Finding approximate tandem repeats in genomic sequences.J Comput Biol. 2005 Sep;12(7):928-42. doi: 10.1089/cmb.2005.12.928. J Comput Biol. 2005. PMID: 16201913 Review.
-
Tandem repeats over the edit distance.Bioinformatics. 2007 Jan 15;23(2):e30-5. doi: 10.1093/bioinformatics/btl309. Bioinformatics. 2007. PMID: 17237101
-
Model of perfect tandem repeat with random pattern and empirical homogeneity testing poly-criteria for latent periodicity revelation in biological sequences.Math Biosci. 2008 Jan;211(1):186-204. doi: 10.1016/j.mbs.2007.10.008. Epub 2007 Nov 5. Math Biosci. 2008. PMID: 18062999
-
Repseek, a tool to retrieve approximate repeats from large DNA sequences.Bioinformatics. 2007 Jan 1;23(1):119-21. doi: 10.1093/bioinformatics/btl519. Epub 2006 Oct 11. Bioinformatics. 2007. PMID: 17038345
-
Key-string algorithm--novel approach to computational analysis of repetitive sequences in human centromeric DNA.Croat Med J. 2003 Aug;44(4):386-406. Croat Med J. 2003. PMID: 12950141 Review.
Cited by
-
NTRFinder: a software tool to find nested tandem repeats.Nucleic Acids Res. 2012 Feb;40(3):e17. doi: 10.1093/nar/gkr1070. Epub 2011 Nov 25. Nucleic Acids Res. 2012. PMID: 22121222 Free PMC article.
-
Database of Periodic DNA Regions in Major Genomes.Biomed Res Int. 2017;2017:7949287. doi: 10.1155/2017/7949287. Epub 2017 Jan 15. Biomed Res Int. 2017. PMID: 28182099 Free PMC article.
-
Finding long tandem repeats in long noisy reads.Bioinformatics. 2021 May 5;37(5):612-621. doi: 10.1093/bioinformatics/btaa865. Bioinformatics. 2021. PMID: 33031558 Free PMC article.
-
High-fidelity, large-scale targeted profiling of microsatellites.Genome Res. 2024 Aug 20;34(7):1008-1026. doi: 10.1101/gr.278785.123. Genome Res. 2024. PMID: 39013593 Free PMC article.
-
Rapid detection of expanded short tandem repeats in personal genomics using hybrid sequencing.Bioinformatics. 2014 Mar 15;30(6):815-22. doi: 10.1093/bioinformatics/btt647. Epub 2013 Nov 8. Bioinformatics. 2014. PMID: 24215022 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources