SAM: String-based sequence search algorithm for mitochondrial DNA database queries
- PMID: 21056022
- PMCID: PMC3064999
- DOI: 10.1016/j.fsigen.2010.10.006
SAM: String-based sequence search algorithm for mitochondrial DNA database queries
Abstract
The analysis of the haploid mitochondrial (mt) genome has numerous applications in forensic and population genetics, as well as in disease studies. Although mtDNA haplotypes are usually determined by sequencing, they are rarely reported as a nucleotide string. Traditionally they are presented in a difference-coded position-based format relative to the corrected version of the first sequenced mtDNA. This convention requires recommendations for standardized sequence alignment that is known to vary between scientific disciplines, even between laboratories. As a consequence, database searches that are vital for the interpretation of mtDNA data can suffer from biased results when query and database haplotypes are annotated differently. In the forensic context that would usually lead to underestimation of the absolute and relative frequencies. To address this issue we introduce SAM, a string-based search algorithm that converts query and database sequences to position-free nucleotide strings and thus eliminates the possibility that identical sequences will be missed in a database query. The mere application of a BLAST algorithm would not be a sufficient remedy as it uses a heuristic approach and does not address properties specific to mtDNA, such as phylogenetically stable but also rapidly evolving insertion and deletion events. The software presented here provides additional flexibility to incorporate phylogenetic data, site-specific mutation rates, and other biologically relevant information that would refine the interpretation of mitochondrial DNA data. The manuscript is accompanied by freeware and example data sets that can be used to evaluate the new software (http://stringvalidation.org).
Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
Similar articles
-
Fine-Tuning Phylogenetic Alignment and Haplogrouping of mtDNA Sequences.Int J Mol Sci. 2021 May 27;22(11):5747. doi: 10.3390/ijms22115747. Int J Mol Sci. 2021. PMID: 34072215 Free PMC article.
-
Next generation database search algorithm for forensic mitogenome analyses.Forensic Sci Int Genet. 2018 Nov;37:204-214. doi: 10.1016/j.fsigen.2018.09.001. Epub 2018 Sep 9. Forensic Sci Int Genet. 2018. PMID: 30241075
-
DNA Commission of the International Society for Forensic Genetics: revised and extended guidelines for mitochondrial DNA typing.Forensic Sci Int Genet. 2014 Nov;13:134-42. doi: 10.1016/j.fsigen.2014.07.010. Epub 2014 Jul 29. Forensic Sci Int Genet. 2014. PMID: 25117402
-
Inspecting close maternal relatedness: Towards better mtDNA population samples in forensic databases.Forensic Sci Int Genet. 2011 Mar;5(2):138-41. doi: 10.1016/j.fsigen.2010.10.001. Epub 2010 Nov 9. Forensic Sci Int Genet. 2011. PMID: 21067986 Free PMC article.
-
HAPLOFIND: a new method for high-throughput mtDNA haplogroup assignment.Hum Mutat. 2013 Sep;34(9):1189-94. doi: 10.1002/humu.22356. Epub 2013 Jun 12. Hum Mutat. 2013. PMID: 23696374
Cited by
-
Fine-Tuning Phylogenetic Alignment and Haplogrouping of mtDNA Sequences.Int J Mol Sci. 2021 May 27;22(11):5747. doi: 10.3390/ijms22115747. Int J Mol Sci. 2021. PMID: 34072215 Free PMC article.
-
Length heteroplasmy of the polyC-polyT-polyC stretch in the dog mtDNA control region.Int J Legal Med. 2015 Sep;129(5):927-35. doi: 10.1007/s00414-014-1106-x. Epub 2014 Nov 14. Int J Legal Med. 2015. PMID: 25394743
-
Graph Algorithms for Mixture Interpretation.Genes (Basel). 2021 Jan 27;12(2):185. doi: 10.3390/genes12020185. Genes (Basel). 2021. PMID: 33514030 Free PMC article.
-
Claudin-7 indirectly regulates the integrin/FAK signaling pathway in human colon cancer tissue.J Hum Genet. 2016 Aug;61(8):711-20. doi: 10.1038/jhg.2016.35. Epub 2016 Apr 28. J Hum Genet. 2016. PMID: 27121327
-
mitoLEAF: mitochondrial DNA Lineage, Evolution, Annotation Framework.NAR Genom Bioinform. 2025 Jun 11;7(2):lqaf079. doi: 10.1093/nargab/lqaf079. eCollection 2025 Jun. NAR Genom Bioinform. 2025. PMID: 40503051 Free PMC article.
References
-
- Bär W., Brinkmann B., Budowle B., Carracedo A., Gill P., Holland M., Lincoln P.J., Mayr W., Morling N., Olaisen B., Schneider P.M., Tully G., Wilson M. DNA Commission of the International Society for Forensic Genetics: guidelines for mitochondrial DNA typing. Int. J. Legal Med. 2000;113:193–196. - PubMed
-
- Andrews R.M., Kubacka I., Chinnery P.F., Lightowlers R.N., Turnbull D.M., Howell N. Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA. Nat. Genet. 1999;23:147. - PubMed
-
- Anderson S., Bankier A.T., Barrell B.G., de Bruijn M.H., Coulson A.R., Drouin J., Eperon I.C., Nierlich D.P., Roe B.A., Sanger F., Schreier P.H., Smith A.J., Staden R., Young I.G. Sequence and organization of the human mitochondrial genome. Nature. 1981;290:457–465. - PubMed
-
- Tully G., Bär W., Brinkmann B., Carracedo A., Gill P., Morling N., Parson W., Schneider P. Considerations by the European DNA profiling (EDNAP) group on the working practices, nomenclature and interpretation of mitochondrial DNA profiles. Forensic Sci. Int. 2001;124:83–91. - PubMed
-
- Wilson M.R., Allard M.W., Monson K.L., Miller K.W., Budowle B. Recommendations for consistent treatment of length variants in the human mitochondrial DNA control region. Forensic Sci. Int. 2002;129:35–42. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials