Graph Algorithms for Mixture Interpretation
- PMID: 33514030
- PMCID: PMC7911948
- DOI: 10.3390/genes12020185
Graph Algorithms for Mixture Interpretation
Abstract
The scale of genetic methods are presently being expanded: forensic genetic assays previously were limited to tens of loci, but now technologies allow for a transition to forensic genomic approaches that assess thousands to millions of loci. However, there are subtle distinctions between genetic assays and their genomic counterparts (especially in the context of forensics). For instance, forensic genetic approaches tend to describe a locus as a haplotype, be it a microhaplotype or a short tandem repeat with its accompanying flanking information. In contrast, genomic assays tend to provide not haplotypes but sequence variants or differences, variants which in turn describe how the alleles apparently differ from the reference sequence. By the given construction, mitochondrial genetic assays can be thought of as genomic as they often describe genetic differences in a similar way. The mitochondrial genetics literature makes clear that sequence differences, unlike the haplotypes they encode, are not comparable to each other. Different alignment algorithms and different variant calling conventions may cause the same haplotype to be encoded in multiple ways. This ambiguity can affect evidence and reference profile comparisons as well as how "match" statistics are computed. In this study, a graph algorithm is described (and implemented in the MMDIT (Mitochondrial Mixture Database and Interpretation Tool) R package) that permits the assessment of forensic match statistics on mitochondrial DNA mixtures in a way that is invariant to both the variant calling conventions followed and the alignment parameters considered. The algorithm described, given a few modest constraints, can be used to compute the "random man not excluded" statistic or the likelihood ratio. The performance of the approach is assessed in in silico mitochondrial DNA mixtures.
Keywords: graph algorithm; massively parallel sequencing; mitochondrial mixtures; mixture interpretation; probabilistic genotyping.
Conflict of interest statement
The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.
Figures











Similar articles
-
MMDIT: A tool for the deconvolution and interpretation of mitochondrial DNA mixtures.Forensic Sci Int Genet. 2021 Nov;55:102568. doi: 10.1016/j.fsigen.2021.102568. Epub 2021 Aug 8. Forensic Sci Int Genet. 2021. PMID: 34416654
-
Leveraging reads that span multiple single nucleotide polymorphisms for haplotype inference from sequencing data.Bioinformatics. 2013 Sep 15;29(18):2245-52. doi: 10.1093/bioinformatics/btt386. Epub 2013 Jul 3. Bioinformatics. 2013. PMID: 23825370 Free PMC article.
-
A phylogenetic approach for haplotype analysis of sequence data from complex mitochondrial mixtures.Forensic Sci Int Genet. 2017 Sep;30:93-105. doi: 10.1016/j.fsigen.2017.05.007. Epub 2017 May 29. Forensic Sci Int Genet. 2017. PMID: 28667863
-
Review of alignment and SNP calling algorithms for next-generation sequencing data.J Appl Genet. 2016 Feb;57(1):71-9. doi: 10.1007/s13353-015-0292-7. Epub 2015 Jun 9. J Appl Genet. 2016. PMID: 26055432 Review.
-
A review of bioinformatic methods for forensic DNA analyses.Forensic Sci Int Genet. 2018 Mar;33:117-128. doi: 10.1016/j.fsigen.2017.12.005. Epub 2017 Dec 12. Forensic Sci Int Genet. 2018. PMID: 29247928 Review.
References
-
- Krawczak M. Forensic interpretation of haploid DNA mixtures. Int. Congress Ser. 2006;1288:477–483. doi: 10.1016/j.ics.2005.10.041. - DOI
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources