Bioinformatic removal of NUMT-associated variants in mitotiling next-generation sequencing data from whole blood samples
- PMID: 30151910
- DOI: 10.1002/elps.201800135
Bioinformatic removal of NUMT-associated variants in mitotiling next-generation sequencing data from whole blood samples
Abstract
Nuclear mitochondrial DNA segments (NUMTs) have arisen because of the transposition of segments of the mitochondrial DNA genome (mitogenome) into the nuclear genome. When using a "mitotiling" strategy, NUMTs may be more readily amplified when targeting the entire mitogenome compared to the control region, as hundreds of primers are required for complete sequencing coverage. In samples with a high percentage of nuclear DNA copies per cell, such as whole blood, NUMT coenrichment may be exacerbated. The present study examined bioinformatic approaches for removing NUMTs and NUMT-associated variants (NAVs) from next-generation sequence data generated using two mitotiling kits (Precision ID and QIAseq). Across 16 samples with low mtDNA copy number, NUMT coenrichment produced 890 NAVs with >5% variant frequency. The use of the consensus sequence to eliminate NUMT reads proved to be effective for QIAseq data, and resulted in >85% NAV removal in Precision ID data. This method was bolstered by NAV filtering in Precision ID analysis. Alternative high stringency mapping to the revised Cambridge Reference Sequence (rCRS) and the human genome reference GRCh38 for the QIAseq data caused a reduction in mitogenome coverage without complete NUMT removal. These bioinformatic solutions facilitate mitotiling sequence data analysis for low-level variant detection.
Keywords: Bioinformatics; Mitochondrial DNA; Next-generation sequencing; Nuclear-mitochondrial DNA segments.
Published 2018. This article is a U.S. Government work and is in the public domain in the USA.
Similar articles
-
The Value of Whole-Genome Sequencing for Mitochondrial DNA Population Studies: Strategies and Criteria for Extracting High-Quality Mitogenome Haplotypes.Int J Mol Sci. 2022 Feb 17;23(4):2244. doi: 10.3390/ijms23042244. Int J Mol Sci. 2022. PMID: 35216360 Free PMC article.
-
The Mighty NUMT: Mitochondrial DNA Flexing Its Code in the Nuclear Genome.Biomolecules. 2023 Apr 27;13(5):753. doi: 10.3390/biom13050753. Biomolecules. 2023. PMID: 37238623 Free PMC article. Review.
-
Distinguishing mitochondrial DNA and NUMT sequences amplified with the precision ID mtDNA whole genome panel.Mitochondrion. 2020 Nov;55:122-133. doi: 10.1016/j.mito.2020.09.001. Epub 2020 Sep 17. Mitochondrion. 2020. PMID: 32949792
-
Numt Parser: Automated identification and removal of nuclear mitochondrial pseudogenes (numts) for accurate mitochondrial genome reconstruction in Panthera.J Hered. 2023 Apr 6;114(2):120-130. doi: 10.1093/jhered/esac065. J Hered. 2023. PMID: 36525576
-
Interpreting NUMTs in forensic genetics: Seeing the forest for the trees.Forensic Sci Int Genet. 2021 Jul;53:102497. doi: 10.1016/j.fsigen.2021.102497. Epub 2021 Mar 15. Forensic Sci Int Genet. 2021. PMID: 33740708 Review.
Cited by
-
The Value of Whole-Genome Sequencing for Mitochondrial DNA Population Studies: Strategies and Criteria for Extracting High-Quality Mitogenome Haplotypes.Int J Mol Sci. 2022 Feb 17;23(4):2244. doi: 10.3390/ijms23042244. Int J Mol Sci. 2022. PMID: 35216360 Free PMC article.
-
From Forensics to Clinical Research: Expanding the Variant Calling Pipeline for the Precision ID mtDNA Whole Genome Panel.Int J Mol Sci. 2021 Nov 6;22(21):12031. doi: 10.3390/ijms222112031. Int J Mol Sci. 2021. PMID: 34769461 Free PMC article.
-
Comparisons of aged samples and modern references provide algorithm for mtDNA analysis in challenging material.Sci Rep. 2025 Feb 24;15(1):6682. doi: 10.1038/s41598-025-90375-8. Sci Rep. 2025. PMID: 39994259 Free PMC article.
-
Developmental Validation of a MPS Workflow with a PCR-Based Short Amplicon Whole Mitochondrial Genome Panel.Genes (Basel). 2020 Nov 13;11(11):1345. doi: 10.3390/genes11111345. Genes (Basel). 2020. PMID: 33202822 Free PMC article.
-
Platinum-Quality Mitogenome Haplotypes from United States Populations.Genes (Basel). 2020 Oct 29;11(11):1290. doi: 10.3390/genes11111290. Genes (Basel). 2020. PMID: 33138247 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources