Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Apr 8;8(1):575-587.
doi: 10.3233/ADR-230120. eCollection 2024.

MitoH3: Mitochondrial Haplogroup and Homoplasmic/Heteroplasmic Variant Calling Pipeline for Alzheimer's Disease Sequencing Project

Affiliations

MitoH3: Mitochondrial Haplogroup and Homoplasmic/Heteroplasmic Variant Calling Pipeline for Alzheimer's Disease Sequencing Project

Congcong Zhu et al. J Alzheimers Dis Rep. .

Abstract

Background: Mitochondrial DNA (mtDNA) is a double-stranded circular DNA and has multiple copies in each cell. Excess heteroplasmy, the coexistence of distinct variants in copies of mtDNA within a cell, may lead to mitochondrial impairments. Accurate determination of heteroplasmy in whole-genome sequencing (WGS) data has posed a significant challenge because mitochondria carrying heteroplasmic variants cannot be distinguished during library preparation. Moreover, sequencing errors, contamination, and nuclear mtDNA segments can reduce the accuracy of heteroplasmic variant calling.

Objective: To efficiently and accurately call mtDNA homoplasmic and heteroplasmic variants from the large-scale WGS data generated from the Alzheimer's Disease Sequencing Project (ADSP), and test their association with Alzheimer's disease (AD).

Methods: In this study, we present MitoH3-a comprehensive computational pipeline for calling mtDNA homoplasmic and heteroplasmic variants and inferring haplogroups in the ADSP WGS data. We first applied MitoH3 to 45 technical replicates from 6 subjects to define a threshold for detecting heteroplasmic variants. Then using the threshold of 5% ≤variant allele fraction≤95%, we further applied MitoH3 to call heteroplasmic variants from a total of 16,113 DNA samples with 6,742 samples from cognitively normal controls and 6,183 from AD cases.

Results: This pipeline is available through the Singularity container engine. For 4,311 heteroplasmic variants identified from 16,113 samples, no significant variant count difference was observed between AD cases and controls.

Conclusions: Our streamlined pipeline, MitoH3, enables computationally efficient and accurate analysis of a large number of samples.

Keywords: Alzheimer’s disease; haplogroup; homoplasmic and heteroplasmic variant calling; mitochondrial DNA; whole genome sequencing.

PubMed Disclaimer

Conflict of interest statement

The authors have no conflict of interest to report.

Figures

Fig. 1
Fig. 1
Sum of inconsistent ratios of (A) Homoplasmic calls and (B) Heteroplasmic calls among technical replicates of each subject (n = 6) according to 5 different VAF thresholds.
Fig. 2
Fig. 2
MT haplogroup distribution in African American (AA), Caribbean Hispanic (CH), and European ancestry (EA) cohorts. Bolded numbers indicate that the haplogroup is more predominant in the particular ancestry group compared to other groups. Red-highlighted numbers signify that the haplogroup constitutes more than 5% within the ancestry, while green-highlighted numbers indicate that the haplogroup comprises less than 5%, yet the frequency is 5-fold greater than in other ancestral groups at least.
Fig. 3
Fig. 3
Among individuals in the total sample, (A) variant allele fraction (VAF) distribution of mtDNA variant calls and (B) proportion of heteroplasmic calls and homoplasmic calls.
Fig. 4
Fig. 4
MT variant distributions in the total sample. A) Proportion of variants by plasmy status. B) Number of variants detected according to allele frequency strata. C) Proportion of variants by plasmy status in each frequency strata.
Fig. 5
Fig. 5
Boxplots showing the mean number of homoplasmic calls according to (A) ancestry group and (B) tissue type among European ancestry individuals. Bar plots show the proportion of heteroplasmic calls according to (C) ancestry group and (D) tissue type among European ancestry individuals.

Similar articles

Cited by

  • Alzheimer's Disease Sequencing Project Release 4 Whole Genome Sequencing Dataset.
    Leung YY, Lee WP, Kuzma AB, Nicaretta H, Valladares O, Gangadharan P, Qu L, Zhao Y, Ren Y, Cheng PL, Kuksa PP, Wang H, White H, Katanic Z, Bass L, Saravanan N, Greenfest-Allen E, Kirsch M, Cantwell L, Iqbal T, Wheeler NR, Farrell JJ, Zhu C, Turner SL, Gunasekaran TI, Mena PR, Jin J, Carter L; Alzheimer’s Disease Sequencing Project; Zhang X, Vardarajan BN, Toga A, Cuccaro M, Hohman TJ, Bush WS, Naj AC, Martin E, Dalgard C, Kunkle BW, Farrer LA, Mayeux RP, Haines JL, Pericak-Vance MA, Schellenberg GD, Wang LS. Leung YY, et al. medRxiv [Preprint]. 2024 Dec 6:2024.12.03.24317000. doi: 10.1101/2024.12.03.24317000. medRxiv. 2024. Update in: Alzheimers Dement. 2025 May;21(5):e70237. doi: 10.1002/alz.70237. PMID: 39677464 Free PMC article. Updated. Preprint.
  • Alzheimer's Disease Sequencing Project release 4 whole genome sequencing dataset.
    Leung YY, Lee WP, Kuzma AB, Nicaretta H, Valladares O, Gangadharan P, Qu L, Zhao Y, Ren Y, Cheng PL, Kuksa PP, Wang H, White H, Katanic Z, Bass L, Saravanan N, Greenfest-Allen E, Kirsch M, Cantwell L, Iqbal T, Wheeler NR, Farrell JJ, Zhu C, Turner SL, Gunasekaran TI, Mena PR, Jin Y, Carter L; Alzheimer's Disease Sequencing Project; Zhang X, Vardarajan BN, Toga A, Cuccaro M, Hohman TJ, Bush WS, Naj AC, Martin E, Dalgard CL, Kunkle BW, Farrer LA, Mayeux RP, Haines JL, Pericak-Vance MA, Schellenberg GD, Wang LS. Leung YY, et al. Alzheimers Dement. 2025 May;21(5):e70237. doi: 10.1002/alz.70237. Alzheimers Dement. 2025. PMID: 40407102 Free PMC article.

References

    1. Monzel AS, Enríquez JA, Picard M (2023) Multifaceted mitochondria: Moving mitochondrial science beyond function and dysfunction. Nat Metab 5, 546–562. - PMC - PubMed
    1. Swerdlow RH, Burns JM, Khan SM (2014) The Alzheimer’s disease mitochondrial cascade hypothesis: Progress and perspectives. Biochim Biophys Acta Mol Basis Dis 1842, 1219–1231. - PMC - PubMed
    1. Stewart JB, Chinnery PF (2015) The dynamics of mitochondrial DNA heteroplasmy: Implications for human health and disease. Nat Rev Genet 16, 530–542. - PubMed
    1. Yana MH, Wang X, Zhu X (2013) Mitochondrial defects and oxidative stress in Alzheimer disease and Parkinson disease. Free Radic Biol Med 62, 90–101. - PMC - PubMed
    1. Coskun PE, Beal MF, Wallace DC (2004) Alzheimer’s brains harbor somatic mtDNA control-region mutations that suppress mitochondrial transcription and replication. Proc Natl Acad Sci U S A 101, 10726–10731. - PMC - PubMed

LinkOut - more resources