Long-read genome sequencing and variant reanalysis increase diagnostic yield in neurodevelopmental disorders
- PMID: 39299904
- PMCID: PMC11610584
- DOI: 10.1101/gr.279227.124
Long-read genome sequencing and variant reanalysis increase diagnostic yield in neurodevelopmental disorders
Abstract
Variant detection from long-read genome sequencing (lrGS) has proven to be more accurate and comprehensive than variant detection from short-read genome sequencing (srGS). However, the rate at which lrGS can increase molecular diagnostic yield for rare disease is not yet precisely characterized. We performed lrGS using Pacific Biosciences "HiFi" technology on 96 short-read-negative probands with rare diseases that were suspected to be genetic. We generated hg38-aligned variants and de novo phased genome assemblies, and subsequently annotated, filtered, and curated variants using clinical standards. New disease-relevant or potentially relevant genetic findings were identified in 16/96 (16.7%) probands, nine of which (8/96, ∼9.4%) harbored pathogenic or likely pathogenic variants. Nine probands (∼9.4%) had variants that were accurately called in both srGS and lrGS and represent changes to clinical interpretation, mostly from recently published gene-disease associations. Seven cases included variants that were only correctly interpreted in lrGS, including copy-number variants (CNVs), an inversion, a mobile element insertion, two low-complexity repeat expansions, and a 1 bp deletion. While evidence for each of these variants is, in retrospect, visible in srGS, they were either not called within srGS data, were represented by calls with incorrect sizes or structures, or failed quality control and filtration. Thus, while reanalysis of older srGS data clearly increases diagnostic yield, we find that lrGS allows for substantial additional yield (7/96, 7.3%) beyond srGS. We anticipate that as lrGS analysis improves, and as lrGS data sets grow allowing for better variant-frequency annotation, the additional lrGS-only rare disease yield will grow over time.
© 2024 Hiatt et al.; Published by Cold Spring Harbor Laboratory Press.
Figures



Update of
-
Long-read genome sequencing and variant reanalysis increase diagnostic yield in neurodevelopmental disorders.medRxiv [Preprint]. 2024 Mar 26:2024.03.22.24304633. doi: 10.1101/2024.03.22.24304633. medRxiv. 2024. Update in: Genome Res. 2024 Nov 20;34(11):1747-1762. doi: 10.1101/gr.279227.124. PMID: 38585854 Free PMC article. Updated. Preprint.
References
-
- Amiel J, Laudier B, Attié-Bitach T, Trang H, De Pontual L, Gener B, Trochet D, Etchevers H, Ray P, Simonneau M, et al. 2003. Polyalanine expansion and frameshift mutations of the paired-like homeobox gene PHOX2B in congenital central hypoventilation syndrome. Nat Genet 33: 459–461. 10.1038/ng1130 - DOI - PubMed
-
- Aref-Eshghi E, Kerkhof J, Pedro VP, DI France G, Barat-Houari M, Ruiz-Pallares N, Andrau JC, Lacombe D, Van-Gils J, Fergelot P, et al. 2021. Evaluation of DNA methylation episignatures for diagnosis and phenotype correlations in 42 Mendelian neurodevelopmental disorders. Am J Hum Genet 108: 1161–1163. 10.1016/j.ajhg.2021.04.022 - DOI - PMC - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources