Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2024 Mar 26:2024.03.22.24304633.
doi: 10.1101/2024.03.22.24304633.

Long-read genome sequencing and variant reanalysis increase diagnostic yield in neurodevelopmental disorders

Affiliations

Long-read genome sequencing and variant reanalysis increase diagnostic yield in neurodevelopmental disorders

Susan M Hiatt et al. medRxiv. .

Update in

Abstract

Variant detection from long-read genome sequencing (lrGS) has proven to be considerably more accurate and comprehensive than variant detection from short-read genome sequencing (srGS). However, the rate at which lrGS can increase molecular diagnostic yield for rare disease is not yet precisely characterized. We performed lrGS using Pacific Biosciences "HiFi" technology on 96 short-read-negative probands with rare disease that were suspected to be genetic. We generated hg38-aligned variants and de novo phased genome assemblies, and subsequently annotated, filtered, and curated variants using clinical standards. New disease-relevant or potentially relevant genetic findings were identified in 16/96 (16.7%) probands, eight of which (8/96, 8.33%) harbored pathogenic or likely pathogenic variants. Newly identified variants were visible in both srGS and lrGS in nine probands (~9.4%) and resulted from changes to interpretation mostly from recent gene-disease association discoveries. Seven cases included variants that were only interpretable in lrGS, including copy-number variants, an inversion, a mobile element insertion, two low-complexity repeat expansions, and a 1 bp deletion. While evidence for each of these variants is, in retrospect, visible in srGS, they were either: not called within srGS data, were represented by calls with incorrect sizes or structures, or failed quality-control and filtration. Thus, while reanalysis of older data clearly increases diagnostic yield, we find that lrGS allows for substantial additional yield (7/96, 7.3%) beyond srGS. We anticipate that as lrGS analysis improves, and as lrGS datasets grow allowing for better variant frequency annotation, the additional lrGS-only rare disease yield will grow over time.

Keywords: Clinical sequencing; LRS Special Issue; Long read sequencing; neurodevelopmental disorder; repeat expansion; structural variation.

PubMed Disclaimer

Conflict of interest statement

COMPETING INTEREST STATEMENT The authors declare no competing interests.

Figures

Figure 1.
Figure 1.. A de novo, 4 Mb paracentric inversion in proband 1, affecting ZBTB20.
A, B. Visualization of a subset of proband and parent reads in IGV at the 5’ (A) and 3’ (B) breakpoints (black arrowheads) indicate a de novo event. C. Alignment of the proband’s assembled paternal contig versus the reference genome supports the inversion. D. Visualization of the inverted region (highlighted in light blue) in the UCSC browser shows the inversion spans 35 protein-coding genes and likely disrupts the ZBTB20 gene (dark blue bar).
Figure 2.
Figure 2.. Two ALS2 deletions in trans in Proband 2.
A. Visualization of proband and parent reads in IGV indicate two overlapping deletions in ALS2; a smaller maternal deletion (pink bar) and a larger paternal deletion (blue bar/arrow). Alignment of the proband’s assembled maternal (B) or paternal contig (C) versus the reference genome support the two deletions (red dashed lines).
Figure 3.
Figure 3.. Proband 3 has a 4 kb insertion in the 3’ UTR of HCFC1.
A. The proband’s family has a history of X-linked intellectual disability, as the proband (not shown) and two other male relatives (gray squares) are affected. B. Model of the relative length of the insertion in the 3’ UTR. C. The insertion is likely inherited from a heterozygous carrier mother, as indicated by srGS reads.

References

    1. Amiel J, Laudier B, Attié-Bitach T, Trang H, De Pontual L, Gener B, Trochet D, Etchevers H, Ray P, Simonneau M, et al. 2003. Polyalanine expansion and frameshift mutations of the paired-like homeobox gene PHOX2B in congenital central hypoventilation syndrome. Nat Genet 33: 459–461. https://pubmed.ncbi.nlm.nih.gov/12640453/ (Accessed February 5, 2024). - PubMed
    1. Aref-Eshghi E, Kerkhof J, Pedro VP, France G DI, Barat-Houari M, Ruiz-Pallares N, Andrau JC, Lacombe D, Van-Gils J, Fergelot P, et al. 2021. Evaluation of DNA Methylation Episignatures for Diagnosis and Phenotype Correlations in 42 Mendelian Neurodevelopmental Disorders. Am J Hum Genet 108: 1161–1163. https://pubmed.ncbi.nlm.nih.gov/34087165/ (Accessed February 27, 2024). - PMC - PubMed
    1. Audet S, Triassi V, Gelinas M, Legault-Cadieux N, Ferraro V, Duquette A, Tetreault M. 2023. Integration of multi-omics technologies for molecular diagnosis in ataxia patients. Front Genet 14: 1304711. http://www.ncbi.nlm.nih.gov/pubmed/38239855 (Accessed February 13, 2024). - PMC - PubMed
    1. Bamshad MJ, Nickerson DA, Chong JX. 2019. Mendelian Gene Discovery: Fast and Furious with No End in Sight. Am J Hum Genet 105: 448–455. https://pubmed.ncbi.nlm.nih.gov/31491408/ (Accessed July 19, 2022). - PMC - PubMed
    1. Baxter SM, Posey JE, Lake NJ, Sobreira N, Chong JX, Buyske S, Blue EE, Chadwick LH, Coban-Akdemir ZH, Doheny KF, et al. 2022. Centers for Mendelian Genomics: A decade of facilitating gene discovery. Genet Med 24: 784–797. https://pubmed.ncbi.nlm.nih.gov/35148959/ (Accessed April 16, 2023). - PMC - PubMed

Publication types