Validation of a comprehensive long-read sequencing platform for broad clinical genetic diagnosis
- PMID: 40385982
- PMCID: PMC12082127
- DOI: 10.3389/fgene.2025.1499456
Validation of a comprehensive long-read sequencing platform for broad clinical genetic diagnosis
Abstract
Though short read high-throughput sequencing, commonly known as Next-Generation Sequencing (NGS), has revolutionized genomics and genetic testing, there is no single genetic test that can accurately detect single nucleotide variants (SNVs), small insertions/deletions (indels), complex structural variants (SVs), repetitive genomic alterations, and variants in genes with highly homologous pseudogenes. The implementation of a unified comprehensive technique that can simultaneously detect a broad spectrum of genetic variation would substantially increase efficiency of the diagnostic process. The current study evaluated the clinical utility of long-read sequencing as a comprehensive genetic test for diagnosis of inherited conditions. Using Oxford Nanopore Technologies long read nanopore sequencing, we successfully developed and validated a clinically deployable integrated bioinformatics pipeline that utilizes a combination of eight publicly available variant callers. A concordance assessment comparing the known variant calls from a well-characterized, benchmarked sample called NA12878 from the National Institute of Standards and Technology (NIST) with the variants detected by our pipeline for this sample, determined that the analytical sensitivity of our pipeline was 98.87% and the analytical specificity exceeded 99.99%. We then evaluated our pipeline's ability to detect 167 clinically relevant variants from 72 clinical samples. This set of variants consisted of 80 SNVs, 26 indels, 32 SVs, and 29 repeat expansions, including 14 variants in genes with highly homologous pseudogenes. The overall detection concordance for these clinically relevant variants was 99.4% (95% CI: 99.7%-99.9%). Importantly, in addition to detecting known clinically relevant variants, in four cases, our pipeline yielded valuable additional information in support of clinical diagnoses that could not have been established using short-read NGS alone. Our findings suggest that long-read sequencing is successful in identifying diverse genomic alterations and that our pipeline functions well as the basis for a single diagnostic test for patients with suspected genetic disease.
Keywords: Oxford Nanopore Technologies; Tandem repeat expansions; clinical genomics; complex structural variants; long-read sequencing; whole genome sequencing.
Copyright © 2025 Sen, Handler, Victorsen, Flaten, Ellison, Knutson, Munro, Martinez, Billington, Laffin, Bray, Mroz, Yohe, Nelson, Bower and Thyagarajan.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures










References
-
- Ali H., Hussain N., Naim M., Zayed M., Al-Mulla F., Kehinde E. O., et al. (2015). A novel PKD1 variant demonstrates a disease-modifying role in trans with a truncating PKD1 mutation in patients with autosomal dominant polycystic kidney disease. BMC Nephrol. 16, 26. 10.1186/s12882-015-0015-7 - DOI - PMC - PubMed
-
- Chen X., Harting J., Farrow E., Thiffault I., Kasperaviciute D., Genomics England Research C., et al. (2023). Comprehensive SMN1 and SMN2 profiling for spinal muscular atrophy analysis using long-read PacBio HiFi sequencing. Am. J. Hum. Genet. 110, 240–250. 10.1016/j.ajhg.2023.01.001 - DOI - PMC - PubMed
LinkOut - more resources
Full Text Sources