Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jun 10;21(1):378.
doi: 10.1186/s12967-023-04243-y.

Implementation of Nanopore sequencing as a pragmatic workflow for copy number variant confirmation in the clinic

Affiliations

Implementation of Nanopore sequencing as a pragmatic workflow for copy number variant confirmation in the clinic

Stephanie U Greer et al. J Transl Med. .

Abstract

Background: Diagnosis of rare genetic diseases can be a long, expensive and complex process, involving an array of tests in the hope of obtaining an actionable result. Long-read sequencing platforms offer the opportunity to make definitive molecular diagnoses using a single assay capable of detecting variants, characterizing methylation patterns, resolving complex rearrangements, and assigning findings to long-range haplotypes. Here, we demonstrate the clinical utility of Nanopore long-read sequencing by validating a confirmatory test for copy number variants (CNVs) in neurodevelopmental disorders and illustrate the broader applications of this platform to assess genomic features with significant clinical implications.

Methods: We used adaptive sampling on the Oxford Nanopore platform to sequence 25 genomic DNA samples and 5 blood samples collected from patients with known or false-positive copy number changes originally detected using short-read sequencing. Across the 30 samples (a total of 50 with replicates), we assayed 35 known unique CNVs (a total of 55 with replicates) and one false-positive CNV, ranging in size from 40 kb to 155 Mb, and assessed the presence or absence of suspected CNVs using normalized read depth.

Results: Across 50 samples (including replicates) sequenced on individual MinION flow cells, we achieved an average on-target mean depth of 9.5X and an average on-target read length of 4805 bp. Using a custom read depth-based analysis, we successfully confirmed the presence of all 55 known CNVs (including replicates) and the absence of one false-positive CNV. Using the same CNV-targeted data, we compared genotypes of single nucleotide variant loci to verify that no sample mix-ups occurred between assays. For one case, we also used methylation detection and phasing to investigate the parental origin of a 15q11.2-q13 duplication with implications for clinical prognosis.

Conclusions: We present an assay that efficiently targets genomic regions to confirm clinically relevant CNVs with a concordance rate of 100%. Furthermore, we demonstrate how integration of genotype, methylation, and phasing data from the Nanopore sequencing platform can potentially simplify and shorten the diagnostic odyssey.

Keywords: Adaptive sampling; Clinical testing; Copy number variants; Genome analysis; Long-read sequencing; Neurodevelopmental disorders; Oxford Nanopore Technologies; Targeted sequencing.

PubMed Disclaimer

Conflict of interest statement

S.U.G, J.B., D.H., P.S., M.R., K.I., and A.K. are either current or previous employees of MyOme, and B.L. and D.E.M. are either current or previous consultants with MyOme. D.E.M is engaged in a research agreement with Oxford Nanopore, and they have paid for him to travel to speak on their behalf. A provisional patent application has been filed on the route of sample identification described in this study.

Figures

Fig. 1
Fig. 1
Overview of the Oxford Nanopore Technologies (ONT) copy number variant (CNV) confirmation assay. CNV coordinates are initially detected with short-read sequencing and then targeted and confirmed using ONT adaptive sampling along with a custom bioinformatics pipeline
Fig. 2
Fig. 2
Examples of copy number variants (CNVs) identified by our pipeline. A Scatter plot of the regional mean depth for a Coriell sample (NA04099/SM4716) known to carry a deletion. A point was plotted for each of the five control regions, the two pad regions, and the deleted region. Each plotted point denotes the mean depth of the region normalized by either the mean depth across all control regions (left side of plot) or the mean depth across all control and pad regions (right side of plot). For each normalization approach, dashed lines are plotted to indicate three standard deviations from the mean of the normalized control regions (gray) and the normalized control and pad regions (green). B Read depth plot for a Coriell sample (NA04099/SM4716) known to carry a deletion. The mean depth in the deleted region was normalized by the mean depth across all control regions, calculated in non-overlapping windows that were 1% of the target size. The red dashed line indicates the mean depth ratio across the deleted region, and the green dashed lines indicate the mean depth ratios of the pad regions directly adjacent to the deleted region. C Scatter plot of the regional mean depth for a blood sample (Individual_5/SM7419) from a healthy individual known to carry a duplication. D Read depth plot for a blood sample (Individual_5/SM7419) from a healthy individual known to carry a duplication
Fig. 3
Fig. 3
Cutoff thresholds for copy number variant (CNV) confirmation. A Boxplot displays of the cutoffs used to confirm deletions and duplications across 50 samples and 56 CNVs (including the false-positive CNV). The cutoff thresholds for CNVs were set at three standard deviations from the mean of either the mean depth ratios of the five control regions or the mean depth ratios of the five control regions and two pad regions, whichever was larger or smaller for deletions and duplications, respectively. ***, p < 0.001. The dashed gray line denotes a mean depth ratio of 1, which would be the expected mean depth ratio for a region not affected by a CNV. B Scatter plot of the difference between the CNV mean depth ratio and its dynamically determined cutoff for each of the 56 deletion, duplication, or control variants. The difference for deletions was calculated as the CNV depth ratio subtracted from the deletion depth ratio cutoff, and the difference for duplications was calculated as the duplication depth ratio cutoff subtracted from the CNV depth ratio, such that confirmed CNVs appear above zero, denoted by the red dashed line
Fig. 4
Fig. 4
Scatter plot of the mean copy number variant (CNV) depth ratios for 56 CNVs (including the false-positive CNV), with differing expected copy numbers, across a range of mean control depths. The dashed horizontal lines indicate the expected mean depth ratio for each expected copy number
Fig. 5
Fig. 5
Heatmap of the proportion of four control regions that matched between Coriell samples sequenced previously with short-read sequencing and in the current study using long-read sequencing with adaptive sampling. The regions are matched based on single nucleotide variant content. The analysis also includes samples sequenced multiple times with long-read sequencing and short-read sequencing, as well as a sample sequenced using only short reads and thus with no long-read sequencing match
Fig. 6
Fig. 6
Integrative Genomics Viewer (IGV) snapshot of the SNRPN promoter CpG island region in Coriell sample NA22397, known to have a duplication of chromosome 15q. Reads assigned to haplotype 1 (Hap 1) are methylated (denoted by red) while reads assigned to haplotype 2 (Hap 2) are unmethylated (denoted by blue)

Similar articles

Cited by

References

    1. Zarrei M, MacDonald JR, Merico D, Scherer SW. A copy number variation map of the human genome. Nat Rev Genet. 2015;16(3):172–83. doi: 10.1038/nrg3871. - DOI - PubMed
    1. Marshall CR, Noor A, Vincent JB, Lionel AC, Feuk L, Skaug J, et al. Structural variation of chromosomes in autism spectrum disorder. Am J Hum Genet. 2008;82(2):477–88. doi: 10.1016/j.ajhg.2007.12.009. - DOI - PMC - PubMed
    1. Lee C, Scherer SW. The clinical context of copy number variation in the human genome. Expert Rev Mol Med. 2010;9(12):e8. doi: 10.1017/S1462399410001390. - DOI - PubMed
    1. Pinto D, Pagnamenta AT, Klei L, Anney R, Merico D, Regan R, et al. Functional impact of global rare copy number variation in autism spectrum disorders. Nature. 2010;466(7304):368–72. doi: 10.1038/nature09146. - DOI - PMC - PubMed
    1. Yuen C, Merico RK, Bookman D, Howe LM, Thiruvahindrapuram J, Patel B. Whole genome sequencing resource identifies 18 new candidate genes for autism spectrum disorder. Nat Neurosci. 2017;20(4):602–11. doi: 10.1038/nn.4524. - DOI - PMC - PubMed

Publication types

LinkOut - more resources