Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 Nov 20;34(11):1701-1718.
doi: 10.1101/gr.279168.124.

Leveraging the power of long reads for targeted sequencing

Affiliations
Review

Leveraging the power of long reads for targeted sequencing

Shruti V Iyer et al. Genome Res. .

Abstract

Long-read sequencing technologies have improved the contiguity and, as a result, the quality of genome assemblies by generating reads long enough to span and resolve complex or repetitive regions of the genome. Several groups have shown the power of long reads in detecting thousands of genomic and epigenomic features that were previously missed by short-read sequencing approaches. While these studies demonstrate how long reads can help resolve repetitive and complex regions of the genome, they also highlight the throughput and coverage requirements needed to accurately resolve variant alleles across large populations using these platforms. At the time of this review, whole-genome long-read sequencing is more expensive than short-read sequencing on the highest throughput short-read instruments; thus, achieving sufficient coverage to detect low-frequency variants (such as somatic variation) in heterogenous samples remains challenging. Targeted sequencing, on the other hand, provides the depth necessary to detect these low-frequency variants in heterogeneous populations. Here, we review currently used and recently developed targeted sequencing strategies that leverage existing long-read technologies to increase the resolution with which we can look at nucleic acids in a variety of biological contexts.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Long-range PCR enrichment. Primers are designed to flank ROI. PCR can be carried out as single reactions for single targets or with multiple targets in a single PCR reaction. Amplified targets can be optionally size-selected via gel if the target size is known. Amplicons are pooled together before library preparation (prep). (Created with BioRender; https://www.biorender.com/)
Figure 2.
Figure 2.
Hybridization-based capture. Biotinylated DNA or RNA guides are designed to be complementary to the ROI. The DNA is fragmented to ∼10 kb and amplified if more mass is needed. Next, the probes bind to the denatured DNA. The probe–ROI complex undergoes a bead-based pulldown to separate the target regions from the rest of the genome. The enriched fragments are amplified and size-selected to maintain the target length. The amplicons are then prepared for long-read sequencing. (Created with BioRender; https://www.biorender.com/)
Figure 3.
Figure 3.
In-gel Cas9-cleavage and target-specific electrophoresis purification. Cells are embedded in agarose and lysed in gel, maintaining DNA fragment length. Cas9 cleavage is carried out in gel using guides specific to the ROI. PFGE is used to separate the target(s) from background DNA based on size, which is known a priori. The purified target is then prepared for sequencing on either of the long-read platforms using appropriate adapters/kits. (Created with BioRender; https://www.biorender.com/)
Figure 4.
Figure 4.
nCATS. DNA is dephosphorylated to prevent sequencing adapter ligation. Cas9 RNPs with guides specific to the ROI are used to cleave the DNA upstream and downstream from the targets. This exposes phosphate groups at the ends of the target strands to which sequencing adapters are then ligated. Targets are, therefore, preferentially sequenced from a sequencing pool consisting of adapter-bound targets and dephosphorylated nontarget DNA. (Created with BioRender; https://www.biorender.com/)
Figure 5.
Figure 5.
Single cut and read-out approaches. Dephosphorylated DNA is split into two separate reactions. crRNA guides are designed in a strand-directed manner with separate guide pools prepared for guides cutting upstream versus downstream from each target. Upstream and downstream guide pools are then used to cleave dephosphorylated DNA in separate reactions. After Cas9 cleavage, both reactions are pooled together, and sequencing adapters are ligated. The prepared library is loaded on nanopore flowcells and sequenced. (Created with BioRender; https://www.biorender.com/)
Figure 6.
Figure 6.
Approaches with background reduction. (A) NE and CaBagE. DNA is cleaved upstream and downstream from the ROI (dashed lines represent cut sites) using target-specific crRNPs. Immediately after Cas9 cleavage, 5′ and 3′ exonucleases are used to digest background DNA while target ends are protected by the bound Cas9. Heat incubation is used to dissociate Cas9 from the targets and inactivate the exonucleases before sequencing adapter ligation. The prepared library is loaded on nanopore flowcells and sequenced. (B) Affinity-based Cas9-mediated enrichment (ACME). DNA is dephosphorylated to prevent sequencing adapter ligation and Cas9 RNPs with guides specific to the ROI are used to cleave the DNA upstream and downstream from the target(s). After cleavage, Cas9 remains bound to the nontarget side of the cut sites (PAM-distal end). The Cas9 enzyme has a C-terminal 6 Histidine Tag. HisTag Dynabeads are used to pull down Cas9 and the nontarget fragments bound to it from the sequencing pool. Adapters are then ligated to the exposed phosphate groups at the ends of the target strand(s). The prepared library is sequenced on ONT flowcells. (Created with BioRender; https://www.biorender.com/)
Figure 7.
Figure 7.
Amplification-free targeted sequencing on the PacBio platform (PacBio No-Amp). DNA is dephosphorylated to prevent sequencing adapter ligation. Cas9 RNPs with guides specific to the ROI are used to cleave the DNA upstream and downstream from the target(s). Sequencing adapters are ligated to the cleaved products. Since no background reduction has been performed yet, nontarget strands protected by Cas9 on both ends will also likely end up with SMRTbell adapters. Exonucleases are introduced to digest the rest of the background DNA. Only those fragments with SMRTbells attached on both sides survive the exonuclease digestion and make up the sequencing pool that is loaded on to PacBio flowcells. (Created with BioRender; https://www.biorender.com/)

Similar articles

Cited by

  • Long-read sequencing for diagnosis of genetic myopathies.
    Yeow D, Rudaks LI, Davis R, Ng K, Ghaoui R, Cheong PL, Ravenscroft G, Kennerson M, Deveson I, Kumar KR. Yeow D, et al. BMJ Neurol Open. 2025 May 11;7(1):e000990. doi: 10.1136/bmjno-2024-000990. eCollection 2025. BMJ Neurol Open. 2025. PMID: 40357124 Free PMC article. Review.
  • Unraveling the hidden complexity of cancer through long-read sequencing.
    Li Q, Keskus AG, Wagner J, Izydorczyk MB, Timp W, Sedlazeck FJ, Klein AP, Zook JM, Kolmogorov M, Schatz MC. Li Q, et al. Genome Res. 2025 Apr 14;35(4):599-620. doi: 10.1101/gr.280041.124. Genome Res. 2025. PMID: 40113261 Free PMC article. Review.
  • Early Genetic Evolution of Driver Mutations in Uveal Melanoma.
    Dollar JJ, Decatur CL, Weis E, Schefler AC, Materin MA, Fuller TS, Skalet AH, Reichstein DA, Kim I, Piggott KD, Demirci H, Aaberg TA Jr, Mruthyunjaya P, Williams BK Jr, Shildkrot E, Oliver SCN, Char DH, Capone A Jr, Mason J 3rd, Walter SD, Altaweel M, Wells J, Gombos DS, Duker J, Hovland P, Tsai T, Javid C, Durante MA, Covington KR, Zhang S, Correa ZM, Harbour JW. Dollar JJ, et al. medRxiv [Preprint]. 2025 Jun 12:2025.06.10.25329358. doi: 10.1101/2025.06.10.25329358. medRxiv. 2025. PMID: 40585173 Free PMC article. Preprint.
  • Targeted long-read sequencing to quantify methylation of the C9orf72 repeat expansion.
    Udine E, Finch NA, DeJesus-Hernandez M, Jackson JL, Baker MC, Saravanaperumal SA, Wieben E, Ebbert MTW, Shah J, Petrucelli L, Rademakers R, Oskarsson B, van Blitterswijk M. Udine E, et al. Mol Neurodegener. 2024 Dec 21;19(1):99. doi: 10.1186/s13024-024-00790-0. Mol Neurodegener. 2024. PMID: 39709476 Free PMC article.
  • Long-Read Sequencing and Structural Variant Detection: Unlocking the Hidden Genome in Rare Genetic Disorders.
    Moustakli E, Christopoulos P, Potiris A, Zikopoulos A, Mavrogianni D, Karampas G, Kathopoulis N, Anagnostaki I, Domali E, Tzallas AT, Drakakis P, Stavros S. Moustakli E, et al. Diagnostics (Basel). 2025 Jul 17;15(14):1803. doi: 10.3390/diagnostics15141803. Diagnostics (Basel). 2025. PMID: 40722552 Free PMC article. Review.

References

    1. Adamopoulos PG, Tsiakanikas P, Boti MA, Scorilas A. 2021. Targeted long-read sequencing decodes the transcriptional atlas of the founding RAS gene family members. Int J Mol Sci 22: 13298. 10.3390/ijms222413298 - DOI - PMC - PubMed
    1. Adli M. 2018. The CRISPR tool kit for genome editing and beyond. Nat Commun 9: 1911. 10.1038/s41467-018-04252-2 - DOI - PMC - PubMed
    1. Aganezov S, Yan SM, Soto DC, Kirsche M, Zarate S, Avdeyev P, Taylor DJ, Shafin K, Shumate A, Xiao C, et al. 2022. A complete reference genome improves analysis of human genetic variation. Science 376: eabl3533. 10.1126/science.abl3533 - DOI - PMC - PubMed
    1. Albert TJ, Molla MN, Muzny DM, Nazareth L, Wheeler D, Song X, Richmond TA, Middle CM, Rodesch MJ, Packard CJ, et al. 2007. Direct selection of human genomic loci by microarray hybridization. Nat Methods 4: 903–905. 10.1038/nmeth1111 - DOI - PubMed
    1. Albrecht V, Zweiniger C, Surendranath V, Lang K, Schöfl G, Dahl A, Winkler S, Lange V, Böhme I, Schmidt AH. 2017. Dual redundant sequencing strategy: full-length gene characterisation of 1056 novel and confirmatory HLA alleles. HLA 90: 79–87. 10.1111/tan.13057 - DOI - PMC - PubMed

LinkOut - more resources