Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2023 Sep;24(9):627-641.
doi: 10.1038/s41576-023-00600-1. Epub 2023 May 9.

Beyond assembly: the increasing flexibility of single-molecule sequencing technology

Affiliations
Review

Beyond assembly: the increasing flexibility of single-molecule sequencing technology

Paul W Hook et al. Nat Rev Genet. 2023 Sep.

Abstract

The maturation of high-throughput short-read sequencing technology over the past two decades has shaped the way genomes are studied. Recently, single-molecule, long-read sequencing has emerged as an essential tool in deciphering genome structure and function, including filling gaps in the human reference genome, measuring the epigenome and characterizing splicing variants in the transcriptome. With recent technological developments, these single-molecule technologies have moved beyond genome assembly and are being used in a variety of ways, including to selectively sequence specific loci with long reads, measure chromatin state and protein-DNA binding in order to investigate the dynamics of gene regulation, and rapidly determine copy number variation. These increasingly flexible uses of single-molecule technologies highlight a young and fast-moving part of the field that is leading to a more accessible era of nucleic acid sequencing.

PubMed Disclaimer

Conflict of interest statement

W.T. has two patents (8,748,091 and 8,394,584) licensed to ONT. W.T. has received travel funds to speak at symposia organized by ONT. P.H. declares no competing interests.

Figures

Fig. 1
Fig. 1. Long-read targeted sequencing methods.
Long-read targeted enrichment methods fall within broad categories including PCR enrichment, hybridization capture, Cas-mediated enrichment and adaptive sampling. a, PCR enrichment uses specific primers to amplify regions of interest before library preparation. b, Hybridization capture uses biotinylated antisense probes designed against regions of interest to isolate DNA fragments containing the targets. PCR and hybridization capture enrichment methods are both commonly used with short-read sequencing and have been adapted to long-read sequencing. c, Cas-mediated enrichment uses Cas ribonuclear complexes (most commonly Cas9) to cut on either side of regions of interest. Cut fragments are selectively sequenced owing to preferential adapter ligation to the freshly cut ends. Targeted fragments can be further enriched through depletion of off-target fragments. d, Enrichment using adaptive sampling is a nanopore sequencing method in which regions of interest are selectively sequenced by controlling the voltage at individual pores to eject unwanted fragments. ONT, Oxford Nanopore Technologies.
Fig. 2
Fig. 2. Long-read, single-molecule methyltransferase footprinting methods can reveal heterogeneity and coordination of chromatin states.
a, In methyltransferase footprinting assays, a methyltransferase enzyme deposits exogenous methylation on accessible DNA, which may include linker DNA between histones, open chromatin regions or regions surrounding transcription factors bound to DNA. b, When this exogenous labelling is performed on long, single molecules, the heterogeneity of nucleosome positioning, open or closed chromatin and protein–DNA binding can be measured on single molecules. c, With long molecules that span multiple regulatory elements, the coordination between adjacent sites can be measured, potentially revealing unknown aspects of gene regulation. d, Antibody-directed methyltransferase labelling builds on methyltransferase footprinting by concentrating labelling around binding sites of specific proteins. The methyltransferase is fused to protein A, protein G or both, which bind to IgG antibodies.
Fig. 3
Fig. 3. Applications of shorter-read sequencing (<5 kb) on single-molecule platforms.
a, Short reads can be quickly sequenced on portable Oxford Nanopore sequencing devices, returning real-time information about copy number variants and aneuploidy in 3 h or less. b, Primary sequence, fragment patterns and endogenous methylation can be measured simultaneously with single-molecule platforms, and that information can be used to assign reads to tissues of origin. c, Accuracy of short reads on single-molecule platforms can be improved by correcting for errors by reading the same molecule multiple times. d, The cost of sequencing short fragments on single-molecule platforms can be decreased by combining multiple different short molecules into a single, long molecule.

References

    1. Goodwin S, McPherson JD, McCombie WR. Coming of age: ten years of next-generation sequencing technologies. Nat. Rev. Genet. 2016;17:333–351. doi: 10.1038/nrg.2016.49. - DOI - PMC - PubMed
    1. Erlich Y, Mitra PP, delaBastide M, McCombie WR, Hannon GJ. Alta-Cyclic: a self-optimizing base caller for next-generation sequencing. Nat. Methods. 2008;5:679–682. doi: 10.1038/nmeth.1230. - DOI - PMC - PubMed
    1. Metzker ML. Sequencing technologies — the next generation. Nat. Rev. Genet. 2010;11:31–46. doi: 10.1038/nrg2626. - DOI - PubMed
    1. Logsdon GA, Vollger MR, Eichler EE. Long-read human genome sequencing and its applications. Nat. Rev. Genet. 2020;21:597–614. doi: 10.1038/s41576-020-0236-x. - DOI - PMC - PubMed
    1. Rhoads A, Au KF. PacBio sequencing and its applications. Genomics Proteomics Bioinformatics. 2015;13:278–289. doi: 10.1016/j.gpb.2015.08.002. - DOI - PMC - PubMed

Publication types