Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2019 Aug 16:10:709.
doi: 10.3389/fgene.2019.00709. eCollection 2019.

Getting the Entire Message: Progress in Isoform Sequencing

Affiliations
Review

Getting the Entire Message: Progress in Isoform Sequencing

Simon A Hardwick et al. Front Genet. .

Abstract

The advent of second-generation sequencing and its application to RNA sequencing have revolutionized the field of genomics by allowing quantification of gene expression, as well as the definition of transcription start/end sites, exons, splice sites and RNA editing sites. However, due to the sequencing of fragments of cDNAs, these methods have not given a reliable picture of complete RNA isoforms. Third-generation sequencing has filled this gap and allows end-to-end sequencing of entire RNA/cDNA molecules. This approach to transcriptomics has been a "niche" technology for a couple of years but now is becoming mainstream with many different applications. Here, we review the background and progress made to date in this rapidly growing field. We start by reviewing the progressive realization that alternative splicing is omnipresent. We then focus on long-noncoding RNA isoforms and the distinct combination patterns of exons in noncoding and coding genes. We consider the implications of the recent technologies of direct RNA sequencing and single-cell isoform RNA sequencing. Finally, we discuss the parameters that define the success of long-read RNA sequencing experiments and strategies commonly used to make the most of such data.

Keywords: RNA; epitranscriptome; isoforms; long-read; splicing.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Progress in isoform sequencing. Timeline highlights some of the key milestones in the history of isoform sequencing, dating back to the advent of short-read RNA-seq back in 2008. Note that this is presented as a summary only and is not intended to be exhaustive of all work done in the field. RNA-seq: RNA sequencing; PacBio: Pacific Biosciences; SLR: synthetic long-read; lncRNA: long noncoding RNA; ONT: Oxford Nanopore Technologies.
Figure 2
Figure 2
Resolution of alternative splicing events with long-read sequencing. (A) Schematic illustration of the structure of a hypothetical gene undergoing alternative splicing. The gene contains two alternatively spliced exons (red and blue) separated by constitutive exons (gray). In theory, if we let ‘n’ equal the number of alternative exons, then there are 2 n different combinations of these exons. (B) Under random pairing, we would expect to see all of these 2 n combinations, each at a relative abundance of 1/2 n. In this case, short-read RNA-seq would be sufficient, as it can accurately quantify percent spliced-in (PSI) scores for individual exons. (C, D) However, coordinated exon pairing can result in a situation whereby the alternative exons are mutually exclusive (C) or mutually associated (D). (E) With short-read RNA-seq, these three scenarios are indistinguishable, as the information regarding the connectivity of the alternative exons is lost. Conversely, with long-read sequencing, it is trivial to determine which scenario is present.

References

    1. Abdel-Ghany S. E., Hamilton M., Jacobi J. L., Ngam P., Devitt N., Schilkey F., et al. (2016). A survey of the sorghum transcriptome using single-molecule long reads. Nat. Commun. 7, 11706. 10.1038/ncomms11706 - DOI - PMC - PubMed
    1. Ameur A., Zaghlool A., Halvardson J., Wetterbom A., Gyllensten U., Cavelier L., et al. (2011). Total RNA sequencing reveals nascent transcription and widespread co-transcriptional splicing in the human brain. Nat. Struct. Mol. Biol. 18, 1435–1440. 10.1038/nsmb.2143 - DOI - PubMed
    1. Anvar S. Y., Allard G., Tseng E., Sheynkman G. M., de Klerk E., Vermaat M., et al. (2018). Full-length mRNA sequencing uncovers a widespread coupling between transcription initiation and mRNA processing. Genome Biol. 19, 46. 10.1186/s13059-018-1418-0 - DOI - PMC - PubMed
    1. Au K. F., Sebastiano V., Afshar P. T., Durruthy J. D., Lee L., Williams B. A., et al. (2013). Characterization of the human ESC transcriptome by hybrid sequencing. . Natl. Acad. Sci. U.S.A. 110, E4821–E4830. 10.1073/pnas.1320101110. - DOI - PMC - PubMed
    1. Au K. F., Underwood J. G., Lee L., Wong W. H. (2012). Improving PacBio long read accuracy by short read alignment. PLoS One 7, e46679. 10.1371/journal.pone.0046679 - DOI - PMC - PubMed