Fine mapping of RNA isoform diversity using an innovative targeted long-read RNA sequencing protocol with novel dedicated bioinformatics pipeline
- PMID: 39350015
- PMCID: PMC11440762
- DOI: 10.1186/s12864-024-10741-0
Fine mapping of RNA isoform diversity using an innovative targeted long-read RNA sequencing protocol with novel dedicated bioinformatics pipeline
Abstract
Background: Solving the structure of mRNA transcripts is a major challenge for both research and molecular diagnostic purposes. Current approaches based on short-read RNA sequencing and RT-PCR techniques cannot fully explore the complexity of transcript structure. The emergence of third-generation long-read sequencing addresses this problem by solving this sequence directly. However, genes with low expression levels are difficult to study with the whole transcriptome sequencing approach. To fix this technical limitation, we propose a novel method to capture transcripts of a gene panel using a targeted enrichment approach suitable for Pacific Biosciences and Oxford Nanopore Technologies platforms.
Results: We designed a set of probes to capture transcripts of a panel of genes involved in hereditary breast and ovarian cancer syndrome. We present SOSTAR (iSofOrmS annoTAtoR), a versatile pipeline to assemble, quantify and annotate isoforms from long read sequencing using a new tool specially designed for this application. The significant enrichment of transcripts by our capture protocol, together with the SOSTAR annotation, allowed the identification of 1,231 unique transcripts within the gene panel from the eight patients sequenced. The structure of these transcripts was annotated with a resolution of one base relative to a reference transcript. All major alternative splicing events of the BRCA1 and BRCA2 genes described in the literature were found. Complex splicing events such as pseudoexons were correctly annotated. SOSTAR enabled the identification of abnormal transcripts in the positive controls. In addition, a case of unexplained inheritance in a family with a history of breast and ovarian cancer was solved by identifying an SVA retrotransposon in intron 13 of the BRCA1 gene.
Conclusions: We have validated a new protocol for the enrichment of transcripts of interest using probes adapted to the ONT and PacBio platforms. This protocol allows a complete description of the alternative structures of transcripts, the estimation of their expression and the identification of aberrant transcripts in a single experiment. This proof-of-concept opens new possibilities for RNA structure exploration in both research and molecular diagnostics.
Keywords: Automatic annotation; HBOC; Isoform assembly; Long read sequencing; RNA splicing.
© 2024. The Author(s).
Conflict of interest statement
All authors except N.S. and D.B. declare that they have no competing interests. N.S. is employed by SeqOne Genomics for the time period October 2020 to present in the context of a public-private PhD project (CIFRE fellowship #2020/0103) partnership between INSERM and SeqOne Genomics. D.B. is employed by SeqOne Genomics as Head Bioinformatics.
Figures







References
-
- Pan Q, Shai O, Lee LJ, Frey BJ, Blencowe BJ. ‘Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing’, Nat. Genet., vol. 40, no. 12, pp. 1413–1415, Dec. 2008, 10.1038/ng.259 - PubMed
-
- Navaratnam DS, Bell TJ, Tu TD, Cohen EL, Oberholtzer JC. ‘Differential Distribution of Ca2+-Activated K + Channel Splice Variants among Hair Cells along the Tonotopic Axis of the Chick Cochlea’, Neuron, vol. 19, no. 5, pp. 1077–1085, Nov. 1997, 10.1016/S0896-6273(00)80398-0 - PubMed
-
- Rosenblatt KP, Sun Z-P, Heller S, Hudspeth AJ. ‘Distribution of Ca2+-Activated K + Channel Isoforms along the Tonotopic Gradient of the Chicken’s Cochlea’, Neuron, vol. 19, no. 5, pp. 1061–1075, Nov. 1997, 10.1016/S0896-6273(00)80397-9 - PubMed
-
- Bonnal SC, López-Oreja I, Valcárcel J. Roles and mechanisms of alternative splicing in cancer — implications for care. Nat Rev Clin Oncol. 2020;17. 10.1038/s41571-020-0350-x. 8, Art. 8, Aug. - PubMed
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous