Benchmarking long-read RNA-sequencing analysis tools using in silico mixtures
- PMID: 37783886
- DOI: 10.1038/s41592-023-02026-3
Benchmarking long-read RNA-sequencing analysis tools using in silico mixtures
Abstract
The lack of benchmark data sets with inbuilt ground-truth makes it challenging to compare the performance of existing long-read isoform detection and differential expression analysis workflows. Here, we present a benchmark experiment using two human lung adenocarcinoma cell lines that were each profiled in triplicate together with synthetic, spliced, spike-in RNAs (sequins). Samples were deeply sequenced on both Illumina short-read and Oxford Nanopore Technologies long-read platforms. Alongside the ground-truth available via the sequins, we created in silico mixture samples to allow performance assessment in the absence of true positives or true negatives. Our results show that StringTie2 and bambu outperformed other tools from the six isoform detection tools tested, DESeq2, edgeR and limma-voom were best among the five differential transcript expression tools tested and there was no clear front-runner for performing differential transcript usage analysis between the five tools compared, which suggests further methods development is needed for this application.
© 2023. The Author(s), under exclusive licence to Springer Nature America, Inc.
Similar articles
-
The long and the short of it: unlocking nanopore long-read RNA sequencing data with short-read differential expression analysis tools.NAR Genom Bioinform. 2021 Apr 26;3(2):lqab028. doi: 10.1093/nargab/lqab028. eCollection 2021 Jun. NAR Genom Bioinform. 2021. PMID: 33937765 Free PMC article.
-
Comprehensive assessment of mRNA isoform detection methods for long-read sequencing data.Nat Commun. 2024 May 10;15(1):3972. doi: 10.1038/s41467-024-48117-3. Nat Commun. 2024. PMID: 38730241 Free PMC article.
-
A benchmarking of workflows for detecting differential splicing and differential expression at isoform level in human RNA-seq studies.Brief Bioinform. 2019 Mar 22;20(2):471-481. doi: 10.1093/bib/bbx122. Brief Bioinform. 2019. PMID: 29040385
-
Long-Read Sequencing - A Powerful Tool in Viral Transcriptome Research.Trends Microbiol. 2019 Jul;27(7):578-592. doi: 10.1016/j.tim.2019.01.010. Epub 2019 Feb 26. Trends Microbiol. 2019. PMID: 30824172 Review.
-
Comparative assessment of long-read error correction software applied to Nanopore RNA-sequencing data.Brief Bioinform. 2020 Jul 15;21(4):1164-1181. doi: 10.1093/bib/bbz058. Brief Bioinform. 2020. PMID: 31232449 Review.
Cited by
-
RNApysoforms: Fast rendering interactive visualization of RNA isoform structure and expression in Python.bioRxiv [Preprint]. 2024 Nov 8:2024.11.06.622357. doi: 10.1101/2024.11.06.622357. bioRxiv. 2024. Update in: Bioinform Adv. 2025 Mar 14;5(1):vbaf057. doi: 10.1093/bioadv/vbaf057. PMID: 39574649 Free PMC article. Updated. Preprint.
-
Using deep long-read RNAseq in Alzheimer's disease brain to assess medical relevance of RNA isoform diversity.bioRxiv [Preprint]. 2023 Dec 11:2023.08.06.552162. doi: 10.1101/2023.08.06.552162. bioRxiv. 2023. Update in: Nat Biotechnol. 2025 Apr;43(4):635-646. doi: 10.1038/s41587-024-02245-9. PMID: 37609156 Free PMC article. Updated. Preprint.
-
Long-read RNA sequencing unveils a novel cryptic exon in MNAT1 along with its full-length transcript structure in TDP-43 proteinopathy.Commun Biol. 2025 Jul 16;8(1):1056. doi: 10.1038/s42003-025-08463-4. Commun Biol. 2025. PMID: 40670663 Free PMC article.
-
Improving gene isoform quantification with miniQuant.Nat Biotechnol. 2025 Jun 3:10.1038/s41587-025-02633-9. doi: 10.1038/s41587-025-02633-9. Online ahead of print. Nat Biotechnol. 2025. PMID: 40461779 Free PMC article.
-
Benchmarking transcriptome deconvolution methods for estimating tissue- and cell-type-specific extracellular vesicle abundances.J Extracell Vesicles. 2024 Sep;13(9):e12511. doi: 10.1002/jev2.12511. J Extracell Vesicles. 2024. PMID: 39320021 Free PMC article.
References
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Molecular Biology Databases