Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Dec 1;21(6):2052-2065.
doi: 10.1093/bib/bbz126.

Systematic evaluation of differential splicing tools for RNA-seq studies

Affiliations

Systematic evaluation of differential splicing tools for RNA-seq studies

Arfa Mehmood et al. Brief Bioinform. .

Abstract

Differential splicing (DS) is a post-transcriptional biological process with critical, wide-ranging effects on a plethora of cellular activities and disease processes. To date, a number of computational approaches have been developed to identify and quantify differentially spliced genes from RNA-seq data, but a comprehensive intercomparison and appraisal of these approaches is currently lacking. In this study, we systematically evaluated 10 DS analysis tools for consistency and reproducibility, precision, recall and false discovery rate, agreement upon reported differentially spliced genes and functional enrichment. The tools were selected to represent the three different methodological categories: exon-based (DEXSeq, edgeR, JunctionSeq, limma), isoform-based (cuffdiff2, DiffSplice) and event-based methods (dSpliceType, MAJIQ, rMATS, SUPPA). Overall, all the exon-based methods and two event-based methods (MAJIQ and rMATS) scored well on the selected measures. Of the 10 tools tested, the exon-based methods performed generally better than the isoform-based and event-based methods. However, overall, the different data analysis tools performed strikingly differently across different data sets or numbers of samples.

Keywords: RNA-seq; differential splicing; event-based methods; exon-based methods; isoform-based methods; splicing events.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Schematic illustration of the comparison of DS tools. In total, 10 different tools were assessed in four different RNA-seq data sets.
Figure 2
Figure 2
Median and standard error of the number of detections, precision, recall and FDR of the 10 compared methods in PCa and HCa data sets with different numbers of replicates. Number of DS genes in the (A) PCa and (B) HCa data set. Precision in the (C) PCa and (D) HCa data set. Recall in the (E) PCa and (F) HCa data set. FDR in the (G) PCa and (H) HCa data set. The points and error bars correspond to the median and the standard error of the 10 randomly sampled subsets for the different numbers of replicates.
Figure 3
Figure 3
Similarity between the methods in the complete PCa and HCa data sets. Overlap of top 500 ranked DS genes between the methods in the (A) PCa and (B) HCa data set. Genes were ranked based on the FDR in all methods except for DiffSplice, which provided its own test statistic to rank the genes instead. dSpliceType was excluded from this comparison as it allowed listing genes only until FDR of 0.05 which provided less than 500 genes in both data sets.
Figure 4
Figure 4
Heatmap of the P-values of the top enriched GO biological processes across the methods in the complete (A) PCa and (B) HCa data sets. Grey colour represents missing values.
Figure 5
Figure 5
Memory usage and run time of the methods with different numbers of replicates in HCa data set. (A) Run time and (B) maximum memory required, as measured by Linux process accounting tool acct. Values are on log10 scale.

Similar articles

Cited by

References

    1. Keren H, Lev-Maor G, Ast G. Alternative splicing and evolution: diversification, exon definition and function. Nat Rev Genet 2010;11:345–55. - PubMed
    1. Pan Q, Shai O, Lee LJ, et al. Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nat Genet 2008;40:1413–5. - PubMed
    1. Wang ET, Sandberg R, Luo S, et al. Alternative isoform regulation in human tissue transcriptomes. Nature 2008;456:470–6. - PMC - PubMed
    1. Chen L. Statistical and computational methods for high-throughput sequencing data analysis of alternative splicing. Stat Biosci 2013;5:138–55. - PMC - PubMed
    1. Sveen A, Kilpinen S, Ruusulehto A, et al. Aberrant RNA splicing in cancer; expression changes and driver mutations of splicing factor genes. Oncogene 2016;35:2413–27. - PubMed

Publication types

Substances