Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Apr 3;39(4):btad179.
doi: 10.1093/bioinformatics/btad179.

SpliceAI-10k calculator for the prediction of pseudoexonization, intron retention, and exon deletion

Affiliations

SpliceAI-10k calculator for the prediction of pseudoexonization, intron retention, and exon deletion

Daffodil M Canson et al. Bioinformatics. .

Abstract

Summary: SpliceAI is a widely used splicing prediction tool and its most common application relies on the maximum delta score to assign variant impact on splicing. We developed the SpliceAI-10k calculator (SAI-10k-calc) to extend use of this tool to predict: the splicing aberration type including pseudoexonization, intron retention, partial exon deletion, and (multi)exon skipping using a 10 kb analysis window; the size of inserted or deleted sequence; the effect on reading frame; and the altered amino acid sequence. SAI-10k-calc has 95% sensitivity and 96% specificity for predicting variants that impact splicing, computed from a control dataset of 1212 single-nucleotide variants (SNVs) with curated splicing assay results. Notably, it has high performance (≥84% accuracy) for predicting pseudoexon and partial intron retention. The automated amino acid sequence prediction allows for efficient identification of variants that are expected to result in mRNA nonsense-mediated decay or translation of truncated proteins.

Availability and implementation: SAI-10k-calc is implemented in R (https://github.com/adavi4/SAI-10k-calc) and also available as a Microsoft Excel spreadsheet. Users can adjust the default thresholds to suit their target performance values.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Types of splicing aberrations predicted by the SpliceAI-10k calculator. Six SNVs that were experimentally confirmed to alter splicing are correctly predicted by SAI-10k-calc (A–F). Of these, two represent correct prediction of combinations of splicing aberration types (D, E). Amino acid sequence predictions include three amino acids preceding the first variant amino acid, followed by the modified sequence inside square brackets. In-frame deletions that do not introduce any missense amino acid change (example in panel E) are indicated by blank square brackets flanked by three wild type amino acids preceding and following the deleted sequence. In all cases, deleted amino acids are not shown

References

    1. Ha C, Kim J-W, Jang J-H.. Performance evaluation of SpliceAI for the prediction of splicing of NF1 variants. Genes 2021;12:1308. - PMC - PubMed
    1. Jaganathan K, Kyriazopoulou Panagiotopoulou S, McRae JF. et al. Predicting splicing from primary sequence with deep learning. Cell 2019;176:535–48.e24. - PubMed
    1. Moles-Fernández A, Domènech-Vivó J, Tenés A. et al. Role of splicing regulatory elements and in silico tools usage in the identification of deep intronic splicing variants in hereditary breast/ovarian cancer genes. Cancers 2021;13:3341. - PMC - PubMed
    1. Movassat M, Forouzmand E, Reese F. et al. Exon size and sequence conservation improves identification of splice-altering nucleotides. RNA 2019;25:1793–805. - PMC - PubMed
    1. Qian X, Wang J, Wang M. et al. Identification of deep-intronic splice mutations in a large cohort of patients with inherited retinal diseases. Front Genet 2021;12:647400. - PMC - PubMed

Publication types