SpliceTransformer predicts tissue-specific splicing linked to human diseases
- PMID: 39443442
- PMCID: PMC11500173
- DOI: 10.1038/s41467-024-53088-6
SpliceTransformer predicts tissue-specific splicing linked to human diseases
Abstract
We present SpliceTransformer (SpTransformer), a deep-learning framework that predicts tissue-specific RNA splicing alterations linked to human diseases based on genomic sequence. SpTransformer outperforms all previous methods on splicing prediction. Application to approximately 1.3 million genetic variants in the ClinVar database reveals that splicing alterations account for 60% of intronic and synonymous pathogenic mutations, and occur at different frequencies across tissue types. Importantly, tissue-specific splicing alterations match their clinical manifestations independent of gene expression variation. We validate the enrichment in three brain disease datasets involving over 164,000 individuals. Additionally, we identify single nucleotide variations that cause brain-specific splicing alterations, and find disease-associated genes harboring these single nucleotide variations with distinct expression patterns involved in diverse biological processes. Finally, SpTransformer analysis of whole exon sequencing data from blood samples of patients with diabetic nephropathy predicts kidney-specific RNA splicing alterations with 83% accuracy, demonstrating the potential to infer disease-causing tissue-specific splicing events. SpTransformer provides a powerful tool to guide biological and clinical interpretations of human diseases.
© 2024. The Author(s).
Conflict of interest statement
The authors have submitted a patent application for the method. Other than this, the authors declare that they do not have any competing interests.
Figures






Similar articles
-
Predicting Splicing from Primary Sequence with Deep Learning.Cell. 2019 Jan 24;176(3):535-548.e24. doi: 10.1016/j.cell.2018.12.015. Epub 2019 Jan 17. Cell. 2019. PMID: 30661751
-
IntSplice: prediction of the splicing consequences of intronic single-nucleotide variations in the human genome.J Hum Genet. 2016 Jul;61(7):633-40. doi: 10.1038/jhg.2016.23. Epub 2016 Mar 24. J Hum Genet. 2016. PMID: 27009626
-
Genomic features defining exonic variants that modulate splicing.Genome Biol. 2010;11(2):R20. doi: 10.1186/gb-2010-11-2-r20. Epub 2010 Feb 16. Genome Biol. 2010. PMID: 20158892 Free PMC article.
-
Rules and tools to predict the splicing effects of exonic and intronic mutations.Wiley Interdiscip Rev RNA. 2018 Jan;9(1). doi: 10.1002/wrna.1451. Epub 2017 Sep 26. Wiley Interdiscip Rev RNA. 2018. PMID: 28949076 Review.
-
Splicing mutations in human genetic disorders: examples, detection, and confirmation.J Appl Genet. 2018 Aug;59(3):253-268. doi: 10.1007/s13353-018-0444-7. Epub 2018 Apr 21. J Appl Genet. 2018. PMID: 29680930 Free PMC article. Review.
Cited by
-
Generative modeling for RNA splicing predictions and design.bioRxiv [Preprint]. 2025 Jan 24:2025.01.20.633986. doi: 10.1101/2025.01.20.633986. bioRxiv. 2025. PMID: 39896553 Free PMC article. Preprint.
-
Translating Muscle RNAseq Into the Clinic for the Diagnosis of Muscle Diseases.Ann Clin Transl Neurol. 2025 Jul;12(7):1465-1479. doi: 10.1002/acn3.70078. Epub 2025 May 25. Ann Clin Transl Neurol. 2025. PMID: 40413734 Free PMC article.
References
-
- Pagani, F. & Baralle, F. Genomic variants in exons and introns: identifying the splicing spoilers. Nat. Rev. Genet.5, 389–96 (2004). - PubMed
-
- Ahmed, M. S., Ikram, S., Bibi, N. & Mir, A. Hutchinson–Gilford progeria syndrome: a premature aging disease. Mol. Neurobiol.55, 4417–4427 (2018). - PubMed
-
- Yeo, G. & Burge, C. Maximum entropy modeling of short sequence motifs with applications to rna splicing signals. J. Comput. Biol.11, 377–94 (2004). - PubMed
Publication types
MeSH terms
Associated data
LinkOut - more resources
Full Text Sources