Method of predicting splice sites based on signal interactions
- PMID: 16584568
- PMCID: PMC1526722
- DOI: 10.1186/1745-6150-1-10
Method of predicting splice sites based on signal interactions
Abstract
Background: Predicting and proper ranking of canonical splice sites (SSs) is a challenging problem in bioinformatics and machine learning communities. Any progress in SSs recognition will lead to better understanding of splicing mechanism. We introduce several new approaches of combining a priori knowledge for improved SS detection. First, we design our new Bayesian SS sensor based on oligonucleotide counting. To further enhance prediction quality, we applied our new de novo motif detection tool MHMMotif to intronic ends and exons. We combine elements found with sensor information using Naive Bayesian Network, as implemented in our new tool SpliceScan.
Results: According to our tests, the Bayesian sensor outperforms the contemporary Maximum Entropy sensor for 5' SS detection. We report a number of putative Exonic (ESE) and Intronic (ISE) Splicing Enhancers found by MHMMotif tool. T-test statistics on mouse/rat intronic alignments indicates, that detected elements are on average more conserved as compared to other oligos, which supports our assumption of their functional importance. The tool has been shown to outperform the SpliceView, GeneSplicer, NNSplice, Genio and NetUTR tools for the test set of human genes. SpliceScan outperforms all contemporary ab initio gene structural prediction tools on the set of 5' UTR gene fragments.
Conclusion: Designed methods have many attractive properties, compared to existing approaches. Bayesian sensor, MHMMotif program and SpliceScan tools are freely available on our web site.
Reviewers: This article was reviewed by Manyuan Long, Arcady Mushegian and Mikhail Gelfand.
Figures

















Similar articles
-
A method of predicting changes in human gene splicing induced by genetic variants in context of cis-acting elements.BMC Bioinformatics. 2010 Jan 12;11:22. doi: 10.1186/1471-2105-11-22. BMC Bioinformatics. 2010. PMID: 20067640 Free PMC article.
-
Rules and tools to predict the splicing effects of exonic and intronic mutations.Wiley Interdiscip Rev RNA. 2018 Jan;9(1). doi: 10.1002/wrna.1451. Epub 2017 Sep 26. Wiley Interdiscip Rev RNA. 2018. PMID: 28949076 Review.
-
GeneSplicer: a new computational method for splice site prediction.Nucleic Acids Res. 2001 Mar 1;29(5):1185-90. doi: 10.1093/nar/29.5.1185. Nucleic Acids Res. 2001. PMID: 11222768 Free PMC article.
-
Computational prediction of splicing regulatory elements shared by Tetrapoda organisms.BMC Genomics. 2009 Nov 4;10:508. doi: 10.1186/1471-2164-10-508. BMC Genomics. 2009. PMID: 19889216 Free PMC article.
-
What's Wrong in a Jump? Prediction and Validation of Splice Site Variants.Methods Protoc. 2021 Sep 5;4(3):62. doi: 10.3390/mps4030062. Methods Protoc. 2021. PMID: 34564308 Free PMC article. Review.
Cited by
-
Aberrant RNA splicing and its functional consequences in cancer cells.Dis Model Mech. 2008 Jul-Aug;1(1):37-42. doi: 10.1242/dmm.000331. Dis Model Mech. 2008. PMID: 19048051 Free PMC article. Review.
-
Clustering ionic flow blockade toggles with a mixture of HMMs.BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S13. doi: 10.1186/1471-2105-9-S9-S13. BMC Bioinformatics. 2008. PMID: 18793458 Free PMC article.
-
Human Splicing Finder: an online bioinformatics tool to predict splicing signals.Nucleic Acids Res. 2009 May;37(9):e67. doi: 10.1093/nar/gkp215. Epub 2009 Apr 1. Nucleic Acids Res. 2009. PMID: 19339519 Free PMC article.
-
Interpretation, stratification and evidence for sequence variants affecting mRNA splicing in complete human genome sequences.Genomics Proteomics Bioinformatics. 2013 Apr;11(2):77-85. doi: 10.1016/j.gpb.2013.01.008. Epub 2013 Mar 14. Genomics Proteomics Bioinformatics. 2013. PMID: 23499923 Free PMC article.
-
A method of predicting changes in human gene splicing induced by genetic variants in context of cis-acting elements.BMC Bioinformatics. 2010 Jan 12;11:22. doi: 10.1186/1471-2105-11-22. BMC Bioinformatics. 2010. PMID: 20067640 Free PMC article.
References
-
- Krogh A. Gene finding: putting the parts together. In: Bishop MJ, editor. Guide to Human Genome Computing. 2. Academic Press, San Diego, CA; 1998. pp. 261–274.
-
- Burge C, Karlin S. Predictions of complete gene structures in human genomic DNA. Journal of Molecular Biology. 1997;268:78–94. - PubMed
-
- Krogh A. Two methods for improving performance of an HMM and their application for gene-finding. In: Gaasterland T et al, editor. Proceedings of the Fifth International Conference on Intelligent Systems for Molecular Biology. AAAI Press, Menlo Park, CA; 1997. pp. 179–186. - PubMed
-
- Rogozin I, Milanesi L. Analysis of Donor Splice Sites in Different Eukaryotic Organisms. Journal of Molecular Evolution. 1997;45:50–59. - PubMed
Grants and funding
LinkOut - more resources
Full Text Sources