Efficient prediction of alternative splice forms using protein domain homology
- PMID: 15107023
Efficient prediction of alternative splice forms using protein domain homology
Abstract
Alternative splicing can yield manifold different mature mRNAs from one precursor. New findings indicate that alternative splicing occurs much more often than previously assumed. A major goal of functional genomics lies in elucidating and characterizing the entire spectrum of alternative splice forms. Existing approaches such as EST-alignments focus only on the mRNA sequence to detect alternative splice forms. They do not consider function and characteristics of the resulting proteins. One important example of such functional characterization is homology to a known protein domain family. A powerful description of protein domains are profile Hidden Markov models (HMM) as stored in the Pfam database. In this paper we address the problem of identifying the splice form with the highest similarity to a protein domain family. Therefore, we take into consideration all possible splice forms. As demonstrated here for a number of genes, this homology based approach can be used successfully for predicting partial gene structures. Furthermore, we present some novel splice form predictions with high-scoring protein domain homology and point out that the detection of splice form specific protein domains helps to answer questions concerning hereditary diseases. Simple approaches based on a BLASTP search cannot be applied here, since the number of possible splice forms increases exponentially with the number of exons. To this end, we have developed an efficient polynomial-time algorithm, called ASFPred (Alternative Splice Form Prediction). This algorithm needs only a set of exons as input.
Similar articles
-
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].Yi Chuan Xue Bao. 2004 May;31(5):431-43. Yi Chuan Xue Bao. 2004. PMID: 15478601 Chinese.
-
RASE: recognition of alternatively spliced exons in C.elegans.Bioinformatics. 2005 Jun;21 Suppl 1:i369-77. doi: 10.1093/bioinformatics/bti1053. Bioinformatics. 2005. PMID: 15961480
-
Fast model-based protein homology detection without alignment.Bioinformatics. 2007 Jul 15;23(14):1728-36. doi: 10.1093/bioinformatics/btm247. Epub 2007 May 8. Bioinformatics. 2007. PMID: 17488755
-
The types and prevalence of alternative splice forms.Curr Opin Struct Biol. 2006 Jun;16(3):362-7. doi: 10.1016/j.sbi.2006.05.002. Epub 2006 May 18. Curr Opin Struct Biol. 2006. PMID: 16713247 Review.
-
Bioinformatics detection of alternative splicing.Methods Mol Biol. 2008;452:179-97. doi: 10.1007/978-1-60327-159-2_9. Methods Mol Biol. 2008. PMID: 18566765 Review.
Cited by
-
Non-EST based prediction of exon skipping and intron retention events using Pfam information.Nucleic Acids Res. 2005 Oct 4;33(17):5611-21. doi: 10.1093/nar/gki870. Print 2005. Nucleic Acids Res. 2005. PMID: 16204458 Free PMC article.
-
Creation and disruption of protein features by alternative splicing -- a novel mechanism to modulate function.Genome Biol. 2005;6(7):R58. doi: 10.1186/gb-2005-6-7-r58. Epub 2005 Jun 22. Genome Biol. 2005. PMID: 15998447 Free PMC article.
-
Extraction, integration and analysis of alternative splicing and protein structure distributed information.BMC Bioinformatics. 2009 Oct 15;10 Suppl 12(Suppl 12):S15. doi: 10.1186/1471-2105-10-S12-S15. BMC Bioinformatics. 2009. PMID: 19828075 Free PMC article.
-
A phylogenetic generalized hidden Markov model for predicting alternatively spliced exons.Algorithms Mol Biol. 2006 Aug 25;1:14. doi: 10.1186/1748-7188-1-14. Algorithms Mol Biol. 2006. PMID: 16934144 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials