Using mRNAs lengths to accurately predict the alternatively spliced gene products in Caenorhabditis elegans
- PMID: 16595562
- DOI: 10.1093/bioinformatics/btl076
Using mRNAs lengths to accurately predict the alternatively spliced gene products in Caenorhabditis elegans
Abstract
Motivation: Computational gene prediction methods are an important component of whole genome analyses. While ab initio gene finders have demonstrated major improvements in accuracy, the most reliable methods are evidence-based gene predictors. These algorithms can rely on several different sources of evidence including predictions from multiple ab initio gene finders, matches to known proteins, sequence conservation and partial cDNAs to predict the final product. Despite the success of these algorithms, prediction of complete gene structures, especially for alternatively spliced products, remains a difficult task.
Results: LOCUS (Length Optimized Characterization of Unknown Spliceforms) is a new evidence-based gene finding algorithm which integrates a length-constraint into a dynamic programming-based framework for prediction of gene products. On a Caenorhabditis elegans test set of alternatively spliced internal exons, its performance exceeds that of current ab initio gene finders and in most cases can accurately predict the correct form of all the alternative products. As the length information used by the algorithm can be obtained in a high-throughput fashion, we propose that integration of such information into a gene-prediction pipeline is feasible and doing so may improve our ability to fully characterize the complete set of mRNAs for a genome.
Availability: LOCUS is available from http://ural.wustl.edu/software.html
Similar articles
-
Integrating alternative splicing detection into gene prediction.BMC Bioinformatics. 2005 Feb 10;6:25. doi: 10.1186/1471-2105-6-25. BMC Bioinformatics. 2005. PMID: 15705189 Free PMC article.
-
RASE: recognition of alternatively spliced exons in C.elegans.Bioinformatics. 2005 Jun;21 Suppl 1:i369-77. doi: 10.1093/bioinformatics/bti1053. Bioinformatics. 2005. PMID: 15961480
-
Non-EST-based prediction of novel alternatively spliced cassette exons with cell signaling function in Caenorhabditis elegans and human.Nucleic Acids Res. 2007;35(10):3192-202. doi: 10.1093/nar/gkm187. Epub 2007 Apr 22. Nucleic Acids Res. 2007. PMID: 17452356 Free PMC article.
-
Detecting gene expression in Caenorhabditis elegans.Genetics. 2025 Jan 8;229(1):1-108. doi: 10.1093/genetics/iyae167. Genetics. 2025. PMID: 39693264 Free PMC article. Review.
-
Genome-guided transcriptome assembly in the age of next-generation sequencing.IEEE/ACM Trans Comput Biol Bioinform. 2013 Sep-Oct;10(5):1234-40. doi: 10.1109/tcbb.2013.140. IEEE/ACM Trans Comput Biol Bioinform. 2013. PMID: 24524156 Free PMC article. Review.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases
Research Materials