Assisted transcriptome reconstruction and splicing orthology
- PMID: 28185551
- PMCID: PMC5123294
- DOI: 10.1186/s12864-016-3103-6
Assisted transcriptome reconstruction and splicing orthology
Abstract
Background: Transcriptome reconstruction, defined as the identification of all protein isoforms that may be expressed by a gene, is a notably difficult computational task. With real data, the best methods based on RNA-seq data identify barely 21 % of the expressed transcripts. While waiting for algorithms and sequencing techniques to improve - as has been strongly suggested in the literature - it is important to evaluate assisted transcriptome prediction; this is the question of how alternative transcription in one species performs as a predictor of protein isoforms in another relatively close species. Most evidence-based gene predictors use transcripts from other species to annotate a genome, but the predictive power of procedures that use exclusively transcripts from external species has never been quantified. The cornerstone of such an evaluation is the correct identification of pairs of transcripts with the same splicing patterns, called splicing orthologs.
Results: We propose a rigorous procedural definition of splicing orthologs, based on the identification of all ortholog pairs of splicing sites in the nucleotide sequences, and alignments at the protein level. Using our definition, we compared 24 382 human transcripts and 17 909 mouse transcripts from the highly curated CCDS database, and identified 11 122 splicing orthologs. In prediction mode, we show that human transcripts can be used to infer over 62 % of mouse protein isoforms. When restricting the predictions to transcripts known eight years ago, the percentage grows to 74 %. Using CCDS timestamped releases, we also analyze the evolution of the number of splicing orthologs over the last decade.
Conclusions: Alternative splicing is now recognized to play a major role in the protein diversity of eukaryotic organisms, but definitions of spliced isoform orthologs are still approximate. Here we propose a definition adapted to the subtle variations of conserved alternative splicing sites, and use it to validate numerous accurate orthologous isoform predictions.
Keywords: Eukaryotes; Splicing orthologs; Transcriptome prediction.
Figures



Similar articles
-
Identifying genes with conserved splicing structure and orthologous isoforms in human, mouse and dog.BMC Genomics. 2022 Mar 18;23(1):216. doi: 10.1186/s12864-022-08429-4. BMC Genomics. 2022. PMID: 35303798 Free PMC article.
-
Significant variations in alternative splicing patterns and expression profiles between human-mouse orthologs in early embryos.Sci China Life Sci. 2017 Feb;60(2):178-188. doi: 10.1007/s11427-015-0348-5. Epub 2016 Jul 4. Sci China Life Sci. 2017. PMID: 27378339
-
Assessment of orthologous splicing isoforms in human and mouse orthologous genes.BMC Genomics. 2010 Oct 1;11:534. doi: 10.1186/1471-2164-11-534. BMC Genomics. 2010. PMID: 20920313 Free PMC article.
-
Alternative Splicing May Not Be the Key to Proteome Complexity.Trends Biochem Sci. 2017 Feb;42(2):98-110. doi: 10.1016/j.tibs.2016.08.008. Epub 2016 Oct 3. Trends Biochem Sci. 2017. PMID: 27712956 Free PMC article. Review.
-
The emerging era of genomic data integration for analyzing splice isoform function.Trends Genet. 2014 Aug;30(8):340-7. doi: 10.1016/j.tig.2014.05.005. Epub 2014 Jun 17. Trends Genet. 2014. PMID: 24951248 Free PMC article. Review.
Cited by
-
ExceS-A: an exon-centric split aligner.J Integr Bioinform. 2022 Mar 7;19(1):20210040. doi: 10.1515/jib-2021-0040. J Integr Bioinform. 2022. PMID: 35254744 Free PMC article.
-
SimSpliceEvol2: alternative splicing-aware simulation of biological sequence evolution and transcript phylogenies.BMC Bioinformatics. 2024 Jul 11;25(1):235. doi: 10.1186/s12859-024-05853-z. BMC Bioinformatics. 2024. PMID: 38992593 Free PMC article.
-
SimSpliceEvol: alternative splicing-aware simulation of biological sequence evolution.BMC Bioinformatics. 2019 Dec 17;20(Suppl 20):640. doi: 10.1186/s12859-019-3207-5. BMC Bioinformatics. 2019. PMID: 31842741 Free PMC article.
-
Identifying genes with conserved splicing structure and orthologous isoforms in human, mouse and dog.BMC Genomics. 2022 Mar 18;23(1):216. doi: 10.1186/s12864-022-08429-4. BMC Genomics. 2022. PMID: 35303798 Free PMC article.
-
Insights Into the Albinism Mechanism for Two Distinct Color Morphs of Northern Snakehead, Channa argus Through Histological and Transcriptome Analyses.Front Genet. 2020 Sep 18;11:830. doi: 10.3389/fgene.2020.00830. eCollection 2020. Front Genet. 2020. PMID: 33193565 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources