Splicing graphs and EST assembly problem
- PMID: 12169546
- DOI: 10.1093/bioinformatics/18.suppl_1.s181
Splicing graphs and EST assembly problem
Abstract
Motivation: The traditional approach to annotate alternative splicing is to investigate every splicing variant of the gene in a case-by-case fashion. This approach, while useful, has some serious shortcomings. Recent studies indicate that alternative splicing is more frequent than previously thought and some genes may produce tens of thousands of different transcripts. A list of alternatively spliced variants for such genes would be difficult to build and hard to analyse. Moreover, such a list does not show the relationships between different transcripts and does not show the overall structure of all transcripts. A better approach would be to represent all splicing variants for a given gene in a way that captures the relationships between different splicing variants.
Results: We introduce the notion of the splicing graph that is a natural and convenient representation of all splicing variants. The key difference with the existing approaches is that we abandon the linear (sequence) representation of each transcript and replace it with a graph representation where each transcript corresponds to a path in the graph. We further design an algorithm to assemble EST reads into the splicing graph rather than assembling them into each splicing variant in a case-by-case fashion.
Similar articles
-
A graph based algorithm for generating EST consensus sequences.Bioinformatics. 2005 Apr 15;21(8):1371-5. doi: 10.1093/bioinformatics/bti184. Epub 2004 Nov 30. Bioinformatics. 2005. PMID: 15572463
-
Genome wide identification and classification of alternative splicing based on EST data.Bioinformatics. 2004 Nov 1;20(16):2579-85. doi: 10.1093/bioinformatics/bth288. Epub 2004 Apr 29. Bioinformatics. 2004. PMID: 15117759
-
A comparative method for identification of gene structures and alternatively spliced variants.Bioinformatics. 2004 Nov 22;20(17):3064-79. doi: 10.1093/bioinformatics/bth368. Epub 2004 Jun 24. Bioinformatics. 2004. PMID: 15217819
-
Gene identification through large-scale EST sequence processing.Appl Bioinformatics. 2003;2(3):123-9. Appl Bioinformatics. 2003. PMID: 15130797 Review.
-
Reconstruction of full-length isoforms from splice graphs.Methods Mol Biol. 2008;452:199-205. doi: 10.1007/978-1-60327-159-2_10. Methods Mol Biol. 2008. PMID: 18566766 Review.
Cited by
-
Inference of alternative splicing from RNA-Seq data with probabilistic splice graphs.Bioinformatics. 2013 Sep 15;29(18):2300-10. doi: 10.1093/bioinformatics/btt396. Epub 2013 Jul 11. Bioinformatics. 2013. PMID: 23846746 Free PMC article.
-
Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels.Bioinformatics. 2012 Apr 15;28(8):1086-92. doi: 10.1093/bioinformatics/bts094. Epub 2012 Feb 24. Bioinformatics. 2012. PMID: 22368243 Free PMC article.
-
SpliceGrapher: detecting patterns of alternative splicing from RNA-Seq data in the context of gene models and EST data.Genome Biol. 2012 Jan 31;13(1):R4. doi: 10.1186/gb-2012-13-1-r4. Genome Biol. 2012. PMID: 22293517 Free PMC article.
-
Accurate isoform discovery with IsoQuant using long reads.Nat Biotechnol. 2023 Jul;41(7):915-918. doi: 10.1038/s41587-022-01565-y. Epub 2023 Jan 2. Nat Biotechnol. 2023. PMID: 36593406 Free PMC article.
-
SpliceDetector: a software for detection of alternative splicing events in human and model organisms directly from transcript IDs.Sci Rep. 2018 Mar 22;8(1):5063. doi: 10.1038/s41598-018-23245-1. Sci Rep. 2018. PMID: 29567976 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials