Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels
- PMID: 22368243
- PMCID: PMC3324515
- DOI: 10.1093/bioinformatics/bts094
Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels
Abstract
Motivation: High-throughput sequencing has made the analysis of new model organisms more affordable. Although assembling a new genome can still be costly and difficult, it is possible to use RNA-seq to sequence mRNA. In the absence of a known genome, it is necessary to assemble these sequences de novo, taking into account possible alternative isoforms and the dynamic range of expression values.
Results: We present a software package named Oases designed to heuristically assemble RNA-seq reads in the absence of a reference genome, across a broad spectrum of expression values and in presence of alternative isoforms. It achieves this by using an array of hash lengths, a dynamic filtering of noise, a robust resolution of alternative splicing events and the efficient merging of multiple assemblies. It was tested on human and mouse RNA-seq data and is shown to improve significantly on the transABySS and Trinity de novo transcriptome assemblers.
Availability and implementation: Oases is freely available under the GPL license at www.ebi.ac.uk/~zerbino/oases/.
Figures




References
-
- Birol I., et al. De novo transcriptome assembly with ABySS. Bioinformatics. 2009;25:2872–2877. - PubMed
-
- Blencowe B.J., et al. Current-generation high-throughput sequencing: deepening insights into mammalian transcriptomes. Gene. Dev. 2009;23:1379–1386. - PubMed
-
- Collins L.J., et al. An approach to transcriptome analysis of non-model organisms using short-read sequences. Genome Inform. 2008;21:3–14. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources