De novo transcriptome assembly of RNA-Seq reads with different strategies
- PMID: 22227905
- DOI: 10.1007/s11427-011-4256-9
De novo transcriptome assembly of RNA-Seq reads with different strategies
Abstract
De novo transcriptome assembly is an important approach in RNA-Seq data analysis and it can help us to reconstruct the transcriptome and investigate gene expression profiles without reference genome sequences. We carried out transcriptome assemblies with two RNA-Seq datasets generated from human brain and cell line, respectively. We then determined an efficient way to yield an optimal overall assembly using three different strategies. We first assembled brain and cell line transcriptome using a single k-mer length. Next we tested a range of values of k-mer length and coverage cutoff in assembling. Lastly, we combined the assembled contigs from a range of k values to generate a final assembly. By comparing these assembly results, we found that using only one k-mer value for assembly is not enough to generate good assembly results, but combining the contigs from different k-mer values could yield longer contigs and greatly improve the overall assembly.
Similar articles
-
Comparative study of de novo assembly and genome-guided assembly strategies for transcriptome reconstruction based on RNA-Seq.Sci China Life Sci. 2013 Feb;56(2):143-55. doi: 10.1007/s11427-013-4442-z. Epub 2013 Feb 8. Sci China Life Sci. 2013. PMID: 23393030
-
A differential k-mer analysis pipeline for comparing RNA-Seq transcriptome and meta-transcriptome datasets without a reference.Funct Integr Genomics. 2019 Mar;19(2):363-371. doi: 10.1007/s10142-018-0647-3. Epub 2018 Nov 27. Funct Integr Genomics. 2019. PMID: 30483906
-
Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C(3) and C(4) species.J Exp Bot. 2011 May;62(9):3093-102. doi: 10.1093/jxb/err029. Epub 2011 Mar 11. J Exp Bot. 2011. PMID: 21398430
-
Bioinformatics challenges in de novo transcriptome assembly using short read sequences in the absence of a reference genome sequence.Nat Prod Rep. 2013 Apr;30(4):490-500. doi: 10.1039/c3np20099j. Nat Prod Rep. 2013. PMID: 23377493 Review.
-
Overview of available methods for diverse RNA-Seq data analyses.Sci China Life Sci. 2011 Dec;54(12):1121-8. doi: 10.1007/s11427-011-4255-x. Epub 2012 Jan 7. Sci China Life Sci. 2011. PMID: 22227904 Review.
Cited by
-
De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers.Gigascience. 2019 May 1;8(5):giz039. doi: 10.1093/gigascience/giz039. Gigascience. 2019. PMID: 31077315 Free PMC article.
-
IDP-denovo: de novo transcriptome assembly and isoform annotation by hybrid sequencing.Bioinformatics. 2018 Jul 1;34(13):2168-2176. doi: 10.1093/bioinformatics/bty098. Bioinformatics. 2018. PMID: 29905763 Free PMC article.
-
Comprehensively identifying and characterizing the missing gene sequences in human reference genome with integrated analytic approaches.Hum Genet. 2013 Aug;132(8):899-911. doi: 10.1007/s00439-013-1300-9. Epub 2013 Apr 10. Hum Genet. 2013. PMID: 23572138
-
Functional genomics by integrated analysis of transcriptome of sweet potato (Ipomoea batatas (L.) Lam.) during root formation.Genes Genomics. 2020 May;42(5):581-596. doi: 10.1007/s13258-020-00927-7. Epub 2020 Apr 2. Genes Genomics. 2020. PMID: 32240514
-
Exploring the pathogenetic association between schizophrenia and type 2 diabetes mellitus diseases based on pathway analysis.BMC Med Genomics. 2013;6 Suppl 1(Suppl 1):S17. doi: 10.1186/1755-8794-6-S1-S17. Epub 2013 Jan 23. BMC Med Genomics. 2013. PMID: 23369358 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous