DNA-protein quasi-mapping for rapid differential gene expression analysis in non-model organisms
- PMID: 39448913
- PMCID: PMC11515663
- DOI: 10.1186/s12859-024-05924-1
DNA-protein quasi-mapping for rapid differential gene expression analysis in non-model organisms
Abstract
Background: Conventional differential gene expression analysis pipelines for non-model organisms require computationally expensive transcriptome assembly. We recently proposed an alternative strategy of directly aligning RNA-seq reads to a protein database, and demonstrated drastic improvements in speed, memory usage, and accuracy in identifying differentially expressed genes.
Result: Here we report a further speed-up by replacing DNA-protein alignment by quasi-mapping, making our pipeline > 1000× faster than assembly-based approach, and still more accurate. We also compare quasi-mapping to other mapping techniques, and show that it is faster but at the cost of sensitivity.
Conclusion: We provide a quick-and-dirty differential gene expression analysis pipeline for non-model organisms without a reference transcriptome, which directly quasi-maps RNA-seq reads to a reference protein database, avoiding computationally expensive transcriptome assembly.
Keywords: DNA-protein alignment; Differential gene expression analysis; Non-model organism; Quasi-mapping; RNA-seq.
© 2024. The Author(s).
Conflict of interest statement
The authors declare that they have no competing interests
Figures






Similar articles
-
Assembly-free rapid differential gene expression analysis in non-model organisms using DNA-protein alignment.BMC Genomics. 2022 Feb 4;23(1):97. doi: 10.1186/s12864-021-08278-7. BMC Genomics. 2022. PMID: 35120462 Free PMC article.
-
A differential k-mer analysis pipeline for comparing RNA-Seq transcriptome and meta-transcriptome datasets without a reference.Funct Integr Genomics. 2019 Mar;19(2):363-371. doi: 10.1007/s10142-018-0647-3. Epub 2018 Nov 27. Funct Integr Genomics. 2019. PMID: 30483906
-
RNA-Seq Data Analysis Pipeline for Plants: Transcriptome Assembly, Alignment, and Differential Expression Analysis.Methods Mol Biol. 2022;2396:47-60. doi: 10.1007/978-1-0716-1822-6_5. Methods Mol Biol. 2022. PMID: 34786675
-
Mapping RNA-seq reads to transcriptomes efficiently based on learning to hash method.Comput Biol Med. 2020 Jan;116:103539. doi: 10.1016/j.compbiomed.2019.103539. Epub 2019 Nov 13. Comput Biol Med. 2020. PMID: 31765913 Review.
-
Mapping RNA-seq Reads with STAR.Curr Protoc Bioinformatics. 2015 Sep 3;51:11.14.1-11.14.19. doi: 10.1002/0471250953.bi1114s51. Curr Protoc Bioinformatics. 2015. PMID: 26334920 Free PMC article. Review.
References
-
- Vijay N, Poelstra JW, Künstner A, Wolf JBW. Challenges and strategies in transcriptome assembly and differential gene expression quantification a. comprehensive in-silico assessment of RNA-seq experiments. Mol Ecol. 2012;22(3):620–34. - PubMed
MeSH terms
Substances
LinkOut - more resources
Full Text Sources