RNA-Seq Data Analysis Pipeline for Plants: Transcriptome Assembly, Alignment, and Differential Expression Analysis
- PMID: 34786675
- DOI: 10.1007/978-1-0716-1822-6_5
RNA-Seq Data Analysis Pipeline for Plants: Transcriptome Assembly, Alignment, and Differential Expression Analysis
Abstract
In this chapter, we describe methods for analyzing RNA-Seq data, presented as a flow along a pipeline beginning with raw data from a sequencer and ending with an output of differentially expressed genes and their functional characterization. The first section covers de novo transcriptome assembly for organisms lacking reference genomes or for those interested in probing against the background of organism-specific transcriptomes assembled from RNA-Seq data. Section 2 covers both gene- and transcript-level quantifications, leading to the third and final section on differential expression analysis between two or more conditions. The pipeline starts with raw sequence reads, followed by quality assessment and preprocessing of the input data to ensure a robust estimate of the transcripts and their differential regulation. The preprocessed data can be inputted into the de novo transcriptome flow to assemble transcripts, functionally annotated using tools such as InterProScan or Blast2Go and then forwarded to differential expression analysis flow, or directly inputted into the differential expression analysis flow if a reference genome is available. An online repository containing sample data has also been made available, as well as custom Python scripts to modify the output of the programs within the pipeline for various downstream analyses.
Keywords: Alignment; Differential expression analysis; RNA-Seq data analysis; Transcriptome assembly; Transcriptomics.
© 2022. Springer Science+Business Media, LLC, part of Springer Nature.
Similar articles
-
An RNA-Seq Data Analysis Pipeline.Methods Mol Biol. 2024;2812:1-9. doi: 10.1007/978-1-0716-3886-6_1. Methods Mol Biol. 2024. PMID: 39068354
-
EMPathways2: Estimation of Enzyme Expression and Metabolic Pathway Activity Using RNA-Seq Reads.Methods Mol Biol. 2024;2812:39-46. doi: 10.1007/978-1-0716-3886-6_3. Methods Mol Biol. 2024. PMID: 39068356
-
RNA-Seq in Nonmodel Organisms.Methods Mol Biol. 2021;2243:143-167. doi: 10.1007/978-1-0716-1103-6_8. Methods Mol Biol. 2021. PMID: 33606257
-
Characterizing and annotating the genome using RNA-seq data.Sci China Life Sci. 2017 Feb;60(2):116-125. doi: 10.1007/s11427-015-0349-4. Epub 2016 Jun 13. Sci China Life Sci. 2017. PMID: 27294835 Review.
-
RNA-Seq differential expression analysis: An extended review and a software tool.PLoS One. 2017 Dec 21;12(12):e0190152. doi: 10.1371/journal.pone.0190152. eCollection 2017. PLoS One. 2017. PMID: 29267363 Free PMC article. Review.
Cited by
-
Integrative Transcriptomic and Metabolic Analyses Reveal That Flavonoid Biosynthesis Is the Key Pathway Regulating Pigment Deposition in Naturally Brown Cotton Fibers.Plants (Basel). 2024 Jul 24;13(15):2028. doi: 10.3390/plants13152028. Plants (Basel). 2024. PMID: 39124145 Free PMC article.
-
Combined Analysis of Transcriptomes and Metabolomes Reveals That MeJA-Mediated Flavonoid Biosynthesis Is Crucial for Pigment Deposition in Naturally Colored Green Cotton Fibers.Genes (Basel). 2025 May 19;16(5):599. doi: 10.3390/genes16050599. Genes (Basel). 2025. PMID: 40428421 Free PMC article.
-
Molecular Cytological Analysis and Specific Marker Development in Wheat-Psathyrostachys huashanica Keng 3Ns Additional Line with Elongated Glume.Int J Mol Sci. 2023 Apr 4;24(7):6726. doi: 10.3390/ijms24076726. Int J Mol Sci. 2023. PMID: 37047699 Free PMC article.
-
Genome-Wide Identification of the Geranylgeranyl Pyrophosphate Synthase (GGPS) Gene Family Associated with Natural Rubber Synthesis in Taraxacum kok-saghyz L. Rodin.Plants (Basel). 2024 Oct 4;13(19):2788. doi: 10.3390/plants13192788. Plants (Basel). 2024. PMID: 39409658 Free PMC article.
References
-
- Costa-Silva J, Domingues D, Lopes FM (2017) RNA-Seq differential expression analysis: an extended review and a software tool. PLoS One 12:1–18 - DOI
-
- Moreton J, Izquierdo A, Emes RD (2016) Assembly, assessment, and availability of De novo generated eukaryotic transcriptomes. Front Genet 6:1–9 - DOI
-
- Müller M, Seifert S, Lübbe T et al (2017) De novo transcriptome assembly and analysis of differential gene expression in response to drought in European beech. PLoS One 12:1–20
-
- Wang X, Yang S, Dong Y et al (2018) De novo transcriptome characterization of Rhodomyrtus tomentosa leaves and identification of genes involved in a/ß-pinene and ß-caryophyllene biosynthesis. Front Plant Sci 9:1231 - DOI
-
- Li QS, Li XM, Qiao RY et al (2018) Data descriptor: De novo transcriptome assembly of fluorine accumulator tea plant camellia sinensis with fluoride treatments. Sci Data 5:1–9 - DOI
MeSH terms
LinkOut - more resources
Full Text Sources