Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022:2396:47-60.
doi: 10.1007/978-1-0716-1822-6_5.

RNA-Seq Data Analysis Pipeline for Plants: Transcriptome Assembly, Alignment, and Differential Expression Analysis

Affiliations

RNA-Seq Data Analysis Pipeline for Plants: Transcriptome Assembly, Alignment, and Differential Expression Analysis

David J Burks et al. Methods Mol Biol. 2022.

Abstract

In this chapter, we describe methods for analyzing RNA-Seq data, presented as a flow along a pipeline beginning with raw data from a sequencer and ending with an output of differentially expressed genes and their functional characterization. The first section covers de novo transcriptome assembly for organisms lacking reference genomes or for those interested in probing against the background of organism-specific transcriptomes assembled from RNA-Seq data. Section 2 covers both gene- and transcript-level quantifications, leading to the third and final section on differential expression analysis between two or more conditions. The pipeline starts with raw sequence reads, followed by quality assessment and preprocessing of the input data to ensure a robust estimate of the transcripts and their differential regulation. The preprocessed data can be inputted into the de novo transcriptome flow to assemble transcripts, functionally annotated using tools such as InterProScan or Blast2Go and then forwarded to differential expression analysis flow, or directly inputted into the differential expression analysis flow if a reference genome is available. An online repository containing sample data has also been made available, as well as custom Python scripts to modify the output of the programs within the pipeline for various downstream analyses.

Keywords: Alignment; Differential expression analysis; RNA-Seq data analysis; Transcriptome assembly; Transcriptomics.

PubMed Disclaimer

Similar articles

Cited by

References

    1. Costa-Silva J, Domingues D, Lopes FM (2017) RNA-Seq differential expression analysis: an extended review and a software tool. PLoS One 12:1–18 - DOI
    1. Moreton J, Izquierdo A, Emes RD (2016) Assembly, assessment, and availability of De novo generated eukaryotic transcriptomes. Front Genet 6:1–9 - DOI
    1. Müller M, Seifert S, Lübbe T et al (2017) De novo transcriptome assembly and analysis of differential gene expression in response to drought in European beech. PLoS One 12:1–20
    1. Wang X, Yang S, Dong Y et al (2018) De novo transcriptome characterization of Rhodomyrtus tomentosa leaves and identification of genes involved in a/ß-pinene and ß-caryophyllene biosynthesis. Front Plant Sci 9:1231 - DOI
    1. Li QS, Li XM, Qiao RY et al (2018) Data descriptor: De novo transcriptome assembly of fluorine accumulator tea plant camellia sinensis with fluoride treatments. Sci Data 5:1–9 - DOI

LinkOut - more resources