Count-based differential expression analysis of RNA sequencing data using R and Bioconductor
- PMID: 23975260
- DOI: 10.1038/nprot.2013.099
Count-based differential expression analysis of RNA sequencing data using R and Bioconductor
Abstract
RNA sequencing (RNA-seq) has been rapidly adopted for the profiling of transcriptomes in many areas of biology, including studies into gene regulation, development and disease. Of particular interest is the discovery of differentially expressed genes across different conditions (e.g., tissues, perturbations) while optionally adjusting for other systematic factors that affect the data-collection process. There are a number of subtle yet crucial aspects of these analyses, such as read counting, appropriate treatment of biological variability, quality control checks and appropriate setup of statistical modeling. Several variations have been presented in the literature, and there is a need for guidance on current best practices. This protocol presents a state-of-the-art computational and statistical RNA-seq differential expression analysis workflow largely based on the free open-source R language and Bioconductor software and, in particular, on two widely used tools, DESeq and edgeR. Hands-on time for typical small experiments (e.g., 4-10 samples) can be <1 h, with computation time <1 d using a standard desktop PC.
Similar articles
-
SPARTA: Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis.BMC Bioinformatics. 2016 Feb 4;17:66. doi: 10.1186/s12859-016-0923-y. BMC Bioinformatics. 2016. PMID: 26847232 Free PMC article.
-
easyRNASeq: a bioconductor package for processing RNA-Seq data.Bioinformatics. 2012 Oct 1;28(19):2532-3. doi: 10.1093/bioinformatics/bts477. Epub 2012 Jul 30. Bioinformatics. 2012. PMID: 22847932 Free PMC article.
-
A flexible count data model to fit the wide diversity of expression profiles arising from extensively replicated RNA-seq experiments.BMC Bioinformatics. 2013 Aug 21;14:254. doi: 10.1186/1471-2105-14-254. BMC Bioinformatics. 2013. PMID: 23965047 Free PMC article.
-
Differential Expression Analysis of RNA-seq Reads: Overview, Taxonomy, and Tools.IEEE/ACM Trans Comput Biol Bioinform. 2020 Mar-Apr;17(2):566-586. doi: 10.1109/TCBB.2018.2873010. Epub 2018 Oct 1. IEEE/ACM Trans Comput Biol Bioinform. 2020. PMID: 30281477 Review.
-
Using R and Bioconductor in Clinical Genomics and Transcriptomics.J Mol Diagn. 2020 Jan;22(1):3-20. doi: 10.1016/j.jmoldx.2019.08.006. Epub 2019 Oct 9. J Mol Diagn. 2020. PMID: 31605800 Review.
Cited by
-
Dynamics of MBD2 deposition across methylated DNA regions during malignant transformation of human mammary epithelial cells.Nucleic Acids Res. 2015 Jul 13;43(12):5838-54. doi: 10.1093/nar/gkv508. Epub 2015 May 24. Nucleic Acids Res. 2015. PMID: 26007656 Free PMC article.
-
Spatio-temporal transcriptome and storage compound profiles of developing faba bean (Vicia faba) seed tissues.Front Plant Sci. 2024 Feb 6;15:1284997. doi: 10.3389/fpls.2024.1284997. eCollection 2024. Front Plant Sci. 2024. PMID: 38379954 Free PMC article.
-
Systems analysis of the prostate transcriptome in African-American men compared with European-American men.Pharmacogenomics. 2016 Jul;17(10):1129-1143. doi: 10.2217/pgs-2016-0025. Epub 2016 Jun 30. Pharmacogenomics. 2016. PMID: 27359067 Free PMC article.
-
Ocean acidification induces distinct transcriptomic responses across life history stages of the sea urchin Heliocidaris erythrogramma.Mol Ecol. 2020 Dec;29(23):4618-4636. doi: 10.1111/mec.15664. Epub 2020 Nov 16. Mol Ecol. 2020. PMID: 33002253 Free PMC article.
-
Comparative RNA-seq analysis reveals a critical role for ethylene in rose (Rosa hybrida) susceptible response to Podosphera pannosa.Front Plant Sci. 2022 Sep 27;13:1018427. doi: 10.3389/fpls.2022.1018427. eCollection 2022. Front Plant Sci. 2022. PMID: 36237514 Free PMC article.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources