Comparative Study

. 2013 Mar 9:14:91.

doi: 10.1186/1471-2105-14-91.

A comparison of methods for differential expression analysis of RNA-seq data

Charlotte Soneson¹, Mauro Delorenzi

Affiliations

PMID: 23497356
PMCID: PMC3608160
DOI: 10.1186/1471-2105-14-91

Comparative Study

A comparison of methods for differential expression analysis of RNA-seq data

Charlotte Soneson et al. BMC Bioinformatics. 2013.

. 2013 Mar 9:14:91.

doi: 10.1186/1471-2105-14-91.

Authors

Charlotte Soneson¹, Mauro Delorenzi

Affiliation

¹ Bioinformatics Core Facility, SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland. Charlotte.Soneson@isb-sib.ch

PMID: 23497356
PMCID: PMC3608160
DOI: 10.1186/1471-2105-14-91

Abstract

Background: Finding genes that are differentially expressed between conditions is an integral part of understanding the molecular basis of phenotypic variation. In the past decades, DNA microarrays have been used extensively to quantify the abundance of mRNA corresponding to different genes, and more recently high-throughput sequencing of cDNA (RNA-seq) has emerged as a powerful competitor. As the cost of sequencing decreases, it is conceivable that the use of RNA-seq for differential expression analysis will increase rapidly. To exploit the possibilities and address the challenges posed by this relatively new type of data, a number of software packages have been developed especially for differential expression analysis of RNA-seq data.

Results: We conducted an extensive comparison of eleven methods for differential expression analysis of RNA-seq data. All methods are freely available within the R framework and take as input a matrix of counts, i.e. the number of reads mapping to each genomic feature of interest in each of a number of samples. We evaluate the methods based on both simulated data and real RNA-seq data.

Conclusions: Very small sample sizes, which are still common in RNA-seq experiments, impose problems for all evaluated methods and any results obtained under such conditions should be interpreted with caution. For larger sample sizes, the methods combining a variance-stabilizing transformation with the 'limma' method for differential expression analysis perform well under many different conditions, as does the nonparametric SAMseq method.

PubMed Disclaimer

Figures

**Figure 1**
**Area under the ROC curve (AUC).** Area under the ROC curve (AUC) for the eleven evaluated methods, in simulation studies $B_{0}^{1250}$ (panel A), $B_{625}^{625}$ (panel B), $B_{0}^{4000}$ (panel C), $B_{2000}^{2000}$ (panel D), $S_{625}^{625}$ (panel E) and $R_{625}^{625}$ (panel F). The boxplots summarize the AUCs obtained across 10 independently simulated instances of each simulation study. Each panel shows the AUCs across three sample sizes (|S₁| = |S₂| = 2, 5 and 10, respectively, signified by the last number in the tick labels). The methods are ordered according to their median AUC for the largest sample size. When all DE genes were regulated in the same direction, increasing the number of DE genes from 1,250 (panel A) to 4,000 (panel C) impaired the performance of all methods. In contrast, when the DE genes were regulated in different directions (panels B and D), the number of DE genes had much less impact. The variability of the performance of baySeq was much higher when all genes were regulated in the same direction (panels A and C) compared to when the DE genes were regulated in different directions (panels B and D). Including outliers (panels E and F) decreased the AUC for most methods (compare to panel B), but less so for the transformation-based methods (voom+limma and vst+limma) and SAMseq.

**Figure 2**
**False discovery curves.** Representative false discovery curves, depicting the number of false positives encountered among the T top-ranked genes by the eleven evaluated methods, for T between 0 and 1,500. In all cases, there were 5 samples per condition. A: Simulation study $B_{0}^{1250}$ . B: Simulation study $B_{625}^{625}$ . C: Simulation study $B_{0}^{4000}$ D: Simulation study $B_{2000}^{2000}$ . E: Simulation study $S_{625}^{625}$ F: Simulation study $R_{625}^{625}$ . Some of the curves do not pass through the origin, since many genes obtained the same ranking score and had to be called simultaneously.

**Figure 3**
**Type I error rates.** Type I error rates, for the six methods providing nominal p-values, in simulation studies $B_{0}^{0}$ (panel A), $P_{0}^{0}$ (panel B), $S_{0}^{0}$ (panel C) and $R_{0}^{0}$ (panel D). Letting some counts follow a Poisson distribution (panel B) reduced the type I error rates for TSPM slightly but had overall a small effect. Including outliers with abnormally high counts (panels C and D) had a detrimental effect on the ability to control the type I error for edgeR and NBPSeq, while DESeq became slightly more conservative.

**Figure 4**
**True false discovery rates.** True false discovery rates (FDR) observed for an imposed FDR threshold of 0.05, for the nine methods returning adjusted p-values or FDR estimates, in simulation studies $B_{0}^{1250}$ (panel A), $B_{625}^{625}$ (panel B), $B_{0}^{4000}$ (panel C) $B_{2000}^{2000}$ , (panel D), $S_{625}^{625}$ (panel E) and $R_{625}^{625}$ (panel F). With only two samples per condition, three of the methods (vst+limma, voom+limma and SAMseq) did not call any DE genes, and the FDR was considered to be undefined.

**Figure 5**
**Analysis of the Bottomly data set. A**: The number of genes found to be significantly DE between the two mouse strains in the Bottomly data set. **B-C**: Overlap among the set of DE genes found by different methods. D: The average number of genes found to be significantly DE genes when contrasting two subsets of mice from the same strain, in which case we expect no truly DE genes.

See this image and copyright information in PMC

References

1. Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008;5(7):621–628. doi: 10.1038/nmeth.1226. - DOI - PubMed
1. Chen G, Wang C, Shi T. Overview of available methods for diverse RNA-Seq data analyses. Sci China Life Sci. 2011;54:1121–1128. - PubMed
1. Oshlack A, Robinson MD, Young MD. From RNA-seq reads to differential expression results. Genome Biol. 2010;11:220. doi: 10.1186/gb-2010-11-12-220. - DOI - PMC - PubMed
1. Agarwal A, Koppstein D, Rozowsky J, Sboner A, Habegger L, Hillier LW, Sasidharan R, Reinke V, Waterston RH, Gerstein M. Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays. BMC Genomics. 2010;11:383. doi: 10.1186/1471-2164-11-383. - DOI - PMC - PubMed
1. Bradford JR, Hey Y, Yates T, Li Y, Pepper SD, Miller CJ. A comparison of massively parallel nucleotide sequencing with oligonucleotide microarrays for global transcription profiling. BMC Genomics. 2010;11:282. doi: 10.1186/1471-2164-11-282. - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations
Medical
- ClinicalTrials.gov
Research Materials
- NCI CPTC Antibody Characterization Program
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A comparison of methods for differential expression analysis of RNA-seq data

Affiliation

A comparison of methods for differential expression analysis of RNA-seq data

Authors

Affiliation

Abstract

Figures

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical

Research Materials

Miscellaneous