A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium
- PMID: 25150838
- PMCID: PMC4321899
- DOI: 10.1038/nbt.2957
A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium
Abstract
We present primary results from the Sequencing Quality Control (SEQC) project, coordinated by the US Food and Drug Administration. Examining Illumina HiSeq, Life Technologies SOLiD and Roche 454 platforms at multiple laboratory sites using reference RNA samples with built-in controls, we assess RNA sequencing (RNA-seq) performance for junction discovery and differential expression profiling and compare it to microarray and quantitative PCR (qPCR) data using complementary metrics. At all sequencing depths, we discover unannotated exon-exon junctions, with >80% validated by qPCR. We find that measurements of relative expression are accurate and reproducible across sites and platforms if specific filters are used. In contrast, RNA-seq and microarrays do not provide accurate absolute measurements, and gene-specific biases are observed for all examined platforms, including qPCR. Measurement performance depends on the platform and data analysis pipeline, and variation is large for transcript-level profiling. The complete SEQC data sets, comprising >100 billion reads (10Tb), provide unique resources for evaluating RNA-seq analyses for clinical and regulatory settings.
Conflict of interest statement
Some of the SEQC (MAQC-III) consortium members are employed by companies that provide services or manufacture products or equipment related to gene expression profiling, as can be seen from the provided affiliations of the manuscript authors.
Figures






References
-
- Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-seq. Nat Methods. 2008;5:621–628. - PubMed
Publication types
MeSH terms
Associated data
- Actions
Grants and funding
- R24 GM102656/GM/NIGMS NIH HHS/United States
- P01 HG000205/HG/NHGRI NIH HHS/United States
- R44HG005297/HG/NHGRI NIH HHS/United States
- Z01 ES102345-04/ES/NIEHS NIH HHS/United States
- P30 CA015083/CA/NCI NIH HHS/United States
- R01 HL109054/HL/NHLBI NIH HHS/United States
- U54 CA119338/CA/NCI NIH HHS/United States
- R01CA163256/CA/NCI NIH HHS/United States
- R01 NS076465/NS/NINDS NIH HHS/United States
- R01 CA163256/CA/NCI NIH HHS/United States
- R24GM102656/GM/NIGMS NIH HHS/United States
- P01HG00205/HG/NHGRI NIH HHS/United States
- R44 HG005297/HG/NHGRI NIH HHS/United States
- R01NS076465/NS/NINDS NIH HHS/United States
- Z01 ES102345/ImNIH/Intramural NIH HHS/United States
- U54CA119338/CA/NCI NIH HHS/United States
- BB/I000771/1/BB_/Biotechnology and Biological Sciences Research Council/United Kingdom
- R01HG006798/HG/NHGRI NIH HHS/United States
- R01 HG006798/HG/NHGRI NIH HHS/United States
- P50 GM021700/GM/NIGMS NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases