Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Jun 1;28(11):1530-2.
doi: 10.1093/bioinformatics/bts196. Epub 2012 Apr 25.

RNA-SeQC: RNA-seq metrics for quality control and process optimization

Affiliations

RNA-SeQC: RNA-seq metrics for quality control and process optimization

David S DeLuca et al. Bioinformatics. .

Abstract

RNA-seq, the application of next-generation sequencing to RNA, provides transcriptome-wide characterization of cellular activity. Assessment of sequencing performance and library quality is critical to the interpretation of RNA-seq data, yet few tools exist to address this issue. We introduce RNA-SeQC, a program which provides key measures of data quality. These metrics include yield, alignment and duplication rates; GC bias, rRNA content, regions of alignment (exon, intron and intragenic), continuity of coverage, 3'/5' bias and count of detectable transcripts, among others. The software provides multi-sample evaluation of library construction protocols, input materials and other experimental parameters. The modularity of the software enables pipeline integration and the routine monitoring of key measures of data quality such as the number of alignable reads, duplication rates and rRNA contamination. RNA-SeQC allows investigators to make informed decisions about sample inclusion in downstream analysis. In summary, RNA-SeQC provides quality control measures critical to experiment design, process optimization and downstream computational analysis.

Availability and implementation: See www.genepattern.org to run online, or www.broadinstitute.org/rna-seqc/ for a command line tool.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Overview of the RNA-SeQC process. (a) RNA-SeQC will work with one or more input samples to produce both a comparative summary across samples as well as a more detailed report for each sample. (b) The comparative summary report includes an extensive range of metrics (in addition to those shown) as well as coverage plots. (c) For each sample, additional reports quantify the coverage profile (variation, gaps, etc.) for individual transcripts

References

    1. Garber M., et al. Computational methods for transcriptome annotation and quantification using RNA-seq. Nat. Meth. 2011;8:469–477. - PubMed
    1. Harrow J., et al. GENCODE: producing a reference annotation for ENCODE. Genome Biol. 2006;7(Suppl. 1):S4.1–S4.9. - PMC - PubMed
    1. Levin J.Z., et al. Comprehensive comparative analysis of strand-specific RNA sequencing methods. Nat. Meth. 2010;7:709–715. - PMC - PubMed
    1. Li H., Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. - PMC - PubMed
    1. Li H., et al. The sequence alignment/map (SAM) format and SAMtools. Bioinformatics. 2009;25:2078–2079. - PMC - PubMed

Publication types