Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Sep;38(16):e165.
doi: 10.1093/nar/gkq602. Epub 2010 Jul 7.

Tissue-specific transcript annotation and expression profiling with complementary next-generation sequencing technologies

Affiliations

Tissue-specific transcript annotation and expression profiling with complementary next-generation sequencing technologies

Matthew S Hestand et al. Nucleic Acids Res. 2010 Sep.

Abstract

Next-generation sequencing is excellently suited to evaluate the abundance of mRNAs to study gene expression. Here we compare two alternative technologies, cap analysis of gene expression (CAGE) and serial analysis of gene expression (SAGE), for the same RNA samples. Along with quantifying gene expression levels, CAGE can be used to identify tissue-specific transcription start sites, while SAGE monitors 3'-end usage. We used both methods to get more insight into the transcriptional control of myogenesis, studying differential gene expression in differentiated and proliferating C2C12 myoblast cells with statistical evaluation of reproducibility and differential gene expression. Both CAGE and SAGE provided highly reproducible data (Pearson's correlations >0.92 among biological triplicates). With both methods we found around 10,000 genes expressed at levels >2 transcripts per million (approximately 0.3 copies per cell), with an overlap of 86%. We identified 4304 and 3846 genes differentially expressed between proliferating and differentiated C2C12 cells by CAGE and SAGE, respectively, with an overlap of 2144. We identified 196 novel regulatory regions with preferential use in proliferating or differentiated cells. Next-generation sequencing of CAGE and SAGE libraries provides consistent expression levels and can enrich current genome annotations with tissue-specific promoters and alternative 3'-UTR usage.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
CAGE and SAGE wiggle tracks for proliferating (Prolif) and differentiated (Diff) cells in the UCSC Genome Browser for the myogenic marker MyoD. We only display reads aligning to the forward strand, the coding direction for MyoD. Chromosomal positions are indicated at the top. For each track the Y-axis scale corresponds to the number of tags aligned at that genomic position. Scales use a maximum from each relevant technique in this viewing window (129 for CAGE and 3912 for SAGE). There is 5formula image and 3formula image concordance for CAGE and SAGE samples, respectively. CAGE provides broader peaks, reflecting TSSs plus formula image26 nt of downstream sequence, wheres SAGE provides discrete peaks. A higher number of tags are in differentiated compared to proliferating samples.
Figure 2.
Figure 2.
High reproducibility was found in CAGE regions between sequencing duplicates (A) and biological replicates (B). (C) Shows correlation between CAGE samples from proliferating and differentiated cells. High reproducibility can also be found between SAGE biological replicates (D). (E) Shows the correlation between SAGE samples from proliferating and differentiated cells. The plotted values represent the square root of the number of tags per region.
Figure 3.
Figure 3.
Correlation of CAGE versus SAGE for proliferating samples (A), differentiated samples (B), and the ratio of proliferating and differentiated cells (C). Values are the square root of the number of tags per gene for A and B. For C, the values are the log ratio of the normalized number of tags per gene in differentiated over proliferating cells. The overlap of detectable genes (D) and differentially expressed genes (E) between CAGE and SAGE is indicated.
Figure 4.
Figure 4.
The UCSC display of (A) UCSC/Ensembl-defined first exon and an upstream Myl1 CAGE region (reverse strand reads only, on which the gene lies) for samples Prolif-1 and Diff-1. The Y-axis indicates the number of tags aligned at each position in the genome. We also display additional track information (UCSC genes, Ensembl genes, Vega genes, Other RefSeq, AceView Genes, N-SCAN and Transcriptome), several of which confirm the presence of the upstream CAGE region. (B) qPCR with primers within the CAGE region for Prolif, Prolif-C (reverse transcriptase control), Diff and Diff-C (reverse transcriptase control). The qPCR results are plotted as threshold cycle (Cp) values (lower = higher expression), with bars indicating a range of one SD between technical duplicates. (C) Standard PCR on agarose gel with forward primer in the novel CAGE region and reverse primer in the conventional exon 1. Comparison with the genomic control verifies the presence of an intron of 200 bases. A 100-bp ladder is included. (A–C) are consistent with higher expression in differentiated than proliferating cells. (D) Cross-species conserved muscle-specific transcription factor binding sites around and upstream of the Myl1 CAGE region support its role as a promoter for this region.

References

    1. Velculescu VE, Zhang L, Vogelstein B, Kinzler KW. Serial analysis of gene expression. Science. 1995;270:484–487. - PubMed
    1. Harbers M, Carninci P. Tag-based approaches for transcriptome research and genome annotation. Nat. Methods. 2005;2:495–502. - PubMed
    1. Nielsen KL, Hogh AL, Emmersen J. DeepSAGE–digital transcriptomics with high sensitivity, simple experimental protocol and multiplexing of samples. Nucleic Acids Res. 2006;34:e133. - PMC - PubMed
    1. Shiraki T, Kondo S, Katayama S, Waki K, Kasukawa T, Kawaji H, Kodzius R, Watahiki A, Nakamura M, Arakawa T. Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage. Proc. Natl Acad. Sci. USA. 2003;100:15776–15781. - PMC - PubMed
    1. Valen E, Pascarella G, Chalk A, Maeda N, Kojima M, Kawazu C, Murata M, Nishiyori H, Lazarevic D. Genome-wide detection and analysis of hippocampus core promoters using DeepCAGE. Genome Res. 2009;19:255–265. - PMC - PubMed

Publication types

Substances