Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004 Nov 23;32(20):e163.
doi: 10.1093/nar/gnh161.

Mining SAGE data allows large-scale, sensitive screening of antisense transcript expression

Affiliations

Mining SAGE data allows large-scale, sensitive screening of antisense transcript expression

Ronan Quéré et al. Nucleic Acids Res. .

Abstract

As a growing number of complementary transcripts, susceptible to exert various regulatory functions, are being found in eukaryotes, high throughput analytical methods are needed to investigate their expression in multiple biological samples. Serial Analysis of Gene Expression (SAGE), based on the enumeration of directionally reliable short cDNA sequences (tags), is capable of revealing antisense transcripts. We initially detected them by observing tags that mapped on to the reverse complement of known mRNAs. The presence of such tags in individual SAGE libraries suggested that SAGE datasets contain latent information on antisense transcripts. We raised a collection of virtual tags for mining these data. Tag pairs were assembled by searching for complementarities between 24-nt long sequences centered on the potential SAGE-anchoring sites of well-annotated human expressed sequences. An analysis of their presence in a large collection of published SAGE libraries revealed transcripts expressed at high levels from both strands of two adjacent, oppositely oriented, transcription units. In other cases, the respective transcripts of such cis-oriented genes displayed a mutually exclusive expression pattern or were co-expressed in a small number of libraries. Other tag pairs revealed overlapping transcripts of trans-encoded unique genes. Finally, we isolated a group of tags shared by multiple transcripts. Most of them mapped on to retroelements, essentially represented in humans by Alu sequences inserted in opposite orientations in the 3'UTR of otherwise different mRNAs. Registering these tags in separate files makes possible computational searches focused on unique sense-antisense pairs. The method developed in the present work shows that SAGE datasets constitute a major resource of rapidly investigating with high sensitivity the expression of antisense transcripts, so that a single tag may be detected in one library when screening a large number of biological samples.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Analysis of SAGE tags mapping to the reverse complement of known transcripts. (A) Conceptual scheme and tag distribution in a U937-cell library (15). (B) Scatter plot showing sense versus antisense frequencies; the regression coefficient was calculated for 146 duplexes (gray dots). (C) Strand-specific RT–PCR of selected transcripts using RNAs from untreated (NT) and differentiated (Diff) U937 cells prepared as for SAGE libraries (15). Tag frequencies are indicated below. First-strand cDNA synthesis initiated with antisense (lanes 1 and 5) and sense-specific primers (lanes 2 and 6). Controls without reverse transcriptase for DNA contamination (lanes 3, 4, 7 and 8). (D) Strand-specific RT–PCR of beta-globin (HBB) mRNA antisense (lane a), sense (lane b) and controls without reverse transcriptase (lanes c and d) using reticulocytes RNA. Scatter plot of globin tag levels in the reticulocyte library, compared to levels in 260 SAGE libraries.
Figure 2
Figure 2
Search for complementary mRNAs pairs in the UniGene collection of cluster-representative sequences. (A) Illustration of the ‘Janus’ tags concept. (B) Flow chart and results of the selection. (C) Numbers of well-oriented transcripts (from UniGene) sharing an Alu-matching J-tag in sense (x-axis) and antisense (y-axis) orientation.
Figure 3
Figure 3
Comparative expression levels of SAGE tags mapping on to transcripts sharing complementary J-tags. Each dot corresponds to one SAGE library. (A) Expression levels of selected pairs in 260 SAGE libraries for cis-encoded, widely expressed genes. (B) Tissue-specific cis-encoded genes. (C) Tissue-specific trans-encoded genes including pseudogenes expressed in inverted orientation. (D) Co-expression of sense and antisense TNNI3 transcripts. (E) Alu elements inserted in the 3′UTR of multiple mRNAs.
Figure 4
Figure 4
Comparative expression levels of SAGE tags mapping on to syncoilin and RBBP4 cis-encoded, oppositely oriented transcripts. White dots, normal tissue samples; black dots, cancer tissue samples. Comparative expression levels in 260 SAGE libraries (A) as in Figure 3, and (B) in libraries prepared from breast normal and cancer samples.

References

    1. Vanhee-Brossollet C. and Vaquero,C. (1998) Do natural antisense transcripts make sense in eukaryotes? Gene, 211, 1–9. - PubMed
    1. Shendure J. and Church,G.M. (2002) Computational discovery of sense-antisense transcription in the human and mouse genomes. Genome Biol., 3, RESEARCH0044. - PMC - PubMed
    1. Lehner B., Williams,G., Campbell,R.D. and Sanderson,C.M. (2002) Antisense transcripts in the human genome. Trends Genet., 18, 63–65. - PubMed
    1. Yelin R., Dahary,D., Sorek,R., Levanon,E.Y., Goldstein,O., Shoshan,A., Diber,A., Biton,S., Tamir,Y., Khosravi,R. et al. (2003) Widespread occurrence of antisense transcription in the human genome. Nat. Biotechnol., 21, 379–386. - PubMed
    1. Kampa D., Cheng,J., Kapranov,P., Yamanaka,M., Brubaker,S., Cawley,S., Drenkow,J., Piccolboni,A., Bekiranov,S., Helt,G. et al. (2004) Novel RNAs identified from an in-depth analysis of the transcriptome of human chromosomes 21 and 22. Genome Res., 14, 331–342. - PMC - PubMed

Publication types