Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2004 Oct;136(2):3223-33.
doi: 10.1104/pp.104.043406.

Maximizing the efficacy of SAGE analysis identifies novel transcripts in Arabidopsis

Affiliations
Comparative Study

Maximizing the efficacy of SAGE analysis identifies novel transcripts in Arabidopsis

Stephen J Robinson et al. Plant Physiol. 2004 Oct.

Abstract

The efficacy of using Serial Analysis of Gene Expression (SAGE) to analyze the transcriptome of the model dicotyledonous plant Arabidopsis was assessed. We describe an iterative tag-to-gene matching process that exploits the availability of the whole genome sequence of Arabidopsis. The expression patterns of 98% of the annotated Arabidopsis genes could theoretically be evaluated through SAGE and using an iterative matching process 79% could be identified by a tag found at a unique site in the genome. A total of 145,170 reliable experimental tags from two Arabidopsis leaf tissue SAGE libraries were analyzed, of which 29,632 were distinct. The majority (93%) of the 12,988 experimental tags observed greater than once could be matched within the Arabidopsis genome. However, only 78% were matched to a single locus within the genome, reflecting the complexities associated with working in a highly duplicated genome. In addition to a comprehensive assessment of gene expression in Arabidopsis leaf tissue, we describe evidence of transcription from pseudo-genes as well as evidence of alternative mRNA processing and anti-sense transcription. This collection of experimental SAGE tags could be exploited to assist in the on-going annotation of the Arabidopsis genome.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Distributions for the length of TIGR defined Arabidopsis UTR sequences. A, 5′ UTRs; B, 3′ UTRs.
Figure 2.
Figure 2.
Comparison of the number of genes with anti-sense transcripts detected by three expression profiling technologies. Total number of annotated Arabidopsis genes with anti-sense transcripts expressed in leaf tissue detected using SAGE (247), MPSS (5,200), and microarray (7,544) technologies.
Figure 3.
Figure 3.
Comparison of the number of genes detected by three expression profiling technologies. Total number of annotated Arabidopsis genes with sense transcripts expressed in leaf tissue detected using SAGE (12,934), MPSS (15,759), and microarray (13,317) technologies.

References

    1. Arabidopsis Genome Initiative (2000) Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408: 796–815 - PubMed
    1. Brenner S, Johnson M, Bridgham J, Golda G, Lloyd DH, Johnson D, Luo S, McCurdy S, Foy M, Ewan M, et al (2000) Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays. Nat Biotechnol 18: 630–634 - PubMed
    1. Carter MJ, Milton ID (1993) An inexpensive and simple method for DNA purifications on silica particles. Nucleic Acids Res 21: 1044. - PMC - PubMed
    1. Carpousis AJ, Vanzo NF, Raynal LC (1999) mRNA degradation. A tale of poly(A) and multiprotein machines. Trends Genet 15: 24–28 - PubMed
    1. Cock JM, Swarup R, Dumas C (1997) Natural antisense transcripts of the S locus receptor kinase gene and related sequences in Brassica oleracea. Mol Gen Genet 255: 514–524 - PubMed

Publication types

Associated data