Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2002 Jul;12(7):1068-74.
doi: 10.1101/gr.62002.

Long-range heterogeneity at the 3' ends of human mRNAs

Affiliations

Long-range heterogeneity at the 3' ends of human mRNAs

Christian Iseli et al. Genome Res. 2002 Jul.

Abstract

The publication of a draft of the human genome and of large collections of transcribed sequences has made it possible to study the complex relationship between the transcriptome and the genome. In the work presented here, we have focused on mapping mRNA 3' ends onto the genome by use of the raw data generated by the expressed sequence tag (EST) sequencing projects. We find that at least half of the human genes encode multiple transcripts whose polyadenylation is driven by multiple signals. The corresponding transcript 3' ends are spread over distances in the kilobase range. This finding has profound implications for our understanding of gene expression regulation and of the diversity of human transcripts, for the design of cDNA microarray probes, and for the interpretation of gene expression profiling experiments.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Examples of extended and overlapping 3′ untranslated region (UTR). Alignments of transcripts to the genome (yellow bar) were visualized using ACEDB. The direction of transcription is bottom to top on the left and top to bottom on the right. The direction of transcription of unspliced expressed sequence tags (ESTs) was based on their annotation, except for the unspliced ORESTES sequences, which were arbitrarily assigned to the right-hand strand. The orientation of 3′ tags was deduced from the polarity of the poly(A) tract. Light blue: RefSeq sequences; dark blue: full-length cDNA sequences; green: ORESTES sequences; and red: EST sequences. 3′ tags are represented by black boxes, with one box per cluster member. Regions covered by UniGene clusters are indicated on the right. (A) 3′ terminal exons of the NCAM2 gene. (B) Overlapping 3′ ends of the COL18A1 and SCL19A1 genes. Many ESTs derived from the COL18A1 gene were omitted for clarity.

References

    1. Aaronson JS, Eckman B, Blevins RA, Borkowski JA, Myerson J, Imran S, Elliston KO. Toward the development of a gene index to the human genome: An assessment of the nature of high-throughput EST sequence data. Genome Res. 1996;6:829–845. - PubMed
    1. Beaudoing E, Gautheret D. Identification of alternate polyadenylation sites and analysis of their tissue distribution using EST data. Genome Res. 2001;11:1520–1526. - PMC - PubMed
    1. Burge CB. Chipping away at the transcriptome. Nat Genet. 2001;27:232–234. - PubMed
    1. Camargo AA, Samaia HPB, Dias-Neto E, Simão DF, Migotto IA, Briones MR, Costa FF, Nagai MA, Verjovski-Almeida S, Zago MA, et al. The contribution of 700,000 “ORF sequence tags” to the definition of the human transcriptome. Proc Natl Acad Sci. 2001;98:12103–12108. - PMC - PubMed
    1. Caron H, van Schaik B, van der Mee M, Baas F, Riggins G, van Sluis P, Hermus MC, van Asperen R, Boon K, Voute PA, et al. The human transcriptome map: Clustering of highly expressed genes in chromosomal domains. Science. 2001;291:1289–1292. - PubMed