Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006 Jan;16(1):45-54.
doi: 10.1101/gr.3883606. Epub 2005 Dec 12.

Abundant novel transcriptional units and unconventional gene pairs on human chromosome 22

Affiliations

Abundant novel transcriptional units and unconventional gene pairs on human chromosome 22

Leonard Lipovich et al. Genome Res. 2006 Jan.

Abstract

Novel transcriptional units (TUs) are EST-supported transcribed features not corresponding to known genes. Unconventional gene pairs (UGPs) are pairs of genes and/or TUs sharing exon-to-exon cis-antisense overlaps or putative bidirectional promoters. Computational TU and UGP discovery followed by manual curation was performed in the entire published 34.9-Mb human chromosome 22 euchromatic sequence. Novel TUs (n = 517) were as abundant as known genes (n = 492) and typically did not have nonprimate DNA and protein homologies. One hundred seventy-one (33%) of TUs, but only 13 (3%) of genes, both lacked nonprimate conservation and localized to gaps in the human-mouse BLASTZ alignment. Novel TUs were richer in exonic primate-specific interspersed repetitive elements (P = 0.001) and were more likely to rely on splice junctions provided by them, than were known genes: 19% of spliced TUs, versus 5% of spliced genes, had a splice site within a primate-specific repeat. Hence, novel TUs and known genes may represent different portions of the transcriptome. Two hundred nine (21%) of chromosome 22 transcripts participated in 77 cis-antisense and 42 promoter-sharing UGPs. Transcripts involved simultaneously in both UGP types were more common than was expected (P = 0.01). UGPs were nonrandomly distributed along the sequence: 89 (75%) clustered in distinct regions, the sum of which equaled 4.4 Mb (<13% of the chromosome). Eighty (67%) of the UGPs possessed significant locus structure differences between primates and rodents. Since some TUs may be functional noncoding transcripts and since the cis-regulatory potential of UGPs is well recognized, TUs and UGPs specific to the primate lineage may contribute to the genomic basis for primate-specific phenotypes.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Schematic illustrations of principal UGP genomic structures. Each arrow represents one transcribed feature. Only the orientation of the transcribed features is shown. Individual exons of the features are not shown. (A) Three types of exon-to-exon cis-antisense pairs. (Left) A tail-to-tail cis-antisense pair of transcribed features (genes, TUs, or one gene and one TU) on opposite strands in the same locus, with overlap of the last exons (“tails”) of the two features. Only the last exons are involved in the overlap. (Center) A head-to-head cis-antisense pair of transcribed features on opposite strands in the same locus, with overlap of the first exons (“heads”) of the two features. Only the first exons are involved in the overlap. (Right) cis-antisense pairs that are neither tail-to-tail nor head-to-head are collectively referred to as “other” or “complex” in this article. Only one of the many possible configurations of cis-antisense pairs is shown (B) A pair of transcribed features with a putative bidirectional promoter. (C) A gene (gray) that is antisense to multiple other genes (black). (D) A gene (gray) that shares a bidirectional promoter with another gene (solid black) and harbors an antisense transcript (hatched black).
Figure 2.
Figure 2.
A contiguous chain of five genes and one TU linked by five UGPs on chr22 (BC004346-SLC7A4). Arrows indicate the direction of transcription of the labeled genes and TU. Cis-antisense overlaps are represented by hatched black vertical rectangles. Putative bidirectional promoters are represented by solid gray vertical rectangles. For clarity, individual exons are not shown. Drawing is not to scale.
Figure 3.
Figure 3.
Clustering of UGPs along 35 Mb of chr22q. UGPs cluster near one another more frequently than expected by chance on the 35 Mb of human chr22q. Cis-antisense gene pairs (A) are defined as forming an island (B) if two pairs lie within 250 kb of one another. Similarly, pairs of genes that may share a putative bidirectional promoter (C) are defined as forming an island (D) if they lie within 250 kb of one another. Sixty-three of 77 (82%) cis-antisense gene pairs and 26 of 42 (62%) of gene pairs sharing a putative bidirectional promoter lie in islands. Ten thousand simulations of distributions of these features into 35 Mb of genomic space did not yield any distribution with clustering as great as that observed (see text for details). Five regions of 22q harbor both cis-antisense islands and putative-bidirectional promoter islands (E).

Similar articles

Cited by

References

    1. Adachi, N. and Lieber, M.R. 2002. Bidirectional gene organization: A common architectural feature of the human genome. Cell 109 807-809. - PubMed
    1. Andres, A.M., Soldevila, M., Saitou, N., Volpini, V., Calafell, F., and Bertranpetit, J. 2003. Understanding the dynamics of Spinocerebellar ataxia 8 (SCA8) locus through a comparative genetic approach in humans and apes. Neurosci. Lett. 336 143-146. - PubMed
    1. Angiolillo, A., Russo, G., Porcellini, A., Smaldone, S., D'Alessandro, F., and Pietropaolo, C. 2002. The human homologue of the mouse Surf5 gene encodes multiple alternatively spliced transcripts. Gene 284 169-178. - PubMed
    1. Burset, M., Seledtsov, I.A., and Solovyev, V.V. 2000. Analysis of canonical and non-canonical splice sites in mammalian genomes. Nucleic Acids Res. 28 4364-4375. - PMC - PubMed
    1. Caffrey, J.J., Safrany, S.T., Yang, X., and Shears, S.B. 2000. Discovery of molecular and catalytic diversity among human diphosphoinositol-polyphosphate phosphohydrolases: An expanding Nudt family. J. Biol. Chem. 275 12730-12736. - PubMed

Publication types