Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Jan 24:10:47.
doi: 10.1186/1471-2164-10-47.

Patterns of exon-intron architecture variation of genes in eukaryotic genomes

Affiliations

Patterns of exon-intron architecture variation of genes in eukaryotic genomes

Liucun Zhu et al. BMC Genomics. .

Abstract

Background: The origin and importance of exon-intron architecture comprises one of the remaining mysteries of gene evolution. Several studies have investigated the variations of intron length, GC content, ordinal position in a gene and divergence. However, there is little study about the structural variation of exons and introns.

Results: We investigated the length, GC content, ordinal position and divergence in both exons and introns of 13 eukaryotic genomes, representing plant and animal. Our analyses revealed that three basic patterns of exon-intron variation were present in nearly all analyzed genomes (P < 0.001 in most cases): an ordinal reduction of length and divergence in both exon and intron, a co-variation between exon and its flanking introns in their length, GC content and divergence, and a decrease of average exon (or intron) length, GC content and divergence as the total exon numbers of a gene increased. In addition, we observed that the shorter introns had either low or high GC content, and the GC content of long introns was intermediate.

Conclusion: Although the factors contributing to these patterns have not been identified, our results provide three important clues: common factor(s) exist and may shape both exons and introns; the ordinal reduction patterns may reflect a time-orderly evolution; and the larger first and last exons may be splicing-required. These clues provide a framework for elucidating mechanisms involved in the organization of eukaryotic genomes and particularly in building exon-intron structures.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Variation patterns of introns' length, as a function of GC content. Each dot contains 1000 introns (A-D). The species in A-C were grouped based on ranges of intron lengths, e.g., up to 16000 bp for mammals. L1, L2, H1, H2 and H3 in D represent five GC-isochore families in human genome [22] which are different in GC content (0~0.37, 0.37~0.41, 0.41~0.46, 0.46~0.53, 0.53~1, respectively). Genes that cover two or more family regions were excluded in D. The x-axis scale in A – D is based on the actual GC content in different genomes.
Figure 2
Figure 2
Variation patterns of intron (A) or exon length (B), as a function of its ordinal number. Only introns or exons with ordinal numbers ≤ 40 were shown in the figures; each dot contains > 1000 samples.
Figure 3
Figure 3
Variation patterns of exon GC content, as a function of flanking intron GC content (A~D), and exon length, as a function of flanking logarithmical intron length (E~H). Each dot contains 1000 introns and 1000 exons (1000 left exons flanking the corresponding introns). Introns used in E~H are > 100 bp.
Figure 4
Figure 4
Correlation between nucleotide divergence (or exon's Ka) and ordinal position of introns (A & B) or of exons (C & D) and between intron and flanking exon divergence (or Ka) (E & F). The left panel shows the results from the comparison between human and chimpanzee, and the right panel shows the comparison between two rice lines.
Figure 5
Figure 5
Correlations between the length and intron (A) or exon (B) count, between GC content and intron (C) or exon (D) count and between the nucleotide divergence and intron (E) or exon (F) count. The number (count) in the horizontal axis stands for the 1, 2-,..., 9 and 10-intron (or -exon) genes.

References

    1. Gilbert W. Why genes in pieces? Nature. 1978;271:501. doi: 10.1038/271501a0. - DOI - PubMed
    1. Costas J, Pereira PS, Vieira CP, Pinho S, Vieira J, Casares F. Dynamics and function of intron sequences of the wingless gene during the evolution of the Drosophila genus. Evol Dev. 2004;6:325–335. doi: 10.1111/j.1525-142X.2004.04040.x. - DOI - PubMed
    1. Robart AR, Zimmerly S. Group II intron retroelements: function and diversity. Cytogenet Genome Res. 2005;110:589–597. doi: 10.1159/000084992. - DOI - PubMed
    1. Rogozin IB, Sverdlov AV, Babenko VN, Koonin EV. Analysis of evolution of exon-intron structure of eukaryotic genes. Brief Bioinform. 2005;6:118–134. doi: 10.1093/bib/6.2.118. - DOI - PubMed
    1. Ohno S. So much "junk" DNA in our genome. Brookhaven Symp Biol. 1972;23:366–370. - PubMed

Publication types

LinkOut - more resources