Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012;4(10):1033-43.
doi: 10.1093/gbe/evs070.

The evolution of intron size in amniotes: a role for powered flight?

Affiliations

The evolution of intron size in amniotes: a role for powered flight?

Qu Zhang et al. Genome Biol Evol. 2012.

Abstract

Intronic DNA is a major component of eukaryotic genes and genomes and can be subject to selective constraint and have functions in gene regulation. Intron size is of particular interest given that it is thought to be the target of a variety of evolutionary forces and has been suggested to be linked ultimately to various phenotypic traits, such as powered flight. Using whole-genome analyses and comparative approaches that account for phylogenetic nonindependence, we examined interspecific variation in intron size variation in three data sets encompassing from 12 to 30 amniotes genomes and allowing for different levels of genome coverage. In addition to confirming that intron size is negatively associated with intron position and correlates with genome size, we found that on average mammals have longer introns than birds and nonavian reptiles, a trend that is correlated with the proliferation of repetitive elements in mammals. Two independent comparisons between flying and nonflying sister groups both showed a reduction of intron size in volant species, supporting an association between powered flight, or possibly the high metabolic rates associated with flight, and reduced intron/genome size. Small intron size in volant lineages is less easily explained as a neutral consequence of large effective population size. In conclusion, we found that the evolution of intron size in amniotes appears to be non-neutral, is correlated with genome size, and is likely influenced by powered flight and associated high metabolic rates.

PubMed Disclaimer

Figures

F<sc>ig</sc>.
1.—
Fig. 1.—
Distribution of intron median size in 11 species used in data set A. “Other introns” include all other introns after the fourth intron. (A) Introns identified in data set A. (B) Introns from genes with at least five introns in each species.
F<sc>ig</sc>.
2.—
Fig. 2.—
Intron size distributions in different data sets. Boxplot is used to display the logarithmized size distribution of introns in each data set. Species names in black represent mammals, names in red represent reptiles/birds, and names in dark green represent amphibians. (A) Data set A; (B) data set B; and (C) data set C.
F<sc>ig</sc>. 3.—
Fig. 3.—
The influence of greater taxon sampling on the significance of PGLS-based t-tests. We generated four larger phylogenetic trees with more bird species (A03 and A12 derived from data set A and B12 and B23 derived from data set B). Then we used the median size of a specific intron class in each species as node values in a phylogenetic tree and performed PGLS analysis. For newly added bird species, node values were generated by normal distribution (see text for details). To get a hypothetical distribution, this procedure was repeated 5,000 times. In each diagram, the red line denotes the P value from PGLS analysis in the original data set, and the blue and green bars denote the 5,000-time simulation of such P value in two simulated data sets derived from a same original data set. (A) Simulation based on the median size of first introns in data set A. (B) Simulation based on the median size of first introns in data set B.
F<sc>ig</sc>.
4.—
Fig. 4.—
Correlation between genome size and intron size. Light-blue lines indicate regression lines derived from normal linear regression model; and brown lines indicate regression lines derived from PGLS model, which accounts for nonindependence among data points. (A) Median size of first introns in data set A; (B) median size of other introns (introns except first introns) in data set A; (C) median size of first introns in data set B; and (D) median size of other introns (introns except first introns) in data set B.
F<sc>ig</sc>.
5.—
Fig. 5.—
Correlations between the proportion of repetitive elements in introns and genomes. Brown lines indicate regression lines from normal linear regression model. (A) Data from data set A and (B) data from data set B.

Comment in

References

    1. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–410. - PubMed
    1. Barker D, Pagel M. Predicting functional gene links from phylogenetic-statistical analyses of whole genomes. PLoS Comput Biol. 2005;1:e3. - PMC - PubMed
    1. Bennett MD. Ordered disposition of parental genomes and individual chromosomes in reconstructed plant nuclei, and their implications. Somat Cell Mol Genet. 1987;13:463–466. - PubMed
    1. Benton M, Donoghue P, Asher R. Calibrating and constraining molecular clocks. In: Hedges S, Kumar S, editors. The timetree of life. New York: Oxford University Press; 2009. pp. 35–86.
    1. Berget SM, Moore C, Sharp PA. Spliced segments at the 5' terminus of adenovirus 2 late mRNA. Proc Natl Acad Sci U S A. 1977;74:3171–3175. - PMC - PubMed

Publication types