Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Jan;31(1):59-69.
doi: 10.1093/molbev/mst166. Epub 2013 Sep 25.

Presence-absence variation in A. thaliana is primarily associated with genomic signatures consistent with relaxed selective constraints

Affiliations

Presence-absence variation in A. thaliana is primarily associated with genomic signatures consistent with relaxed selective constraints

Stephen J Bush et al. Mol Biol Evol. 2014 Jan.

Abstract

The sequencing of multiple genomes of the same plant species has revealed polymorphic gene and exon loss. Genes associated with disease resistance are overrepresented among those showing structural variations, suggesting an adaptive role for gene and exon presence-absence variation (PAV). To shed light on the possible functional relevance of polymorphic coding region loss and the mechanisms driving this process, we characterized genes that have lost entire exons or their whole coding regions in 17 fully sequenced Arabidopsis thaliana accessions. We found that although a significant enrichment in genes associated with certain functional categories is observed, PAV events are largely restricted to genes with signatures of reduced essentiality: PAV genes tend to be newer additions to the genome, tissue specific, and lowly expressed. In addition, PAV genes are located in regions of lower gene density and higher transposable element density. Partial coding region PAV events were associated with only a marginal reduction in gene expression level in the affected accession and occurred in genes with higher levels of alternative splicing in the Col-0 accession. Together, these results suggest that although adaptive scenarios cannot be ruled out, PAV events can be explained without invoking them.

Keywords: Arabidopsis; adaptive evolution; exon deletion; presence–absence variation; transposable elements; whole genome evolution.

PubMed Disclaimer

Figures

F<sc>ig</sc><sc>.</sc> 1.
Fig. 1.
Distribution of E-PAV genes (n = 330)—those with at least one, but not all, exons missing in at least one accession—by GOslim categories for molecular function (a), biological process (b) and cellular component (c), and by family (d). Both expected and observed number of E-PAV genes per category represented on each bar. Where there is a significant enrichment (P ≤ 0.05) between the amount of observed and expected E-PAV genes for a particular category, an asterisk is shown over the bars. Only categories with at least one E-PAV gene are shown.
F<sc>ig</sc><sc>.</sc> 2.
Fig. 2.
Genetic features associated with intact (having no exons under P/A variation), E-PAV (having at least one, but not all, exons missing in at least one accession), and CDS-PAV (having the entire CDS missing in at least one accession) genes. From left to right, top to bottom: dN/dS, age, expression level, Tajima’s D, number of paralogs, and tau. See supplementary table S3, Supplementary Material online, for values of means and statistical analysis.
F<sc>ig</sc><sc>.</sc> 3.
Fig. 3.
Genomic context for intact (having no exons under P/A variation), E-PAV (having at least one, but not all, exons missing in at least one accession), and CDS-PAV (having the entire CDS missing in at least one accession) genes. Averaged values for the genes in each set are given for, from top to bottom, the intergenic distance, the percentage of TE bases in the nongenic sequence of a 10-kb window centered on that gene’s midpoint, and the percentage of recombinogenic motifs in the genic sequence of a 1-kb window centered on that gene’s midpoint. See also supplementary tables 3 and 4 (Supplementary Material online) for the values of specific TE families and other window sizes.
F<sc>ig</sc><sc>.</sc> 4.
Fig. 4.
Distribution of Z scores for standardized transcript abundance data in the affected accession. Data show that 210 genes that have one or more missing exons in only one of 17 A. thaliana accessions (relative to Col-0).

References

    1. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–410. - PubMed
    1. Bakker EG, Toomajian C, Kreitman M, Bergelson J. A genome-wide survey of R gene polymorphisms in Arabidopsis. Plant Cell. 2006;18:1803–1818. - PMC - PubMed
    1. Boguski MS, Lowe TMJ, Tolstoshev CM. dbEST—database for expressed sequence tags. Nat Genet. 1993;4:332–333. - PubMed
    1. Brenner S, Johnson M, Bridgham J, et al. (24 co-authors) Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays. Nat Biotechnol. 2000;18:630–634. - PubMed
    1. Cao J, Schneeberger K, Ossowski S, et al. (17 co-authors) Whole-genome sequencing of multiple Arabidopsis thaliana populations. Nat Genet. 2011;43:956–963. - PubMed

Publication types

Substances

LinkOut - more resources