Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Aug 1;37(8):2197-2210.
doi: 10.1093/molbev/msaa072.

Runaway GC Evolution in Gerbil Genomes

Affiliations

Runaway GC Evolution in Gerbil Genomes

Rodrigo Pracana et al. Mol Biol Evol. .

Abstract

Recombination increases the local GC-content in genomic regions through GC-biased gene conversion (gBGC). The recent discovery of a large genomic region with extreme GC-content in the fat sand rat Psammomys obesus provides a model to study the effects of gBGC on chromosome evolution. Here, we compare the GC-content and GC-to-AT substitution patterns across protein-coding genes of four gerbil species and two murine rodents (mouse and rat). We find that the known high-GC region is present in all the gerbils, and is characterized by high substitution rates for all mutational categories (AT-to-GC, GC-to-AT, and GC-conservative) both at synonymous and nonsynonymous sites. A higher AT-to-GC than GC-to-AT rate is consistent with the high GC-content. Additionally, we find more than 300 genes outside the known region with outlying values of AT-to-GC synonymous substitution rates in gerbils. Of these, over 30% are organized into at least 17 large clusters observable at the megabase-scale. The unusual GC-skewed substitution pattern suggests the evolution of genomic regions with very high recombination rates in the gerbil lineage, which can lead to a runaway increase in GC-content. Our results imply that rapid evolution of GC-content is possible in mammals, with gerbil species providing a powerful model to study the mechanisms of gBGC.

Keywords: GC-biased gene conversion; GC-content; biased substitution, fixation bias; gBGC; genome evolution; recombination.

PubMed Disclaimer

Figures

<sc>Fig</sc>. 1.
Fig. 1.
GC-skewed synonymous evolution in protein-coding genes located in the known high-GC region. (A) Topology of the phylogenetic relationships between the species analyzed in this study; (B) GC-content in the third-codon position (GC3) per species per gene; (C) the rate of synonymous substitution (dS) per mutational category: weak-to-strong (WS), strong-to-weak (SW), weak-to-weak (WW), and strong-to-strong (SS); (D) comparison between weak-to-strong dS (dSWS) and strong-to-weak dS (dSSW).
<sc>Fig</sc>. 2.
Fig. 2.
GC-skewed nonsynonymous evolution in protein-coding genes located in the known high-GC region. (A) GC-content in the first and second codon positions (GC12) per species per gene, (B) and nonsynonymous substitution rate (dN) per mutational category: weak-to-strong (WS), strong-to-weak (SW), weak-to-weak (WW), and strong-to-strong (SS), (C) comparison between weak-to-strong dN (dNWS) and strong-to-weak dN (dNSW).
<sc>Fig</sc>. 3.
Fig. 3.
Outliers in the rate of synonymous substitution (dS) measured from the gerbil-murine split for 8,809 groups of orthologous genes outside the known high-GC region. (A) Topography of the tree of the species included in the alignments of each group of orthologous genes; our focal species include two gerbil species (in pink) and two murine species (in green); dS was measured in the colored branches, that is, from the point of gerbil-murine divergence to the tip of each of the four focal species. (B) Comparison between the rate of weak-to-strong synonymous substitution (dSWS) and the strong-to-weak synonymous substitution (dSSW) in the two gerbil and the two murine species, normalized by dividing each value by the average rate of the respective species and mutation category; colored points represent genes above the dS threshold chosen to define outliers (dS >2.5 times the average for the respective species and mutational category); only values under 6 are shown. (C) Euler diagram of the dSWS outliers, where the area of each segment is approximately proportional to the number of overlapping outlier genes per species (only gene numbers >1 included), shown also as a percentage of the 524 genes that were outliers in at least one species. (D) Pairwise comparisons of the GC-content at the third-codon position (GC3) between the gerbil Psammomys obesus and the mouse Mus musculus for 8,797 groups of orthologous genes (out of 8,809) for which both species have a GC3 measurement.
<sc>Fig</sc>. 4.
Fig. 4.
Clustering of outliers in the weak-to-strong dS. (A) Normalized rate of synonymous substitution (dS) for two mutational categories, strong-to-weak (SW) and weak-to-strong (WS), for 8,809 genes; (B) average normalized weak-to-strong dS (dSWS) in sliding windows of 1 Mb with a step of 0.25 Mb, showing only windows with >3 genes, and with outlying regions marked with a vertical gray bar. The genes are mapped by row to the chromosomes of the Mus musculus reference genome assembly. The horizontal gray line represents the dS threshold chosen to define outlier genes (dS >2.5 times the average for the respective species and mutational category). The dark blue horizontal bars represent the location of the previously known high-GC region.

References

    1. Altenhoff AM, Gil M, Gonnet GH, Dessimoz C.. 2013. Inferring hierarchical orthologous groups from orthologous gene pairs. PLoS One 8(1):e53786. - PMC - PubMed
    1. Altenhoff AM, Glover NM, Train C-M, Kaleb K, Warwick Vesztrocy A, Dylus D, de Farias TM, Zile K, Stevenson C, Long J, et al. 2018. The OMA orthology database in 2018: retrieving evolutionary relationships among all domains of life through richer web and programmatic interfaces. Nucleic Acids Res. 46(D1):D477–D485. - PMC - PubMed
    1. Arbeithuber B, Betancourt AJ, Ebner T, Tiemann-Boege I.. 2015. Crossovers are associated with mutation and biased gene conversion at recombination hotspots. Proc Natl Acad Sci USA. 112(7):2109–2114. - PMC - PubMed
    1. Arnheim N, Calabrese P, Tiemann-Boege I.. 2007. Mammalian meiotic recombination hot spots. Annu Rev Genet. 41(1):369–399. - PubMed
    1. Auton A, Fledel-Alon A, Pfeifer S, Venn O, Ségurel L, Street T, Leffler EM, Bowden R, Aneas I, Broxholme J, et al. 2012. A fine-scale chimpanzee genetic map from population sequencing. Science 336(6078):193–198. - PMC - PubMed

Publication types