Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Mar;28(3):1307-17.
doi: 10.1093/molbev/msq318. Epub 2010 Dec 6.

Phylogenomic analysis of the uracil-DNA glycosylase superfamily

Affiliations

Phylogenomic analysis of the uracil-DNA glycosylase superfamily

J Ignacio Lucas-Lledó et al. Mol Biol Evol. 2011 Mar.

Abstract

The spontaneous deamination of cytosine produces uracil mispaired with guanine in DNA, which will produce a mutation, unless repaired. In all domains of life, uracil-DNA glycosylases (UDGs) are responsible for the elimination of uracil from DNA. Thus, UDGs contribute to the integrity of the genetic information and their loss results in mutator phenotypes. We are interested in understanding the role of UDG genes in the evolutionary variation of the rate and the spectrum of spontaneous mutations. To this end, we determined the presence or absence of the five main UDG families in more than 1,000 completely sequenced genomes and analyzed their patterns of gene loss and gain in eubacterial lineages. We observe nonindependent patterns of gene loss and gain between UDG families in Eubacteria, suggesting extensive functional overlap in an evolutionary timescale. Given that UDGs prevent transitions at G:C sites, we expected the loss of UDG genes to bias the mutational spectrum toward a lower equilibrium G + C content. To test this hypothesis, we used phylogenetically independent contrasts to compare the G + C content at intergenic and 4-fold redundant sites between lineages where UDG genes have been lost and their sister clades. None of the main UDG families present in Eubacteria was associated with a higher G + C content at intergenic or 4-fold redundant sites. We discuss the reasons of this negative result and report several features of the evolution of the UDG superfamily with implications for their functional study. uracil-DNA glycosylase, mutation rate evolution, mutational bias, GC content, DNA repair, mutator gene.

PubMed Disclaimer

Figures

F<sc>IG</sc>. 1.
FIG. 1.
Conditional spontaneous base substitution rates per base per generation for Drosophila melanogaster (white, Keightley et al. 2009), Caenorhabditis elegans (pattern, Denver et al. 2009), and Arabidopsis thaliana (black, Ossowski et al. 2010). Horizontal lines indicate total mutation rates for each species with their standard errors in grey. Total mutation rates are equal to the average of the sum of mutation rates at A:T sites and the sum of mutation rates at G:C sites, weighted by the proportion of each kind of sites in the genome. Data for D. melanogaster and for C. elegans were derived from the original publications assuming that the proportion of A:T sites surveyed is the same as the genomic proportion of A + T. Data for D. melanogaster were not corrected for a small downward bias described in Keightley et al. (2009).
F<sc>IG</sc>. 2.
FIG. 2.
ML phylogenies of the main UDG families, and the sequence logos of their most conserved homologous motif. The branch color indicates eubacterial (black), archaeal (green), or eukaryotic (red) sequences. A putative family 6, not functionally described in the literature, is indicated with a question mark.
F<sc>IG</sc>. 3.
FIG. 3.
Summary of values of the Michaelis constant (KM) and the rate of uracil excision (kcat; see text for details) from U:A pairs by family 1 UDGs of different species found in the literature (see raw data and references in the Supplementary table S4, Supplementary Material online). Boxes are interquartile ranges; horizontal lines within the boxes are medians; whiskers extend to the most extreme data points that are no more than 1.5 times the interquartile range away from the box; data points further than 1.5 times the interquartile range from the box are drawn as empty circles. Horizontal lines without a box represent single values.
F<sc>IG</sc>. 4.
FIG. 4.
Posterior distributions of the rates of loss of UDG family 1 and of extended family 4, in the absence or presence of families 2 or 1. The meanings of boxes, whiskers, and empty circles are the same as in figure 3.

References

    1. Abascal F, Zardoya R, Posada D. ProtTest: selection of best-fit models of protein evolution. Bioinformatics. 2005;21:2104–2105. - PubMed
    1. Abu M, Waters TR. The main role of human thymine-DNA glycosylase is removal of thymine produced by deamination of 5-methylcytosine and not removal of ethenocytosine. J Biol Chem. 2003;278:8739–8744. - PubMed
    1. Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–3402. - PMC - PubMed
    1. An Q, Robins P, Lindahl T, Barnes D. C → T mutagenesis and γ-radiation sensitivity due to deficiency in the Smug1 and Ung DNA glycosylases. EMBO J. 2005;24:2205–2213. - PMC - PubMed
    1. Aravind L, Koonin EV. The α/β uracil DNA glycosylases: a common origin with diverse fates. Genome Biol. 2000 1:research0007.1–research0007.8. - PMC - PubMed

Publication types

MeSH terms