Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Aug;28(8):2351-7.
doi: 10.1093/molbev/msr055. Epub 2011 Feb 25.

Genome-wide evolutionary conservation of N-glycosylation sites

Affiliations

Genome-wide evolutionary conservation of N-glycosylation sites

Chungoo Park et al. Mol Biol Evol. 2011 Aug.

Abstract

Although posttranslational protein modifications are generally thought to perform important cellular functions, recent studies showed that a large fraction of phosphorylation sites are not evolutionarily conserved. Whether the same is true for other protein modifications, such as N-glycosylation is an open question. N-glycosylation is a form of cotranslational and posttranslational modification that occurs by enzymatic addition of a polysaccharide, or glycan, to an asparagine (N) residue of a protein. Examining a large set of experimentally determined mouse N-glycosylation sites, we find that the evolutionary rate of glycosylated asparagines is significantly lower than that of nonglycosylated asparagines of the same proteins. We further confirm that the conservation of glycosylated asparagines is accompanied by the conservation of the canonical motif sequence for glycosylation, suggesting that the above substitution rate difference is related to glycosylation. Interestingly, when solvent accessibility is considered, the substitution rate disparity between glycosylated and nonglycosylated asparagines is highly significant at solvent accessible sites but not at solvent inaccessible sites. Thus, although the solvent inaccessible glycosylation sites were experimentally identified, they are unlikely to be genuine or physiologically important. For solvent accessible asparagines, our analysis reveals a widespread and strong functional constraint on glycosylation, unlike what has been observed for phosphorylation sites in most studies, including our own analysis. Because the majority of N-glycosylation occurs at solvent accessible sites, our results show an overall functional importance for N-glycosylation.

PubMed Disclaimer

Figures

F<sc>IG</sc>. 1.
FIG. 1.
Fraction of unconserved asparagine (N) residues in glycoproteins identified from (A) any of five mouse tissues and (B) each of the five mouse tissues. Error bars indicate standard errors. P values are from χ2 tests. N+ sites are experimentally identified glycosylated sites, N0 sites are all other N sites in the same proteins, and N sites are those N0 sites that are not followed by canonical glycosylation motifs.
F<sc>IG</sc>. 2.
FIG. 2.
Proportion of conserved canonical glycosylation motifs. Error bars indicate standard errors.
F<sc>IG</sc>. 3.
FIG. 3.
Variation in substitution rate in the neighborhood of mouse N sites. Error bars indicate standard errors.
F<sc>IG</sc>. 4.
FIG. 4.
Substitution patterns of mouse N+ and N0 sites in (A) mouse–rat, (B) mouse–human, and (C) mouse–chicken comparisons. The dot indicates a deletion. P values are from χ2 tests.
F<sc>IG</sc>. 5.
FIG. 5.
Correlation between the number of N+ sites per glycoprotein and the substitution rate ratio between N+ and N0 sites.
F<sc>IG</sc>. 6.
FIG. 6.
N-glycosylation at solvent accessible and inaccessible sites. (A) Frequency distributions of solvent accessibility scores for N+ and N0 sites. (B) Substitution rate ratio between N+ and N0 sites varies with solvent accessibility. *P < 0.05; **P < 0.01. P values are from χ2 tests. (C) Fraction of unconserved canonical motifs. P values are from χ2 tests.
F<sc>IG</sc>. 7.
FIG. 7.
Fraction of unconserved asparagine (N) residues depends on glycosylation, whereas that of unconserved serine (S), threonine (T), and tyrosine (Y) residues does not depend on phosphorylation. The comparison is made between mouse proteins that are subject to both glycosylation and phosphorylation and their rat orthologs. The number of residues considered in each sample is provided in each bar. Error bars indicate standard errors.

References

    1. Balla S, Thapar V, Verma S, et al. (15 co-authors) Minimotif Miner: a tool for investigating protein function. Nat Methods. 2006;3:175–177. - PubMed
    1. Bustamante CD, Townsend JP, Hartl DL. Solvent accessibility and purifying selection within proteins of Escherichia coli and Salmonella enterica. Mol Biol Evol. 2000;17:301–308. - PubMed
    1. Chen SC, Chen FC, Li WH. Phosphorylated and non-phosphorylated serine and threonine residues evolve at different rates in mammals. Mol Biol Evol. 2010;27:2548–2554. - PMC - PubMed
    1. Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–1797. - PMC - PubMed
    1. Freeze HH. Genetic defects in the human glycome. Nat Rev Genet. 2006;7:537–551. - PubMed

Publication types