Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Feb 26;6(2):144-154.
doi: 10.1021/acscentsci.9b00021. Epub 2020 Feb 19.

Sequential Glycosylation of Proteins with Substrate-Specific N-Glycosyltransferases

Affiliations

Sequential Glycosylation of Proteins with Substrate-Specific N-Glycosyltransferases

Liang Lin et al. ACS Cent Sci. .

Abstract

Protein glycosylation is a common post-translational modification that influences the functions and properties of proteins. Despite advances in methods to produce defined glycoproteins by chemoenzymatic elaboration of monosaccharides, the understanding and engineering of glycoproteins remain challenging, in part, due to the difficulty of site-specifically controlling glycosylation at each of several positions within a protein. Here, we address this limitation by discovering and exploiting the unique, conditionally orthogonal peptide acceptor specificities of N-glycosyltransferases (NGTs). We used cell-free protein synthesis and mass spectrometry of self-assembled monolayers to rapidly screen 41 putative NGTs and rigorously characterize the unique acceptor sequence preferences of four NGT variants using 1254 acceptor peptides and 8306 reaction conditions. We then used the optimized NGT-acceptor sequence pairs to sequentially install monosaccharides at four sites within one target protein. This strategy to site-specifically control the installation of N-linked monosaccharides for elaboration to a variety of functional N-glycans overcomes a major limitation in synthesizing defined glycoproteins for research and therapeutic applications.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

Figure 1
Figure 1
Site-specific control of glycosylation by sequential enzymatic addition of glycans. The GlycoSCORES workflow was used to screen 41 putative NGTs and discover differences in peptide acceptor specificities. Conditionally orthogonal NGT-substrate sequence pairs were further optimized, such that the first NGT only glycosylates one of the four substrate sequences, the second NGT glycosylates one of the remaining three substrate sequences, and the third NGT only glycosylates one of the remaining two substrate sequences, with the last substrate sequence glycosylated by the fourth NGT. The four acceptor sites were then incorporated as GlycTags into a single protein. Sequential treatment of the protein with the four NGTs gave stepwise glycosylation of each of the four glycosylation sites. Glycans, modification sites, and NGTs have been color coded for illustration purposes.
Figure 2
Figure 2
GlycoSCORES screening of NGT homologues for unique peptide specificities. (a) Screening for active NGTs from putative bacterial glycosyltransferases. Forty-one putative glycosyltransferases from the CAZY database were screened against six representative peptide substrates for N-glucosyltransferase activity. The phylogenetic tree is rooted with human OGT (black dot) as the outgroup. Six enzymes showed N-glucosyltransferase activity, with strong activity from ApNGT, EcNGT, HdNGT, HiNGT, and MhNGT. Relative MS intensities of peptide substrates and glucose modified peptide products are shown when treated with 5% (v/v) CFPS NGTs and 2.5 mM UDP-Glc at 30 °C for 21 h (n = 1). (b) Scheme for GlycoSCORES workflow for discovery and characterization of the peptide specificities for the NGTs. NGTs were produced in CFPS and mixed with UDP-Glc sugar donor and cysteine-containing peptide substrates, which were then immobilized to a maleimide functionalized self-assembled monolayer and characterized by SAMDI-MS. (c) Peptide acceptor specificity comparison of HiNGT (gray), EcNGT (orange), ApNGT (blue), and ApNGTQ469A (red) using a peptide array of X–1NX+1TRC sequence. Expanded X–1NX+1(T/S)RC substrate libraries with numerical annotation are shown in Supplementary Figure 4. Enzyme concentrations were controlled to obtain a maximum yield of ∼85% in each heatmap to facilitate the comparison of specificities. The heatmap for ApNGT is taken from a previous report. Each heatmap shows percentage glucose modification from n = 1 experiment using the conditions: 0.42 μM CFPS HiNGT or 0.75 μM CFPS EcNGT, 30 °C for 21 h; 0.055 μM CFPS ApNGT or 0.014 μM CFPS ApNGTQ469A, 30 °C for 1 h.
Figure 3
Figure 3
Optimization of conditionally orthogonal GlycTags for NGTs. (a) Screening to improve the conditional orthogonality of NGT-GlycTag pairs for HiNGT, EcNGT, ApNGT, and ApNGTQ469A using an X–1NX+1(T/S)RC peptide library. Peptides were screened with HiNGT, EcNGT, ApNGT, and then ApNGTQ469A, respectively. Sequences are arranged by decreasing conditional orthogonality (differences in modification between specific NGT and the sum of preceding NGTs). The sequences are also divided into regions that are preferred by ApNGTQ469A only (red), ApNGT but not EcNGT or HiNGT (blue), EcNGT but not HiNGT (orange), and HiNGT (gray). GlycTags showing strong conditional orthogonality which were identified in each region used for additional optimization are highlighted (colored boxes). Heatmaps with experimental conditions and numerical annotation from n = 2 separate peptide IVG reactions are shown in Supplementary Figure 7. (b) Conditional orthogonality screening of selected sequences from a which were resynthesized with 19 amino acids in the X–2 position in 5-mer X–2(X–1NX+1T/S)RC library. GlycTags identified from each region and used for additional optimization are highlighted. (c) Conditional orthogonality screening of selected sequences from (b) which were resynthesized with 19 amino acids in the X–3 position. Heatmaps with experimental conditions and numerical annotation from n = 1 separate peptide IVG reactions for (b) and (c) are shown in Supplementary Figures 8 and 9, respectively. The corresponding colors of sequences originating from previous screens are shown above the heatmaps. A set of four GlycTags with >95% conditional orthogonality was selected for site-specific control of sequential glycosylation at four sites within one target protein.
Figure 4
Figure 4
Optimized GlycTag sequences showing conditional orthogonality as peptides and within a single Im7 target protein. (a) The conditional orthogonality of optimized 6-mer GlycTags. HiNGT, EcNGT, ApNGT, and ApNGTQ469A modification of selected GlycTags from Figure 3c under optimized conditions were resynthesized and reanalyzed by GlycoSCORES with purified NGTs. The modification efficiency of each peptide was calculated using individually measured RIFs (Supplementary Table 4). Experimental conditions (n = 3 individual IVGs): 0.2 μM HiNGT or 0.67 μM EcNGT, 30 °C for 21 h; 0.45 μM ApNGT or 0.1 μM ApNGTQ469A, 30 °C for 3 h. (b) Optimized 6-mer GlycTags were inserted into the N-terminus, C-terminus, and two exposed loops of the glycosylation model protein Im7, with flanking sequences of RATT-GlycTag-RAGG to facilitate trypsinization and quantitative LC-qTOF analysis. 10 μM of purified 4gIm7 bearing all four optimized GlycTags was reacted with 2.5 mM UDP-Glc and various concentrations of each purified NGT for 4 h in n = 3 IVG reactions. After modification, 4gIm7 was purified from the reaction, treated with trypsin, and analyzed by LC-qTOF. The modification efficiency (occupancy) of each site was calculated using individually measured RIFs (shown in Supplementary Table 5). A vertical dotted line in each graph denotes optimal conditions for each NGT which provide the greatest conditional orthogonality. Representative LC chromatograms and related MS spectra are presented in Supplementary Figure 13. (c) The conditional orthogonality of each optimized 6-mer GlycTag within 4gIm7 under optimized conditions. Bar graphs with errors for heatmaps in (a) and (c) are shown in Supplementary Figure 10. Similar modification patterns were observed for peptides and GlycTags within engineered 4gIm7.
Figure 5
Figure 5
Site-specific control of sequential glycosylation at four distinct GlycTag sequences within a single Im7 protein. (a) A workflow for site-specific control of glycosylation at four sequences within Im7 by sequential addition of NGT enzymes. (b) Representative deconvoluted MS spectra for intact 4gIm7 of n = 3 IVG reactions showing SUMO-4gIm7 after each NGT treatment, cleaved by UIp1 and analyzed by LC-qTOF. Two μM purified SUMO-4gIm7 was incubated with 0.3 μM HiNGT and then immobilized to magnetic beads and sequentially washed and treated with 0.3 μM EcNGT, 0.4 μM ApNGT, and then 2 μM ApNGTQ469A. Reactions (n = 3) were performed for 4 h at 30 °C containing 2.5 mM UDP-Glc and monitored by elution and LC-qTOF after each wash step. These data show that the dominant forms of 4gIm7 are those modified with one glucose per NGT treatment. The optimization of reaction conditions for SUMO-4gIm7 are shown in Supplementary Figure 15. One representative LC chromatogram and related deconvoluted MS spectra are presented in Supplementary Figure 16. (c) A bar graph of site-occupancy at each of the four GlycTags in the same n = 3 experiments in (b) after each NGT treatment, cleavage by trypsin, and analysis by LC-qTOF. Each GlycTag was mainly glycosylated by its corresponding NGT.

Similar articles

Cited by

References

    1. Khoury G. A.; Baliban R. C.; Floudas C. A. Proteome-wide post-translational modification statistics: Frequency analysis and curation of the swiss-prot database. Sci. Rep. 2011, 1, 90.10.1038/srep00090. - DOI - PMC - PubMed
    1. Elliott S.; Lorenzini T.; Asher S.; Aoki K.; Brankow D.; Buck L.; Busse L.; Chang D.; Fuller J.; Grant J.; Hernday N.; Hokum M.; Hu S.; Knudten A.; Levin N.; Komorowski R.; Martin F.; Navarro R.; Osslund T.; Rogers G.; Rogers N.; Trail G.; Egrie J. Enhancement of therapeutic protein in vivo activities through glycoengineering. Nat. Biotechnol. 2003, 21 (4), 414–421. 10.1038/nbt799. - DOI - PubMed
    1. Lin C.-W.; Tsai M.-H.; Li S.-T.; Tsai T.-I.; Chu K.-C.; Liu Y.-C.; Lai M.-Y.; Wu C.-Y.; Tseng Y.-C.; Shivatare S. S.; Wang C.-H.; Chao P.; Wang S.-Y.; Shih H.-W.; Zeng Y.-F.; You T.-H.; Liao J.-Y.; Tu Y.-C.; Lin Y.-S.; Chuang H.-Y.; Chen C.-L.; Tsai C.-S.; Huang C.-C.; Lin N.-H.; Ma C.; Wu C.-Y.; Wong C.-H. A common glycan structure on immunoglobulin g for enhancement of effector functions. Proc. Natl. Acad. Sci. U. S. A. 2015, 112 (34), 10611–10616. 10.1073/pnas.1513456112. - DOI - PMC - PubMed
    1. Murakami M.; Kiuchi T.; Nishihara M.; Tezuka K.; Okamoto R.; Izumi M.; Kajihara Y. Chemical synthesis of erythropoietin glycoforms for insights into the relationship between glycosylation pattern and bioactivity. Sci. Adv. 2016, 2 (1), e1500678.10.1126/sciadv.1500678. - DOI - PMC - PubMed
    1. Sethuraman N.; Stadheim T. A. Challenges in therapeutic glycoprotein production. Curr. Opin. Biotechnol. 2006, 17 (4), 341–346. 10.1016/j.copbio.2006.06.010. - DOI - PubMed