Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Apr 7:8:235.
doi: 10.3389/fbioe.2020.00235. eCollection 2020.

Linker and N-Terminal Domain Engineering of Pyrrolysyl-tRNA Synthetase for Substrate Range Shifting and Activity Enhancement

Affiliations

Linker and N-Terminal Domain Engineering of Pyrrolysyl-tRNA Synthetase for Substrate Range Shifting and Activity Enhancement

Han-Kai Jiang et al. Front Bioeng Biotechnol. .

Abstract

The Methanosarcina mazei pyrrolysyl-tRNA synthetase (PylRS)⋅tRNAPyl pair can be used to incorporate non-canonical amino acids (ncAAs) into proteins at installed amber stop codons. Although engineering of the PylRS active site generates diverse binding pockets, the substrate ranges are found similar in charging lysine and phenylalanine analogs. To expand the diversity of the ncAA side chains that can be incorporated via the PylRS⋅tRNAPyl pair, exploring remote interactions beyond the active site is an emerging approach in expanding the genetic code research. In this work, remote interactions between tRNAPyl, the tRNA binding domain of PylRS, and/or an introduced non-structured linker between the N- and C-terminus of PylRS were studied. The substrate range of the PylRS⋅tRNAPyl pair was visualized by producing sfGFP-UAG gene products, which also indicated amber suppression efficiencies and substrate specificity. The unstructured loop linking the N-terminal and C-terminal domains (CTDs) of PylRS has been suggested to regulate the interaction between PylRS and tRNAPyl. In exploring the detailed role of the loop region, different lengths of the linker were inserted into the junction between the N-terminal and the C-terminal domains of PylRS to unearth the impact on remote effects. Our findings suggest that the insertion of a moderate-length linker tunes the interface between PylRS and tRNAPyl and subsequently leads to improved suppression efficiencies. The suppression activity and the substrate specificity of PylRS were altered by introducing three mutations at or near the N-terminal domain of PylRS (N-PylRS). Using a N-PylRS⋅tRNAPyl pair, three ncAA substrates, two S-benzyl cysteine and a histidine analog, were incorporated into the protein site specifically.

Keywords: amber codon suppression; linker engineering; non-canonical amino acids; pyrrolysyl-tRNA synthetase; tRNA binding domain.

PubMed Disclaimer

Figures

FIGURE 1
FIGURE 1
Protein designs of evolved MmPylRS and their linker engineering. (A) Illustration of the interaction between MmPylRS and tRNAPyl. R61K/H63Y/S193R mutations (labeled in open red circles) are distributed on the tRNA binding domain of MmPylRS NTD or CTD (labeled as filled blue circle). The detailed interactions from X-ray crystal structures in PylRS-NTD/tRNA (Suzuki et al., 2017) and PylRS-CTD/tRNA (Nozawa et al., 2009) are indicated in Figures 2, 3. tRNA 2D topology and sequence are labeled in green filled circles. The orange dashed line denotes the inserted linker between P149 and A150. (B) Engineered MmPylRS constructs used in this work. N-PylRS represents MmPylRS with R61K/H63Y/S193R mutations; PylRS-D1 represents MmPylRS with insertion of the spacer containing the stop and the start codons between P149 and A150; PylRS-L1 indicates the insertion of SGGGGS linker between P149 and A150; PylRS-L2 for S(GGGGS)2 linker insertion; PylRS-L3 for S(GGGGS)3 linker insertion.
SCHEME 1
SCHEME 1
Chemical structures of the non-canonical amino acids described in this study. ncAA 1: Nε-(tert-butoxycarbonyl)-L-lysine (BocK); 2: Nε- (alloyloxycarbonyl)-L-lysine (AlloK); 3: S-(4-methylbenzyl)-L-cysteine (MbzlC); 4: S-(4-methoxylbenzyl)-L-cysteine (MeObzlC); 5: N-π-methyl-L-histidine (3MeH); 6: Nε-(carbobenzyloxy)-L-lysine (CbzK); 7: Nε-(2-chlorocarbobenzyloxy)-L-lysine (ClCbzK); 8: Nε-(carbobenzyloxy)-D-lysine (DCbzK); 9: Nε-(2-chlorocarbobenzyloxy)- D-lysine (DClCbzK); 10: (R)-2-amino-3-(2-benzyloxycarbonylaminoethylselanyl)propanoic acid (SeCbzK); 11: (S)-6-(benzyloxycarbonylamino)-2-hydroxygexanoic acid (CbzKOH).
FIGURE 2
FIGURE 2
Structure of MmPylRS NTD complex with tRNAPyl. R61 and H63 located in the PylRS NTD; the yellow dashed lines represent potential hydrogen-bonding interactions. Three such interactions are illustrated between the side chain of R61 and R52 within PylRS NTD and the phosphodiester backbone of 58A and 59U in tRNAPyl. One hydrogen-bonding interaction was found between the side chain of H63 and K85 within the PylRS NTD. The structure is based on the PDB entry 5UD5.
FIGURE 3
FIGURE 3
Structure of MmPylRS CTD and its superimposition with the DhPylRS CTD/tRNAPyl complex. (A) S193 is located in the tRNA binding domain of MmPylRS CTD (green). (B) The superimposed structure of MmPylRS CTD (green) and DhPylRS CTD⋅DhtRNAPyl complex (cyan). The structures are based on the PDB entry 2Q7H and 2ZNI.
FIGURE 4
FIGURE 4
The sfGFP-UAG2 and sfGFP-UAG27 gene suppression efficiencies of PylRS enzyme variants. Incorporation efficiencies of PylRS variants (Figure 1B) as measured by fluorescence intensities of sfGFP with an amber mutation at position 2 (A) and 27 (B). The proteins were expressed in 1 mM ncAA and IPTG in GMML medium at 37°C for 12 h. The cells were excited at 485 nm and the fluorescence intensities were detected at 535 nm. The cell density was monitored by absorbance at 595 nm. “C” indicates the Control experiments of cells with the supplement of 1 mM IPTG; 1–5 denote the supplement of 1 mM IPTG and ncAA 15 (Scheme 1). The background signals from cells without adding IPTG were subtracted from each group. The error bars represent the standard deviation of sfGFP production from four repeated experiments.
FIGURE 5
FIGURE 5
The sfGFP-UAG2 and sfGFP-UAG27 gene suppression efficiencies of ZRS enzyme variants. Incorporation efficiencies of ZRS variants (Figure 1B) as measured by the fluorescence intensities of sfGFP with amber mutation at position 2 (A) and 27 (B). The proteins were expressed in 1 mM ncAA and IPTG in GMML medium at 37°C for 12 h. The cells were excited at 485 nm and the fluorescence intensities were detected at 535 nm. The cell density was monitored by absorbance at 595 nm. “C” indicate the Control experiments of cells with the supplement of 1 mM IPTG; 3, 4, 611 denote the supplement of 1 mM IPTG and ncAA 3, 4, 611 (Scheme 1). Error bars represent the standard deviation of sfGFP production from four repeated experiments.
FIGURE 6
FIGURE 6
sfGFP production by ZRS variants and mass characterization. (A) Amber suppression of the sfGFP-UAG27 gene (N-ZRS⋅tRNAPyl pair) and the sfGFP-UAG2 gene (ZRS-D1⋅tRNAPyl pair) product with ncAA 34 and 611. The sfGFP proteins were produced in E. coli BL21 (DE3) coding N-ZRS⋅tRNAPyl or ZRS-D1⋅tRNAPyl pair with the supplement of 1 mM IPTG and ncAAs in GMML medium at 37°C for 12 h. The whole-cell lysate was analyzed by SDS-PAGE and western blotting by anti-His tag antibody indicated as α-His6X. (B) ESI-MS determination of sfGFP-UAG27 proteins with ncAA 6, 7, and 11. Full-length sfGFP-6 and sfGFP-7 proteins were produced by N-ZRS⋅tRNAPyl pair in E. coli BL21 (DE3) with the supplement of 1 mM IPTG and ncAA 6 or 7 in LB medium at 37°C for 12 h. Full-length sfGFP-11 proteins were produced with the same condition but with 1 mM ncAA 11 and in GMML minimal medium. The calculated molecular masses of sfGFP-7 are 28,121 and 27,989 Da (–Met); the observed molecular masses are 28,120 and 27,989 Da (–Met). The calculated molecular masses of sfGFP-6 are 28,085 and 27,954 Da (–Met); the observed molecular masses are 28,085 and 27,955 Da (–Met). The calculated molecular masses of sfGFP-11 are 28,086, 27,955 (–Met), and 25,320 Da (truncated sfGFP at 27 position); the observed molecular masses are 27,955 (–Met) and 27,820 Da (without Cbz group at 27 position and N-terminal Met residues) and 25,036 Da. The detailed electrospray and deconvoluted mass spectra are shown in Supplementary Figures S4–S6. ESI-MS determination of sfGFP-UAG2 protein. sfGFP-11* with ncAA 11 (ZRS-D1⋅tRNAPyl pair) is shown in Supplementary Figure S7.
FIGURE 7
FIGURE 7
Deconvoluted ESI-MS spectra of sfGFP containing ncAAs at position 27. Full-length sfGFP-3, sfGFP-4, and sfGFP-5 were expressed using the N-PylRS⋅tRNAPyl pair in the presence of ncAA 3, 4 (2 mM) or 5 (1 mM), respectively, in GMML medium (sfGFP-3 and sfGFP-4) or LB medium (sfGFP-5). The purified protein samples were subjected to ESI-MS for identification of the incorporation of defined ncAA. Deconvoluted ESI-MS of ncAA 35 incorporated sfGFP at position 27 are shown above. sfGFP-X in which X represents one of the ncAA 35 was incorporated into sfGFP at position 27. The calculated molecular masses of sfGFP-3 are 28,030 and 27,899 Da (-Met); the observed molecular masses are 28,030 and 27,899 Da (-Met). The calculated molecular masses of sfGFP-4 are 28,046 and 27,915 Da (-Met); the observed molecular masses are 28,046 and 27,915 Da (-Met). The calculated molecular masses of sfGFP-5 are 27,974 and 27,843 Da (-Met); the observed molecular masses are 27,974 and 27,843 Da (-Met). The detailed electrospray and deconvoluted mass spectra are shown in Supplementary Figures S1–S3.
FIGURE 8
FIGURE 8
MALDI-TOF-MS/MS analysis of sfGFP-3 and sfGFP-5. The sfGFP-3 and sfGFP-5 proteins were in-gel digested with trypsin and subjected to MALDI-TOF-MS/MS analysis. (A) The tandem mass spectrum of the X(3)SVR; X denotes a fragment from sfGFP-3, with ncAA 3 incorporated at position 27. The calculated molecular mass of the X(3)SVR peptide fragment is 567.284 Da and the actual molecular mass is 568.327 Da. Full-length sfGFP-3 was expressed using N-PylRS in the presence of 2 mM ncAA 3 in GMML medium. (B) The tandem mass spectrum of the X(5)SVR; X denotes a fragment from sfGFP-5 with ncAA 5 incorporated at position 27. The calculated molecular mass of the X(5)SVR peptide fragment is 511.287 Da and the actual molecular mass is 512.609 Da. Full-length sfGFP-5 was produced using N-PylRS⋅tRNAPyl pair in the presence of 1 mM ncAA 5 in LB medium.

References

    1. Ambrogelly A., Gundllapalli S., Herring S., Polycarpo C., Frauer C., Söll D. (2007). Pyrrolysine is not hardwired for cotranslational insertion at UAG codons. Proc. Natl. Acad. Sci. U.S.A. 104 3141–3146. 10.1073/pnas.0611634104 - DOI - PMC - PubMed
    1. Cavarelli J., Delagoutte B., Eriani G., Gangloff J., Moras D. (1998). L-arginine recognition by yeast arginyl-tRNA synthetase. EMBO J. 17 5438–5448. 10.1093/emboj/17.18.5438 - DOI - PMC - PubMed
    1. Guo L. T., Wang Y. S., Nakamura A., Eiler D., Kavran J. M., Wong M., et al. (2014). Polyspecific pyrrolysyl-tRNA synthetases from directed evolution. Proc. Natl. Acad. Sci. U. S. A. 111 16724–16729. 10.1073/pnas.1419737111 - DOI - PMC - PubMed
    1. Han S., Yang A., Lee S., Lee H.-W., Park C. B., Park H.-S. (2017). Expanding the genetic code of Mus musculus. Nat. Commun. 8:14568. 10.1038/ncomms14568 - DOI - PMC - PubMed
    1. Hao B., Gong W., Ferguson T. K., James C. M., Krzycki J. A., Chan M. K. (2002). A new UAG-encoded residue in the structure of a methanogen methyltransferase. Science 296 1462–1466. 10.1126/science.1069556 - DOI - PubMed

LinkOut - more resources