Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2023 Feb 27;378(1871):20220030.
doi: 10.1098/rstb.2022.0030. Epub 2023 Jan 11.

Discovery, implications and initial use of semi-synthetic organisms with an expanded genetic alphabet/code

Affiliations
Review

Discovery, implications and initial use of semi-synthetic organisms with an expanded genetic alphabet/code

Floyd E Romesberg. Philos Trans R Soc Lond B Biol Sci. .

Abstract

Much recent interest has focused on developing proteins for human use, such as in medicine. However, natural proteins are made up of only a limited number of canonical amino acids with limited functionalities, and this makes the discovery of variants with some functions difficult. The ability to recombinantly express proteins containing non-canonical amino acids (ncAAs) with properties selected to impart the protein with desired properties is expected to dramatically improve the discovery of proteins with different functions. Perhaps the most straightforward approach to such an expansion of the genetic code is through expansion of the genetic alphabet, so that new codon/anticodon pairs can be created to assign to ncAAs. In this review, I briefly summarize more than 20 years of effort leading ultimately to the discovery of synthetic nucleotides that pair to form an unnatural base pair, which when incorporated into DNA, is stably maintained, transcribed and used to translate proteins in Escherichia coli. In addition to discussing wide ranging conceptual implications, I also describe ongoing efforts at the pharmaceutical company Sanofi to employ the resulting 'semi-synthetic organisms' or SSOs, for the production of next-generation protein therapeutics. This article is part of the theme issue 'Reactivity and mechanism in chemical and synthetic biology'.

Keywords: expanded genetic alphabet; expanded genetic code; protein therapeutics; semi-synthetic organisms; unnatural base pair.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
The natural base pairs, dG–dC and dA–dT and three UBPs, dNaM–d5SICS, dNaM–dTPT3 and dCNMO–dTPT3, developed to expand the genetic alphabet. TAT1 is only used in the ribonucleoside triphosphate form shown (see near end of §4 for details).
Figure 2.
Figure 2.
Structure of UBP (a) in free duplex DNA [67] and (b) during formation in the active site of a polymerase [54]. (Online version in colour.)
Figure 3.
Figure 3.
(a) Error elimination system to increase UBP (XY) retention, where Cas9 cleavage is directed by single-guide RNAs that target it to sequences that have lost the UBP (lower left plasmid (red); N = natural nucleotide), leading to degradation of the corresponding plasmid [78]. (b) Proposed mechanism of in vivo replication of DNA containing a UBP. In well replicated sequences, replication is mediated by the main replicative polymerase, Pol III (tan sphere), with errors eliminated by methyl-directed mismatch repair (MMR). In more challenging sequences, the stalled Pol III replisome is rescued by Pol II (blue sphere), which results in retention of the UBP, or by RecA (light pink hexagons), after which the UBP is lost during recombinational repair of the stalled fork. Pathways through which retention is increased (YX UBP product) or decreased (N-X product) in the optimized SSO are shown in bold green and orange arrows, respectively. Adapted with permission from [80]. (Online version in colour.)
Figure 4.
Figure 4.
Components of an SSO with an expanded genetic alphabet and code. The unnatural nucleotide triphosphates of the UBP are indicated as (d)XTP and (d)YTP; the encoded non-canonical amino acid (ncAA) and nucleotide transporter (PtNTT2) are indicated; RNAP, RNA polymerase. (Online version in colour.)
Figure 5.
Figure 5.
Natural IL-2 (left) preferentially binds the trimeric IL-2Rαβγ receptor relative to the dimeric IL-2Rβγ receptor (indicated by thick and thin arrows, respectively). The site-specifically incorporated PEG of THOR-707 (right) interferes with IL-2Rα binding and thus better stimulates cell populations bearing the dimeric IL-2Rβγ. (Online version in colour.)

Similar articles

Cited by

References

    1. Furter R. 1998. Expansion of the genetic code: site-directed p-fluoro-phenylalanine incorporation in Escherichia coli. Protein Sci. 7, 419-426. (10.1002/pro.5560070223) - DOI - PMC - PubMed
    1. Wang L, Magliery TJ, Liu DR, Schultz PG. 2000. A new functional suppressor tRNA/aminoacyl–tRNA synthetase pair for the in vivo incorporation of unnatural amino acids into proteins. J. Am. Chem. Soc. 122, 5010-5011. (10.1021/ja000595y) - DOI
    1. Srinivasan G, James CM, Krzycki JA. 2002. Pyrrolysine encoded by UAG in archaea: charging of a UAG-decoding specialized tRNA. Science 296, 1459-1462. (10.1126/science.1069588) - DOI - PubMed
    1. Blight SK, et al. 2004. Direct charging of tRNACUA with pyrrolysine in vitro and in vivo. Nature 431, 333-335. (10.1038/nature02895) - DOI - PubMed
    1. Polycarpo C, Ambrogelly A, Berube A, Winbush SM, McCloskey JA, Crain PF, Wood JL, Soll D. 2004. An aminoacyl-tRNA synthetase that specifically activates pyrrolysine. Proc. Natl Acad. Sci. USA 101, 12 450-12 454. (10.1073/pnas.0405362101) - DOI - PMC - PubMed

LinkOut - more resources