Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jan 5;52(D1):D476-D482.
doi: 10.1093/nar/gkad1090.

iNClusive: a database collecting useful information on non-canonical amino acids and their incorporation into proteins for easier genetic code expansion implementation

Affiliations

iNClusive: a database collecting useful information on non-canonical amino acids and their incorporation into proteins for easier genetic code expansion implementation

Leon-Samuel Icking et al. Nucleic Acids Res. .

Abstract

The incorporation of non-canonical amino acids (ncAAs) into proteins is a powerful technique used in various research fields. Genetic code expansion (GCE) is the most common way to achieve this: a specific codon is selected to be decoded by a dedicated tRNA orthogonal to the endogenous ones. In the past 30 years, great progress has been made to obtain novel tRNA synthetases (aaRSs) accepting a variety of ncAAs with distinct physicochemical properties, to develop robust in vitro assays or approaches for codon reassignment. This sparked the use of the technique, leading to the accumulation of publications, from which gathering all relevant information can appear daunting. Here we present iNClusive (https://non-canonical-aas.biologie.uni-freiburg.de/), a manually curated, extensive repository using standardized nomenclature that provides organized information on ncAAs successfully incorporated into target proteins as verified by mass spectrometry. Since we focused on tRNA synthetase-based tRNA loading, we provide the sequence of the tRNA and aaRS used for the incorporation. Derived from more than 687 peer-reviewed publications, it currently contains 2432 entries about 466 ncAAs, 569 protein targets, 500 aaRSs and 144 tRNAs. We foresee iNClusive will encourage more researchers to experiment with ncAA incorporation thus contributing to the further development of this exciting technique.

PubMed Disclaimer

Figures

Graphical Abstract
Graphical Abstract
Figure 1.
Figure 1.
Scheme of the orthogonal translation system for the incorporation of ncAAs into target proteins. The heterologous aaRS and its cognate tRNA (blue) do not interact with the endogenous aaRSs and tRNAs (grey). The orthogonal aaRS recognizes the ncAA and loads it onto its cognate tRNA. The ncAA-loaded tRNA then interacts with endogenous ribosomes to allow for the translation of the target mRNA. In this example, the amber STOP codon (UAG) codes for the ncAA. aaRS, aminoacyl tRNA synthetase. ncAA, non-canonical amino acid. For simplicity only one endogenous aaRS–tRNA pair is shown (grey box).
Figure 2.
Figure 2.
Scheme depicting the workflow used to generate the iNClusive database. The hits were generated using the ‘Publish or Perish’ software (31) upstream of Google Scholar. The publications were then manually reviewed and inserted into a preliminary database. The SMILES of the ncAAs were generated with the Optical Structure Recognition (OSRA) software (32) and compared with those given in the PubChem database (33). The retrieved information was confirmed manually by a different team member and only then added into the iNClusive database. A second review process done by a third distinct person was additionally carried out on 10% of randomly picked entries.
Figure 3.
Figure 3.
Pie charts showing the number of entries in the database for the indicated entities. For instance, the number 396 for sfGFP indicates that this protein has been modified 396 times. Two entries might differ only for the organism used for the experiment or for the aminoacyl tRNA synthetase employed. Similarly, the number 191 for 4-azido-l-phenylalanine does not mean this ncAA has been incorporated in 191 different target proteins. Entries are distinct if any of the other categories in the database are different, such as tRNA, protein, organism in which the incorporation was tested, etc.

Similar articles

Cited by

References

    1. Hao B., Gong W., Ferguson T.K., James C.M., Krzycki J.A., Chan M.K.. A new UAG-encoded residue in the structure of a methanogen methyltransferase. Science. 2002; 296:1462–1466. - PubMed
    1. Böck A., Forchhammer K., Heider J., Leinfelder W., Sawers G., Veprek B., Zinoni F.. Selenocysteine: the 21st amino acid. Mol. Microbiol. 1991; 5:515–520. - PubMed
    1. Mody I. GABAA receptor synaptic functions. Encyclopedia of Neuroscience. 2009; Elsevier; 441–445.
    1. Gupta R., Gupta N., Bindal S.. Bacterial cell wall biosynthesis and inhibitors. Fundamentals of Bacterial Physiology and Metabolism. 2021; Singapore: Springer Singapore; 81–98.
    1. Morris S.M. Regulation of enzymes of urea and arginine synthesis. Annu. Rev. Nutr. 1992; 12:81–101. - PubMed