Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 May 6;81(9):1868-1878.
doi: 10.1016/j.molcel.2021.03.015. Epub 2021 Apr 1.

Discovering the landscape of protein modifications

Affiliations
Review

Discovering the landscape of protein modifications

E Keith Keenan et al. Mol Cell. .

Abstract

Protein modifications modulate nearly every aspect of cell biology in organisms, ranging from Archaea to Eukaryotes. The earliest evidence of covalent protein modifications was found in the early 20th century by studying the amino acid composition of proteins by chemical hydrolysis. These discoveries challenged what defined a canonical amino acid. The advent and rapid adoption of mass-spectrometry-based proteomics in the latter part of the 20th century enabled a veritable explosion in the number of known protein modifications, with more than 500 discrete modifications counted today. Now, new computational tools in data science, machine learning, and artificial intelligence are poised to allow researchers to make significant progress in discovering new protein modifications and determining their function. In this review, we take an opportunity to revisit the historical discovery of key post-translational modifications, quantify the current landscape of covalent protein adducts, and assess the role that new computational tools will play in the future of this field.

Keywords: amino acid; metabolism; post-translational modifications; protein acylation; protein modifications.

PubMed Disclaimer

Conflict of interest statement

Declaration of interests The authors declare no competing interests.

Figures

Figure 1.
Figure 1.
Twenty proteinogenic amino acid structures depicted using standard Corey–Pauling–Koltun coloring convention: black, carbon; red, oxygen; blue, nitrogen, yellow, sulfur; thin lines, single bond; medium lines, double bond; hydrogens omitted for clarity.
Figure 2.
Figure 2.
Timeline of protein modification discovery. Cumulative number of protein modifications known at each year. For methods of calculation and code, see www.github.com/matthewhirschey.
Figure 3.
Figure 3.
Current landscape of protein modifications. A. Approximately 500 protein modifications have been described across all 20 protein amino acids; colors represent most frequent modifications by ‘keyword’; protein crosslinks were removed from analysis. Data accessed from www.uniprot.org (UniProt Consortium, 2018). For methods of calculation and code, see www.github.com/hirscheylab. B. Distribution of masses of modifications appended to proteins. Line represents frequency of masses, scaled to 1; rug-plot hashes represent individual masses. Protein crosslinks were removed from analysis. For methods of calculation and code, see www.github.com/matthewhirschey.
Figure 4.
Figure 4.
Known acyl-CoA species. Chain length and chemical properties of all known acyl-CoA metabolites in humans. Data accessed from www.hmdb.ca (Wishart et al., 2018). For methods of calculation and code, see www.github.com/matthewhirschey.
Figure 5.
Figure 5.
Predicted protein modifications. A. Acyl-CoA species from the human metabolome database were compared to known masses matching lysine modifications; B. The human metabolome database was queried for known reactive acyl-phosphates, thioesters, or aldehydes, and plotted for number of possible carbons appended to proteins. Data accessed from www.hmdb.ca (Wishart et al., 2018). For methods of calculation and code, see www.github.com/matthewhirschey.

References

    1. Aebersold R, and Mann M (2003). Mass spectrometry-based proteomics. Nature 422, 198–207. - PubMed
    1. Alderson NL, Wang Y, Blatnik M, Frizzell N, Walla MD, Lyons TJ, Alt N, Carson JA, Nagai R, Thorpe SR, et al. (2006). S-(2-Succinyl)cysteine: a novel chemical modification of tissue proteins by a Krebs cycle intermediate. Arch Biochem Biophys 450, 1–8. - PubMed
    1. Allfrey VG, Faulkner R, and Mirsky AE (1964). Acetylation and Methylation of Histones and Their Possible Role in the Regulation of Rna Synthesis. Proc Natl Acad Sci U S A 51, 786–794. - PMC - PubMed
    1. Alquezar C, Arya S, and Kao AW (2020). Tau Post-translational Modifications: Dynamic Transformers of Tau Function, Degradation, and Aggregation. Front Neurol 11, 595532. - PMC - PubMed
    1. Backus KM, Correia BE, Lum KM, Forli S, Horning BD, Gonzalez-Paez GE, Chatterjee S, Lanning BR, Teijaro JR, Olson AJ, et al. (2016). Proteome-wide covalent ligand discovery in native biological systems. Nature 534, 570–574. - PMC - PubMed

Publication types