. 2004 Aug;13(8):2059-77.

doi: 10.1110/ps.04637904.

Evolutionary constraints associated with functional specificity of the CMGC protein kinases MAPK, CDK, GSK, SRPK, DYRK, and CK2alpha

Natarajan Kannan¹, Andrew F Neuwald

Affiliations

PMID: 15273306
PMCID: PMC2279817
DOI: 10.1110/ps.04637904

Evolutionary constraints associated with functional specificity of the CMGC protein kinases MAPK, CDK, GSK, SRPK, DYRK, and CK2alpha

Natarajan Kannan et al. Protein Sci. 2004 Aug.

. 2004 Aug;13(8):2059-77.

doi: 10.1110/ps.04637904.

Authors

Natarajan Kannan¹, Andrew F Neuwald

Affiliation

¹ Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA.

PMID: 15273306
PMCID: PMC2279817
DOI: 10.1110/ps.04637904

Abstract

Amino acid residues associated with functional specificity of cyclin-dependent kinases (CDKs), mitogen-activated protein kinases (MAPKs), glycogen synthase kinases (GSKs), and CDK-like kinases (CLKs), which are collectively termed the CMGC group, were identified by categorizing and quantifying the selective constraints acting upon these proteins during evolution. Many constraints specific to CMGC kinases correspond to residues between the N-terminal end of the activation segment and a CMGC-conserved insert segment associated with coprotein binding. The strongest such constraint is imposed on a "CMGC-arginine" near the substrate phosphorylation site with a side chain that plays a role both in substrate recognition and in kinase activation. Two nearby buried waters, which are also present in non-CMGC kinases, typically position the main chain of this arginine relative to the catalytic loop. These and other CMGC-specific features suggest a structural linkage between coprotein binding, substrate recognition, and kinase activation. Constraints specific to individual subfamilies point to mechanisms for CMGC kinase specialization. Within casein kinase 2alpha (CK2alpha), for example, the binding of one of the buried waters appears prohibited by the side chain of a leucine that is highly conserved within CK2alpha and that, along with substitution of lysine for the CMGC-arginine, may contribute to the broad substrate specificity of CK2alpha by relaxing characteristically conserved, precise interactions near the active site. This leucine is replaced by a conserved isoleucine or valine in other CMGC kinases, thereby illustrating the potential functional significance of subtle amino acid substitutions. Analysis of other CMGC kinases similarly suggests candidate family-specific residues for experimental follow-up.

PubMed Disclaimer

Figures

**Figure 1.**
Contrast hierarchical alignments of various CMGC kinase families with representative sequences from distinct eukaryotic kingdoms for each family. The structural regions shown in Figures 3 ▶–6 ▶ and discussed in the text are indicated at the *top*. The histograms *above* the alignments plot the strength of the category-specific selective constraints imposed at each position (essentially using logarithmic scaling); these constraints also are indicated qualitatively through residue highlighting (with biochemically similar residues colored similarly). Dots *below* the histograms indicate residues assigned to that alignment’s category. Secondary structure is indicated directly *above* the aligned sequences, with β strands indicated by their number designations (i.e., 6–9 correspond to the β6–β9 strands, respectively) and helices by their letter designations (i.e., F and G correspond to the F-helix and the G-helix, respectively). The *leftmost* column of each alignment gives protein descriptions using the following color code to specify major eukaryotic taxa: metazoans, red; plants, green; fungi, dark yellow; and protozoans, cyan. See Materials and Methods for sequence identifiers. The foreground sequences (see text) are shown indirectly via the consensus patterns and corresponding weighted residue frequencies (wt_res_freqs) *below* the aligned sequences actually displayed. (Such sequence weighting adjusts for overrepresented families in the alignment.) Foreground alignment residue frequencies are indicated in integer tenths where, for example, a 5 indicates that the corresponding residue directly above it occurs in 50% to 60% of the weighted sequences. (A) Contrast hierarchical alignment of kinase-shared constraints. All protein kinases (10,583 sequences) comprise the foreground and all proteins, the background (see text for details). (B) Contrast hierarchical alignment of CMGC-specific constraints. CMGC kinases (1309 sequences) comprise the foreground and all protein kinases (i.e., the foreground set in A), the background sequences. Note that the alignment of SRPKs residues against P227 and G228 of the query (ERK2) is uncertain due to SRPK-specific insertions in this region; their structural locations are likewise uncertain due to disordering of this region in the Sky1p crystal structure. This may be related to SRPK-specific functions (see text).

**Figure 2.**
Critical regions of contrast hierarchical alignments showing various family-specific constraints within CMGC kinases. CMGC kinases (operationally defined as protein kinases with either an arginine or a lysine at the CMGC-arginine position) constitute the background sequence set for each alignment. See legend to Figure 1 ▶ for a further description of alignment highlighting and notation. See Materials and Methods for sequence identifiers. For each alignment, the sequences displayed are from distinct phyla. See text for details. (A) MAP-specific constraints. (B) CDK-specific constraints. (C) GSK-specific constraints. (D) SRPK-specific constraints. (E) DYRK-specific constraints. (F) CK2-specific constraints.

**Figure 3.**
Structural features of CMGC protein kinases discussed in the text. The structure of p38 MAP kinase (PDB 1cm8) is shown as a prototype of the CMGC group. Color scheme is as follows: Main-chain traces of key regions, colored as indicated at the *top* of Figure 1 ▶; main-chain traces of other regions are light gray; ATPs are cyan; phosphate moieties use the standard CPK color scheme; oxygen, nitrogen, and hydrogen atoms establishing hydrogen bonds are red, blue, and white, respectively; side chains of kinase-shared residues are pale magenta; and CMGC-specific residues are light yellow. Hydrogen bonds are depicted as dotted lines; CH–π interactions (Weiss et al. 2001) are depicted as dotted lines into dot clouds. (A) Key regions within the protein kinase N- and C-terminal domains. The locations of the αC, αF, and αG helices are indicated. (B) Close-up of regions that stabilize and interact with the CMGC-arginine (R192). Modeled substrate with a P + 1 proline is shown.

**Figure 4.**
Structural features surrounding buried waters located between the APE region and the catalytic loop in CMGC and other protein kinases. In the main figures, the buried waters are shown as oxygens surrounded by turquoise dot clouds; in the *inset*, predicted hydrogen bonds formed by these waters are shown. Residue side chains and canonical glycine main chains are colored by functional categories as follows: kinase-shared, pale magenta; CMGC-specific, light yellow; intermediate between kinase-shared and CMGC-specific, light orange to pale red; family-specific, pale cyan; intermediate between CMGC-specific and family-specific, pale green; and subfamily-specific, light blue. Hydrogen bonding carbons are colored as their corresponding side chains, and van der Waals interactions are depicted as dot clouds. See the legend to Figure 3 ▶ for other bond representations and coloring conventions; see text for details. (A) Erk2 MAP kinase (PDB 2erk). (B) CDK2 (PDB 1qmz). (C) GSK (PDB 1gng). (D) CK2α (PDB 1lp4). In CK2α the water nearer the APE region is missing, presumably due to filling of its binding cavity by the side chain of L213, which is distinctively conserved within CK2α. (E) The SRPK-related Sky1p kinase (PDB 1how). The water nearer the APE region is replaced by the side chain of Q566, which occupies the conformationally strained position in SRPKs. Note that both the nitrogen and the oxygen of the glutamine side chain participate in hydrogen bonds that, in other protein kinases, are typically established by this water. (F) The (non-CMGC) phosphorylase kinase (PDB 2phk). As shown here, these buried waters also occur in non-CMGC protein kinases.

**Figure 5.**
Category-specific structural features within and surrounding the activation segment. Electrostatic interactions are depicted as dot clouds. See the legends to Figures 3 ▶ and 4 ▶ for other bond representations and coloring conventions; see text for details. (A) ERK2 MAP kinase (PDB 2erk). (B) CDK2 (PDB 1qmz). (C) GSK (PDB 1gng). (D) CK2α (PDB 1ds5). (E) Model of substrate-bound SRPKs. This model is based on the non–substrate-bound form of the SRPK-related Sky1p protein (PDB 1how) and on CMGC-specific interactions revealed by our analysis. Residue numbering is based on the Sky1p structure. (F) Hypothetical model of DYRK (constructed as described in the text).

**Figure 6.**
CMGC canonical structural features near the αF-to-αG region and the CMGC insert. See the legends to Figures 3 ▶ and 4 ▶ for bond representations and coloring conventions; see text for details and discussion. (A) ERK2 MAP kinase (PDB 2erk). (B) CDK2 (PDB 1qmz). (C) GSK (PDB 1gng). (D) CK2α (PDB 1lp4). (E) Sky1p (PDB 1how). (F) Sky1p structural features near a histidine (H618) that replaces the CMGC-glutamine residue within SRPKs.

See this image and copyright information in PMC

Cited by

Co-conserved MAPK features couple D-domain docking groove to distal allosteric sites via the C-terminal flanking tail.
Nguyen T, Ruan Z, Oruganty K, Kannan N. Nguyen T, et al. PLoS One. 2015 Mar 23;10(3):e0119636. doi: 10.1371/journal.pone.0119636. eCollection 2015. PLoS One. 2015. PMID: 25799139 Free PMC article.
Structural and Enzymological Evidence for an Altered Substrate Specificity in Okur-Chung Neurodevelopmental Syndrome Mutant CK2α^Lys198Arg.
Werner C, Gast A, Lindenblatt D, Nickelsen A, Niefind K, Jose J, Hochscherf J. Werner C, et al. Front Mol Biosci. 2022 Apr 4;9:831693. doi: 10.3389/fmolb.2022.831693. eCollection 2022. Front Mol Biosci. 2022. PMID: 35445078 Free PMC article.
Functional flexibility of human cyclin-dependent kinase-2 and its evolutionary conservation.
Bártová I, Koca J, Otyepka M. Bártová I, et al. Protein Sci. 2008 Jan;17(1):22-33. doi: 10.1110/ps.072951208. Epub 2007 Nov 27. Protein Sci. 2008. PMID: 18042686 Free PMC article.
SUMOylation of the Forkhead transcription factor FOXL2 promotes its stabilization/activation through transient recruitment to PML bodies.
Georges A, Benayoun BA, Marongiu M, Dipietromaria A, L'Hôte D, Todeschini AL, Auer J, Crisponi L, Veitia RA. Georges A, et al. PLoS One. 2011;6(10):e25463. doi: 10.1371/journal.pone.0025463. Epub 2011 Oct 12. PLoS One. 2011. PMID: 22022399 Free PMC article.
Phosphorylation of the transcription activator CLOCK regulates progression through a ∼ 24-h feedback loop to influence the circadian period in Drosophila.
Mahesh G, Jeong E, Ng FS, Liu Y, Gunawardhana K, Houl JH, Yildirim E, Amunugama R, Jones R, Allen DL, Edery I, Kim EY, Hardin PE. Mahesh G, et al. J Biol Chem. 2014 Jul 11;289(28):19681-93. doi: 10.1074/jbc.M114.568493. Epub 2014 May 28. J Biol Chem. 2014. PMID: 24872414 Free PMC article.

See all "Cited by" articles

References

1. Adams, J.A. 2003. Activation loop phosphorylation and catalysis in protein kinases: Is there functional evidence for the autoinhibitor model? Biochemistry 42 601–607. - PubMed
1. Aubol, B.E., Chakrabarti, S., Ngo, J., Shaffer, J., Nolen, B., Fu, X.D., Ghosh, G., and Adams, J.A. 2003. Processive phosphorylation of alternative splicing factor/splicing factor 2. Proc. Natl. Acad. Sci. 100 12601–12606. - PMC - PubMed
1. Battistutta, R., Sarno, S., De Moliner, E., Marin, O., Issinger, O.G., Zanotti, G., and Pinna, L.A. 2000. The crystal structure of the complex of Zea mays α subunit with a fragment of human β subunit provides the clue to the architecture of protein kinase CK2 holoenzyme. Eur. J. Biochem. 267 5184–5190. - PubMed
1. Bax, B., Carter, P.S., Lewis, C., Guy, A.R., Bridges, A., Tanner, R., Pettman, G., Mannix, C., Culbert, A.A., Brown, M.J., et al. 2001. The structure of phosphorylated GSK-3β complexed with a peptide, FRATtide, that inhibits β-catenin phosphorylation. Structure 9 1143–1152. - PubMed
1. Becker, W. and Joost, H.G. 1999. Structural and functional characteristics of DYRK, a novel subfamily of protein kinases with dual specificity. Prog. Nucleic Acid Res. Mol. Biol. 62 1–17. - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Evolutionary constraints associated with functional specificity of the CMGC protein kinases MAPK, CDK, GSK, SRPK, DYRK, and CK2alpha

Affiliation

Evolutionary constraints associated with functional specificity of the CMGC protein kinases MAPK, CDK, GSK, SRPK, DYRK, and CK2alpha

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources