Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2002 Jul 1;21(13):3213-24.
doi: 10.1093/emboj/cdf327.

Structure of coronavirus main proteinase reveals combination of a chymotrypsin fold with an extra alpha-helical domain

Affiliations
Comparative Study

Structure of coronavirus main proteinase reveals combination of a chymotrypsin fold with an extra alpha-helical domain

Kanchan Anand et al. EMBO J. .

Abstract

The key enzyme in coronavirus polyprotein processing is the viral main proteinase, M(pro), a protein with extremely low sequence similarity to other viral and cellular proteinases. Here, the crystal structure of the 33.1 kDa transmissible gastroenteritis (corona)virus M(pro) is reported. The structure was refined to 1.96 A resolution and revealed three dimers in the asymmetric unit. The mutual arrangement of the protomers in each of the dimers suggests that M(pro) self-processing occurs in trans. The active site, comprised of Cys144 and His41, is part of a chymotrypsin-like fold that is connected by a 16 residue loop to an extra domain featuring a novel alpha-helical fold. Molecular modelling and mutagenesis data implicate the loop in substrate binding and elucidate S1 and S2 subsites suitable to accommodate the side chains of the P1 glutamine and P2 leucine residues of M(pro) substrates. Interactions involving the N-terminus and the alpha-helical domain stabilize the loop in the orientation required for trans-cleavage activity. The study illustrates that RNA viruses have evolved unprecedented variations of the classical chymotrypsin fold.

PubMed Disclaimer

Figures

None
Fig. 1. Sequence comparison of coronavirus main proteinases. The alignment was produced using CLUSTAL X, version 1.81 (Thompson et al., 1997), and corrected manually on the basis of the three-dimensional structure of TGEV Mpro. The corresponding sequences of FIPV (strain 79–1146), HCoV (strain 229E), bovine coronavirus (BCoV, isolate LUN), MHV (strain JHM) and IBV (strain Beaudette) were derived from the replicative polyproteins of the respective viruses whose sequences are deposited at the DDBJ/EMBL/GenBank database (accession Nos: FIPV, AF326575; HCoV, X69721; BCoV, AF391542; MHV, M55148; IBV, M95169; TGEV, AJ271965). The β-strands and α-helices as revealed in the TGEV Mpro crystal structure (this study) are shown above the sequence alignment (see also Figures 4 and 5). Black background colour indicates the catalytic cysteine and histidine residues. Grey background colour indicates the key residue of the S1 subsite (TGEV Mpro His162) and its equivalents in other coronavirus main proteinases. Also shown in grey are the phenylalanine and tyrosine residues (TGEV Mpro Phe139 and Tyr160) that are proposed to stabilize the neutral state of His162 (see text for details).
None
Fig. 2. Stereo view of a representative part of the electron density map. The 2|Fo| – |Fc| electron density map (1.96 Å resolution, contoured at 1σ above the mean) corresponds to Mpro residues 160–162 (Tyr–Met–His), a conserved motif in coronavirus main proteinases. The strong hydrogen bonding interaction between the Tyr160 hydroxyl group and His162 Nδ1 is indicated.
None
Fig. 3. Stereo depiction of the six molecules (three dimers) of TGEV Mpro in the asymmetric unit. The monomers A–F are shown in different colours; A = red, B = black, C = green, D = orange-red, E = yellow and F = cyan. Note the 2-fold symmetry axes between the monomers in each of the dimers, and between the two lower dimers in the figure (AB and EF). Each of the monomers measures ∼70 Å × 22 Å × 40 Å.
None
Fig. 4. A MOLSCRIPT diagram (Kraulis, 1991) showing the overall fold of TGEV Mpro (A) with the two β-barrel domains and the α-helical C-terminal domain. β-strands and helices are represented as arrows and cylinders, respectively. The β-barrels of each domain I and II are composed of six-stranded β-sheets (green). Domain III is composed mainly of α-helices (red). The structures of HAV 3Cpro (PDB code: 1HAV) (B) and α-chymotrypsin (4CHA, residues 12–15 and 147–148 are excised) (C) are shown for comparison.
None
Fig. 5. Topological representation of the secondary structure elements of a TGEV Mpro monomer. α-helices and β-strands are represented as cylinders and arrows, respectively. Numbers indicate the N- and C-terminal residues of the secondary structure elements. Strands bI and cI are adjacent. Cys144 (yellow) and His41 (blue) are shown by circles. The positions of the N- and C-termini are indicated. Also, the presumed localization of the P5–P1 region of a model substrate is shown (blue) (for details, see text and Figure 7).
None
Fig. 6. Active site of the TGEV Mpro. (A) Difference electron density (|Fo| – |Fc| at 3.0σ above the mean; red) for the oxidized active site Cys144, indicating three oxygen atoms bound to the sulfur. (B) The catalytic Cys144 and His41 residues are shown. The region forming the oxyanion hole (main chain amides of Gly142, Thr143 and Cys144) is highlighted in pink. The water molecule, which occupies a position equivalent to that of the catalytic aspartate of serine proteinases, is shown together with its hydrogen-bonding partners, His41, His163 and Asp186. (C) Superposition of the active site residues of chymotrypsin (shown in red) with the spatially equivalent residues of TGEV Mpro (blue) and HAV 3Cpro (green). The equivalent to the third catalytic residue (Asp102) of chymotrypsin is Asp84 in HAV 3Cpro (side chain oriented differently) and Val84 in TGEV Mpro.
None
Fig. 7. Stereo diagram of a P5–P1 substrate (Asn–Ser–Thr–Leu–Gln, red; corresponding to the TGEV Mpro N-terminal autoprocessing site) modelled into the active site cleft of the TGEV Mpro. Hydrogen bonds are depicted by dotted lines.
None
Fig. 8. Intra- and intermolecular contacts of the TGEV Mpro N-terminus. (A) MOLSCRIPT stereo representation of a TGEV Mpro dimer. Molecule A is coloured from blue at the N-terminus, via green (domain II), to red (C-terminus), while molecule B is shown in grey. The catalytic Cys144 and His41 residues are labelled in both monomers. (B) Detailed view of the interactions made by the N-terminal segment (blue) and domains II/III of monomer A as well as domains II/III of monomer B. Residues critically involved in these interactions are designated by the single-letter code and shown in ball-and-stick representation (see text for details). The N- and C-termini of molecule A are indicated.

References

    1. Allaire M., Chernaia,M.M., Malcolm,B.A. and James,M.N. (1994) Picornaviral 3C cysteine proteinases have a fold similar to chymotrypsin-like serine proteinases. Nature, 369, 72–76. - PubMed
    1. Andino R., Rieckhof,G.E., Achacoso,P.L. and Baltimore,D. (1993) Poliovirus RNA synthesis utilizes an RNP complex formed around the 5′-end of viral RNA. EMBO J., 12, 3587–3598. - PMC - PubMed
    1. Bacon D.J. and Anderson,W.F. (1988) A fast algorithm for rendering space-filling molecule pictures. J. Mol. Graphics, 6, 219–220.
    1. Bazan J.F. and Fletterick,R.J. (1988) Viral cysteine proteases are homologous to the trypsin-like family of serine proteases: structural and functional implications. Proc. Natl Acad. Sci. USA, 85, 7872–7876. - PMC - PubMed
    1. Bergmann E.M., Mosimann,S.C., Chernaia,M.M., Malcolm,B.A. and James,M.N. (1997) The refined crystal structure of the 3C gene product from hepatitis A virus: specific proteinase activity and RNA recognition. J. Virol., 71, 2436–2448. - PMC - PubMed

Publication types

MeSH terms