Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2005 Dec;14(12):2972-81.
doi: 10.1110/ps.051759005.

Protein surface analysis for function annotation in high-throughput structural genomics pipeline

Affiliations

Protein surface analysis for function annotation in high-throughput structural genomics pipeline

T Andrew Binkowski et al. Protein Sci. 2005 Dec.

Abstract

Structural genomics (SG) initiatives are expanding the universe of protein fold space by rapidly determining structures of proteins that were intentionally selected on the basis of low sequence similarity to proteins of known structure. Often these proteins have no associated biochemical or cellular functions. The SG success has resulted in an accelerated deposition of novel structures. In some cases the structural bioinformatics analysis applied to these novel structures has provided specific functional assignment. However, this approach has also uncovered limitations in the functional analysis of uncharacterized proteins using traditional sequence and backbone structure methodologies. A novel method, named pvSOAR (pocket and void Surface of Amino Acid Residues), of comparing the protein surfaces of geometrically defined pockets and voids was developed. pvSOAR was able to detect previously unrecognized and novel functional relationships between surface features of proteins. In this study, pvSOAR is applied to several structural genomics proteins. We examined the surfaces of YecM, BioH, and RpiB from Escherichia coli as well as the CBS domains from inosine-5'-monosphate dehydrogenase from Streptococcus pyogenes, conserved hypothetical protein Ta549 from Thermoplasm acidophilum, and CBS domain protein mt1622 from Methanobacterium thermoautotrophicum with the goal to infer information about their biochemical function.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
An overview of the pvSOAR search methodology using YecM from E. coli. The structure of YecM (A) is submitted to the CASTp Web server (B) for pocket and void identification. A surface is identified, in this case CASTp pocket 18, and selected as the query template (C, green). The query is submitted to the pvSOAR Web server (D) and statistically significant surfaces matches are returned: neurolysin (PDB ID 1i1i, CASTp ID 85) from R. norvegicus (E, red), thermolysin (PDB ID 1lnd, CASTp ID 47) from B. thermoproteolyticus (F, orange), and from lethal toxin factor (PDB ID 1j7n, CASTp ID 198) from B. anthracis (G, yellow). The conserved residues between the query and library surfaces are superimposed (H). Figures were generated using PyMol (DeLano 2002) and CASTpyMol plugin.
Figure 2.
Figure 2.
The surface pocket (CASTp ID 20, orange) on BioH from E. coli (PDB ID 1m33) contains a grouping of basic residues (yellow) (A). The CoA binding from aminoglycoside 2′-N-acetyltransferase (AAC(2′)-lc) from M. tuberculosis (PDB ID 1m4g, E.C.2.3.1.-) (B). CoA ligand has been modeled into the surface of BioH (D), based on the superposition with AAC(2′)-lc (C). Using a smaller solvent probe radius (1.2 Å) to define the pocket, the surface reveals an additional channel protruding into the domain interface which contains the buried catalytic triad (yellow) and Phe143 (magenta) (E).
Figure 3.
Figure 3.
The surfaces containing the proposed catalytic residues of RpiB from E. coli (PDB ID 1nn4, CASTp ID 67) (A) and M. tuberculosis (PDB ID 1usl, CASTp ID 67) (B) and RpiA from E. coli (PDB ID 1o8b, CASTp ID 49) (C). The distribution of amino acid residues in each surface (D).
Figure 4.
Figure 4.
Structures containing CBS domains from CBS domain protein mt1622 from M. thermoautotrophicum (PDB ID 1pbj) (A), inosine-5′-monosphate dehydrogenase (IMPDH) from S. pyogenes (PDB ID 1zfj, E.C.1.1.1.205) (Zhang et al. 1999) (B), and conserved hypothetical protein Ta549 from T. acidophilum (PDB ID 1pvm) (C). Surfaces are colored by element type. The proposed nucleotide bindings surface of mt1622 (CASTp ID 9) has AMP modeled into it based on the superposition to a flavoprotein (PDB ID 1efp) (A). The proposed nucleotide bindings surface of IMPDH (CASTp ID 31) has ATP modeled into it based on the superposition to cyclin-dependent kinase 2 (PDB ID 1b38) (B). Ta549 contains an additional C terminus CBS domain (C, orange) opposite the tandem domain interface surface (C, green). The domain insert creates a novel surface that shares similarity to an ATP binding surface from saicarsynthase (PDB ID 1obd). An ATP molecule has been modeled into the surface of Ta549 (orange) based on the superposition of the conserved residues (D). A stereo representation of the surfaces from Ta549 (orange) and saicarsynthase (white) with the modeled ATP (E).

References

    1. Bateman, A. 1997. The structure of a domain common to archaebacteria and the homocystinuria disease protein. Trends Biochem. Sci. 22 12–13. - PubMed
    1. Berman, H.M., Battistuz, T., Bhat, T.N., Bluhm, W.F., Bourne, P.E., Burkhardt, K., Feng, Z., Gilliland, G.L., Iype, L., Jain, S., et al. 2002. The Protein Data Bank. Acta Crystallogr. D Biol. Crystallogr. 58 899–907. - PubMed
    1. Binkowski, T.A., Adamian, L., and Liang, J. 2003a. Inferring functional relationships of proteins from local sequence and spatial surface patterns. J. Mol. Biol. 332 505–526. - PubMed
    1. Binkowski, T.A., Naghibzadeh, S., and Liang, J. 2003b. CASTp: Computed atlas of surface topography of proteins. Nucleic Acids Res. 31 3352–3355. - PMC - PubMed
    1. Binkowski, T.A., Freeman, P., and Liang, J. 2004. pvSOAR: Detecting similar surface patterns of pocket and void surfaces of amino acid residues on proteins. Nucleic Acids Res. 32 W555–W558. - PMC - PubMed

Publication types

LinkOut - more resources