Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Apr 25:13:6.
doi: 10.1186/1472-6807-13-6.

Structural and functional studies of S-adenosyl-L-methionine binding proteins: a ligand-centric approach

Affiliations

Structural and functional studies of S-adenosyl-L-methionine binding proteins: a ligand-centric approach

Rajaram Gana et al. BMC Struct Biol. .

Abstract

Background: The post-genomic era poses several challenges. The biggest is the identification of biochemical function for protein sequences and structures resulting from genomic initiatives. Most sequences lack a characterized function and are annotated as hypothetical or uncharacterized. While homology-based methods are useful, and work well for sequences with sequence identities above 50%, they fail for sequences in the twilight zone (<30%) of sequence identity. For cases where sequence methods fail, structural approaches are often used, based on the premise that structure preserves function for longer evolutionary time-frames than sequence alone. It is now clear that no single method can be used successfully for functional inference. Given the growing need for functional assignments, we describe here a systematic new approach, designated ligand-centric, which is primarily based on analysis of ligand-bound/unbound structures in the PDB. Results of applying our approach to S-adenosyl-L-methionine (SAM) binding proteins are presented.

Results: Our analysis included 1,224 structures that belong to 172 unique families of the Protein Information Resource Superfamily system. Our ligand-centric approach was divided into four levels: residue, protein/domain, ligand, and family levels. The residue level included the identification of conserved binding site residues based on structure-guided sequence alignments of representative members of a family, and the identification of conserved structural motifs. The protein/domain level included structural classification of proteins, Pfam domains, domain architectures, and protein topologies. The ligand level included ligand conformations, ribose sugar puckering, and the identification of conserved ligand-atom interactions. The family level included phylogenetic analysis.

Conclusion: We found that SAM bound to a total of 18 different fold types (I-XVIII). We identified 4 new fold types and 11 additional topological arrangements of strands within the well-studied Rossmann fold Methyltransferases (MTases). This extends the existing structural classification of SAM binding proteins. A striking correlation between fold type and the conformation of the bound SAM (classified as types) was found across the 18 fold types. Several site-specific rules were created for the assignment of functional residues to families and proteins that do not have a bound SAM or a solved structure.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Ligand-centric approach. This approach involves a multipronged analysis at various sequence and structural levels. These include analysis at the residue level, analysis at the protein/domain level, and analysis at the family level. At the residue level, this analysis includes identification of conserved binding site residues based on structure-guided sequence alignments of representative members of a family and identification of conserved motifs. At the protein/domain level, analysis includes examination of SCOP folds, pfam domains, and protein topologies. At the ligand level, the analysis includes ligand conformations, ribose sugar puckering (when applicable), and identification of conserved ligand-atom interactions. Finally, at the family level, the approach includes phylogenetic analysis.
Figure 2
Figure 2
Fold types of SAM-binding proteins. The folds follow SCOP classification, except for Helical Bundle, which we have assigned. A total of 18 folds include 9 Mtases and 9 non-MTases indicated by #. Structures belonging to the Rossmann fold methylases have evolved to become MTases and non- MTases and are indicated with a yellow box. SAM-dependent MTases have been previously categorized into five classes by Cheng et al.[38]. We have extended this to include a total of nine classes. The four added classes are indicated by **.
Figure 3
Figure 3
Topological classes within fold type I. The classification is based on defining the strands that form the core. The strands are numbered from the N-terminus to the C-terminus and read from left to right. Only the SAM/SAH domain is included. Bound SAM is shown as a ball and sticks, and the structures are represented in cartoon diagrams. The strands that form the core are colored in red. The labels list the PDB-ID followed by the topology. The corresponding two-dimensional topological arrangement is provided in Additional file 3. The figures were generated using PyMOL visualization software (http://www.pymol.org).
Figure 4
Figure 4
Ligand conformations across all 18 fold types. A striking correlation between fold type and ligand conformation was noted. One representative structure was selected from each of the different folds. The structure with the highest resolution was chosen. The ligand SAM/SAH is indicated as a ball and stick. The figure was generated using Chimera visualization software (http://www.cgl.ucsf.edu/chimera/), and atoms are labeled. **Beside Type VII (PDB-ID: 4A2N) indicates an average temperature factor of >80Å2 for the ligand and hence may not be reliable. Conformation can be confirmed as more structures become available.
Figure 5
Figure 5
Superposition of all fold type I SAM/SAH ligands of representative structures from each family that have a mean B-factor of <80Å2. A. Superposition via the ribose moiety. B. Superposition of all SAM atoms. Figure was generated using Chimera Visualization Software (http://www.cgl.ucsf.edu/chimera/).
Figure 6
Figure 6
Structure-guided alignment of representative structures for fold type I. Only the aligned core is shown. The alignment was completed using the Cn3d tool. The structural representation is shown as tubes.
Figure 7
Figure 7
Taxonomic distributions of SAM-binding proteins. Families that have representative members from all three branches of life [Archaea (A), Bacteria (B), and Eukaryotes (E)] are indicated within the rectangular boxes. The corresponding fold type is indicated for each of these families along with the circumference of the circle. A total of 29 families that belong to 10 different fold types contain members in all three branches of life. This information may help to identify the last universal common ancestor of SAM-binding proteins.

References

    1. Casari G, Andrade MA, Bork P, Boyle J, Daruvar A, Ouzounis C, Schneider R, Tamames J, Valencia A, Sander C. Challenging times for bioinformatics. Nature. 1995;376(6542):647–648. doi: 10.1038/376647a0. - DOI - PubMed
    1. Blundell TL, Mizuguchi K. Structural genomics: an overview. Prog Biophys Mol Biol. 2000;73(5):289–295. doi: 10.1016/S0079-6107(00)00008-0. - DOI - PubMed
    1. Watson JD, Todd AE, Bray J, Laskowski RA, Edwards A, Joachimiak A, Orengo CA, Thornton JM. Target selection and determination of function in structural genomics. IUBMB Life. 2003;55(4–5):249–255. - PMC - PubMed
    1. UniProt C. The universal protein resource (UniProt) Nucleic Acids Res. 2008;36:D190–D195. doi: 10.1093/nar/gkn141. Database issue. - DOI - PMC - PubMed
    1. Catoni GL. S-Adenosylmethionine; a new intermediate formed enzymatically from L-methionine and adenosinetriphosphate. J Biol Chem. 1953;204(1):403–416. - PubMed

Publication types

LinkOut - more resources