. 2009 Jul;8(7):1516-26.

doi: 10.1074/mcp.M900025-MCP200. Epub 2009 Apr 7.

Multiple Motif Scanning to identify methyltransferases from the yeast proteome

Tanya C Petrossian¹, Steven G Clarke

Affiliations

PMID: 19351663
PMCID: PMC2709183
DOI: 10.1074/mcp.M900025-MCP200

Multiple Motif Scanning to identify methyltransferases from the yeast proteome

Tanya C Petrossian et al. Mol Cell Proteomics. 2009 Jul.

. 2009 Jul;8(7):1516-26.

doi: 10.1074/mcp.M900025-MCP200. Epub 2009 Apr 7.

Authors

Tanya C Petrossian¹, Steven G Clarke

Affiliation

¹ Department of Chemistry and Biochemistry, University of California, Los Angeles, California 90095, USA.

PMID: 19351663
PMCID: PMC2709183
DOI: 10.1074/mcp.M900025-MCP200

Abstract

A new program (Multiple Motif Scanning) was developed to scan the Saccharomyces cerevisiae proteome for Class I S-adenosylmethionine-dependent methyltransferases. Conserved Motifs I, Post I, II, and III were identified and expanded in known methyltransferases by primary sequence and secondary structural analysis through hidden Markov model profiling of both a yeast reference database and a reference database of methyltransferases with solved three-dimensional structures. The roles of the conserved amino acids in the four motifs of the methyltransferase structure and function were then analyzed to expand the previously defined motifs. Fisher-based negative log statistical matrix sets were developed from the prevalence of amino acids in the motifs. Multiple Motif Scanning is able to scan the proteome and score different combinations of the top fitting sequences for each motif. In addition, the program takes into account the conserved number of amino acids between the motifs. The output of the program is a ranked list of proteins that can be used to identify new methyltransferases and to reevaluate the assignment of previously identified putative methyltransferases. The Multiple Motif Scanning program can be used to develop a putative list of enzymes for any type of protein that has one or more motifs conserved at variable spacings and is freely available (www.chem.ucla.edu/files/MotifSetup.Zip). Finally hidden Markov model profile clustering analysis was used to subgroup Class I methyltransferases into groups that reflect their methyl-accepting substrate specificity.

PubMed Disclaimer

Figures

**Fig. 1.**
The four signature motifs (*underlined*) of the Class I methyltransferases shown in their primary, predicted secondary, and actual secondary structure in three known methyltransferase proteins in *S.cerevisiae*. The motifs were identified using HHpred with HHsearch 1.5, which utilizes HMM profile *versus* profile searches to align the proteins. HHpred also generates secondary structural predictions (seen here) that also aid in the alignment and identification of the motifs, denoted C for random coil, E for β sheet, and H for helical structures. Because the crystal structures have been solved for Dot1, Hmt1, and Ppm1 proteins, the secondary structures of the crystals have been used for comparison with those predicted by HHpred. Although a crystal structure is known for the Mtf1/YMR228w, the reaction that it catalyzes is unknown. As seen here, predicted secondary structures are very similar to the actual secondary structures, especially in the motif regions.

**Fig. 2.**
**Crystal structure of Dot1 (*top*) and schematic (*bottom*) that depicts the secondary structures present in Class I methyltransferases.** Class I methyltransferases are distinguished by a common three-dimensional structural core, which includes a seven-strand twisted β sheet that provides the major binding interactions for AdoMet. β strands are in *yellow*, helices are in *red*, and non-strand/non-helices are shown in *green*. The S-adenosylhomocysteine molecule in the crystal structure is depicted in the *stick* model (Protein Data Bank code 1U2Z, Ref. 29).

**Fig. 3.**
**Development of a Fisher-based sequence scoring matrix used for Multiple Motif Scanning analysis.** The scoring matrix for each amino acid was compiled using statistical analysis from the two known databases of methyltransferases. The number of each amino acid residue at each position of each motif was tallied. p values from χ² tests were obtained by comparing the actual count of each amino acid with the “expected” count, which was calculated from frequencies found in the Pseudogene.org database. Values were then converted to the absolute value of the scores by taking the negative log, and scores were designated with a negative value if the actual amino acid count was less than the expected.

**Fig. 4.**
**Conservation of spacing between motifs.** The number of amino acids between the extended Motifs I and Post I and Motifs II and III for known methyltransferases are depicted. Scores for spacing were based on the frequency of yeast and structural database combined with penalties for excessive gap distance as described under “Experimental Procedures.”

**Fig. 5.**
**Conserved amino acid residues in and adjacent to the four methyltransferase motifs.** The motifs are arranged as they occur in the β sheet. Conserved amino acids along with the χ² p values from the yeast data set are shown in *boxes*. The residues that composed the originally described motifs are highlighted in *bold* print with a *gray background*. The secondary structure as determined from both the yeast and crystal data sets is denoted by an *arrow* (β strand), *purple box* (helix), or *line* (non-helix or strand). *Dotted* representations indicate structures that are not present in all methyltransferases. Residues involved in turns are also shown. The chemical interactions of key amino acids residues within the protein (*blue lines*) and/or with cofactor AdoMet (*) are shown. The *thick black lines* between each of the β strands indicate amino acids contributing to two hydrophobic pockets created by side chains of residues coming into (*solid*) and out of (*dashed*) the plane of the β sheet. Side chains from helical regions contributing to these hydrophobic pockets are indicated similarly by *solid* and *dashed circles* and *ovals*.

**Fig. 6.**
**Extended Motifs I, Post I, II, and III used for Multiple Motif Scanning analysis.** The amino acid frequency among known yeast methyltransferases is depicted using WebLogo (30) for the expanded Motifs I, Post I, II, and III determined in this work. A larger number of bits for each letter designates the importance of each amino acid in the motif.

**Fig. 7.**
**Multiple Motif Scanning software.** The Multiple Motif Scanning program was developed to scan the proteome for novel methyltransferases to resolve the problems encountered by Katz *et al.* (6). Yeast matrix and crystal structure matrix were independently used to scan the *S. cerevisiae* proteome. Multiple Motif Scanning recognizes the top five best matches for the first motif entered and subsequently identifies the top five plausible second motifs for each of the matches of the first motifs. The program continues to find the top five sequences that fit the motif for every previous combination to produce 5ⁿ matches for n number of motifs. All combinations are scored, and the top 10 combinations are saved. Proteins are ranked among each other based on the top score. The extended Motifs I, Post I, II, and III were used for analysis with gap considerations between motifs.

**Fig. 8.**
**Accuracy of Multiple Motif Scanning.**Motifs found by the Multiple Motif Scanning program using the yeast matrix were compared with the HMM profile alignments to calculate the percentage of inaccuracy in identifying the motifs. The program outputs the overall top score calculated from the combination of sequence-fitting motifs and spacing between them combined along with the next nine top scoring combinations.

**Fig. 9.**
**HMM profile clustering to determine putative substrates of potential methyltransferases.** HMM profile *versus* profile clustering was utilized to find the calculated homology between proteins. Proteins clustered in several groups (*A\NK*) that display similarity in type of substrate as well as the atomic nucleophile of the substrate in the methyltransferase reaction (carbon, nitrogen, oxygen, etc.). Known methyltransferases are depicted in *black*, known non-methyltransferases are depicted in *magenta*, and unknown ORFs are depicted in *green*

See this image and copyright information in PMC

Cited by

A novel small molecule methyltransferase is important for virulence in Candida albicans.
Lissina E, Weiss D, Young B, Rella A, Cheung-Ong K, Del Poeta M, Clarke SG, Giaever G, Nislow C. Lissina E, et al. ACS Chem Biol. 2013 Dec 20;8(12):2785-93. doi: 10.1021/cb400607h. Epub 2013 Oct 16. ACS Chem Biol. 2013. PMID: 24083538 Free PMC article.
A novel automethylation reaction in the Aspergillus nidulans LaeA protein generates S-methylmethionine.
Patananan AN, Palmer JM, Garvey GS, Keller NP, Clarke SG. Patananan AN, et al. J Biol Chem. 2013 May 17;288(20):14032-14045. doi: 10.1074/jbc.M113.465765. Epub 2013 Mar 26. J Biol Chem. 2013. PMID: 23532849 Free PMC article.
Closing in on human methylation-the versatile family of seven-β-strand (METTL) methyltransferases.
Falnes PØ. Falnes PØ. Nucleic Acids Res. 2024 Oct 28;52(19):11423-11441. doi: 10.1093/nar/gkae816. Nucleic Acids Res. 2024. PMID: 39351878 Free PMC article. Review.
Probabilistic approach to predicting substrate specificity of methyltransferases.
Szczepińska T, Kutner J, Kopczyński M, Pawłowski K, Dziembowski A, Kudlicki A, Ginalski K, Rowicka M. Szczepińska T, et al. PLoS Comput Biol. 2014 Mar 20;10(3):e1003514. doi: 10.1371/journal.pcbi.1003514. eCollection 2014 Mar. PLoS Comput Biol. 2014. PMID: 24651469 Free PMC article.
Emerging technologies to map the protein methylome.
Carlson SM, Gozani O. Carlson SM, et al. J Mol Biol. 2014 Oct 9;426(20):3350-62. doi: 10.1016/j.jmb.2014.04.024. Epub 2014 May 5. J Mol Biol. 2014. PMID: 24805349 Free PMC article. Review.

See all "Cited by" articles

References

1. Cheng X., Blumenthal R. M. ( 1999) S-Adenosylmethionine-Dependent Methyltransferases: Structures and Functions, World Scientific, Singapore
1. Djordjevic S., Stock A. M. ( 1997) Crystal structure of the chemotaxis receptor methyltransferase CheR suggests a conserved structural motif for binding S-adenosylmethionine. Structure 5, 545– 558 - PubMed
1. Martin J. L., McMillan F. M. ( 2002) SAM (dependent) I AM: the S-adenosylmethionine-dependent methyltransferase fold. Curr. Opin. Struct. Biol. 12, 783– 793 - PubMed
1. Schluckebier G., O'Gara M., Saenger W, Cheng X. ( 1995) Universal catalytic domain structure of AdoMet-dependent methyltransferases. J. Mol. Biol. 247, 16– 20 - PubMed
1. Schubert H. L., Blumenthal R. M., Cheng X. ( 2003) Many paths to methyltransfer: a chronicle of convergence. Trends Biochem. Sci. 28, 329– 335 - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Multiple Motif Scanning to identify methyltransferases from the yeast proteome

Affiliation

Multiple Motif Scanning to identify methyltransferases from the yeast proteome

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Molecular Biology Databases

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Molecular Biology Databases