Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006 Dec 18;7 Suppl 5(Suppl 5):S13.
doi: 10.1186/1471-2105-7-S5-S13.

Prediction of the functional class of metal-binding proteins from sequence derived physicochemical properties by support vector machine approach

Affiliations

Prediction of the functional class of metal-binding proteins from sequence derived physicochemical properties by support vector machine approach

H H Lin et al. BMC Bioinformatics. .

Abstract

Metal-binding proteins play important roles in structural stability, signaling, regulation, transport, immune response, metabolism control, and metal homeostasis. Because of their functional and sequence diversity, it is desirable to explore additional methods for predicting metal-binding proteins irrespective of sequence similarity. This work explores support vector machines (SVM) as such a method. SVM prediction systems were developed by using 53,333 metal-binding and 147,347 non-metal-binding proteins, and evaluated by an independent set of 31,448 metal-binding and 79,051 non-metal-binding proteins. The computed prediction accuracy is 86.3%, 81.6%, 83.5%, 94.0%, 81.2%, 85.4%, 77.6%, 90.4%, 90.9%, 74.9% and 78.1% for calcium-binding, cobalt-binding, copper-binding, iron-binding, magnesium-binding, manganese-binding, nickel-binding, potassium-binding, sodium-binding, zinc-binding, and all metal-binding proteins respectively. The accuracy for the non-member proteins of each class is 88.2%, 99.9%, 98.1%, 91.4%, 87.9%, 94.5%, 99.2%, 99.9%, 99.9%, 98.0%, and 88.0% respectively. Comparable accuracies were obtained by using a different SVM kernel function. Our method predicts 67% of the 87 metal-binding proteins non-homologous to any protein in the Swissprot database and 85.3% of the 333 proteins of known metal-binding domains as metal-binding. These suggest the usefulness of SVM for facilitating the prediction of metal-binding proteins. Our software can be accessed at the SVMProt server http://jing.cz3.nus.edu.sg/cgi-bin/svmprot.cgi.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The sequence of a hypothetic protein for illustration of derivation of the feature vector of a protein. Sequence index indicates the position of an amino acid in the sequence. The index for each type of amino acids in the sequence (A or E) indicates the position of the first, second, third, ... of that type of amino acid (The position of the first, second, third, ..., A is at 1, 3, 4, ...). A/E transition indicates the position of AE or EA pairs in the sequence.

Similar articles

Cited by

References

    1. Wintz H, Fox T, Wu YY, Feng V, Chen W, Chang HS, Zhu T, Vulpe C. Expression profiles of Arabidopsis thaliana in mineral deficiencies reveal novel transporters involved in metal homeostasis. J Biol Chem. 2003;278:47644–47653. doi: 10.1074/jbc.M309338200. - DOI - PubMed
    1. Cox EH, McLendon GL. Zinc-dependent protein folding. Curr Opin Chem Biol. 2000;4:162–165. doi: 10.1016/S1367-5931(99)00070-8. - DOI - PubMed
    1. Michel SL, Berg JM. Building a metal binding domain, one half at a time. Chem Biol. 2002;9:667–668. doi: 10.1016/S1074-5521(02)00160-6. - DOI - PubMed
    1. de la Calle Guntinas MB, Bordin G, Rodriguez AR. Identification, characterization and determination of metal-binding proteins by liquid chromatography. A review. Anal Bioanal Chem. 2002;374:369–378. doi: 10.1007/s00216-002-1508-3. - DOI - PubMed
    1. Yang W, Lee HW, Hellinga H, Yang JJ. Structural analysis, identification, and design of calcium-binding sites in proteins. Proteins. 2002;47:344–356. doi: 10.1002/prot.10093. - DOI - PubMed

Publication types

LinkOut - more resources