Incorporating support vector machine for identifying protein tyrosine sulfation sites
- PMID: 19373826
- DOI: 10.1002/jcc.21258
Incorporating support vector machine for identifying protein tyrosine sulfation sites
Abstract
Tyrosine sulfation is a post-translational modification of many secreted and membrane-bound proteins. It governs protein-protein interactions that are involved in leukocyte adhesion, hemostasis, and chemokine signaling. However, the intrinsic feature of sulfated protein remains elusive and remains to be delineated. This investigation presents SulfoSite, which is a computational method based on a support vector machine (SVM) for predicting protein sulfotyrosine sites. The approach was developed to consider structural information such as concerning the secondary structure and solvent accessibility of amino acids that surround the sulfotyrosine sites. One hundred sixty-two experimentally verified tyrosine sulfation sites were identified using UniProtKB/SwissProt release 53.0. The results of a five-fold cross-validation evaluation suggest that the accessibility of the solvent around the sulfotyrosine sites contributes substantially to predictive accuracy. The SVM classifier can achieve an accuracy of 94.2% in five-fold cross validation when sequence positional weighted matrix (PWM) is coupled with values of the accessible surface area (ASA). The proposed method significantly outperforms previous methods for accurately predicting the location of tyrosine sulfation sites.
Copyright 2009 Wiley Periodicals, Inc.
Similar articles
-
PredSulSite: prediction of protein tyrosine sulfation sites with multiple features and analysis.Anal Biochem. 2012 Sep 1;428(1):16-23. doi: 10.1016/j.ab.2012.06.003. Epub 2012 Jun 9. Anal Biochem. 2012. PMID: 22691961
-
Support Vector Machine-based classification of protein folds using the structural properties of amino acid residues and amino acid residue pairs.Bioinformatics. 2007 Dec 15;23(24):3320-7. doi: 10.1093/bioinformatics/btm527. Epub 2007 Nov 7. Bioinformatics. 2007. PMID: 17989092
-
Incorporating structural characteristics for identification of protein methylation sites.J Comput Chem. 2009 Jul 15;30(9):1532-43. doi: 10.1002/jcc.21232. J Comput Chem. 2009. PMID: 19263424
-
Toward a framework for sulfoproteomics: Synthesis and characterization of sulfotyrosine-containing peptides.Biopolymers. 2008;90(3):459-77. doi: 10.1002/bip.20821. Biopolymers. 2008. PMID: 17680702 Review.
-
Tyrosine sulfation: an increasingly recognised post-translational modification of secreted proteins.N Biotechnol. 2009 Jun;25(5):299-317. doi: 10.1016/j.nbt.2009.03.011. N Biotechnol. 2009. PMID: 19658209 Review.
Cited by
-
iSNO-PseAAC: predict cysteine S-nitrosylation sites in proteins by incorporating position specific amino acid propensity into pseudo amino acid composition.PLoS One. 2013;8(2):e55844. doi: 10.1371/journal.pone.0055844. Epub 2013 Feb 7. PLoS One. 2013. PMID: 23409062 Free PMC article.
-
GSHSite: exploiting an iteratively statistical method to identify s-glutathionylation sites with substrate specificity.PLoS One. 2015 Apr 7;10(4):e0118752. doi: 10.1371/journal.pone.0118752. eCollection 2015. PLoS One. 2015. PMID: 25849935 Free PMC article.
-
EuLoc: a web-server for accurately predict protein subcellular localization in eukaryotes by incorporating various features of sequence segments into the general form of Chou's PseAAC.J Comput Aided Mol Des. 2013 Jan;27(1):91-103. doi: 10.1007/s10822-012-9628-0. Epub 2013 Jan 3. J Comput Aided Mol Des. 2013. PMID: 23283513
-
Demonstration of the Coexistence of Duplicated LH Receptors in Teleosts, and Their Origin in Ancestral Actinopterygians.PLoS One. 2015 Aug 13;10(8):e0135184. doi: 10.1371/journal.pone.0135184. eCollection 2015. PLoS One. 2015. PMID: 26271038 Free PMC article.
-
Minor fibrillar collagens, variable regions alternative splicing, intrinsic disorder, and tyrosine sulfation.Protein Cell. 2012 Jun;3(6):419-33. doi: 10.1007/s13238-012-2917-5. Epub 2012 Jul 1. Protein Cell. 2012. PMID: 22752873 Free PMC article. Review.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources