Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Jun 25;19(1):237.
doi: 10.1186/s12859-018-2249-4.

Detecting Succinylation sites from protein sequences using ensemble support vector machine

Affiliations

Detecting Succinylation sites from protein sequences using ensemble support vector machine

Qiao Ning et al. BMC Bioinformatics. .

Abstract

Background: Lysine succinylation is a new kind of post-translational modification which plays a key role in protein conformation regulation and cellular function control. To understand the mechanism of succinylation profoundly, it is necessary to identify succinylation sites in proteins accurately. However, traditional methods, experimental approaches, are labor-intensive and time-consuming. Computational prediction methods have been proposed recent years, and they are popular because of their convenience and high speed. In this study, we developed a new method to predict succinylation sites in protein combining multiple features, including amino acid composition, binary encoding, physicochemical property and grey pseudo amino acid composition, with a feature selection scheme (information gain). And then, it was trained using SVM (Support Vector Machine) and an ensemble learning algorithm.

Results: The performance of this method was measured with an accuracy of 89.14% and a MCC (Matthew Correlation Coefficient) of 0.79 using 10-fold cross validation on training dataset and an accuracy of 84.5% and a MCC of 0.2 on independent dataset.

Conclusions: The conclusions made from this study can help to understand more of the succinylation mechanism. These results suggest that our method was very promising for predicting succinylation sites. The source code and data of this paper are freely available at https://github.com/ningq669/PSuccE .

Keywords: Ensemble learning algorithm; Grey pseudo amino acid composition; Information gain; Multiple features; Predict succinylation sites; SVM.

PubMed Disclaimer

Conflict of interest statement

Ethics approval and consent to participate

All authors approval and consent to participate.

Consent for publication

All authors read and consent to publish the manuscript.

Competing interests

The authors declare that they no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Figures

Fig. 1
Fig. 1
The flow chart of PSuccE
Fig. 2
Fig. 2
The information entropy value of positions around the central residue
Fig. 3
Fig. 3
Two sample logos of the compositional biases around succinylation sites compared to non-succinylation sites. Statistically significant symbols are plotted using the size of the symbol that is proportional to the difference between the two samples. Residues are separated in two groups: (1) enriched in the positive samples; and (2) depleted in the positive samples. Color of the symbols was classified according to the polarity of the residue side chain
Fig. 4
Fig. 4
ROC curves (AUC) of predictions based on 10-fold cross validation

Similar articles

Cited by

References

    1. Weinert B, Schölz C, Wagner S, Iesmantavicius V, Su D, Daniel J, Choudhary C. Lysine Succinylation is a frequently occurring modification in prokaryotes and eukaryotes and extensively overlaps with acetylation [J] Cell Rep. 2013;4(4):842–851. doi: 10.1016/j.celrep.2013.07.024. - DOI - PubMed
    1. Xie Z, Dai J, Dai L, Tan M, Cheng Z, Wu Y, Boeke J, Zhao Y. Lysine Succinylation and lysine Malonylation in histones [J] Mol Cell Proteomics Mcp. 2012;11(5):100–107. doi: 10.1074/mcp.M111.015875. - DOI - PMC - PubMed
    1. Tan M, Peng C, Anderson K, Chhoy P, Xie Z, Dai L, Park J, Chen Y, Huang H, Zhang Y, Ro J, Wagner GR, Green MF, Madsen AS, Schmiesing J, Peterson BS, Xu G, Ilkayeva OR, Muehlbauer MJ, Braulke T, Mühlhausen C, Backos DS, Olsen CA, McGuire PJ, Pletcher SD, Lombard DB, Hirschey MD, Zhao Y. Lysine Glutarylation is a protein posttranslational modification regulated by SIRT5 [J] Cell Metab. 2014;19(4):605–617. doi: 10.1016/j.cmet.2014.03.014. - DOI - PMC - PubMed
    1. Papanicolaou KN, O'Rourke B, Foster DB. Metabolism leaves its mark on the powerhouse: recent progress in post-translational modifications of lysine in mitochondria [J] Front Physiol. 2013;5(5):301. - PMC - PubMed
    1. Zhang Z, Tan M, Xie Z, Dai L, Chen Y, Zhao T. Identification of lysine succinylation as a new post-translational modification [J] Nat Chem Biol. 2011;7(1):58–63. doi: 10.1038/nchembio.495. - DOI - PMC - PubMed

Publication types

MeSH terms

LinkOut - more resources