Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 May 13:6:25801.
doi: 10.1038/srep25801.

A homology-based pipeline for global prediction of post-translational modification sites

Affiliations

A homology-based pipeline for global prediction of post-translational modification sites

Xiang Chen et al. Sci Rep. .

Abstract

The pathways of protein post-translational modifications (PTMs) have been shown to play particularly important roles for almost any biological process. Identification of PTM substrates along with information on the exact sites is fundamental for fully understanding or controlling biological processes. Alternative computational strategies would help to annotate PTMs in a high-throughput manner. Traditional algorithms are suited for identifying the common organisms and tissues that have a complete PTM atlas or extensive experimental data. While annotation of rare PTMs in most organisms is a clear challenge. In this work, to this end we have developed a novel homology-based pipeline named PTMProber that allows identification of potential modification sites for most of the proteomes lacking PTMs data. Cross-promotion E-value (CPE) as stringent benchmark has been used in our pipeline to evaluate homology to known modification sites. Independent-validation tests show that PTMProber achieves over 58.8% recall with high precision by CPE benchmark. Comparisons with other machine-learning tools show that PTMProber pipeline performs better on general predictions. In addition, we developed a web-based tool to integrate this pipeline at http://bioinfo.ncu.edu.cn/PTMProber/index.aspx. In addition to pre-constructed prediction models of PTM, the website provides an extensional functionality to allow users to customize models.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Method comparison.
(A) Comparing the number of predicted PTM sites (y-axis) in p53 family proteins between MLA (machine-learning approach, yellow bars) and SSA (sequence similarity-based approach for PTMProber, peacock blue bars) for five PTM types. Our five MLAs works include SubPhosPred tool for phosphorylation, PSKAcePred tool for acetylation, UbiProber tool for ubiquitination, PMeS tool for methylation and SUMOAMVR for sumoylation. (B) Distribution of the number of phosphorylation sites (x-axis) predicted on the 21 proteins of p53 family by PTMProber.
Figure 2
Figure 2. Extensible contents of PTMProber.
The extensible source (yellow box) includes data of known proteome that is constructed known proteome BLAST database and known peptides BLAST database and data of query proteome that yields query proteome BLAST database. PTMProber pipeline runs PTM prediction by invoking these databases (green box) updating from various data source (red box). The databases generated from makeblastdb programs in the blast-2.2.28+ package.
Figure 3
Figure 3. System flow of PTMProber pipeline.
The gray box represents the computing node, while red characters correspond with the numbered steps in PTMProber pipeline. The step 0a represents that the protein as a query must correspond to query proteome database; and the query protein database must contain query protein. The step 0b acquires the known peptide from the known proteome, and vice versa. The steps 1–6 represent that search or obtain the corresponding symbol contained in gray box. The steps 5–6 (BLAST-2 and BLAST-3) ascertain whether user’s query proteins contain PTM sites.

Similar articles

Cited by

References

    1. Baker P. R., Trinidad J. C. & Chalkley R. J. Modification site localization scoring integrated into a search engine. Mol Cell Proteomics 10 M111, 008078 (2011). - PMC - PubMed
    1. Jensen O. N. Modification-specific proteomics: characterization of post-translational modifications by mass spectrometry. Curr Opin Chem Biol 8, 33–41 (2004). - PubMed
    1. Schwartz D., Chou M. F. & Church G. M. Predicting protein post-translational modifications using meta-analysis of proteome scale data sets. Mol Cell Proteomics 8, 365–379 (2009). - PMC - PubMed
    1. Liu C. & Li H. In silico prediction of post-translational modifications. Methods in molecular biology (Clifton, N.J.) 760, 325–340 (2011). - PubMed
    1. Li Y. et al.. Accurate in silico identification of species-specific acetylation sites by integrating protein sequence-derived and functional features. Sci Rep 4, 5765 (2014). - PMC - PubMed

Publication types