Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jun 4:7:e7046.
doi: 10.7717/peerj.7046. eCollection 2019.

PTMphinder: an R package for PTM site localization and motif extraction from proteomic datasets

Affiliations

PTMphinder: an R package for PTM site localization and motif extraction from proteomic datasets

Jacob M Wozniak et al. PeerJ. .

Abstract

Background: Mass-spectrometry-based proteomics is a prominent field of study that allows for the unbiased quantification of thousands of proteins from a particular sample. A key advantage of these techniques is the ability to detect protein post-translational modifications (PTMs) and localize them to specific amino acid residues. These approaches have led to many significant findings in a wide range of biological disciplines, from developmental biology to cancer and infectious diseases. However, there is a current lack of tools available to connect raw PTM site information to biologically meaningful results in a high-throughput manner. Furthermore, many of the available tools require significant programming knowledge to implement.

Results: The R package PTMphinder was designed to enable researchers, particularly those with minimal programming background, to thoroughly analyze PTMs in proteomic data sets. The package contains three functions: parseDB, phindPTMs and extractBackground. Together, these functions allow users to reformat proteome databases for easier analysis, localize PTMs within full proteins, extract motifs surrounding the identified sites and create proteome-specific motif backgrounds for statistical purposes. Beta-testing of this R package has demonstrated its simplicity and ease of integration with existing tools.

Conclusion: PTMphinder empowers researchers to fully analyze and interpret PTMs derived from proteomic data. This package is simple enough for researchers with limited programming experience to understand and implement. The data produced from this package can inform subsequent research by itself and also be used in conjunction with other tools, such as motif-x, for further analysis.

Keywords: Mass spectrometry; Motifs; PTMs; Post-translational modifications; Proteomics; R package.

PubMed Disclaimer

Conflict of interest statement

The authors declare there are no competing interests.

Figures

Figure 1
Figure 1. Workflow of PTMphinder Package.
Blue boxes indicate PTMphinder functions while grey bocks indicate functions outside of PTMphinder. Pipeline simply requires the user’s proteomic dataset, a proteome database and minor, additional user-defined parameters.
Figure 2
Figure 2. Assessment of PTMphinder functionality.
(A) Results of running rmotif-x on flanking peptide sequences already extracted in a previous publication12. (B) Results of running rmotif-x on flanking peptide sequences extracted using PTMphinder from the raw phospho-proteomics data in the same publication. Motif enrichment scores are represented in the parentheses. Depictions of motifs were generated using WebLogo (https://weblogo.berkeley.edu/). (C) Time of motif extraction for various methods (n = 5 modified peptides).

Similar articles

Cited by

References

    1. Chou MF, Schwartz D. Biological sequence motif discovery using motif-x. Current Protocols in Bioinformatics. 2011;Chapter 13:15–24. doi: 10.1002/0471250953.bi1315s35. Unit 13. - DOI - PubMed
    1. Cohen P. The role of protein phosphorylation in human health and disease. The Sir Hans Krebs Medal Lecture. European Journal of Biochemistry. 2001;268:5001–5010. doi: 10.1046/j.0014-2956.2001.02473.x. - DOI - PubMed
    1. Cohen P. The origins of protein phosphorylation. Nature Cell Biology. 2002;4:E127–130. doi: 10.1038/ncb0502-e127. - DOI - PubMed
    1. Dennis JW, Granovsky M, Warren CE. Protein glycosylation in development and disease. Bioessays. 1999;21:412–421. doi: 10.1002/(SICI)1521-1878(199905)21:5<412::AID-BIES8>3.0.CO;2-5. - DOI - PubMed
    1. He Z, Yang C, Guo G, Li N, Yu W. Motif-All: discovering all phosphorylation motifs. BMC Bioinformatics. 2011;12(Suppl 1):S22. doi: 10.1186/1471-2105-12-S1-S22. - DOI - PMC - PubMed