Is It Possible to Find Needles in a Haystack? Meta-Analysis of 1000+ MS/MS Files Provided by the Russian Proteomic Consortium for Mining Missing Proteins
- PMID: 32456206
- PMCID: PMC7356824
- DOI: 10.3390/proteomes8020012
Is It Possible to Find Needles in a Haystack? Meta-Analysis of 1000+ MS/MS Files Provided by the Russian Proteomic Consortium for Mining Missing Proteins
Abstract
Despite direct or indirect efforts of the proteomic community, the fraction of blind spots on the protein map is still significant. Almost 11% of human genes encode missing proteins; the existence of which proteins is still in doubt. Apparently, proteomics has reached a stage when more attention and curiosity need to be exerted in the identification of every novel protein in order to expand the unusual types of biomaterials and/or conditions. It seems that we have exhausted the current conventional approaches to the discovery of missing proteins and may need to investigate alternatives. Here, we present an approach to deciphering missing proteins based on the use of non-standard methodological solutions and encompassing diverse MS/MS data, obtained for rare types of biological samples by members of the Russian Proteomic community in the last five years. These data were re-analyzed in a uniform manner by three search engines, which are part of the SearchGUI package. The study resulted in the identification of two missing and five uncertain proteins detected with two peptides. Moreover, 149 proteins were detected with a single proteotypic peptide. Finally, we analyzed the gene expression levels to suggest feasible targets for further validation of missing and uncertain protein observations, which will fully meet the requirements of the international consortium. The MS data are available on the ProteomeXchange platform (PXD014300).
Keywords: Chromosome-Centric Human Proteome Project (C-HPP); human proteome; mass spectrometry; missing proteins; neXtProt; proteotypic peptide; uncertain proteins.
Conflict of interest statement
The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.
Figures




References
-
- Poverennaya E.V., Ilgisonis E.V., Ponomarenko E.A., Kopylov A.T., Zgoda V.G., Radko S.P., Lisitsa A.V., Archakov A.I. Why Are the Correlations between mRNA and Protein Levels so Low among the 275 Predicted Protein-Coding Genes on Human Chromosome 18? J. Proteome Res. 2017;16:4311–4318. doi: 10.1021/acs.jproteome.7b00348. - DOI - PubMed
-
- Ilgisonis E.V., Kopylov A.T., Ponomarenko E.A., Poverennaya E.V., Tikhonova O.V., Farafonova T.E., Novikova S., Lisitsa A.V., Zgoda V.G., Archakov A.I. Increased Sensitivity of Mass Spectrometry by Alkaline Two-Dimensional Liquid Chromatography: Deep Cover of the Human Proteome in Gene-Centric Mode. J. Proteome Res. 2018;17:4258–4266. doi: 10.1021/acs.jproteome.8b00754. - DOI - PubMed
Grants and funding
LinkOut - more resources
Full Text Sources
Molecular Biology Databases