Evolution of Protein Functional Annotation: Text Mining Study
- PMID: 35330478
- PMCID: PMC8952229
- DOI: 10.3390/jpm12030479
Evolution of Protein Functional Annotation: Text Mining Study
Abstract
Within the Human Proteome Project initiative framework for creating functional annotations of uPE1 proteins, the neXt-CP50 Challenge was launched in 2018. In analogy with the missing-protein challenge, each command deciphers the functional features of the proteins in the chromosome-centric mode. However, the neXt-CP50 Challenge is more complicated than the missing-protein challenge: the approaches and methods for solving the problem are clear, but neither the concept of protein function nor specific experimental and/or bioinformatics protocols have been standardized to address it. We proposed using a retrospective analysis of the key HPP repository, the neXtProt database, to identify the most frequently used experimental and bioinformatic methods for analyzing protein functions, and the dynamics of accumulation of functional annotations. It has been shown that the dynamics of the increase in the number of proteins with known functions are greater than the progress made in the experimental confirmation of the existence of questionable proteins in the framework of the missing-protein challenge. At the same time, the functional annotation is based on the guilty-by-association postulate, according to which, based on large-scale experiments on API-MS and Y2H, proteins with unknown functions are most likely mapped through "handshakes" to biochemical processes.
Keywords: CHPP; Human Proteome Project; missing proteins; neXt-MP50; neXtCP-50; neXtProt; protein function; text-mining; uPE1 proteins.
Conflict of interest statement
The authors declare no competing financial interest.
Figures


Similar articles
-
Launching the C-HPP neXt-CP50 Pilot Project for Functional Characterization of Identified Proteins with No Known Function.J Proteome Res. 2018 Dec 7;17(12):4042-4050. doi: 10.1021/acs.jproteome.8b00383. Epub 2018 Nov 29. J Proteome Res. 2018. PMID: 30269496 Free PMC article.
-
Progress on Identifying and Characterizing the Human Proteome: 2019 Metrics from the HUPO Human Proteome Project.J Proteome Res. 2019 Dec 6;18(12):4098-4107. doi: 10.1021/acs.jproteome.9b00434. Epub 2019 Sep 13. J Proteome Res. 2019. PMID: 31430157 Free PMC article.
-
The 2023 Report on the Proteome from the HUPO Human Proteome Project.J Proteome Res. 2024 Feb 2;23(2):532-549. doi: 10.1021/acs.jproteome.3c00591. Epub 2024 Jan 17. J Proteome Res. 2024. PMID: 38232391 Free PMC article. Review.
-
AlphaFun: Structural-Alignment-Based Proteome Annotation Reveals why the Functionally Unknown Proteins (uPE1) Are So Understudied.J Proteome Res. 2024 May 3;23(5):1593-1602. doi: 10.1021/acs.jproteome.3c00678. Epub 2024 Apr 16. J Proteome Res. 2024. PMID: 38626392 Free PMC article.
-
The 2024 Report on the Human Proteome from the HUPO Human Proteome Project.J Proteome Res. 2024 Dec 6;23(12):5296-5311. doi: 10.1021/acs.jproteome.4c00776. Epub 2024 Nov 8. J Proteome Res. 2024. PMID: 39514846 Free PMC article. Review.
Cited by
-
Identification of Potential Therapeutic Targets on the Level of DNA/mRNAs, Proteins and Metabolites: A Systematic Mapping Review of Scientific Texts' Fragments from Open Targets.Curr Issues Mol Biol. 2023 Apr 13;45(4):3406-3418. doi: 10.3390/cimb45040223. Curr Issues Mol Biol. 2023. PMID: 37185747 Free PMC article. Review.
-
Microbial Interactions in Food Fermentation: Interactions, Analysis Strategies, and Quality Enhancement.Foods. 2025 Jul 17;14(14):2515. doi: 10.3390/foods14142515. Foods. 2025. PMID: 40724333 Free PMC article. Review.
References
-
- Omenn G.S., Lane L., Overall C.M., Cristea I.M., Corrales F.J., Lindskog C., Paik Y.-K., Van Eyk J.E., Liu S., Pennington S.R., et al. Research on the Human Proteome Reaches a Major Milestone: >90% of Predicted Human Proteins Now Credibly Detected, According to the HUPO Human Proteome Project. J. Proteome Res. 2020;19:4735. doi: 10.1021/acs.jproteome.0c00485. - DOI - PMC - PubMed
Grants and funding
LinkOut - more resources
Full Text Sources