Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar 16;12(3):479.
doi: 10.3390/jpm12030479.

Evolution of Protein Functional Annotation: Text Mining Study

Affiliations

Evolution of Protein Functional Annotation: Text Mining Study

Ekaterina V Ilgisonis et al. J Pers Med. .

Abstract

Within the Human Proteome Project initiative framework for creating functional annotations of uPE1 proteins, the neXt-CP50 Challenge was launched in 2018. In analogy with the missing-protein challenge, each command deciphers the functional features of the proteins in the chromosome-centric mode. However, the neXt-CP50 Challenge is more complicated than the missing-protein challenge: the approaches and methods for solving the problem are clear, but neither the concept of protein function nor specific experimental and/or bioinformatics protocols have been standardized to address it. We proposed using a retrospective analysis of the key HPP repository, the neXtProt database, to identify the most frequently used experimental and bioinformatic methods for analyzing protein functions, and the dynamics of accumulation of functional annotations. It has been shown that the dynamics of the increase in the number of proteins with known functions are greater than the progress made in the experimental confirmation of the existence of questionable proteins in the framework of the missing-protein challenge. At the same time, the functional annotation is based on the guilty-by-association postulate, according to which, based on large-scale experiments on API-MS and Y2H, proteins with unknown functions are most likely mapped through "handshakes" to biochemical processes.

Keywords: CHPP; Human Proteome Project; missing proteins; neXt-MP50; neXtCP-50; neXtProt; protein function; text-mining; uPE1 proteins.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

Figure 1
Figure 1
Change in the completeness of human protein data according to neXtProt. (a) Chronology of changes in the number of protein identifications; (b) chronology of replenishment of neXtProt with information on protein functions.
Figure 2
Figure 2
A cloud of biological functions for proteins. The data presented refer to 1392 proteins whose functional annotation appeared in neXtProt from the beginning of 2016 to the beginning of 2021.

Similar articles

Cited by

References

    1. Auton A., Brooks L., Durbin R., Garrison E., Kang H., Korbel J., Marchini J., McCarthy S., McVean G., Abecasis G. A Global Reference for Human Genetic Variation. Nature. 2015;526:68–74. doi: 10.1038/NATURE15393. - DOI - PMC - PubMed
    1. Lek M., Karczewski K., Minikel K., Samocha K., Banks E., Fennell T., O’Donnell-Luria A., Ware J., Hill A., Cummings B., et al. Analysis of Protein-Coding Genetic Variation in 60,706 Humans. Nature. 2016;536:285–291. doi: 10.1038/nature19057. - DOI - PMC - PubMed
    1. Adhikari S., Nice E.C., Deutsch E.W., Lane L., Omenn G.S., Pennington S.R., Paik Y.-K., Overall C.M., Corrales F.J., Cristea I.M., et al. A High-Stringency Blueprint of the Human Proteome. Nat. Commun. 2020;11:5301. doi: 10.1038/s41467-020-19045-9. - DOI - PMC - PubMed
    1. Omenn G.S., Lane L., Overall C.M., Cristea I.M., Corrales F.J., Lindskog C., Paik Y.-K., Van Eyk J.E., Liu S., Pennington S.R., et al. Research on the Human Proteome Reaches a Major Milestone: >90% of Predicted Human Proteins Now Credibly Detected, According to the HUPO Human Proteome Project. J. Proteome Res. 2020;19:4735. doi: 10.1021/acs.jproteome.0c00485. - DOI - PMC - PubMed
    1. Omenn G.S. Reflections on the HUPO Human Proteome Project, the Flagship Project of the Human Proteome Organization, at 10 Years. Mol. Cell. Proteom. 2021;20:100062. doi: 10.1016/j.mcpro.2021.100062. - DOI - PMC - PubMed

LinkOut - more resources