Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Dec 4;19(12):4735-4746.
doi: 10.1021/acs.jproteome.0c00485. Epub 2020 Oct 19.

Research on the Human Proteome Reaches a Major Milestone: >90% of Predicted Human Proteins Now Credibly Detected, According to the HUPO Human Proteome Project

Affiliations

Research on the Human Proteome Reaches a Major Milestone: >90% of Predicted Human Proteins Now Credibly Detected, According to the HUPO Human Proteome Project

Gilbert S Omenn et al. J Proteome Res. .

Abstract

According to the 2020 Metrics of the HUPO Human Proteome Project (HPP), expression has now been detected at the protein level for >90% of the 19 773 predicted proteins coded in the human genome. The HPP annually reports on progress made throughout the world toward credibly identifying and characterizing the complete human protein parts list and promoting proteomics as an integral part of multiomics studies in medicine and the life sciences. NeXtProt release 2020-01 classified 17 874 proteins as PE1, having strong protein-level evidence, up 180 from 17 694 one year earlier. These represent 90.4% of the 19 773 predicted coding genes (all PE1,2,3,4 proteins in neXtProt). Conversely, the number of neXtProt PE2,3,4 proteins, termed the "missing proteins" (MPs), was reduced by 230 from 2129 to 1899 since the neXtProt 2019-01 release. PeptideAtlas is the primary source of uniform reanalysis of raw mass spectrometry data for neXtProt, supplemented this year with extensive data from MassIVE. PeptideAtlas 2020-01 added 362 canonical proteins between 2019 and 2020 and MassIVE contributed 84 more, many of which converted PE1 entries based on non-MS evidence to the MS-based subgroup. The 19 Biology and Disease-driven B/D-HPP teams continue to pursue the identification of driver proteins that underlie disease states, the characterization of regulatory mechanisms controlling the functions of these proteins, their proteoforms, and their interactions, and the progression of transitions from correlation to coexpression to causal networks after system perturbations. And the Human Protein Atlas published Blood, Brain, and Metabolic Atlases.

Keywords: Biology and Disease-HPP (B/D-HPP); Chromosome-Centric HPP (C-HPP); Human Protein Atlas; Human Proteome Project (HPP); Mass Spectrometry Interactive Virtual Environment (MassIVE); PeptideAtlas; missing proteins (MPs); neXtProt protein existence (PE) metrics; non-MS PE1 proteins; uncharacterized protein existence 1 (uPE1).

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

Figure 1.
Figure 1.
Bar chart showing the progression of the state of protein validation for the past five years, since the HPP Guidelines 2.1 have been in effect. The PE1 proteins validated with mass spectrometry (MS) evidence are depicted in green, PE1 proteins based on other types of protein evidence in yellow, and the PE2,3,4 missing proteins (MPs) that have no or insufficient protein-level evidence in human specimens and cell lines in red. The green bars for 2016–2019 are from PeptideAtlas, while for 2020 it is a combination with MassIVE. Data for the green + yellow (PE1) and red (PE2,3,4) bars are provided in Table 1.
Figure 2.
Figure 2.
Top 10 data sets that contributed to the increment in canonical proteins in between 2019 and 2020 builds of PeptideAtlas. These new MS-based canonical proteins represent both new PE1 proteins that previously were PE2,3,4 (red in Figure 1) and previous non-MS-based PE1 proteins (yellow in Figure 1) that are now validated with MS evidence in PeptideAtlas. Each data set is labeled with the ProteomeXchange identifier (PXD), first*last authors, publication year, number of new canonical proteins, and method highlighted.
Figure 3.
Figure 3.
Flowchart depicts the detailed changes in neXtProt PE1–4 categories (plus PE5) from neXtProt release 2019–01 to neXtProt release 2020–01 (see text).

Similar articles

Cited by

References

    1. Legrain P; Aebersold R; Archakov A; Bairoch A; Bala K; Beretta L; Bergeron J; Borchers CH; Corthals GL; Costello CE; Deutsch EW; Domon B; Hancock W; He F; Hochstrasser D; Marko-Varga G; Salekdeh GH; Sechi S; Snyder M; Srivastava S; Uhlen M; Wu CH; Yamamoto T; Paik YK; Omenn GS The human proteome project: current state and future direction. Mol. Cell. Proteomics 2011, 10 (7), M111.009993. - PMC - PubMed
    1. Marko-Varga G; Omenn GS; Paik YK; Hancock WS A first step toward completion of a genome-wide characterization of the human proteome. J. Proteome Res 2013, 12 (1), 1–5. - PubMed
    1. Lane L; Bairoch A; Beavis RC; Deutsch EW; Gaudet P; Lundberg E; Omenn GS Metrics for the human proteome project 2013–2014 and strategies for finding missing proteins. J. Proteome Res 2014, 13 (1), 15–20. - PMC - PubMed
    1. Omenn GS; Lane L; Lundberg EK; Beavis RC; Nesvizhskii AI; Deutsch EW Metrics for the human proteome project 2015: Progress on the human proteome and guidelines for high-confidence protein identification. J. Proteome Res 2015, 14 (9), 3452–60. - PMC - PubMed
    1. Omenn GS; Lane L; Lundberg EK; Beavis RC; Overall CM; Deutsch EW Metrics for the human proteome project 2016: Progress on identifying and characterizing the human proteome, including post-translational modifications. J. Proteome Res 2016, 15(11), 3951–3960. - PMC - PubMed

Publication types