Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2016:2016:7436849.
doi: 10.1155/2016/7436849. Epub 2016 May 19.

The Size of the Human Proteome: The Width and Depth

Affiliations
Review

The Size of the Human Proteome: The Width and Depth

Elena A Ponomarenko et al. Int J Anal Chem. 2016.

Abstract

This work discusses bioinformatics and experimental approaches to explore the human proteome, a constellation of proteins expressed in different tissues and organs. As the human proteome is not a static entity, it seems necessary to estimate the number of different protein species (proteoforms) and measure the number of copies of the same protein in a specific tissue. Here, meta-analysis of neXtProt knowledge base is proposed for theoretical prediction of the number of different proteoforms that arise from alternative splicing (AS), single amino acid polymorphisms (SAPs), and posttranslational modifications (PTMs). Three possible cases are considered: (1) PTMs and SAPs appear exclusively in the canonical sequences of proteins, but not in splice variants; (2) PTMs and SAPs can occur in both proteins encoded by canonical sequences and in splice variants; (3) all modification types (AS, SAP, and PTM) occur as independent events. Experimental validation of proteoforms is limited by the analytical sensitivity of proteomic technology. A bell-shaped distribution histogram was generated for proteins encoded by a single chromosome, with the estimation of copy numbers in plasma, liver, and HepG2 cell line. The proposed metabioinformatics approaches can be used for estimation of the number of different proteoforms for any group of protein-coding genes.

PubMed Disclaimer

Figures

Figure 1
Figure 1
(a) Distribution of the copy numbers of master proteins of chromosome 18 normalized per single HepG2/liver cell or 1 µL of plasma. (b) Share as a function of the detected proteins (in % to the total number of chromosome 18-coded proteins) and the analytical sensitivity.

References

    1. Collins F. S., Lander E. S., Rogers J., Waterston R. H. Finishing the euchromatic sequence of the human genome. Nature. 2005;50:162–168. - PubMed
    1. Wilhelm M., Schlegl J., Hahne H., et al. Mass-spectrometry-based draft of the human proteome. Nature. 2014;509(7502):582–587. doi: 10.1038/nature13319. - DOI - PubMed
    1. Kim M.-S., Pinto S. M., Getnet D., et al. A draft map of the human proteome. Nature. 2014;509(7502):575–581. doi: 10.1038/nature13302. - DOI - PMC - PubMed
    1. Karlsson C., Malmström L., Aebersold R., Malmström J. Proteome-wide selected reaction monitoring assays for the human pathogen Streptococcus pyogenes . Nature Communications. 2012;3, article 1301 doi: 10.1038/ncomms2297. - DOI - PMC - PubMed
    1. Roth M. J., Forbes A. J., Boyne M. T., II, Kim Y.-B., Robinson D. E., Kelleher N. L. Precise and parallel characterization of coding polymorphisms, alternative splicing, and modifications in human proteins by mass spectrometry. Molecular and Cellular Proteomics. 2005;4(7):1002–1008. doi: 10.1074/mcp.M500064-MCP200. - DOI - PMC - PubMed