Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Mar 3;22(3):729-742.
doi: 10.1021/acs.jproteome.2c00406. Epub 2022 Dec 28.

Integrated View of Baseline Protein Expression in Human Tissues

Affiliations

Integrated View of Baseline Protein Expression in Human Tissues

Ananth Prakash et al. J Proteome Res. .

Abstract

The availability of proteomics datasets in the public domain, and in the PRIDE database, in particular, has increased dramatically in recent years. This unprecedented large-scale availability of data provides an opportunity for combined analyses of datasets to get organism-wide protein abundance data in a consistent manner. We have reanalyzed 24 public proteomics datasets from healthy human individuals to assess baseline protein abundance in 31 organs. We defined tissue as a distinct functional or structural region within an organ. Overall, the aggregated dataset contains 67 healthy tissues, corresponding to 3,119 mass spectrometry runs covering 498 samples from 489 individuals. We compared protein abundances between different organs and studied the distribution of proteins across these organs. We also compared the results with data generated in analogous studies. Additionally, we performed gene ontology and pathway-enrichment analyses to identify organ-specific enriched biological processes and pathways. As a key point, we have integrated the protein abundance results into the resource Expression Atlas, where they can be accessed and visualized either individually or together with gene expression data coming from transcriptomics datasets. We believe this is a good mechanism to make proteomics data more accessible for life scientists.

Keywords: human proteome; mass spectrometry; public data re-use; quantitative proteomics.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

Figure 1
Figure 1
Overview of the study design and reanalysis pipeline. QA: Quality assessment. Reprinted (Adapted or Reprinted in part) with permission from . Copyright 2022 EMBL-EBI.
Figure 2
Figure 2
(A) Number of canonical proteins identified across different organs. The number within the parenthesis indicates the number of samples. (B) Range of normalized iBAQ protein abundances across different organs. The number within the parenthesis indicates the number of samples. In panels (A) and (B), the term heart is used in a broader sense to mean the cardiovascular system. (C) Canonical proteins identified across different datasets. The number within the parenthesis indicates the number of unique tissues in the dataset. (D) Range of normalized iBAQ protein abundances across different datasets. The number within the parenthesis indicates the number of unique tissues in the dataset. (E) Comparison of total spectral data with the number of canonical proteins identified in each dataset and organ. (F) Distribution of canonical proteins identified across organs.
Figure 3
Figure 3
Heatmap of pairwise Pearson correlation coefficients across all samples. The color on the heatmap represents the correlation coefficient, which was calculated using the bin-transformed iBAQ values. The samples are hierarchically clustered on columns and rows using Euclidean distances. The clusters composed of the brain and cardiovascular system (heart) samples are highlighted with black borders. The abbreviations used in the organs’ header are B: brain, C: colon, H: heart, L: liver, O: ovary, and P: pancreas.
Figure 4
Figure 4
(A) PCA of brain samples colored by the tissue types. (B) PCA of brain samples colored by their respective dataset identifiers. (C) PCA of cardiovascular system (heart) samples colored by the tissue types. (D) PCA of cardiovascular system (heart) samples colored by their respective dataset identifiers. The numbers in parenthesis indicate the number of datasets for each tissue. Binned values of canonical proteins quantified in at least 50% of the samples were used to perform the PCA.
Figure 5
Figure 5
(A) Analysis of organ-specific canonical proteins. The analysis comprises the number of canonical proteins found in 31 organs, classified in three groups: “organ-enriched”, “group-enriched”, and “group mixed”. (B) Pathway analysis of the over-represented canonical proteins, showing the statistically significant representative pathways (p-value <0.05) in 31 organs. In panels (A) and (B), the term heart is used in a broader sense to mean the cardiovascular system.

Similar articles

Cited by

References

    1. Nesvizhskii A. I.; Aebersold R. Interpretation of shotgun proteomic data: the protein inference problem. Mol. Cell. Proteomics 2005, 4, 1419–1440. 10.1074/mcp.R500012-MCP200. - DOI - PubMed
    1. Kim M. S.; Pinto S. M.; Getnet D.; Nirujogi R. S.; Manda S. S.; Chaerkady R.; Madugundu A. K.; Kelkar D. S.; Isserlin R.; Jain S.; et al. A draft map of the human proteome. Nature 2014, 509, 575–581. 10.1038/nature13302. - DOI - PMC - PubMed
    1. Jiang L.; Wang M.; Lin S.; Jian R.; Li X.; Chan J.; Dong G.; Fang H.; Robinson A. E.; Consortium G. T.; et al. A Quantitative Proteome Map of the Human Body. Cell 2020, 183, 269–283.e19. 10.1016/j.cell.2020.08.036. - DOI - PMC - PubMed
    1. Uhlén M.; Fagerberg L.; Hallström B. M.; Lindskog C.; Oksvold P.; Mardinoglu A.; Sivertsson A.; Kampf C.; Sjöstedt E.; Asplund A.; et al. Proteomics. Tissue-based map of the human proteome. Science 2015, 347, 1260419.10.1126/science.1260419. - DOI - PubMed
    1. Wilhelm M.; Schlegl J.; Hahne H.; Gholami A. M.; Lieberenz M.; Savitski M. M.; Ziegler E.; Butzmann L.; Gessulat S.; Marx H.; et al. Mass-spectrometry-based draft of the human proteome. Nature 2014, 509, 582–587. 10.1038/nature13319. - DOI - PubMed

Publication types