Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Nov;102(11):1170-1181.
doi: 10.1038/s41374-022-00830-7. Epub 2022 Aug 3.

High-throughput proteomics: a methodological mini-review

Affiliations
Review

High-throughput proteomics: a methodological mini-review

Miao Cui et al. Lab Invest. 2022 Nov.

Abstract

Proteomics plays a vital role in biomedical research in the post-genomic era. With the technological revolution and emerging computational and statistic models, proteomic methodology has evolved rapidly in the past decade and shed light on solving complicated biomedical problems. Here, we summarize scientific research and clinical practice of existing and emerging high-throughput proteomics approaches, including mass spectrometry, protein pathway array, next-generation tissue microarrays, single-cell proteomics, single-molecule proteomics, Luminex, Simoa and Olink Proteomics. We also discuss important computational methods and statistical algorithms that can maximize the mining of proteomic data with clinical and/or other 'omics data. Various principles and precautions are provided for better utilization of these tools. In summary, the advances in high-throughput proteomics will not only help better understand the molecular mechanisms of pathogenesis, but also to identify the signature signaling networks of specific diseases. Thus, modern proteomics have a range of potential applications in basic research, prognostic oncology, precision medicine, and drug discovery.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. The process of proteomics “from bench to bedside”.
The mass spectrometry (MS)-based methods, single-molecule proteomics (SMP) and single-cell proteomics (SCP) have been widely used to identify and quantify new proteins in the initial discovery stage. Protein pathway array (PPA) is a high-throughput technique to explore the regulation of protein-protein interactions, pathway-pathway interactions, and biological functions to find the position of newly discovered protein in the cell signaling networks. Luminex, Meso-scale Discovery (MSD), Simoa and Olink are effective high-throughput methods for clinical validation after the proteomic markers are verified using tissue microarray (TMA).
Fig. 2
Fig. 2. The flow chart of data analysis.
Normalization is the most significant step after acquiring the raw data. Data can be analyzed according to specific study design and available clinical information and it can be based on the raw data after normalization or a result from other analyses. For example, the clustering analysis can be performed on raw data or the proteins that have significant changes after SAM.
Fig. 3
Fig. 3. Plotsheet generated by the significance analysis of microarrays: data are presented as a scatter plot of expected (x-axis) vs observed (y-axis) and the solid line indicates the relative difference expression of group.
Red color indicates upgrade and green color indicates downgrade. The data points that exceed a threshold from expected relative differences have significant different.
Fig. 4
Fig. 4. Examples of hierarchical clustering analysis (HCA) and grid analysis of time-series expression (GATE).
a The heatmap of a two-way hierarchical clustering analysis was performed by the Multi Experiment Viewer (MeV) (http://mev.tm4.org/). The color in each square represents a numerical value and the bar is on the top. All samples (x-axis) were clustered into three groups, while all protein markers were clustered into four groups. b Protein markers were cluster by multiple data points using GATE. The multiple data points can be divided by time, dosages, or stages of the disease.
Fig. 5
Fig. 5. Principal component analysis (PCA).
a The PCA mapping was performed using the Partek Genomics Suite (Partek, St. Louis, MO) (https://www.partek.com/partek-genomics-suite/). Patients with different survival status (red represents dead and blue represents alive) were separated by eight proteins. The first principal component is plotted on the X-axis and captures 34.9% of the variance. The second principal component is plotted on the Y-axis and achieves 15.6 % of the variance. b The scree plot represents the contribution of each principal component in PCA, and each principal component’s contribution decreases sequentially.
Fig. 6
Fig. 6. Examples of ingenuity pathway analysis (IPA) and gene-set enrichment analysis (GSEA).
a The signaling networks generated by database-based Ingenuity Pathway Analysis (IPA). The up- and downregulated proteins are represented by molecules in red and green color, respectively. The pathways were labeled outside of the network. b The top canonical pathways that were most significant to the dataset were identified by the IPA. The score assigned to each pathway was presented in –log (p value) using Fisher’s exact test. c The enrichment plot generated by the database-based gene set enrichment analysis (GSEA). The bar in the middle of the figure was labeled in red and blue from left to right, which means risk factor and protective factor separately. The enriched gene set is the IVANOVA_HEMATOPOIESIS_EARLY_PROGENITOR (https://www.gsea-msigdb.org/gsea/msigdb/cards/IVANOVA_HEMATOPOIESIS_EARLY_PROGENITOR), which is a protective factor in this figure.
Fig. 7
Fig. 7. Circos plot: Among all eight clinicopathological categories, the gender occupied the most significant proportion of the distribution, suggesting that it is the clinical factor that has the most impact on the signaling network.
Among 20 canonical pathways altered in the disease, the HER-2 and p53 are affected most, suggested that they play essential roles in the pathogenesis of the disease.

Similar articles

Cited by

References

    1. Collins FS, McKusick VA. Implications of the Human Genome Project for medical science. JAMA. 2001;285:540–544. doi: 10.1001/jama.285.5.540. - DOI - PubMed
    1. Wang K, Huang C, Nice E. Recent advances in proteomics: towards the human proteome. Biomed Chromatogr. 2014;28:848–857. doi: 10.1002/bmc.3157. - DOI - PubMed
    1. Mathivanan S. Integrated Bioinformatics Analysis of the Publicly Available Protein Data Shows Evidence for 96% of the Human Proteome. J Proteomics Bioinform. 2014;7:41–49. doi: 10.4172/jpb.1000301. - DOI
    1. Deutsch EW, Lane L, Overall CM, Bandeira N, Baker MS, Pineau C, et al. Human Proteome Project Mass Spectrometry Data Interpretation Guidelines 3.0. J Proteome Res. 2019;18:4108–4116. doi: 10.1021/acs.jproteome.9b00542. - DOI - PMC - PubMed
    1. Wilhelm M, Schlegl J, Hahne H, Gholami AM, Lieberenz M, Savitski MM, et al. Mass-spectrometry-based draft of the human proteome. Nature. 2014;509:582–587. doi: 10.1038/nature13319. - DOI - PubMed

Publication types