Review

. 2024 Jun 4;4(4):338-417.

doi: 10.1021/acsmeasuresciau.3c00068. eCollection 2024 Aug 21.

Comprehensive Overview of Bottom-Up Proteomics Using Mass Spectrometry

Yuming Jiang^{1

2

3}, Devasahayam Arokia Balaya Rex⁴, Dina Schuster^{5

6

7}, Benjamin A Neely⁸, Germán L Rosano⁹, Norbert Volkmar⁵, Amanda Momenzadeh^{1

2

3}, Trenton M Peters-Clarke¹⁰, Susan B Egbert¹¹, Simion Kreimer^{2

3}, Emma H Doud¹², Oliver M Crook¹³, Amit Kumar Yadav¹⁴, Muralidharan Vanuopadath¹⁵, Adrian D Hegeman¹⁶, Martín L Mayta^{17

18}, Anna G Duboff¹⁹, Nicholas M Riley¹⁹, Robert L Moritz²⁰, Jesse G Meyer^{1

2

3}

Affiliations

¹ Department of Computational Biomedicine, Cedars Sinai Medical Center, Los Angeles, California 90048, United States.
² Smidt Heart Institute, Cedars Sinai Medical Center, Los Angeles, California 90048, United States.
³ Advanced Clinical Biosystems Research Institute, Cedars Sinai Medical Center, Los Angeles, California 90048, United States.
⁴ Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore 575018, India.
⁵ Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich 8093, Switzerland.
⁶ Department of Biology, Institute of Molecular Biology and Biophysics, ETH Zurich, Zurich 8093, Switzerland.
⁷ Laboratory of Biomolecular Research, Division of Biology and Chemistry, Paul Scherrer Institute, Villigen 5232, Switzerland.
⁸ Chemical Sciences Division, National Institute of Standards and Technology, NIST, Charleston, South Carolina 29412, United States.
⁹ Mass Spectrometry Unit, Institute of Molecular and Cellular Biology of Rosario, Rosario, 2000 Argentina.
¹⁰ Department of Pharmaceutical Chemistry, University of California-San Francisco, San Francisco, California, 94158, United States.
¹¹ Department of Chemistry, University of Manitoba, Winnipeg, Manitoba, R3T 2N2 Canada.
¹² Center for Proteome Analysis, Indiana University School of Medicine, Indianapolis, Indiana, 46202-3082, United States.
¹³ Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford OX1 3LB, United Kingdom.
¹⁴ Translational Health Science and Technology Institute, NCR Biotech Science Cluster 3rd Milestone Faridabad-Gurgaon Expressway, Faridabad, Haryana 121001, India.
¹⁵ School of Biotechnology, Amrita Vishwa Vidyapeetham, Kollam-690 525, Kerala, India.
¹⁶ Departments of Horticultural Science and Plant and Microbial Biology, University of Minnesota, Twin Cities, Minnesota 55108, United States.
¹⁷ School of Medicine and Health Sciences, Center for Health Sciences Research, Universidad Adventista del Plata, Libertador San Martin 3103, Argentina.
¹⁸ Molecular Biology Department, School of Pharmacy and Biochemistry, Universidad Nacional de Rosario, Rosario 2000, Argentina.
¹⁹ Department of Chemistry, University of Washington, Seattle, Washington 98195, United States.
²⁰ Institute for Systems biology, Seattle, Washington 98109, United States.

PMID: 39193565
PMCID: PMC11348894
DOI: 10.1021/acsmeasuresciau.3c00068

Review

Comprehensive Overview of Bottom-Up Proteomics Using Mass Spectrometry

Yuming Jiang et al. ACS Meas Sci Au. 2024.

. 2024 Jun 4;4(4):338-417.

doi: 10.1021/acsmeasuresciau.3c00068. eCollection 2024 Aug 21.

Authors

Affiliations

¹ Department of Computational Biomedicine, Cedars Sinai Medical Center, Los Angeles, California 90048, United States.
² Smidt Heart Institute, Cedars Sinai Medical Center, Los Angeles, California 90048, United States.
³ Advanced Clinical Biosystems Research Institute, Cedars Sinai Medical Center, Los Angeles, California 90048, United States.
⁴ Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore 575018, India.
⁵ Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich 8093, Switzerland.
⁶ Department of Biology, Institute of Molecular Biology and Biophysics, ETH Zurich, Zurich 8093, Switzerland.
⁷ Laboratory of Biomolecular Research, Division of Biology and Chemistry, Paul Scherrer Institute, Villigen 5232, Switzerland.
⁸ Chemical Sciences Division, National Institute of Standards and Technology, NIST, Charleston, South Carolina 29412, United States.
⁹ Mass Spectrometry Unit, Institute of Molecular and Cellular Biology of Rosario, Rosario, 2000 Argentina.
¹⁰ Department of Pharmaceutical Chemistry, University of California-San Francisco, San Francisco, California, 94158, United States.
¹¹ Department of Chemistry, University of Manitoba, Winnipeg, Manitoba, R3T 2N2 Canada.
¹² Center for Proteome Analysis, Indiana University School of Medicine, Indianapolis, Indiana, 46202-3082, United States.
¹³ Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford OX1 3LB, United Kingdom.
¹⁴ Translational Health Science and Technology Institute, NCR Biotech Science Cluster 3rd Milestone Faridabad-Gurgaon Expressway, Faridabad, Haryana 121001, India.
¹⁵ School of Biotechnology, Amrita Vishwa Vidyapeetham, Kollam-690 525, Kerala, India.
¹⁶ Departments of Horticultural Science and Plant and Microbial Biology, University of Minnesota, Twin Cities, Minnesota 55108, United States.
¹⁷ School of Medicine and Health Sciences, Center for Health Sciences Research, Universidad Adventista del Plata, Libertador San Martin 3103, Argentina.
¹⁸ Molecular Biology Department, School of Pharmacy and Biochemistry, Universidad Nacional de Rosario, Rosario 2000, Argentina.
¹⁹ Department of Chemistry, University of Washington, Seattle, Washington 98195, United States.
²⁰ Institute for Systems biology, Seattle, Washington 98109, United States.

PMID: 39193565
PMCID: PMC11348894
DOI: 10.1021/acsmeasuresciau.3c00068

Abstract

Proteomics is the large scale study of protein structure and function from biological systems through protein identification and quantification. "Shotgun proteomics" or "bottom-up proteomics" is the prevailing strategy, in which proteins are hydrolyzed into peptides that are analyzed by mass spectrometry. Proteomics studies can be applied to diverse studies ranging from simple protein identification to studies of proteoforms, protein-protein interactions, protein structural alterations, absolute and relative protein quantification, post-translational modifications, and protein stability. To enable this range of different experiments, there are diverse strategies for proteome analysis. The nuances of how proteomic workflows differ may be challenging to understand for new practitioners. Here, we provide a comprehensive overview of different proteomics methods. We cover from biochemistry basics and protein extraction to biological interpretation and orthogonal validation. We expect this Review will serve as a handbook for researchers who are new to the field of bottom-up proteomics.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

**Figure 1**
Proteome complexity. Each gene may be expressed in the form of multiple protein products, or proteoforms, through alternative splicing and incorporation of post-translational modifications. As such, there are many more unique proteoforms than genes. While there exist 20,000–23,000 coding genes in the human genome, upwards of 1,000,000 unique human proteoforms may exist. The study of the structure, function, and spatial and temporal regulation of these proteins is the subject of mass spectrometry-based proteomics

**Figure 2**
Multiple protease proteolysis improves protein inference. The use of other proteases beyond trypsin such as lysyl endopeptidase (Lys-C), peptidyl-Asp metallopeptidase (Asp-N), glutamyl peptidase I, (Glu-C), chymotrypsin, clostripain (Arg-C), or peptidyl-Lys metalloendopeptidase (Lys-N) can generate a greater diversity of peptides. This improves protein sequence coverage and allows for the correct identification of their N-termini. Increasing the number of complimentary enzymes used will increase the number of proteins identified by single peptides and decreases the ambiguity of the assignment of protein groups. Therefore, this will allow more protein isoforms and post-translational modifications to be identified than using trypsin alone.

**Figure 3**
Quantitative strategies commonly used in proteomics. A few non-comprehensive examples are of quantification methods are shown. (A) Label-free quantification. Proteins are extracted from samples, enzymatically hydrolyzed into peptides and analyzed by mass spectrometry. Extracted ion chromatograms from peptides are compared across samples that are analyzed sequentially. (B) Metabolic labeling. Stable isotope labeling by amino acids in cell culture (SILAC) is based on feeding cells stable isotope labeled amino acids (“light” or “heavy”). Samples grown with heavy or light amino acids are mixed before cell lysis. The relative intensities of the heavy and light peptide are used to compute protein differences between samples. (C) Isobaric or chemical labeling. Proteins are isolated separately from samples, enzymatically hydrolyzed into peptides, and then chemically tagged with isobaric stable isotope labels. These isobaric tags produce unique reporter mass-to-charge (m/z) signals that are produced upon fragmentation with MS/MS. Peptide fragment ions are used to identify peptides, and the relative reporter ion signals are used for quantification.

**Figure 4**
Chemical structure of isobaric tags. This shows the TMT 6-plex from ThermoFisher, which is an example of an isobaric tag. The structure has three elements, the reactive group (in this case N-hydroxysuccinimide), the balancer, and the reporter. The reactive group enables quick covalent conjugation to nucleophilic amines found at peptide n-terminus and lysine side chains. The balance and reporter groups together contain a total of six heavy isotopes. The stars in the structures indicate the positions of all six heavy atoms for each TMT form. For this reason, a sample labeled by the any version will have the same precursor mass. However, upon fragmentation, the balance group is lost and the reporter retains a charge. The reporter group is measured in the low mass region and is proportional to the starting amount of each sample before mixing This ratio of reporter signals enables relative quantification.

**Figure 5**
Solid phase extraction (SPE). SPE is a sample preparation technique that uses a solid adsorbent contained most commonly in a cartridge device to selectively adsorb certain molecules from solution. The first step is the conditioning of the cartridge which involves wetting the adsorbent to solvate its functional groups and filling the void spaces with solvent thereby removing any air in the column. This is necessary to produce a suitable environment for adsorption and thus ensure reproducible interaction with the analytes. After conditioning, the sample is loaded in the cartridge. This can be performed with the aid of positive or negative pressure to ensure a constant flow rate. In this step molecules bind the adsorbent and interferences pass through. Next, the column is washed with the mobile phase to eliminate the contaminants while ensuring the analyte remains bound. Finally, peptides are eluted in an appropriate buffer solution with polarity or charge that competes with interaction with the solid phase.

**Figure 6**
MALDI. The analyte-matrix mixture is irradiated by a laser source, leading to ablation. Desorption and proton transfer ionize the analyte molecules that can then be accelerated into a mass spectrometer.

**Figure 7**
Electrospray ionization. Charged droplets are formed; their size is reduced due to evaporation until charge repulsion leads to Coulomb fission and results in charged analyte molecules.

**Figure 8**
Diagram of typical mass spectrometer modules. Systems must have an ion source, mass analyzer, detector, vacuum system, and control system.

**Figure 9**
Schematic diagram of typical QqQ system. Three quadrupoles enable precursor selection, fragmentation, and fragment ion selection.

**Figure 10**
Schematic diagram of a typical quadrupole time-of-flight mass spectrometer. Like a QQQ, a Q-TOF will have two quadrupoles for selection and fragmentation followed by the TOF for the final higher resolution separation and detection.

**Figure 11**
Schematic diagram of orbitrap. (A) Close up of an Orbitrap. (B) General schematic of complete Q-Orbitrap system.

**Figure 12**
Schematic of FT-ICR. (A) Typical FT-ICR cell. (B) Example of complete FT-ICR system.

**Figure 13**
Ion mobility. (A) Conceptional diagram of three types of ion mobility strategies. (B) Schematic of drift tube ion mobility spectrometry. (C) Schematic of high field asymmetric waveform ion mobility spectrometry (FAIMS). (D) Schematic of trapped ion mobility spectrometry (TIMS).

**Figure 14**
Peptide fragmentation methods. (A) Sequence-informative fragment ions are termed a/x-, b/y-, and c/z-type fragments depending on which bond along the peptide backbone breaks. Fragments that explain the intact N-terminus of the peptide are a-, b-, and c-type ions, while x-, y-, and z-type ions explain the intact C-terminus of the peptide. Other panels show common dissociation methods, including collision, electron, and photon-based fragmentation. (B) Resonant collision-induced dissociation (resCID) and beam-type CID (beamCID) both produce mainly b/y-type sequencing ions through collisions with background gases like helium and nitrogen that increase the internal energy of peptide cations. (C) Electron capture and electron transfer dissociation (ECD and ETD) generate mainly c/z-type fragments through electron-mediated radical driven cleavage of the peptide backbone. (D) Infrared multi-photon dissociation (IRMPD) is a slow heating method similar in dissociation mechanism to resCID, but very different in implementation due to the IR lasers required (often with lower energy 10.6 micron photons). Ultraviolet photodissociation (UVPD) can use a range of wavelengths (popular options shown) to introduce higher energy photons to peptide cations, causing vibrational and electronic excitation that can generate all major fragment ion types depending on wavelength used.

**Figure 15**
Types of DIA. (A) SRM/MRM. Peptides are ionized by ESI and although there are many peptides entering the mass spectrometer at any time, the first quadrupole (Q1) isolates one mass, which is then fragmented by HCD. Fragment masses from the peptide are then selected in the third quadrupole (Q3). This leads to very low noise and high sensitivity. (B) PRM. Like MRM, peptides are selected in the first quadrupole, but this analysis is done on a high-resolution instrument like an Orbitrap or TOF. Selectivity is gained by exploiting the high mass accuracy and resolution to monitor multiple fragment ions. (C) uDIA/SWATH. Like MRM and PRM, peptides are isolated with Q1, but in this case a much wider isolation window is used. This usually results in co-isolation of many peptides simultaneously. Fragments from many peptides are measured with high resolution and high mass accuracy. Special software is used to get peptide identities and quantities from the fragment ions.

**Figure 16**
Proteomics data analysis and biological interpretation. The process begins with protein identification and quantification using tools such as Proteome Discoverer, Spectronaut, Spectromine, MS Fragger, MaxQuant, and Skyline. Quality control measures ensure data integrity, leading to a biological interpretation of the results. Differential expression analyses may include relative abundance charts, heat maps, and volcano plots. Functional analysis encompasses gene ontology, protein-protein interactions, and signaling pathways.

**Figure 17**
Human Peptide Atlas as of 2024. (A) Current total search space and identified elements of the 2024 human PeptideAtlas. (B) Historical cumulative plot of the identified total proteins (blue vertical bars) and the unique proteins identified per dataset (red vertical bars) over the period of 2005−2024

**Figure 18**
Analysis of a simple network using different centrality measurements. Nodes are colored according to each metric using a yellow-to-red gradient (yellow: lowest value, red: highest value). Network visualization and analysis were performed in Cytoscape.

**Figure 19**
Types of functional enrichment methods. In the volcano plot (left), proteins with altered values are colored blue or red according to arbitrarily chosen cut-off values for significance and fold change. Black bars or thick-bordered nodes indicate members of a GO category.

See this image and copyright information in PMC

Update of

Comprehensive Overview of Bottom-Up Proteomics using Mass Spectrometry.
Jiang Y, Rex DAB, Schuster D, Neely BA, Rosano GL, Volkmar N, Momenzadeh A, Peters-Clarke TM, Egbert SB, Kreimer S, Doud EH, Crook OM, Yadav AK, Vanuopadath M, Mayta ML, Duboff AG, Riley NM, Moritz RL, Meyer JG. Jiang Y, et al. ArXiv [Preprint]. 2023 Nov 13:arXiv:2311.07791v1. ArXiv. 2023. Update in: ACS Meas Sci Au. 2024 Jun 04;4(4):338-417. doi: 10.1021/acsmeasuresciau.3c00068. PMID: 38013887 Free PMC article. Updated. Preprint.

References

1. Martin-Baniandres P.; Lan W.-H.; Board S.; Romero-Ruiz M.; Garcia-Manyes S.; Qing Y.; Bayley H. Enzyme-Less Nanopore Detection of Post-Translational Modifications Within Long Polypeptides. Nat. Nanotechnol. 2023, 18 (11), 1335–1340. 10.1038/s41565-023-01462-8. - DOI - PMC - PubMed
1. Wang X.; Thomas T.-M.; Ren R.; Zhou Y.; Zhang P.; Li J.; Cai S.; Liu K.; Ivanov A. P.; Herrmann A.; Edel J. B. Nanopore Detection Using Supercharged Polypeptide Molecular Carriers. J. Am. Chem. Soc. 2023, 145 (11), 6371–6382. 10.1021/jacs.2c13465. - DOI - PMC - PubMed
1. Yusko E. C.; Bruhn B. R.; Eggenberger O. M.; Houghtaling J.; Rollings R. C.; Walsh N. C.; Nandivada S.; Pindrus M.; Hall A. R.; Sept D.; Li J.; Kalonia D. S.; Mayer M. Real-Time Shape Approximation and Fingerprinting of Single Proteins Using a Nanopore. Nature Nanotech 2017, 12 (4), 360–367. 10.1038/nnano.2016.267. - DOI - PubMed
1. Afshar Bakshloo M.; Kasianowicz J. J.; Pastoriza-Gallego M.; Mathé J.; Daniel R.; Piguet F.; Oukhaled A. Nanopore-Based Protein Identification. J. Am. Chem. Soc. 2022, 144 (6), 2716–2725. 10.1021/jacs.1c11758. - DOI - PubMed
1. Swaminathan J.; Boulgakov A. A.; Hernandez E. T.; Bardo A. M.; Bachman J. L.; Marotta J.; Johnson A. M.; Anslyn E. V.; Marcotte E. M. Highly Parallel Single-Molecule Identification of Proteins in Zeptomole-Scale Mixtures. Nat Biotechnol 2018, 36 (11), 1076–1082. 10.1038/nbt.4278. - DOI - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Comprehensive Overview of Bottom-Up Proteomics Using Mass Spectrometry

Affiliations

Comprehensive Overview of Bottom-Up Proteomics Using Mass Spectrometry

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Update of

References

Publication types

Grants and funding

LinkOut - more resources

Full Text Sources