Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2023 Jan 31;148(3):475-486.
doi: 10.1039/d2an01246d.

Overview and considerations in bottom-up proteomics

Affiliations
Review

Overview and considerations in bottom-up proteomics

Rachel M Miller et al. Analyst. .

Abstract

Proteins are the key biological actors within cells, driving many biological processes integral to both healthy and diseased states. Understanding the depth of complexity represented within the proteome is crucial to our scientific understanding of cellular biology and to provide disease specific insights for clinical applications. Mass spectrometry-based proteomics is the premier method for proteome analysis, with the ability to both identify and quantify proteins. Although proteomics continues to grow as a robust field of bioanalytical chemistry, advances are still necessary to enable a more comprehensive view of the proteome. In this review, we provide a broad overview of mass spectrometry-based proteomics in general, and highlight four developing areas of bottom-up proteomics: (1) protein inference, (2) alternative proteases, (3) sample-specific databases and (4) post-translational modification discovery.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
Sources of proteome complexity. Proteoforms provide a depth of complexity to the proteome which would not be possible if a gene only led to the production of a single protein product. Instead, mutations at the gene-level, variants or alternative splicing at the transcript-level, and post-translational modifications or cleavage events at the protein-level contribute to a still undefined number of proteoforms, which are the functional units of the proteome.
Figure 2:
Figure 2:
Experimental workflows for bottom-up and top-down proteomic approaches.
Figure 3:
Figure 3:
Comparison of the theoretical and experimental length distribution of tryptic peptides. The length distribution of in silico digested tryptic peptides (grey), as determined by ProteaseGuru, is compared to the length distribution of peptides experimentally identified from MetaMorpheus analysis of the tryptic data Miller et. al. (green). Most experimentally identified peptides are between 7–35 amino acids in length, whereas the theoretical tryptic digest favors the generation of shorter peptides.
Figure 4:
Figure 4:
Comparison of short- and long-read sequencing for the reconstruction of transcript isoforms. In short-read RNA-sequencing approaches, RNA fragments are generated from which full-length transcripts must be reconstructed. Depending on the coverage of alternative splice junctions, incorrect transcript inference can be achieved. In this example, based on the fragments identified, a single transcript is reconstructed. Therefore, the two additional transcript isoforms are missed. In long-read RNA-sequencing, full-length transcripts are sequenced, and no reconstruction is required. Therefore, in the provided example, all three transcript isoforms expressed in the sample are identified.

References

    1. Aebersold R; Mann M Mass Spectrometry-Based Proteomics. Nature 2003, 422 (6928), 198–207. 10.1038/nature01511. - DOI - PubMed
    1. Smith LM; Kelleher NL Proteoform: A Single Term Describing Protein Complexity. Nat. Methods 2013, 10 (3), 186–187. 10.1038/nmeth.2369. - DOI - PMC - PubMed
    1. Aebersold R; Agar JN; Amster IJ; Baker MS; Bertozzi CR; Boja ES; Costello CE; Cravatt BF; Fenselau C; Garcia BA; Ge Y; Gunawardena J; Hendrickson RC; Hergenrother PJ; Huber CG; Ivanov AR; Jensen ON; Jewett MC; Kelleher NL; Kiessling LL; Krogan NJ; Larsen MR; Loo JA; Ogorzalek Loo RR; Lundberg E; MacCoss MJ; Mallick P; Mootha VK; Mrksich M; Muir TW; Patrie SM; Pesavento JJ; Pitteri SJ; Rodriguez H; Saghatelian A; Sandoval W; Schlüter H; Sechi S; Slavoff SA; Smith LM; Snyder MP; Thomas PM; Uhlén M; Van Eyk JE; Vidal M; Walt DR; White FM; Williams ER; Wohlschlager T; Wysocki VH; Yates NA; Young NL; Zhang B How Many Human Proteoforms Are There? Nat. Chem. Biol. 2018, 14 (3), 206–214. 10.1038/nchembio.2576. - DOI - PMC - PubMed
    1. Smith LM; Kelleher NL Proteoforms as the next Proteomics Currency. Science 2018, 359 (6380), 1106–1107. 10.1126/science.aat1884. - DOI - PMC - PubMed
    1. Uhlén M; Fagerberg L; Hallström BM; Lindskog C; Oksvold P; Mardinoglu A; Sivertsson Å; Kampf C; Sjöstedt E; Asplund A; Olsson I; Edlund K; Lundberg E; Navani S; Szigyarto CA-K; Odeberg J; Djureinovic D; Takanen JO; Hober S; Alm T; Edqvist P-H; Berling H; Tegel H; Mulder J; Rockberg J; Nilsson P; Schwenk JM; Hamsten M; von Feilitzen K; Forsberg M; Persson L; Johansson F; Zwahlen M; von Heijne G; Nielsen J; Pontén F Proteomics. Tissue-Based Map of the Human Proteome. Science 2015, 347 (6220), 1260419. 10.1126/science.1260419. - DOI - PubMed

MeSH terms