Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jan;409(2):607-618.
doi: 10.1007/s00216-016-9970-5. Epub 2016 Oct 12.

Use of an informed search space maximizes confidence of site-specific assignment of glycoprotein glycosylation

Affiliations

Use of an informed search space maximizes confidence of site-specific assignment of glycoprotein glycosylation

Kshitij Khatri et al. Anal Bioanal Chem. 2017 Jan.

Abstract

In order to interpret glycopeptide tandem mass spectra, it is necessary to estimate the theoretical glycan compositions and peptide sequences, known as the search space. The simplest way to do this is to build a naïve search space from sets of glycan compositions from public databases and to assume that the target glycoprotein is pure. Often, however, purified glycoproteins contain co-purified glycoprotein contaminants that have the potential to confound assignment of tandem mass spectra based on naïve assumptions. In addition, there is increasing need to characterize glycopeptides from complex biological mixtures. Fortunately, liquid chromatography-mass spectrometry (LC-MS) methods for glycomics and proteomics are now mature and accessible. We demonstrate the value of using an informed search space built from measured glycomes and proteomes to define the search space for interpretation of glycoproteomics data. We show this using α-1-acid glycoprotein (AGP) mixed into a set of increasingly complex matrices. As the mixture complexity increases, the naïve search space balloons and the ability to assign glycopeptides with acceptable confidence diminishes. In addition, it is not possible to identify glycopeptides not foreseen as part of the naïve search space. A search space built from released glycan glycomics and proteomics data is smaller than its naïve counterpart while including the full range of proteins detected in the mixture. This maximizes the ability to assign glycopeptide tandem mass spectra with confidence. As the mixture complexity increases, the number of tandem mass spectra per glycopeptide precursor ion decreases, resulting in lower overall scores and reduced depth of coverage for the target glycoprotein. We suggest use of α-1-acid glycoprotein as a standard to gauge effectiveness of analytical methods and bioinformatics search parameters for glycoproteomics studies. Graphical Abstract Assignment of site specific glycosylation from LC-tandemMS data.

Keywords: Alpha-1-acid glycoprotein; Glycoinformatics; Glycomics; Glycoproteomics; Glycosylation; Integrated-omics; Mass spectrometry.

PubMed Disclaimer

Conflict of interest statement

Conflict of interests The authors have no conflicts of interest.

Figures

Fig. 1
Fig. 1
a Venn diagram showing number and overlap of proteins identified from proteomics analyses of individual serum glycoproteins and albumin and IgG depleted serum. Only proteins identified at a 1 % FDR with two or more unique peptides were included. Comparison of a naïve search space (b) with proteomics informed (c) and proteomics + glycomics informed (d) search spaces for glycoprotein mixture 3. x-axes show the Swissprot identifiers for glycoproteins included in the hypotheses and y-axis shows number of glycopeptides per glycoprotein considered. Glycoprotein standards AGP (A1AG), fetuin (FETUA), and transferrin (TRFE) are shown on the left-most side of the x-axes
Fig. 2
Fig. 2
Base peak chromatogram of enriched transferrin glycopeptides stacked over extracted ion chromatograms for saccharide oxonium ions, generated by MS/MS, indicating the presence of glycopeptides in the eluting peaks
Fig. 3
Fig. 3
Comparison of the performance of naïve versus informed search spaces for an AGP glycoproteomics result. a Number of glycopeptide matches over a range of confidence thresholds (q-value) based on glycoproteomics searches followed by FDR calculation by decoy database searches. Colored lines represent different hypotheses as indicated in the legend. b A comparison of number of glycopeptides and decoys matches for naïve and informed search spaces at different MS2 score thresholds
Fig. 4
Fig. 4
Effects of increasing sample complexity on analytical methods. a Base peak chromatograms (MS only) for samples with different levels of complexity. b Extracted ion chromatograms showing the abundance of an AGP glycopeptide (LVPVPITNATLDR-Hex6, HexNAc5, NeuAc3) in the different complexity levels. c The average number of tandem mass spectra acquired per precursor across samples of different complexity. d Distributions of precursors selected for tandem MS across a LC-tandem MS experiment for samples with scaling complexity levels
Fig. 5
Fig. 5
Grouped bar plots showing identified site-specific AGP glycoforms in samples of different complexity levels and depleted serum. The different glycoforms for each glycosylation site are listed on the x-axes while the y-axes indicate summed abundances of all fragment ions in the matched glycopeptide spectra. Different colored bars are specific for pure AGP, glycoprotein mixtures, or depleted serum as indicated in the legend

References

    1. Leymarie N, Griffin PJ, Jonscher K, Kolarich D, Orlando R, McComb M, Zaia J, Aguilan J, Alley WR, Altmann F, Ball LE, Basumallick L, Bazemore-Walker CR, Behnken H, Blank MA, Brown KJ, Bunz S-C, Cairo CW, Cipollo JF, Daneshfar R, Desaire H, Drake RR, Go EP, Goldman R, Gruber C, Halim A, Hathout Y, Hensbergen PJ, Horn DM, Hurum D, Jabs W, Larson G, Ly M, Mann BF, Marx K, Mechref Y, Meyer B, Möginger U, Neusüss C, Nilsson J, Novotny MV, Nyalwidhe JO, Packer NH, Pompach P, Reiz B, Resemann A, Rohrer JS, Ruthenbeck A, Sanda M, Schulz JM, Schweiger-Hufnagel U, Sihlbom C, Song E, Staples GO, Suckau D, Tang H, Thaysen-Andersen M, Viner RI, An Y, Valmu L, Wada Y, Watson M, Windwarder M, Whittal R, Wuhrer M, Zhu Y, Zou C. Interlaboratory study on differential analysis of protein glycosylation by mass spectrometry: the ABRF glycoprotein research multi-institutional study 2012. Mol Cell Proteomics. 2013 doi: 10.1074/mcp.M113.03064. - DOI - PMC - PubMed
    1. Desaire H, Hua D. When can glycopeptides be assigned based solely on high-resolution mass spectrometry data? Int J Mass Spectrom. 2009;287:21–6. doi: 10.1016/j.ijms.2008.12.001. - DOI
    1. Mayampurath AM, Wu Y, Segu ZM, Mechref Y, Tang H. Improving confidence in detection and characterization of protein N-glycosylation sites and microheterogeneity. Rapid Commun Mass Spectrom. 2011;25:2007–19. doi: 10.1002/rcm.5059. - DOI - PubMed
    1. Wu Y, Mechref Y, Klouckova I, Mayampurath A, Novotny MV, Tang H. Mapping site-specific protein N-glycosylations through liquid chromatography/mass spectrometry and targeted tandem mass spectrometry. Rapid Commun Mass Spectrom. 2010;24:965–72. doi: 10.1002/rcm.447. - DOI - PubMed
    1. Wang WT, LeDonne NC, Ackerman B, Sweeley CC. Structural characterization of oligosaccharides by high-performance liquid chromatography, fast-atom bombardment-mass spectrometry, and exoglycosidase digestion. Anal Biochem. 1984;141:366–81. doi: 10.1016/0003-2697(84)90057-5. - DOI - PubMed

LinkOut - more resources