Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020;13(4):465-474.
doi: 10.4310/sii.2020.v13.n4.a4. Epub 2020 Jul 31.

Meta-analysis of peptides to detect protein significance

Affiliations

Meta-analysis of peptides to detect protein significance

Yuping Zhang et al. Stat Interface. 2020.

Abstract

Shotgun assays are widely used in biotechnologies to characterize large molecules, which are hard to be measured as a whole directly. For instance, in Liquid Chromatography - Mass Spectrometry (LC-MS) shotgun experiments, proteins in biological samples are digested into peptides, and then peptides are separated and measured. However, in proteomics study, investigators are usually interested in the performance of the whole proteins instead of those peptide fragments. In light of meta-analysis, we propose an adaptive thresholding method to select informative peptides, and combine peptide-level models to protein-level analysis. The meta-analysis procedure and modeling rationale can be adapted to data analysis of other types of shotgun assays.

Keywords: Adaptive thresholding; Meta-analysis; Shotgun technology.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Intensities of peptides from Adolase A in spike-in data. The x-axis indicates samples. There are 18 samples from 6 conditions in total. Each condition has three samples. The samples are ordered by their conditions. Different condition has different spike-in protein concentrations, which are 25, 50, 100, 200, 400, and 800 (fmol), ordered from left to right in the figure. The y-axis indicates the log2 scaled peptide intensities. Different lines with different colors and types indicate different peptides.
Figure 2.
Figure 2.
Random missing probability estimation. The x-axis indicates the average intensity of each peptide. The y-axis indicates the missing rate of each peptide. The solid curve is fitted by the cubic spline regression. The value on the y-axis for the dotted line indicates the estimated completely random missing probability π^.
Figure 3.
Figure 3.
The Receiver Operating Characteristic (ROC) plots for six simulation studies. The x-axes indicate false positive rate, and the y-axes indicate true positive rate. The curves with different colors and line types show the average true positive rates across 10 times replicates for each method respectively. The corresponding shadows show the standard errors. Black solid lines indicate PEAT results; Red dashed lines indicate SFPQ results; Orange longdash lines indicate RRollup results; Blue dotdash lines indicate ZRollup results; Green twodash lines indicate MSstats results.
Figure 4.
Figure 4.
Heatmap of the estimated protein expression for the burn injury study. Each row indicates one protein, each column indicates one sample. The samples are from three categories – healthy controls, burn patients from the first time point, burn patients from the second time point. We add a white line among different biological conditions. The white cells in the heatmap indicate that no protein expression values were estimated due to peptide observations were missing for the entire group.

Similar articles

References

    1. Aebersold R and Mann M (2003). Mass spectrometry-based proteomics. Nature 422 198–207. - PubMed
    1. Belczacka I, Latosinska A, Metzger J, Marx D, Vlahou A, Mischak H and Frantzi M (2019). Proteomics biomarkers for solid tumors: Current status and future prospects. Mass spectrometry reviews 38 49–78. - PubMed
    1. Bellew M, Coram M, Fitzgibbon M, Igra M, Randolph T, Wang P, May D, Eng J, Fang R, Lin C, Chen J, Goodlett D, Whiteaker J, Paulovich A and McIntosh M (2006). A suite of algorithms for the comprehensive analysis of complex protein mixtures using high-resolution LC-MS. Bioinformatics 22 1902–9. - PubMed
    1. Birnbaum A (1954). Combining independent tests of significance. Journal of the American Statistical Association 559–74. MR0065101
    1. Clough T, Key M, Ott I, Ragg S, Schadow G and Vitek O (2009). Protein quantification in label-free LC-MS experiments. Journal of proteome research 8 5275–84. - PubMed

LinkOut - more resources