Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jul 11;3(7):100521.
doi: 10.1016/j.crmeth.2023.100521. eCollection 2023 Jul 24.

Targeted proteomics data interpretation with DeepMRM

Affiliations

Targeted proteomics data interpretation with DeepMRM

Jungkap Park et al. Cell Rep Methods. .

Abstract

Targeted proteomics is widely utilized in clinical proteomics; however, researchers often devote substantial time to manual data interpretation, which hinders the transferability, reproducibility, and scalability of this approach. We introduce DeepMRM, a software package based on deep learning algorithms for object detection developed to minimize manual intervention in targeted proteomics data analysis. DeepMRM was evaluated on internal and public datasets, demonstrating superior accuracy compared with the community standard tool Skyline. To promote widespread adoption, we have incorporated a stand-alone graphical user interface for DeepMRM and integrated its algorithm into the Skyline software package as an external tool.

Keywords: Skyline; machine learning; multiple reaction monitoring; object detection; peak detection; quality control; quantification; selected reaction monitoring; targeted proteomics.

PubMed Disclaimer

Conflict of interest statement

J.P. is an employee of Bertis, Inc., and C.W., D.A., and S.K. are employees of Bertis Bioscience, Inc., both of which are companies developing proteomics-based diagnostics solutions.

Figures

None
Graphical abstract
Figure 1
Figure 1
DeepMRM workflow for detecting peak groups of targeted peptides Given the target list and MRM/PRM/DIA data as input, transition chromatograms of both heavy and light peptides are fed into the model as 2-channel heatmap images. Multiscale 1D feature maps are extracted and processed by two subnetworks: a classifier for determining whether candidate peak groups are present and a regressor for detecting the boundary of candidate peak groups. Detected peak groups are then examined by a CNN-based transition classification model to select transitions unaffected by interference or noise. The abundances of target peptides are estimated based on peak areas of heavy and light peptides.
Figure 2
Figure 2
Comparison of DeepMRM and Skyline with or without mProphet Relative quantification and distribution of the abundance of heavy peptides for (A–D) noisy dataset and (E–H) complex background dataset. There is no difference between Skyline default and Skyline at FDR 5% in the complex background dataset because mProphet did not filter any results. Red dotted lines represent the theoretical values according to the heavy peptide abundance. In the boxplots, the centerline, edges, and whiskers represent the median, the first and third quartile, and 1.5× interquartile range, respectively. Outlier points outside of the whiskers are indicated by dot symbols. The table below the boxplot shows averaged correlation coefficients, arctangent absolute percentage error over 43 targeted peptides, and the number of peak groups in each quantile box (PCC, Pearson’s correlation; SPC, Spearman’s rank correlation; MAAPE, mean arctangent absolute percentage error). See also Figure S3.
Figure 3
Figure 3
DeepMRM performances on MRM, PRM, and DIA datasets (A and B) (A) Average precision (AP) and recall (RC) scores for MRM, PRM, and DIA datasets and (B) scatterplot comparing the results of light/heavy ratios calculated by peaks manually annotated and those by peaks detected by DeepMRM. The scale of axes is log2. The table below the scatterplot shows correlation coefficients (PCC, Pearson’s correlation; SPC, Spearman’s rank correlation) and mean arctangent absolute percentage error (MAAPE). See also Figure S2 and Table S2.

References

    1. Picotti P., Aebersold R. Selected reaction monitoring–based proteomics: workflows, potential, pitfalls and future directions. Nat. Methods. 2012;9:555–566. doi: 10.1038/nmeth.2015. - DOI - PubMed
    1. Doerr A. Mass spectrometry–based targeted proteomics. Nat. Methods. 2013;10:23. doi: 10.1038/nmeth.2286. - DOI - PubMed
    1. Schubert O.T., Röst H.L., Collins B.C., Rosenberger G., Aebersold R. Quantitative proteomics: challenges and opportunities in basic and applied research. Nat. Protoc. 2017:1289–1294. doi: 10.1038/nprot.2017.040. - DOI - PubMed
    1. Reiter L., Rinner O., Picotti P., Hüttenhain R., Beck M., Brusniak M.Y., Hengartner M.O., Aebersold R. mProphet: automated data processing and statistical validation for large-scale SRM experiments. Nat. Methods. 2011;8:430–435. doi: 10.1038/nmeth.1584. - DOI - PubMed
    1. Pino L.K., Searle B.C., Bollinger J.G., Nunn B., MacLean B., MacCoss M.J. The Skyline ecosystem: Informatics for quantitative mass spectrometry proteomics. Mass Spectrom. Rev. 2020;39:229–244. doi: 10.1002/MAS.21540. - DOI - PMC - PubMed

Publication types

LinkOut - more resources