Targeted proteomics data interpretation with DeepMRM

Affiliations

¹ Bertis, Inc., Seoul 06108, Republic of Korea.
² Bertis Bioscience, Inc., San Diego, CA 92121, USA.
³ Department of Chemistry, Center for Proteogenomic Research, Korea University, Seoul 02841, Republic of Korea.
⁴ Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA.

PMID: 37533638
PMCID: PMC10391571
DOI: 10.1016/j.crmeth.2023.100521

Targeted proteomics data interpretation with DeepMRM

Jungkap Park et al. Cell Rep Methods. 2023.

. 2023 Jul 11;3(7):100521.

doi: 10.1016/j.crmeth.2023.100521. eCollection 2023 Jul 24.

Authors

Affiliations

¹ Bertis, Inc., Seoul 06108, Republic of Korea.
² Bertis Bioscience, Inc., San Diego, CA 92121, USA.
³ Department of Chemistry, Center for Proteogenomic Research, Korea University, Seoul 02841, Republic of Korea.
⁴ Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA.

PMID: 37533638
PMCID: PMC10391571
DOI: 10.1016/j.crmeth.2023.100521

Abstract

Targeted proteomics is widely utilized in clinical proteomics; however, researchers often devote substantial time to manual data interpretation, which hinders the transferability, reproducibility, and scalability of this approach. We introduce DeepMRM, a software package based on deep learning algorithms for object detection developed to minimize manual intervention in targeted proteomics data analysis. DeepMRM was evaluated on internal and public datasets, demonstrating superior accuracy compared with the community standard tool Skyline. To promote widespread adoption, we have incorporated a stand-alone graphical user interface for DeepMRM and integrated its algorithm into the Skyline software package as an external tool.

Keywords: Skyline; machine learning; multiple reaction monitoring; object detection; peak detection; quality control; quantification; selected reaction monitoring; targeted proteomics.

PubMed Disclaimer

Conflict of interest statement

J.P. is an employee of Bertis, Inc., and C.W., D.A., and S.K. are employees of Bertis Bioscience, Inc., both of which are companies developing proteomics-based diagnostics solutions.

Figures

**Figure 1**
DeepMRM workflow for detecting peak groups of targeted peptides Given the target list and MRM/PRM/DIA data as input, transition chromatograms of both heavy and light peptides are fed into the model as 2-channel heatmap images. Multiscale 1D feature maps are extracted and processed by two subnetworks: a classifier for determining whether candidate peak groups are present and a regressor for detecting the boundary of candidate peak groups. Detected peak groups are then examined by a CNN-based transition classification model to select transitions unaffected by interference or noise. The abundances of target peptides are estimated based on peak areas of heavy and light peptides.

**Figure 2**
Comparison of DeepMRM and Skyline with or without mProphet Relative quantification and distribution of the abundance of heavy peptides for (A–D) noisy dataset and (E–H) complex background dataset. There is no difference between Skyline default and Skyline at FDR 5% in the complex background dataset because mProphet did not filter any results. Red dotted lines represent the theoretical values according to the heavy peptide abundance. In the boxplots, the centerline, edges, and whiskers represent the median, the first and third quartile, and 1.5× interquartile range, respectively. Outlier points outside of the whiskers are indicated by dot symbols. The table below the boxplot shows averaged correlation coefficients, arctangent absolute percentage error over 43 targeted peptides, and the number of peak groups in each quantile box (PCC, Pearson’s correlation; SPC, Spearman’s rank correlation; MAAPE, mean arctangent absolute percentage error). See also Figure S3.

**Figure 3**
DeepMRM performances on MRM, PRM, and DIA datasets (A and B) (A) Average precision (AP) and recall (RC) scores for MRM, PRM, and DIA datasets and (B) scatterplot comparing the results of light/heavy ratios calculated by peaks manually annotated and those by peaks detected by DeepMRM. The scale of axes is log2. The table below the scatterplot shows correlation coefficients (PCC, Pearson’s correlation; SPC, Spearman’s rank correlation) and mean arctangent absolute percentage error (MAAPE). See also Figure S2 and Table S2.

See this image and copyright information in PMC

References

1. Picotti P., Aebersold R. Selected reaction monitoring–based proteomics: workflows, potential, pitfalls and future directions. Nat. Methods. 2012;9:555–566. doi: 10.1038/nmeth.2015. - DOI - PubMed
1. Doerr A. Mass spectrometry–based targeted proteomics. Nat. Methods. 2013;10:23. doi: 10.1038/nmeth.2286. - DOI - PubMed
1. Schubert O.T., Röst H.L., Collins B.C., Rosenberger G., Aebersold R. Quantitative proteomics: challenges and opportunities in basic and applied research. Nat. Protoc. 2017:1289–1294. doi: 10.1038/nprot.2017.040. - DOI - PubMed
1. Reiter L., Rinner O., Picotti P., Hüttenhain R., Beck M., Brusniak M.Y., Hengartner M.O., Aebersold R. mProphet: automated data processing and statistical validation for large-scale SRM experiments. Nat. Methods. 2011;8:430–435. doi: 10.1038/nmeth.1584. - DOI - PubMed
1. Pino L.K., Searle B.C., Bollinger J.G., Nunn B., MacLean B., MacCoss M.J. The Skyline ecosystem: Informatics for quantitative mass spectrometry proteomics. Mass Spectrom. Rev. 2020;39:229–244. doi: 10.1002/MAS.21540. - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions

Grants and funding

R24 GM141156/GM/NIGMS NIH HHS/United States

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Targeted proteomics data interpretation with DeepMRM

Affiliations

Targeted proteomics data interpretation with DeepMRM

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources