Semi-supervised learning for somatic variant calling and peptide identification in personalized cancer immunotherapy
- PMID: 33375939
- PMCID: PMC7772914
- DOI: 10.1186/s12859-020-03813-x
Semi-supervised learning for somatic variant calling and peptide identification in personalized cancer immunotherapy
Abstract
Background: Personalized cancer vaccines are emerging as one of the most promising approaches to immunotherapy of advanced cancers. However, only a small proportion of the neoepitopes generated by somatic DNA mutations in cancer cells lead to tumor rejection. Since it is impractical to experimentally assess all candidate neoepitopes prior to vaccination, developing accurate methods for predicting tumor-rejection mediating neoepitopes (TRMNs) is critical for enabling routine clinical use of cancer vaccines.
Results: In this paper we introduce Positive-unlabeled Learning using AuTOml (PLATO), a general semi-supervised approach to improving accuracy of model-based classifiers. PLATO generates a set of high confidence positive calls by applying a stringent filter to model-based predictions, then rescores remaining candidates by using positive-unlabeled learning. To achieve robust performance on clinical samples with large patient-to-patient variation, PLATO further integrates AutoML hyper-parameter tuning, classification threshold selection based on spies, and support for bootstrapping.
Conclusions: Experimental results on real datasets demonstrate that PLATO has improved performance compared to model-based approaches for two key steps in TRMN prediction, namely somatic variant calling from exome sequencing data and peptide identification from MS/MS data.
Keywords: Exome sequencing; Machine learning; Peptide identification; Positive-unlabeled learning; Somatic variant calling; Tandem mass-spectrometry.
Conflict of interest statement
ES and JF declare that they have no competing interests. IIM declares that he has a significant financial interest in Truvax Inc., a company developing personalized cancer vaccines.
Figures
References
-
- Castle JC, Kreiter S, Diekmann J, Löwer M, van de Roemer N, de Graaf J, Selmi A, Diken M, Boegel S, Paret C, Koslowski M, Kuhn AN, Britten CM, Huber C, Türeci Ö, Sahin U. Exploiting the mutanome for tumor vaccination. Cancer Res. 2012;72(5):1081–1091. doi: 10.1158/0008-5472.CAN-11-3722. - DOI - PubMed
-
- Duan F, Duitama J, Seesi SA, Ayres C, Corcelli S, Pawashe A, Blanchard T, McMahon D, Sidney J, Sette A, Baker B, Mandoiu II, Srivastava PK. Genomic and bioinformatic profiling of mutational neo-epitopes reveals new rules to predict anti-cancer immunogenicity. J Exp Med. 2014;211(11):2231–2248. doi: 10.1084/jem.20141308. - DOI - PMC - PubMed
-
- Gubin MM, Zhang X, Schuster H, Caron E, Ward JP, Noguchi T, Ivanova Y, Hundal J, Arthur CD, Krebber W-J, Mulder GE, Toebes M, Vesely MD, Lam SSK, Korman AJ, Allison JP, Freeman GJ, Sharpe AH, Pearce EL, Schumacher TN, Aebersold R, Rammensee H-G, Melief CJM, Mardis ER, Gillanders WE, Artyomov MN, Schreiber RD. Checkpoint blockade cancer immunotherapy targets tumour-specific mutant antigens. Nature. 2014;515(7528):577–81. doi: 10.1038/nature13988. - DOI - PMC - PubMed
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
