Methodology for Good Machine Learning with Multi-Omics Data
- PMID: 37965805
- DOI: 10.1002/cpt.3105
Methodology for Good Machine Learning with Multi-Omics Data
Abstract
In 2020, Novartis Pharmaceuticals Corporation and the U.S. Food and Drug Administration (FDA) started a 4-year scientific collaboration to approach complex new data modalities and advanced analytics. The scientific question was to find novel radio-genomics-based prognostic and predictive factors for HR+/HER- metastatic breast cancer under a Research Collaboration Agreement. This collaboration has been providing valuable insights to help successfully implement future scientific projects, particularly using artificial intelligence and machine learning. This tutorial aims to provide tangible guidelines for a multi-omics project that includes multidisciplinary expert teams, spanning across different institutions. We cover key ideas, such as "maintaining effective communication" and "following good data science practices," followed by the four steps of exploratory projects, namely (1) plan, (2) design, (3) develop, and (4) disseminate. We break each step into smaller concepts with strategies for implementation and provide illustrations from our collaboration to further give the readers actionable guidance.
© 2023 The Authors. Clinical Pharmacology & Therapeutics © 2023 American Society for Clinical Pharmacology and Therapeutics. This article has been contributed to by U.S. Government employees and their work is in the public domain in the USA.
References
-
- Coroller, T.P. et al. Multi-omics investigation on the prognostic and predictive factors in metastatic breast cancer using data from Phase III ribociclib clinical trials: A statistical and machine learning analysis plan. medRxiv 2023-08 (2023).
-
- Diao, J.A. et al. Human-interpretable image features derived from densely mapped cancer pathology slides predict diverse molecular phenotypes. Nat. Commun. 12, 1613 (2021).
-
- Libbrecht, M.W. & Noble, W.S. Machine learning applications in genetics and genomics. Nat. Rev. Genet. 16, 321-332 (2015).
-
- Dvijotham, K. et al. Enhancing the reliability and accuracy of AI-enabled diagnosis via complementarity-driven deferral to clinicians. Nat. Med. 1-7, 1814-1820 (2023).
-
- Lambin, P. et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat. Rev. Clin. Oncol. 14, 749-762 (2017).
MeSH terms
LinkOut - more resources
Full Text Sources