Applying interpretable machine learning in computational biology-pitfalls, recommendations and opportunities for new developments
- PMID: 39122941
- PMCID: PMC11348280
- DOI: 10.1038/s41592-024-02359-7
Applying interpretable machine learning in computational biology-pitfalls, recommendations and opportunities for new developments
Abstract
Recent advances in machine learning have enabled the development of next-generation predictive models for complex computational biology problems, thereby spurring the use of interpretable machine learning (IML) to unveil biological insights. However, guidelines for using IML in computational biology are generally underdeveloped. We provide an overview of IML methods and evaluation techniques and discuss common pitfalls encountered when applying IML methods to computational biology problems. We also highlight open questions, especially in the era of large language models, and call for collaboration between IML and computational biology researchers.
© 2024. Springer Nature America, Inc.
Conflict of interest statement
A.T. received gift research grants from Meta, Morgan Stanley, and Amazon. J.M. received gift research grant from Google Research. A.T. works part-time for Amplify Partners. The other authors declare no competing interests.
Figures
References
-
- Miller T Explanation in artificial intelligence: Insights from the social sciences. Artificial Intelligence 267, 1–38 (2019).
-
- Doshi-Velez F & Kim B Towards a rigorous science of interpretable machine learning. arXiv preprint arXiv:1702.08608 (2017).
-
- Azodi CB, Tang J & Shiu S-H Opening the black box: Interpretable machine learning for geneticists. Trends in Genetics 36, 442–455 (2020). - PubMed
-
- Eraslan G, Avsec Ž, Gagneur J & Theis FJ Deep learning: new computational modelling techniques for genomics. Nature Reviews Genetics 20, 389–403 (2019). This paper gives an extensive review of the application of deep learning models in genomics. - PubMed
Publication types
MeSH terms
Grants and funding
- IIS2046613/National Science Foundation (NSF)
- IIS1838017/National Science Foundation (NSF)
- UH3 CA268202/CA/NCI NIH HHS/United States
- U24 HG012070/HG/NHGRI NIH HHS/United States
- R01 HG007352/HG/NHGRI NIH HHS/United States
- R56 HG013092/HG/NHGRI NIH HHS/United States
- IIS1705121/National Science Foundation (NSF)
- R01HG007352/U.S. Department of Health & Human Services | National Institutes of Health (NIH)
- UM1HG011593/U.S. Department of Health & Human Services | National Institutes of Health (NIH)
- IIS2112471/National Science Foundation (NSF)
- R01HG012303/U.S. Department of Health & Human Services | National Institutes of Health (NIH)
- UM1 HG011593/HG/NHGRI NIH HHS/United States
- R01 HG012303/HG/NHGRI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Miscellaneous
