A general framework for developing computable clinical phenotype algorithms
- PMID: 38748991
- PMCID: PMC11258420
- DOI: 10.1093/jamia/ocae121
A general framework for developing computable clinical phenotype algorithms
Abstract
Objective: To present a general framework providing high-level guidance to developers of computable algorithms for identifying patients with specific clinical conditions (phenotypes) through a variety of approaches, including but not limited to machine learning and natural language processing methods to incorporate rich electronic health record data.
Materials and methods: Drawing on extensive prior phenotyping experiences and insights derived from 3 algorithm development projects conducted specifically for this purpose, our team with expertise in clinical medicine, statistics, informatics, pharmacoepidemiology, and healthcare data science methods conceptualized stages of development and corresponding sets of principles, strategies, and practical guidelines for improving the algorithm development process.
Results: We propose 5 stages of algorithm development and corresponding principles, strategies, and guidelines: (1) assessing fitness-for-purpose, (2) creating gold standard data, (3) feature engineering, (4) model development, and (5) model evaluation.
Discussion and conclusion: This framework is intended to provide practical guidance and serve as a basis for future elaboration and extension.
Keywords: computable algorithms; health outcomes; modeling methods; recommended practices.
© The Author(s) 2024. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Conflict of interest statement
R.B. is an author on US Patent 9,075,796, “Text mining for large medical text datasets and corresponding medical text classification using informative feature selection.” At present, this patent is not licensed and does not generate royalties. All other authors have no competing interests to declare.
Figures
References
MeSH terms
LinkOut - more resources
Full Text Sources
