Methods for identifying health status from routinely collected health data: An overview
- PMID: 39897572
- PMCID: PMC11786076
- DOI: 10.1016/j.imr.2024.101100
Methods for identifying health status from routinely collected health data: An overview
Abstract
Routinely collected health data (RCD) are currently accelerating publications that evaluate the effectiveness and safety of medicines and medical devices. One of the fundamental steps in using these data is developing algorithms to identify health status that can be used for observational studies. However, the process and methodologies for identifying health status from RCD remain insufficiently understood. While most current methods rely on International Classification of Diseases (ICD) codes, they may not be universally applicable. Although machine learning methods hold promise for more accurately identifying the health status, they remain underutilized in RCD studies. To address these significant methodological gaps, we outline key steps and methodological considerations for identifying health statuses in observational studies using RCD. This review has the potential to boost the credibility of findings from observational studies that use RCD.
Keywords: Health status; Machine learning algorithms; Routinely collected health data; Rule-based algorithms.
© 2025 Korea Institute of Oriental Medicine. Published by Elsevier B.V.
Conflict of interest statement
The authors declare that they have no conflicts of interest.
Figures
References
-
- Sherman RE, Anderson SA, Dal Pan GJ, Gray GW, Gross T, Hunter NL, et al. Real-world evidence - what is it and what can it tell us? New Engl J Med. 2016;375(23):2293–2297. - PubMed
-
- Deng L, Chen Z, Zhu P, Hu C, Jin T, Wang X, et al. Effects of integrated traditional Chinese and Western medicine for acute pancreatitis: A real-world study in a tertiary teaching hospital. J Evid Based Med. 2024;17(3):575–587. - PubMed
LinkOut - more resources
Full Text Sources
