Machine learning based classification of catastrophic health expenditures: a cross-sectional study of Korean low-income households
- PMID: 40775329
- PMCID: PMC12333208
- DOI: 10.1186/s12913-025-13139-0
Machine learning based classification of catastrophic health expenditures: a cross-sectional study of Korean low-income households
Abstract
Background: Despite the National Health Insurance (NHI) system implemented in South Korea, concerns persist regarding access to health coverage for low-income households. To address this issue, this study aims to use machine learning-based data mining techniques to classify whether such households will face catastrophic health expenditures (CHEs).
Methods: A total of 4,031 low-income people were extracted using 2019 data from the Korea Health Panel Survey. The classification model was developed using four machine learning algorithms: Random Forest, Gradient boosting, Decision tree, Ridge regression, Neural network, and AdaBoost. Ten-fold cross validation was carried out to ensure the reliability of the analysis results. The model was evaluated based on the Area Under Receiver Operating Characteristics (AUROC) as well as accuracy, precision, recall, and F-1 score.
Results: The study's findings revealed that the incidence of CHE was 26.2% in low-income households. The AdaBoost model had the highest classifiable power. It showed AUROC of 89.8%, accuracy of 83.1%, precision of 82.4%, recall of 83.1, and F1 score of 82.1%. The study found that economic activity, chronic disease, and age were significant factors that could lead to CHEs. Therefore, individuals over 65, with chronic conditions, and unemployed had the highest likelihood of developing CHE.
Conclusion: It is essential to identify low-income households that are at risk of CHEs in advance before facing the economic burden. This research is expected to provide fundamental data that can aid in developing an integrated support program to prevent and manage CHEs more effectively.
Keywords: CHE; Catastrophic health expenditure; Health policy; Machine learning; Population health.
© 2025. The Author(s).
Conflict of interest statement
Declarations. Ethics approval and consent to participate: This study was approved by the Korea University Institutional Review Board (IRB No. 2023-0043). The IRB of Korea University waived informed consent since this study was retrospective and blinding of the personal information in the data was performed. This data is publicly accessible and written informed consent is obtained from all the participants before participating in the survey. Respondents’ information was completely anonymized for use for research purposes and unidentified prior to analysis. The authors assert that all procedures contributing to this work comply with the ethical standards of the relevant national and institutional committees on human experimentation and with the Helsinki Declaration of 1975, as revised in 2000. Consent for publication: Not applicable. Competing interests: The authors declare no competing interests.
Figures
Similar articles
-
The economic toll of cancer: catastrophic expenditure and impoverishment among lower-income households in Malaysia.BMC Public Health. 2025 Jul 2;25(1):2216. doi: 10.1186/s12889-025-23415-7. BMC Public Health. 2025. PMID: 40604705 Free PMC article.
-
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23. Clin Orthop Relat Res. 2024. PMID: 39051924
-
Application of machine learning algorithms to model predictors of informed contraceptive choice among reproductive age women in six high fertility rate sub Sahara Africa countries.BMC Public Health. 2025 May 29;25(1):1986. doi: 10.1186/s12889-025-23242-w. BMC Public Health. 2025. PMID: 40442626 Free PMC article.
-
The burden of household out-of-pocket healthcare expenditures in Ethiopia: a systematic review and meta-analysis.Int J Equity Health. 2022 Jan 31;21(1):14. doi: 10.1186/s12939-021-01610-3. Int J Equity Health. 2022. PMID: 35101038 Free PMC article.
-
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3. Cochrane Database Syst Rev. 2022. PMID: 35593186 Free PMC article.
References
-
- Böhm K, Schmid A, Götze R, Landwehr C, Rothgang H. Five types of OECD healthcare systems: empirical results of a deductive classification. Health Policy. 2013;113(3):258–69. 10.1016/j.healthpol.2013.09.003. - PubMed
-
- Lee HY, Oh J, Kawachi I. Changes in catastrophic health expenditures for major diseases after A 2013 health insurance expansion in South Korea. Health Aff. 2022;41(5):722–31. 10.1377/hlthaff.2021.01320. - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources