Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Aug 7;25(1):1040.
doi: 10.1186/s12913-025-13139-0.

Machine learning based classification of catastrophic health expenditures: a cross-sectional study of Korean low-income households

Affiliations

Machine learning based classification of catastrophic health expenditures: a cross-sectional study of Korean low-income households

Seok Min Ji et al. BMC Health Serv Res. .

Abstract

Background: Despite the National Health Insurance (NHI) system implemented in South Korea, concerns persist regarding access to health coverage for low-income households. To address this issue, this study aims to use machine learning-based data mining techniques to classify whether such households will face catastrophic health expenditures (CHEs).

Methods: A total of 4,031 low-income people were extracted using 2019 data from the Korea Health Panel Survey. The classification model was developed using four machine learning algorithms: Random Forest, Gradient boosting, Decision tree, Ridge regression, Neural network, and AdaBoost. Ten-fold cross validation was carried out to ensure the reliability of the analysis results. The model was evaluated based on the Area Under Receiver Operating Characteristics (AUROC) as well as accuracy, precision, recall, and F-1 score.

Results: The study's findings revealed that the incidence of CHE was 26.2% in low-income households. The AdaBoost model had the highest classifiable power. It showed AUROC of 89.8%, accuracy of 83.1%, precision of 82.4%, recall of 83.1, and F1 score of 82.1%. The study found that economic activity, chronic disease, and age were significant factors that could lead to CHEs. Therefore, individuals over 65, with chronic conditions, and unemployed had the highest likelihood of developing CHE.

Conclusion: It is essential to identify low-income households that are at risk of CHEs in advance before facing the economic burden. This research is expected to provide fundamental data that can aid in developing an integrated support program to prevent and manage CHEs more effectively.

Keywords: CHE; Catastrophic health expenditure; Health policy; Machine learning; Population health.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: This study was approved by the Korea University Institutional Review Board (IRB No. 2023-0043). The IRB of Korea University waived informed consent since this study was retrospective and blinding of the personal information in the data was performed. This data is publicly accessible and written informed consent is obtained from all the participants before participating in the survey. Respondents’ information was completely anonymized for use for research purposes and unidentified prior to analysis. The authors assert that all procedures contributing to this work comply with the ethical standards of the relevant national and institutional committees on human experimentation and with the Helsinki Declaration of 1975, as revised in 2000. Consent for publication: Not applicable. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Flow diagram of the participants selection
Fig. 2
Fig. 2
The AUROC of CHE classification models using various machine learning algorithms

Similar articles

References

    1. Böhm K, Schmid A, Götze R, Landwehr C, Rothgang H. Five types of OECD healthcare systems: empirical results of a deductive classification. Health Policy. 2013;113(3):258–69. 10.1016/j.healthpol.2013.09.003. - PubMed
    1. Shin SM. Household catastrophic health expenditure related to pain in Korea. Korean J Pain. 2023;36(3):347–57. 10.3344/kjp.23041. - PMC - PubMed
    1. Koo JH, Jung HW. Which indicator should be used? A comparison between the incidence and intensity of catastrophic health expenditure: using difference-in-difference analysis. Health Econ Rev. 2022;12(1):58. 10.1186/s13561-022-00403-w. - PMC - PubMed
    1. Azzani M, Roslani AC, Su TT. Determinants of household catastrophic health expenditure: A systematic review. Malaysian J Med Sciences: MJMS. 2019;26(1):15–43. 10.21315/mjms2019.26.1.3. - PMC - PubMed
    1. Lee HY, Oh J, Kawachi I. Changes in catastrophic health expenditures for major diseases after A 2013 health insurance expansion in South Korea. Health Aff. 2022;41(5):722–31. 10.1377/hlthaff.2021.01320. - PubMed

LinkOut - more resources