Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Sep:163:107146.
doi: 10.1016/j.compbiomed.2023.107146. Epub 2023 Jun 15.

Automated identification of patient subgroups: A case-study on mortality of COVID-19 patients admitted to the ICU

Collaborators, Affiliations

Automated identification of patient subgroups: A case-study on mortality of COVID-19 patients admitted to the ICU

I Vagliano et al. Comput Biol Med. 2023 Sep.

Abstract

Background: - Subgroup discovery (SGD) is the automated splitting of the data into complex subgroups. Various SGD methods have been applied to the medical domain, but none have been extensively evaluated. We assess the numerical and clinical quality of SGD methods.

Method: - We applied the improved Subgroup Set Discovery (SSD++), Patient Rule Induction Method (PRIM) and APRIORI - Subgroup Discovery (APRIORI-SD) algorithms to obtain patient subgroups on observational data of 14,548 COVID-19 patients admitted to 73 Dutch intensive care units. Hospital mortality was the clinical outcome. Numerical significance of the subgroups was assessed with information-theoretic measures. Clinical significance of the subgroups was assessed by comparing variable importance on population and subgroup levels and by expert evaluation.

Results: - The tested algorithms varied widely in the total number of discovered subgroups (5-62), the number of selected variables, and the predictive value of the subgroups. Qualitative assessment showed that the found subgroups make clinical sense. SSD++ found most subgroups (n = 62), which added predictive value and generally showed high potential for clinical use. APRIORI-SD and PRIM found fewer subgroups (n = 5 and 6), which did not add predictive value and were clinically less relevant.

Conclusion: - Automated SGD methods find clinical subgroups that are relevant when assessed quantitatively (yield added predictive value) and qualitatively (intensivists consider the subgroups significant). Different methods yield different subgroups with varying degrees of predictive performance and clinical quality. External validation is needed to generalize the results to other populations and future research should explore which algorithm performs best in other settings.

Keywords: COVID-19; Data registry; In-hospital mortality; Intensive care; Machine learning; Subgroup discovery.

PubMed Disclaimer

Conflict of interest statement

Declaration of competing interest The authors declare that they have no conflict of interests.

Figures

Image 1
Graphical abstract
Fig. 1
Fig. 1
Expert evaluation on clinical relevance of the obtained subgroups.

References

    1. Higgins J.P.T., Thomas J., Chandler J., Cumpston M., Li T., Page M.J., Welch V.A., editors. Cochrane Handbook for Systematic Reviews of Interventions Version 6.3 (Updated February 2022). Cochrane. 2022. www.training.cochrane.org/handbook Available from. :
    1. Ventura S., Luna J.M. Supervised Descriptive Pattern Mining. Springer; Cham: 2018. Subgroup discovery. - DOI
    1. Helal S. Subgroup discovery algorithms: a survey and empirical evaluation. J. Comput. Sci. Technol. 2016;31:561–576. doi: 10.1007/s11390-016-1647-1. - DOI
    1. Proença H.M., Grünwald P., Bäck T., Leeuwen M v. In: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2020. Hutter F., Kersting K., Lijffijt J., Valera I., editors. Vol. 12457. Springer; Cham: 2021. Discovering outstanding subgroup lists for numeric targets using MDL. (Lecture Notes in Computer Science). - DOI
    1. Esnault C., Gadonna M.-L., Queyrel M., Templier A., Zucker J.-D. Q-finder: an algorithm for credible subgroup discovery in clinical data analysis — an application to the international diabetes management practice study. Front. Artif. Intell. 2020;3 doi: 10.3389/frai.2020.559927. - DOI - PMC - PubMed

Publication types