Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Oct 10;12(1):16990.
doi: 10.1038/s41598-022-21390-2.

Clustering of trauma patients based on longitudinal data and the application of machine learning to predict recovery

Affiliations

Clustering of trauma patients based on longitudinal data and the application of machine learning to predict recovery

Kostas Stoitsas et al. Sci Rep. .

Abstract

Predicting recovery after trauma is important to provide patients a perspective on their estimated future health, to engage in shared decision making and target interventions to relevant patient groups. In the present study, several unsupervised techniques are employed to cluster patients based on longitudinal recovery profiles. Subsequently, these data-driven clusters were assessed on clinical validity by experts and used as targets in supervised machine learning models. We present a formalised analysis of the obtained clusters that incorporates evaluation of (i) statistical and machine learning metrics, (ii) clusters clinical validity with descriptive statistics and medical expertise. Clusters quality assessment revealed that clusters obtained through a Bayesian method (High Dimensional Supervised Classification and Clustering) and a Deep Gaussian Mixture model, in combination with oversampling and a Random Forest for supervised learning of the cluster assignments provided among the most clinically sensible partitioning of patients. Other methods that obtained higher classification accuracy suffered from cluster solutions with large majority classes or clinically less sensible classes. Models that used just physical or a mix of physical and psychological outcomes proved to be among the most sensible, suggesting that clustering on psychological outcomes alone yields recovery profiles that do not conform to known risk factors.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
PCA biplot for the set of General Health variables with indication of the ten clusters obtained with kml3d.
Figure 2
Figure 2
Optimum number of clusters with kml3d for the four different cases of variables and k-means.
Figure 3
Figure 3
The two graphs at the top present recovery based on EQ-VAS and EQ-5D for the case of General Health with HDclassif. The two graphs at the bottom depict psychological condition (high values indicate high stress and anxiety) of various clusters after the injury.

References

    1. Alonzo, T. A. Clinical prediction models: A practical approach to development, validation, and updating: by Ewout W. Steyerberg (2009).
    1. Moons KG, Royston P, Vergouwe Y, Grobbee DE, Altman DG. Prognosis and prognostic research: what, why, and how? BMJ. 2009;338:b606. doi: 10.1136/bmj.b606. - DOI - PubMed
    1. Organization, W. H. et al. Injuries and violence: the facts 2014 (2014).
    1. LNAZ. Traumazorg in Beeld; annual report of the Dutch trauma registry 2015–2019 (2020).
    1. Lee Y-H, Bang H, Kim DJ. How to establish clinical prediction models. Endocrinol. Metab. 2016;31:38–44. doi: 10.3803/EnM.2016.31.1.38. - DOI - PMC - PubMed

Publication types