. 2020 Jun 15;37(12):1431-1444.

doi: 10.1089/neu.2019.6705. Epub 2020 Mar 11.

Unsupervised Machine Learning Reveals Novel Traumatic Brain Injury Patient Phenotypes with Distinct Acute Injury Profiles and Long-Term Outcomes

Kaitlin A Folweiler^{1

2}, Danielle K Sandsmark³, Ramon Diaz-Arrastia³, Akiva S Cohen^{1

4}, Aaron J Masino^{1

2

4}

Affiliations

¹ Department of Anesthesiology and Critical Care Medicine, and Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA.
² Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA.
³ Department of Neurology and University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, USA.
⁴ Department of Anesthesiology and Critical Care Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, USA.

PMID: 32008422
PMCID: PMC7249479
DOI: 10.1089/neu.2019.6705

Unsupervised Machine Learning Reveals Novel Traumatic Brain Injury Patient Phenotypes with Distinct Acute Injury Profiles and Long-Term Outcomes

Kaitlin A Folweiler et al. J Neurotrauma. 2020.

. 2020 Jun 15;37(12):1431-1444.

doi: 10.1089/neu.2019.6705. Epub 2020 Mar 11.

Authors

Kaitlin A Folweiler^{1

2}, Danielle K Sandsmark³, Ramon Diaz-Arrastia³, Akiva S Cohen^{1

4}, Aaron J Masino^{1

2

4}

Affiliations

¹ Department of Anesthesiology and Critical Care Medicine, and Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA.
² Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA.
³ Department of Neurology and University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, USA.
⁴ Department of Anesthesiology and Critical Care Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, USA.

PMID: 32008422
PMCID: PMC7249479
DOI: 10.1089/neu.2019.6705

Abstract

The heterogeneity of traumatic brain injury (TBI) remains a core challenge for the success of interventional clinical trials. Data-driven approaches for patient stratification may help to identify TBI patient phenotypes during the acute injury period as well as facilitate targeted trial patient enrollment and analysis of treatment efficacy. In this study, we implemented an unsupervised machine learning approach to identify TBI subpopulations at injury baseline using data from 1213 TBI patients who participated in the Citicoline Brain Injury Treatment Trial (COBRIT) Trial. A wrapper framework utilizing generalized low-rank models automatically selected relevant clinical features that were subsequently used to cluster patients using a partitioning around medoids clustering algorithm. Using this approach, we identified three patient phenotypes with unique clinical injury profiles based on a subset of acute injury features. Phenotype-specific differences in long-term functional outcome trajectories were respectively observed at 3 and 6 months after injury. In comparison, when patients were grouped by baseline Glasgow Coma Scale (GCS), no differences in baseline clinical feature profiles or long-term outcomes were observed. To test phenotype reproducibility in an external validation data set, we used a K-nearest neighbors algorithm to classify subjects in the Transforming Research and Clinical Knowledge in Traumatic Brain Injury (TRACK-TBI) Pilot data set into corresponding phenotypes, then measured the Gower's dissimilarities between TRACK-TBI and COBRIT subjects in each phenotype. No significant differences were found between trial subjects within two phenotypes, suggesting that these phenotypes may be generalizable within a broad range of TBI severity. Further, Extended Glasgow Outcome Scale (GOS-E) outcomes in the TRACK-TBI data set similarly demonstrated phenotype-specific differences in long-term outcomes. Our results suggest that unsupervised machine learning is a promising and effective approach for discovery of novel injury subpopulations over the conventional GCS-based method, and may improve patient selection in future TBI clinical trials.

Keywords: GCS; TBI; clinical trial; machine learning; unsupervised clustering.

PubMed Disclaimer

Conflict of interest statement

No competing financial interests exist.

Figures

**FIG. 1.**
Diagram of the hybrid generalized low-rank model and clustering approach implemented for unsupervised learning. The full feature set with n features and m observations (i.e., traumatic brain injury [TBI] patients) is decomposed into two matrices of lower rank (i.e., dimensions), k. An L1-regularization parameter, γ, is applied to the second low-rank matrix to create a feature subset n’, of the original matrix. The n’ x m feature subset is used to calculate an m x m dissimilarity matrix of the observations and clustered using the partitioning around medoids (PAM) algorithm. The average silhouette width of the clusters is calculated for a range of 3–10 clusters in PAM. The γ parameter is increased if the average silhouette width is higher than the previous iteration and stopped when n’ is zero. The final feature subset n’ and clustering schema is selected using the γ value that yields the highest cluster silhouette width.

**FIG. 2.**
Determining the necessity of each feature in contributing to the final cluster assignment. Feature necessity: Each feature was individually replaced with a null distribution of randomly shuffled values. The remaining features plus the null feature were then clustered upon and the similarity of the clustering result was compared with the original feature set clustering solution using two different measures: (A) the Jaccard similarity coefficient and (B) the pairwise similarity index. Any feature with a Jaccard similarity coefficient >0.75 and pairwise similarity index >90% (dotted lines) when nullified was considered unnecessary. Color image is available online.

**FIG. 3.**
Partitional clustering reveals distinct traumatic brain injury phenotypes. **(A)** T-distributed stochastic neighbor embedding (T-SNE) projection of 1213 traumatic brain injury (TBI) patients from the Citicoline Brain Injury Treatment Trial (COBRIT) study, each dot representing one patient. The partitioning around medoids (PAM) clustering solution, which yielded the maximum average silhouette width, resulted in three clusters labeled phenotype A (teal, n = 420), phenotype B (red, n = 446), and phenotype C (purple, n = 347). X and Y axes denote two-dimensional (2-D) representation of six-dimensional feature space. Novel TBI phenotypes have different recovery outcome trajectories based on the Extended Glasgow Outcome Scale (GOS-E) scores at **(B)** 90 days and **(C)** 180 days post-injury. Statistical significance was computed using the Kruskal–Wallis test with Holm's correction for multiple comparisons (asterisks represent p values: ****p < 0.0001, p > 0.05 n.s.). Color image is available online.

**FIG. 4.**
Baseline Glasgow Coma Scale (GCS) scores do not overlap with traumatic brain injury patient phenotypes and do not correlate with long-term outcome. (A) T-distributed stochastic neighbor embedding (T-SNE) projection of patients within a reduced feature space (same as Fig. 3) labeled by injury severity based on patients' acute GCS score. Injury severity was classified as severe (GCS <8, n = 834; dark green), moderate (GCS 9–12, n = 304; orange), and mild (defined as GCS 13–15 with an abnormal computed tomography [CT] scan, n = 75; blue). Extended Glasgow Outcome Scale (GOS-E) scores at (B) 90 days and **(C)** 180 days post-injury by injury severity. Statistical significance was computed using the Kruskal–Wallis test with Holm's correction for multiple comparisons (asterisks represent p values: *p < 0.05, p > 0.05 n.s.). Color image is available online.

**FIG. 5.**
Transforming Research and Clinical Knowledge in Traumatic Brain Injury (TRACK-TBI) Pilot subjects classified into phenotypes demonstrate similar injury profiles and Extended Glasgow Outcome Scale (GOS-E) outcomes as Citicoline Brain Injury Treatment Trial (COBRIT) phenotype subjects. (A) T-distributed stochastic neighbor embedding (T-SNE) projection of the original COBRIT subject phenotypes (COBRIT Phen. A, Phen. B, Phen. C) with the addition of TRACK-TBI Pilot subjects given phenotype assignments by a K-nearest neighbors (K-NN) classifier (TRACK-TBI Phen. A, Phen. B, Phen. C). TRACK-TBI phenotype extended GOS-E scores at (B) 90 days and **(C)** 180 days post-injury significance was computed using the Kruskal–Wallis test with Holm's correction for multiple comparisons (asterisks represent p values: **p < 0.01, p > 0.05 n.s.). Color image is available online.

See this image and copyright information in PMC

Comment in

Data-driven approaches to reveal the pathobiological heterogeneity in patients with traumatic brain injury.
Åkerlund C, Ercole A. Åkerlund C, et al. Intensive Care Med. 2023 Sep;49(9):1107-1109. doi: 10.1007/s00134-023-07156-y. Epub 2023 Jul 20. Intensive Care Med. 2023. PMID: 37470833 Free PMC article. No abstract available.
Response to Folweiler KA et al., Unsupervised Machine Learning Reveals Novel Traumatic Brain Injury Patient Phenotypes With Distinct Acute Injury Profiles and Long-Term Outcomes (DOI: 10.1089/neu.2019.6705).
Wang CY, Wu JC, Kuo YH. Wang CY, et al. J Neurotrauma. 2024 Jan;41(1-2):292-293. doi: 10.1089/neu.2023.0396. Epub 2023 Oct 18. J Neurotrauma. 2024. PMID: 37756375 No abstract available.
Response to Wang et al., Unsupervised machine learning reveals novel traumatic brain injury patient phenotypes with distinct acute injury profiles and long-term outcomes (doi: 10.1089/neu.2019.6705).
Hood KF, Cohen AS. Hood KF, et al. J Neurotrauma. 2024 Feb;41(3-4):539. doi: 10.1089/neu.2023.0478. Epub 2023 Dec 20. J Neurotrauma. 2024. PMID: 37776180 No abstract available.

References

1. Taylor C.A., Bell J.M., Breiding M.J., and Xu L. (2017). Traumatic brain injury–related emergency department visits, hospitalizations, and deaths — United States, 2007 and 2013. MMWR Surveill. Summ. 66, 1–16 - PMC - PubMed
1. Maas A.I.R., Steyerberg E.W., Murray G.D., Bullock R., Baethmann A., Marshall L.F., and Teasdale G.M. (1999). Why have recent trials of neuroprotective agents in head injury failed to show convincing efficacy? A pragmatic analysis and theoretical considerations. Neurosurgery 44, 1286–1298 - PubMed
1. Maas A.I.R., Roozenbeek B., and Manley G.T. (2010). Clinical trials in traumatic brain injury: past experience and current developments. Neurotherapeutics 7, 115–26 - PMC - PubMed
1. Narayan R.K., Michel M.E., Ansell B., Baethmann A., Biegon A., Bracken M.B., Bullock M.R., Choi S.C., Clifton G.L., Contant C.F., Coplin W.M., Dietrich W.D., Ghajar J., Grady S.M., Grossman R.G., Hall E.D., Heetderks W., Hovda D.A., Jallo J., Katz R.L., Knoller N., Kochanek P.M., Maas A.I., Majde J., Marion D.W., Marmarou A., Marshall L.F., McIntosh T.K., Miller E., Mohberg N., Muizelaar J.P., Pitts L.H., Quinn P., Riesenfeld G., Robertson C.S., Strauss K.I., Teasdale G., Temkin N., Tuma R., Wade C., Walker M.D., Weinrich M., Whyte J., Wilberger J., Young A.B., and Yurkewicz L. (2002). Clinical trials in head injury. J. Neurotrauma 19, 503–557 - PMC - PubMed
1. Marshall L.F. (2000). Head injury: recent past, present, and future. Neurosurgery 47, 546–61 - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Unsupervised Machine Learning Reveals Novel Traumatic Brain Injury Patient Phenotypes with Distinct Acute Injury Profiles and Long-Term Outcomes

Affiliations

Unsupervised Machine Learning Reveals Novel Traumatic Brain Injury Patient Phenotypes with Distinct Acute Injury Profiles and Long-Term Outcomes

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Comment in

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical