Ensemble machine learning prediction of posttraumatic stress disorder screening status after emergency room hospitalization

Santiago Papini¹, Derek Pisner², Jason Shumake², Mark B Powers³, Christopher G Beevers², Evan E Rainey⁴, Jasper A J Smits², Ann Marie Warren⁴

Affiliations

¹ Department of Psychology and Institute for Mental Health Research, The University of Texas at Austin, United States. Electronic address: spapini@utexas.edu.
² Department of Psychology and Institute for Mental Health Research, The University of Texas at Austin, United States.
³ Department of Psychology and Institute for Mental Health Research, The University of Texas at Austin, United States; Baylor University Medical Center, United States.
⁴ Baylor University Medical Center, United States.

PMID: 30419537
PMCID: PMC6777842
DOI: 10.1016/j.janxdis.2018.10.004

Ensemble machine learning prediction of posttraumatic stress disorder screening status after emergency room hospitalization

Santiago Papini et al. J Anxiety Disord. 2018 Dec.

. 2018 Dec:60:35-42.

doi: 10.1016/j.janxdis.2018.10.004. Epub 2018 Oct 30.

Authors

Santiago Papini¹, Derek Pisner², Jason Shumake², Mark B Powers³, Christopher G Beevers², Evan E Rainey⁴, Jasper A J Smits², Ann Marie Warren⁴

Affiliations

¹ Department of Psychology and Institute for Mental Health Research, The University of Texas at Austin, United States. Electronic address: spapini@utexas.edu.
² Department of Psychology and Institute for Mental Health Research, The University of Texas at Austin, United States.
³ Department of Psychology and Institute for Mental Health Research, The University of Texas at Austin, United States; Baylor University Medical Center, United States.
⁴ Baylor University Medical Center, United States.

PMID: 30419537
PMCID: PMC6777842
DOI: 10.1016/j.janxdis.2018.10.004

Abstract

Posttraumatic stress disorder (PTSD) develops in a substantial minority of emergency room admits. Inexpensive and accurate person-level assessment of PTSD risk after trauma exposure is a critical precursor to large-scale deployment of early interventions that may reduce individual suffering and societal costs. Toward this aim, we applied ensemble machine learning to predict PTSD screening status three months after severe injury using cost-effective and minimally invasive data. Participants (N = 271) were recruited at a Level 1 Trauma Center where they provided variables routinely collected at the hospital, including pulse, injury severity, and demographics, as well as psychological variables, including self-reported current depression, psychiatric history, and social support. Participant zip codes were used to extract contextual variables including population total and density, average annual income, and health insurance coverage rates from publicly available U.S. Census data. Machine learning yielded good prediction of PTSD screening status 3 months post-hospitalization, AUC = 0.85 95% CI [0.83, 0.86], and significantly outperformed all benchmark comparison models in a cross-validation procedure designed to yield an unbiased estimate of performance. These results demonstrate that good prediction can be attained from variables that individually have relatively weak predictive value, pointing to the promise of ensemble machine learning approaches that do not rely on strong isolated risk factors.

Keywords: Computational psychiatry; Emergency room; Machine learning; PTSD; Personalized prognosis; Precision medicine; Prevention; Trauma.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: The authors have declared that no competing interests exist.

Figures

**Figure 1.**
Average gain across cross-validated tests of the full model indicates the proportion that a feature contributed toward the ensemble’s prediction of PTSD screening status. Shading indicates the source of each feature, and the error bars show 95% CI. Hospital features were extracted from the hospital intake. Census features were extracted from publicly available census data based on participant zip codes. Psychological features were extracted from the following: patients self-reported history of mood and anxiety diagnoses, current depression was assessed with the Patient Health Questionnaire-8, physical and mental health functioning was assessed with the Veterans RAND 12-item Health survey, social support was assessed with the Social Provisions Scale, resilience was assessed with the Connor Davidson Resilience Scale, and alcohol use was assessed with the Alcohol Use Disorder Identification Test-Consumption.

**Figure 2.**
Nine features with gain > 0.05. Locally weighted smoothing (loess) curves show how the probability of PTSD+ prediction (y-axis) varies as a function of a feature (x-axis) after controlling for all other features. Note the y-axis range, which was selected to illustrate the narrow range of influence that single features have on prediction.

**Figure 3.**
A) Sensitivity as a function of 1-specificity for the full model (all features) and comparison models including logistic regression with PTSD severity at the hospital as the only predictor, and machine learning with only features routinely collected at the hospital. AUC statistics refer to area under these curves, which is significantly higher for the full model. The non-overlapping shaded areas represent improvement in prediction. The dashed line corresponds to a no-information model, which yields AUC = 0.50. B) Distribution of predicted probabilities of developing PTSD from the full model. The x-axis refers to mean test-sample prediction for each patient, and frequency bars are shaded by actual PTSD screening status at 3 months. The gray bars to the left of the 50% threshold represent the number of accurate PTSD− predictions, and the black bars to the right represent accurate PTSD+ predictions. The models yield a wide arrange of personalized predictions that are highly accurate at the extreme ends but less accurate near the 50% threshold, which should be considered for guiding treatment decisions.

See this image and copyright information in PMC

References

1. Admon R, Milad MR, & Hendler T (2013). A causal model of post-traumatic stress disorder: disentangling predisposed from acquired neural abnormalities. Trends in Cognitive Sciences, 17(7), 337–347. - PubMed
1. American Psychiatric Association. (2000). Diagnostic and statistical manual-text revision (DSM-IV-TRim, 2000).
1. American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders (DSM-5®). American Psychiatric Pub. Retrieved from https://books.google.com/books?hl=en&lr=&id=-JivBAAAQBAJ&oi=fnd&pg=PT18&...
1. Babyak MA (2004). What you see may not be what you get: a brief, nontechnical introduction to overfitting in regression-type models. Psychosomatic Medicine, 66(3), 411–421. - PubMed
1. Baker DG, Nievergelt CM, & O’Connor DT (2012). Biomarkers of PTSD: Neuropeptides and immune signaling. Neuropharmacology, 62(2), 663–673. 10.1016/j.neuropharm.2011.02.027 - DOI - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Ensemble machine learning prediction of posttraumatic stress disorder screening status after emergency room hospitalization

Affiliations

Ensemble machine learning prediction of posttraumatic stress disorder screening status after emergency room hospitalization

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical