Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Dec:60:35-42.
doi: 10.1016/j.janxdis.2018.10.004. Epub 2018 Oct 30.

Ensemble machine learning prediction of posttraumatic stress disorder screening status after emergency room hospitalization

Affiliations

Ensemble machine learning prediction of posttraumatic stress disorder screening status after emergency room hospitalization

Santiago Papini et al. J Anxiety Disord. 2018 Dec.

Abstract

Posttraumatic stress disorder (PTSD) develops in a substantial minority of emergency room admits. Inexpensive and accurate person-level assessment of PTSD risk after trauma exposure is a critical precursor to large-scale deployment of early interventions that may reduce individual suffering and societal costs. Toward this aim, we applied ensemble machine learning to predict PTSD screening status three months after severe injury using cost-effective and minimally invasive data. Participants (N = 271) were recruited at a Level 1 Trauma Center where they provided variables routinely collected at the hospital, including pulse, injury severity, and demographics, as well as psychological variables, including self-reported current depression, psychiatric history, and social support. Participant zip codes were used to extract contextual variables including population total and density, average annual income, and health insurance coverage rates from publicly available U.S. Census data. Machine learning yielded good prediction of PTSD screening status 3 months post-hospitalization, AUC = 0.85 95% CI [0.83, 0.86], and significantly outperformed all benchmark comparison models in a cross-validation procedure designed to yield an unbiased estimate of performance. These results demonstrate that good prediction can be attained from variables that individually have relatively weak predictive value, pointing to the promise of ensemble machine learning approaches that do not rely on strong isolated risk factors.

Keywords: Computational psychiatry; Emergency room; Machine learning; PTSD; Personalized prognosis; Precision medicine; Prevention; Trauma.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: The authors have declared that no competing interests exist.

Figures

Figure 1.
Figure 1.
Average gain across cross-validated tests of the full model indicates the proportion that a feature contributed toward the ensemble’s prediction of PTSD screening status. Shading indicates the source of each feature, and the error bars show 95% CI. Hospital features were extracted from the hospital intake. Census features were extracted from publicly available census data based on participant zip codes. Psychological features were extracted from the following: patients self-reported history of mood and anxiety diagnoses, current depression was assessed with the Patient Health Questionnaire-8, physical and mental health functioning was assessed with the Veterans RAND 12-item Health survey, social support was assessed with the Social Provisions Scale, resilience was assessed with the Connor Davidson Resilience Scale, and alcohol use was assessed with the Alcohol Use Disorder Identification Test-Consumption.
Figure 2.
Figure 2.
Nine features with gain > 0.05. Locally weighted smoothing (loess) curves show how the probability of PTSD+ prediction (y-axis) varies as a function of a feature (x-axis) after controlling for all other features. Note the y-axis range, which was selected to illustrate the narrow range of influence that single features have on prediction.
Figure 3.
Figure 3.
A) Sensitivity as a function of 1-specificity for the full model (all features) and comparison models including logistic regression with PTSD severity at the hospital as the only predictor, and machine learning with only features routinely collected at the hospital. AUC statistics refer to area under these curves, which is significantly higher for the full model. The non-overlapping shaded areas represent improvement in prediction. The dashed line corresponds to a no-information model, which yields AUC = 0.50. B) Distribution of predicted probabilities of developing PTSD from the full model. The x-axis refers to mean test-sample prediction for each patient, and frequency bars are shaded by actual PTSD screening status at 3 months. The gray bars to the left of the 50% threshold represent the number of accurate PTSD− predictions, and the black bars to the right represent accurate PTSD+ predictions. The models yield a wide arrange of personalized predictions that are highly accurate at the extreme ends but less accurate near the 50% threshold, which should be considered for guiding treatment decisions.

References

    1. Admon R, Milad MR, & Hendler T (2013). A causal model of post-traumatic stress disorder: disentangling predisposed from acquired neural abnormalities. Trends in Cognitive Sciences, 17(7), 337–347. - PubMed
    1. American Psychiatric Association. (2000). Diagnostic and statistical manual-text revision (DSM-IV-TRim, 2000).
    1. American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders (DSM-5®). American Psychiatric Pub. Retrieved from https://books.google.com/books?hl=en&lr=&id=-JivBAAAQBAJ&oi=fnd&pg=PT18&...
    1. Babyak MA (2004). What you see may not be what you get: a brief, nontechnical introduction to overfitting in regression-type models. Psychosomatic Medicine, 66(3), 411–421. - PubMed
    1. Baker DG, Nievergelt CM, & O’Connor DT (2012). Biomarkers of PTSD: Neuropeptides and immune signaling. Neuropharmacology, 62(2), 663–673. 10.1016/j.neuropharm.2011.02.027 - DOI - PubMed

Publication types