Developing a practical suicide risk prediction model for targeting high-risk patients in the Veterans health Administration
- PMID: 28675617
- PMCID: PMC5614864
- DOI: 10.1002/mpr.1575
Developing a practical suicide risk prediction model for targeting high-risk patients in the Veterans health Administration
Abstract
Objectives: The US Veterans Health Administration (VHA) has begun using predictive modeling to identify Veterans at high suicide risk to target care. Initial analyses are reported here.
Methods: A penalized logistic regression model was compared with an earlier proof-of-concept logistic model. Exploratory analyses then considered commonly-used machine learning algorithms. Analyses were based on electronic medical records for all 6,360 individuals classified in the National Death Index as having died by suicide in fiscal years 2009-2011 who used VHA services the year of their death or prior year and a 1% probability sample of time-matched VHA service users alive at the index date (n = 2,112,008).
Results: A penalized logistic model with 61 predictors had sensitivity comparable to the proof-of-concept model (which had 381 predictors) at target thresholds. The machine learning algorithms had relatively similar sensitivities, the highest being for Bayesian additive regression trees, with 10.7% of suicides occurred among the 1.0% of Veterans with highest predicted risk and 28.1% among the 5.0% of with highest predicted risk.
Conclusions: Based on these results, VHA is using penalized logistic regression in initial intervention implementation. The paper concludes with a discussion of other practical issues that might be explored to increase model performance.
Keywords: assessment/diagnosis; clinical decision support; epidemiology; machine learning; predictive modeling; suicide/self harm.
Copyright © 2017 John Wiley & Sons, Ltd.
References
-
- Alam, M. G. R. , Cho, E. J. , Huh, E. N. , & Hong, C. S. (2014). Cloud based mental state monitoring system for suicide risk reconnaissance using wearable bio‐sensors In Proceedings of the 8th International Conference on Ubiquitous Information Management and Communication (Article No. 56). Siem Reap, Cambodia: ACM.
-
- Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. 10.1023/A:1010933404324 - DOI
-
- Centers for Disease Control and Prevention & Department of Health and Human Services . (2015). National Death Index. https://www.healthdata.gov/dataset/national-death-index [1 August 2016]
-
- Chawla, N. V. (2010). Data mining for imbalanced datasets: An overview In Maimon O., & Rokach L. (Eds.), Data Mining and Knowledge Discovery Handbook (2nd ed.) (pp. 875–886). Berlin/Heidelberg: Springer.
-
- Chipman, H. A. , George, E. I. , & McCulloch, R. E. (2010). BART: Bayesian additive regression trees. The Annals of Applied Statistics, 4(1), 266–298. 10.1214/09-AOAS285 - DOI
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
