Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2024 Sep 20:2024.04.29.24306260.
doi: 10.1101/2024.04.29.24306260.

Importance of variables from different time frames for predicting self-harm using health system data

Affiliations

Importance of variables from different time frames for predicting self-harm using health system data

Charles J Wolock et al. medRxiv. .

Update in

Abstract

Objective: Self-harm risk prediction models developed using health system data (electronic health records and insurance claims information) often use patient information from up to several years prior to the index visit when the prediction is made. Measurements from some time periods may not be available for all patients. Using the framework of algorithm-agnostic variable importance, we study the predictive potential of variables corresponding to different time horizons prior to the index visit and demonstrate the application of variable importance techniques in the biomedical informatics setting.

Materials and methods: We use variable importance to quantify the potential of recent (up to three months before the index visit) and distant (more than one year before the index visit) patient mental health information for predicting self-harm risk using data from seven health systems. We quantify importance as the decrease in predictiveness when the variable set of interest is excluded from the prediction task. We define predictiveness using discriminative metrics: area under the receiver operating characteristic curve (AUC), sensitivity, and positive predictive value.

Results: Mental health predictors corresponding to the three months prior to the index visit show strong signal of importance; in one setting, excluding these variables decreased AUC from 0.85 to 0.77. Predictors corresponding to more distant information were less important.

Discussion: Predictors from the months immediately preceding the index visit are highly important. Implementation of self-harm prediction models may be challenging in settings where recent data are not completely available (e.g., due to lags in insurance claims processing) at the time a prediction is made.

Conclusion: Clinically derived variables from different time frames exhibit varying levels of importance for predicting self-harm. Variable importance analyses can inform whether and how to implement risk prediction models into clinical practice given real-world data limitations. These analyses be applied more broadly in biomedical informatics research to provide insight into general clinical risk prediction tasks.

Keywords: clinical prediction models; feature importance; insurance claims data; predictive analytics; suicide.

PubMed Disclaimer

Conflict of interest statement

CONFLICTS OF INTEREST K.J.C. has worked on grants awarded to Kaiser Permanente Southern California by Janssen Pharmaceuticals. S.M.S. has worked on grants awarded to Kaiser Permanente Washington Health Research Institute (KPWHRI) by Bristol Meyers Squibb and by Pfizer. She was also a co-investigator on grants awarded to KPWHRI from Syneos Health, who represented a consortium of pharmaceutical companies carrying out FDA-mandated studies regarding the safety of extended-release opioids. The other authors report there are no competing interests to declare.

Figures

Figure 1:
Figure 1:
Schematic of temporal predictor groups in the variable importance analysis. (a) Predictors were categorized into four groups: base predictors, including demographics and comorbidities, that were included in all prediction models (gray), and mental health–specific predictors covering the 0–3 months (dark blue), 4–12 months (medium blue), and 13–60 months (light blue) prior to the prediction instance (vertical black line). The outcome window spanned 90 days from the prediction instance (date of the visit). Note that the timeline is not drawn to scale. (b) We made four comparisons to assess variable importance. In each case, the larger model used base predictors plus some subset of temporal predictors. The reduced model was constructed by removing a temporal predictor group.
Figure 2:
Figure 2:
Estimated variable importance for temporal predictor groups in terms of AUC. Note the different x-axis scales for each outcome-setting pair, which are based on the estimated maximum possible variable importance (see Supplementary Material for details).
Figure 3:
Figure 3:
Estimated variable importance for temporal predictor groups in terms of sensitivity at the 95th percentile of risk scores. Note the different x-axis scales for each outcome-setting pair, which are based on the estimated maximum possible variable importance (see Supplementary Material for details).
Figure 4:
Figure 4:
Estimated variable importance for temporal predictor groups in terms of PPV at the 95th percentile of risk scores. Note the different x-axis scales for each outcome-setting pair, which are based on the estimated maximum possible variable importance (see Supplementary Material for details).

References

    1. National Institute of Mental Health. Suicide. https://www.nimh.nih.gov/health/statistics/suicide#:~:text=The%20total%2..., 2023. Accessed November 10, 2023.
    1. Centers for Disease Control and Prevention. Facts about suicide. https://www.cdc.gov/suicide/facts/index.html#:~:text=Suicide%20was%20res...., 2023. Accessed March 25, 2024.
    1. Kroenke Kurt, Spitzer Robert L, and Williams Janet BW. The PHQ-9: validity of a brief depression severity measure. Journal of General Internal Medicine, 16(9):606–613, 2001. - PMC - PubMed
    1. Jacobs Douglas G, Baldessarini Ross J, Conwell Yeates, Fawcett Jan A, Horton Leslie, Meltzer Herbert, Pfeffer Cynthia R, and Simon Robert I. Assessment and treatment of patients with suicidal behaviors. APA Practice Guidelines, 1:183, 2010.
    1. Barak-Corren Yuval, Castro Victor M, Javitt Solomon, Hoffnagle Alison G, Dai Yael, Perlis Roy H, Nock Matthew K, Smoller Jordan W, and Reis Ben Y. Predicting suicidal behavior from longitudinal electronic health records. American Journal of Psychiatry, 174(2):154–162, 2017. - PubMed

Publication types

LinkOut - more resources