Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Aug 27:21:100337.
doi: 10.1016/j.invent.2020.100337. eCollection 2020 Sep.

Applying machine learning on health record data from general practitioners to predict suicidality

Affiliations

Applying machine learning on health record data from general practitioners to predict suicidality

Kasper van Mens et al. Internet Interv. .

Abstract

Background: Suicidal behaviour is difficult to detect in the general practice. Machine learning (ML) algorithms using routinely collected data might support General Practitioners (GPs) in the detection of suicidal behaviour. In this paper, we applied machine learning techniques to support GPs recognizing suicidal behaviour in primary care patients using routinely collected general practice data.

Methods: This case-control study used data from a national representative primary care database including over 1.5 million patients (Nivel Primary Care Database). Patients with a suicide (attempt) in 2017 were selected as cases (N = 574) and an at risk control group (N = 207,308) was selected from patients with psychological vulnerability but without a suicide attempt in 2017. RandomForest was trained on a small subsample of the data (training set), and evaluated on unseen data (test set).

Results: Almost two-third (65%) of the cases visited their GP within the last 30 days before the suicide (attempt). RandomForest showed a positive predictive value (PPV) of 0.05 (0.04-0.06), with a sensitivity of 0.39 (0.32-0.47) and area under the curve (AUC) of 0.85 (0.81-0.88). Almost all controls were accurately labeled as controls (specificity = 0.98 (0.97-0.98)). Among a sample of 650 at-risk primary care patients, the algorithm would label 20 patients as high-risk. Of those, one would be an actual case and additionally, one case would be missed.

Conclusion: In this study, we applied machine learning to predict suicidal behaviour using general practice data. Our results showed that these techniques can be used as a complementary step in the identification and stratification of patients at risk of suicidal behaviour. The results are encouraging and provide a first step to use automated screening directly in clinical practice. Additional data from different social domains, such as employment and education, might improve accuracy.

Keywords: Electronic health records; General practice; Machine learning; Suicide.

PubMed Disclaimer

Conflict of interest statement

All authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Number of cases with a registration in their GP file prior to a suicide (attempt) (N = 574).

Similar articles

Cited by

References

    1. Barak-Corren Y. Predicting suicidal behavior from longitudinal electronic health records. Am. J. Psychiatr. 2017;174(2):154–162. - PubMed
    1. Belsher B.E. Prediction models for suicide attempts and deaths: a systematic review and simulation. JAMA Psychiatry. 2019;76(6):642–651. - PubMed
    1. de Beurs D.P. Trends in suicidal behaviour in Dutch general practice 1983–2013: a retrospective observational study. BMJ Open. 2016;6(5) - PMC - PubMed
    1. Breiman L. 2015. The randomForest Package.
    1. Elzinga E. Discussing suicidality with depressed patients: an observational study in Dutch sentinel general practices. BMJ Open. 2019;9(4) - PMC - PubMed

LinkOut - more resources