Utilizing Text Mining, Data Linkage and Deep Learning in Police and Health Records to Predict Future Offenses in Family and Domestic Violence
- PMID: 34713088
- PMCID: PMC8521947
- DOI: 10.3389/fdgth.2021.602683
Utilizing Text Mining, Data Linkage and Deep Learning in Police and Health Records to Predict Future Offenses in Family and Domestic Violence
Abstract
Family and Domestic violence (FDV) is a global problem with significant social, economic, and health consequences for victims including increased health care costs, mental trauma, and social stigmatization. In Australia, the estimated annual cost of FDV is $22 billion, with one woman being murdered by a current or former partner every week. Despite this, tools that can predict future FDV based on the features of the person of interest (POI) and victim are lacking. The New South Wales Police Force attends thousands of FDV events each year and records details as fixed fields (e.g., demographic information for individuals involved in the event) and as text narratives which describe abuse types, victim injuries, threats, including the mental health status for POIs and victims. This information within the narratives is mostly untapped for research and reporting purposes. After applying a text mining methodology to extract information from 492,393 FDV event narratives (abuse types, victim injuries, mental illness mentions), we linked these characteristics with the respective fixed fields and with actual mental health diagnoses obtained from the NSW Ministry of Health for the same cohort to form a comprehensive FDV dataset. These data were input into five deep learning models (MLP, LSTM, Bi-LSTM, Bi-GRU, BERT) to predict three FDV offense types ("hands-on," "hands-off," "Apprehended Domestic Violence Order (ADVO) breach"). The transformer model with BERT embeddings returned the best performance (69.00% accuracy; 66.76% ROC) for "ADVO breach" in a multilabel classification setup while the binary classification setup generated similar results. "Hands-off" offenses proved the hardest offense type to predict (60.72% accuracy; 57.86% ROC using BERT) but showed potential to improve with fine-tuning of binary classification setups. "Hands-on" offenses benefitted least from the contextual information gained through BERT embeddings in which MLP with categorical embeddings outperformed it in three out of four metrics (65.95% accuracy; 78.03% F1-score; 70.00% precision). The encouraging results indicate that future FDV offenses can be predicted using deep learning on a large corpus of police and health data. Incorporating additional data sources will likely increase the performance which can assist those working on FDV and law enforcement to improve outcomes and better manage FDV events.
Keywords: big data; data linkage; deep learning; family and domestic violence; health records; predictive analytics; text mining.
Copyright © 2021 Karystianis, Cabral, Han, Poon and Butler.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures








Similar articles
-
A Systematic Literature Review of the Use of Computational Text Analysis Methods in Intimate Partner Violence Research.J Fam Violence. 2023 Mar 21:1-20. doi: 10.1007/s10896-023-00517-7. Online ahead of print. J Fam Violence. 2023. PMID: 37358974 Free PMC article. Review.
-
Mental Illness Concordance Between Hospital Clinical Records and Mentions in Domestic Violence Police Narratives: Data Linkage Study.JMIR Form Res. 2022 Oct 20;6(10):e39373. doi: 10.2196/39373. JMIR Form Res. 2022. PMID: 36264613 Free PMC article.
-
Surveillance of Domestic Violence Using Text Mining Outputs From Australian Police Records.Front Psychiatry. 2022 Feb 9;12:787792. doi: 10.3389/fpsyt.2021.787792. eCollection 2021. Front Psychiatry. 2022. PMID: 35222105 Free PMC article.
-
Automated Analysis of Domestic Violence Police Reports to Explore Abuse Types and Victim Injuries: Text Mining Study.J Med Internet Res. 2019 Mar 12;21(3):e13067. doi: 10.2196/13067. J Med Internet Res. 2019. PMID: 30860490 Free PMC article.
-
[Domestic violence: any progress?].Bull Acad Natl Med. 2014 Apr-May;198(4-5):893-903. Bull Acad Natl Med. 2014. PMID: 26753414 Review. French.
Cited by
-
A Systematic Literature Review of the Use of Computational Text Analysis Methods in Intimate Partner Violence Research.J Fam Violence. 2023 Mar 21:1-20. doi: 10.1007/s10896-023-00517-7. Online ahead of print. J Fam Violence. 2023. PMID: 37358974 Free PMC article. Review.
-
Mental Illness Concordance Between Hospital Clinical Records and Mentions in Domestic Violence Police Narratives: Data Linkage Study.JMIR Form Res. 2022 Oct 20;6(10):e39373. doi: 10.2196/39373. JMIR Form Res. 2022. PMID: 36264613 Free PMC article.
References
-
- World Health Organisation . Violence Against Women. (2017). Available online at: https://www.who.int/news-room/fact-sheets/detail/violence-against-women.
-
- VicHealth . The Health Costs of Violence. Measuring the Burden of Diseases Caused by Intimate Partner Violence. Melbourne: (2005).
-
- Australian Institute of Health and Welfare . Family, Domestic and Sexual Violence in Australia. (2018). Available online at: https://www.aihw.gov.au/reports/domestic-violence/family-domestic-sexual....
-
- Campo M. Children's exposure to domestic and family violence: key issues and responses. J Home Econ Inst Aust. (2015) 22:33.
LinkOut - more resources
Full Text Sources