Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Apr:138:106090.
doi: 10.1016/j.chiabu.2023.106090. Epub 2023 Feb 8.

Using natural language processing to identify child maltreatment in health systems

Affiliations

Using natural language processing to identify child maltreatment in health systems

Sonya Negriff et al. Child Abuse Negl. 2023 Apr.

Abstract

Background: Rates of child maltreatment (CM) obtained from electronic health records are much lower than national child welfare prevalence rates indicate. There is a need to understand how CM is documented to improve reporting and surveillance.

Objectives: To examine whether using natural language processing (NLP) in outpatient chart notes can identify cases of CM not documented by ICD diagnosis code, the overlap between the coding of child maltreatment by ICD and NLP, and any differences by age, gender, or race/ethnicity.

Methods: Outpatient chart notes of children age 0-18 years old within Kaiser Permanente Washington (KPWA) 2018-2020 were used to examine a selected set of maltreatment-related terms categorized into concept unique identifiers (CUI). Manual review of text snippets for each CUI was completed to flag for validated cases and retrain the NLP algorithm.

Results: The NLP results indicated a crude rate of 1.55 % to 2.36 % (2018-2020) of notes with reference to CM. The rate of CM identified by ICD code was 3.32 per 1000 children, whereas the rate identified by NLP was 37.38 per 1000 children. The groups that increased the most in identification of maltreatment from ICD to NLP were adolescents (13-18 yrs. old), females, Native American children, and those on Medicaid. Of note, all subgroups had substantially higher rates of maltreatment when using NLP.

Conclusions: Use of NLP substantially increased the estimated number of children who have been impacted by CM. Accurately capturing this population will improve identification of vulnerable youth at high risk for mental health symptoms.

Keywords: Child abuse; Child maltreatment; Electronic health records; Natural language processing.

PubMed Disclaimer

Conflict of interest statement

Declaration of competing interest The authors have no conflicts of interest relevant to this article to disclose.

References

    1. Annapragada AV, Donaruma-Kwoh MM, Annapragada AV, & Starosolski ZA (2021). A natural language processing and deep learning approach to identify child abuse from pediatric electronic medical records. PLoS One, 16(2), e0247404. - PMC - PubMed
    1. Brown DW, Anda RF, Felitti VJ, Edwards VJ, Malarcher AM, Croft JB, & Giles WH (2010). Adverse childhood experiences are associated with the risk of lung cancer: a prospective cohort study. BMC Public Health, 20(10), 1–12. - PMC - PubMed
    1. Cohen S, Doyle WJ, Turner RB, Alper CM, & Skoner DP (2004). Childhood socioeconomic status and host resistance to infectious illness in adulthood. Psychosomatic Medicine, 66(4), 553–558. doi:10.1097/01.psy.0000126200.05189.d3 - DOI - PubMed
    1. Danese A, & Tan M (2014). Childhood maltreatment and obesity: systematic review and meta-analysis. Molecular Psychiatry, 19(5), 544–554. doi:10.1038/mp.2013.54 - DOI - PubMed
    1. Danese A, & Widom CS (2020). Objective and subjective experiences of child maltreatment and their relationships with psychopathology. Nature Human Behavior, 4(8), 811–818. - PubMed

Publication types