Developing Machine Learning Models for Behavioral Coding
- PMID: 30698755
- PMCID: PMC6415657
- DOI: 10.1093/jpepsy/jsy113
Developing Machine Learning Models for Behavioral Coding
Abstract
Objective: The goal of this research is to develop a machine learning supervised classification model to automatically code clinical encounter transcripts using a behavioral code scheme.
Methods: We first evaluated the efficacy of eight state-of-the-art machine learning classification models to recognize patient-provider communication behaviors operationalized by the motivational interviewing framework. Data were collected during the course of a single weight loss intervention session with 37 African American adolescents and their caregivers. We then tested the transferability of the model to a novel treatment context, 80 patient-provider interactions during routine human immunodeficiency virus (HIV) clinic visits.
Results: Of the eight models tested, the support vector machine model demonstrated the best performance, achieving a .680 F1-score (a function of model precision and recall) in adolescent and .639 in caregiver sessions. Adding semantic and contextual features improved accuracy with 75.1% of utterances in adolescent and 73.8% in caregiver sessions correctly coded. With no modification, the model correctly classified 72.0% of patient-provider utterances in HIV clinical encounters with reliability comparable to human coders (k = .639).
Conclusions: The development of a validated approach for automatic behavioral coding offers an efficient alternative to traditional, resource-intensive methods with the potential to dramatically accelerate the pace of outcomes-oriented behavioral research. The knowledge gained from computer-driven behavioral research can inform clinical practice by providing clinicians with empirically supported communication strategies to tailor their conversations with patients. Lastly, automatic behavioral coding is a critical first step toward fully automated eHealth/mHealth (electronic/mobile Health) behavioral interventions.
Keywords: machine learning; motivational interviewing; qualitative research.
© The Author(s) 2019. Published by Oxford University Press on behalf of the Society of Pediatric Psychology. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
References
-
- Adamou M., Antoniou G., Greasidou E., Lagani V., Charonyktakis P., Tsamardinos I. (2018). Mining free-text medical notes for suicide risk assessment. Paper presented at the Proceedings of the 10th Hellenic Conference on Artificial Intelligence, Patras, Greece.
-
- Armstrong R., Symons M., Scott J., Arnott W., Copland D., McMahon K., Whitehouse A. (2018). Predicting language difficulties in middle childhood from early developmental milestones: A comparison of traditional regression and machine learning techniques. Journal of Speech, Language, and Hearing Research: JSLHR, 61, 1926–1944. - PubMed
-
- Bakeman R., Quera V. (1997). Observing interaction: An introduction to sequential analysis (2nd edn). New York, NY: Cambridge University Press.
-
- Bakeman R., Quera V. (2011). Sequential analysis and observational methods for the behavioral sciences. New York, NY: Cambridge University Press.