Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Aug 23:7:1130.
doi: 10.3389/fpsyg.2016.01130. eCollection 2016.

Automated Video Analysis of Non-verbal Communication in a Medical Setting

Affiliations

Automated Video Analysis of Non-verbal Communication in a Medical Setting

Yuval Hart et al. Front Psychol. .

Abstract

Non-verbal communication plays a significant role in establishing good rapport between physicians and patients and may influence aspects of patient health outcomes. It is therefore important to analyze non-verbal communication in medical settings. Current approaches to measure non-verbal interactions in medicine employ coding by human raters. Such tools are labor intensive and hence limit the scale of possible studies. Here, we present an automated video analysis tool for non-verbal interactions in a medical setting. We test the tool using videos of subjects that interact with an actor portraying a doctor. The actor interviews the subjects performing one of two scripted scenarios of interviewing the subjects: in one scenario the actor showed minimal engagement with the subject. The second scenario included active listening by the doctor and attentiveness to the subject. We analyze the cross correlation in total kinetic energy of the two people in the dyad, and also characterize the frequency spectrum of their motion. We find large differences in interpersonal motion synchrony and entrainment between the two performance scenarios. The active listening scenario shows more synchrony and more symmetric followership than the other scenario. Moreover, the active listening scenario shows more high-frequency motion termed jitter that has been recently suggested to be a marker of followership. The present approach may be useful for analyzing physician-patient interactions in terms of synchrony and dominance in a range of medical settings.

Keywords: doctor-patient interactions; entrainment; non-verbal communication; performance; synchronization; video analysis.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Examples of performance A and B in the dyadic actor-subject interaction. In performance A, the actor mainly types, and asks a few closed questions. In performance B, the actor actively listens to the subject using open questions and reflections, and explains the mechanism and effect of the drug.
Figure 2
Figure 2
Cross-correlation of subject and actor motion kinetic energy shows higher synchrony and symmetric followership in performance B (blue) compared with performance A (red). Height at zero delay means synchrony of motion, height at positive delay means subject's entrainment by the actor, and height at negative delay means actor's entrainment by the subject. x-axis: time delay [sec], y-axis: Normalized cross-correlation. Inset, two examples of specific dyadic cross-correlation of performance A (red) and performance B (blue).
Figure 3
Figure 3
Synchrony and entrainment of dyadic interaction differentiates between performance A and B in the video analysis. Performance B (blue circles) has higher synchrony values and more equal entrainment between the actor and subject compared with performance A (red circles). A logistic regression classifier separates the two performances with a 72% accuracy (black dashed line). The 70% probability function lines for performance B (green dashed line) and for performance A (purple dashed line) are shown. The classifier probability function can be described as: P(1) = 1/(1 + aebx+cy). The parameters of the logistic regression classifier are: a = 5 ± 1, b = −23 ± 5, c = 0.6 ± 0.2, mean ± std, as determined by bootstrapping with 1000 repeats.
Figure 4
Figure 4
Jitter of subjects and actor differs between performance A and performance B. The jitter motion (Fourier Power at frequencies 1.5–5 Hz) of both subjects (A) and actor (B) is higher at performance B scenario, suggesting more followership motion (Noy et al., 2011) of both interlocutors. *p < 0.05, ***p < 0.001. Subjects: Mann–Whitney test, p < 0.03, rank biserial correlation, r = 0.39. Actor: Mann–Whitney test, p < 0.001, rank biserial correlation, r = 0.8.

Similar articles

Cited by

References

    1. Alexiadis D. S., Kelly P., Daras P., O'Connor N. E., Boubekeur T., Moussa M. B. (2011). Evaluating a dancer's performance using kinect-based skeleton tracking, in Proceedings of the 19th ACM International Conference on Multimedia MM '11 (New York, NY: ACM; ), 659–662.
    1. Aruguete M. S., Roberts C. A. (2002). Participants' ratings of male physicians who vary in race and communication style. Psychol. Rep. 91, 793–806. 10.2466/pr0.2002.91.3.793 - DOI - PubMed
    1. Baaren R. B., van Holland R. W., Kawakami K., van Knippenberg A. (2004). Mimicry and prosocial behavior. Psychol. Sci. 15, 71–74. 10.1111/j.0963-7214.2004.01501012.x - DOI - PubMed
    1. Bensing J. M., Kerssens J. J., van der Pasch M. (1995). Patient-directed gaze as a tool for discovering and handling psychosocial problems in general practice. J. Nonverbal Behav. 19, 223–242. 10.1007/BF02173082 - DOI
    1. Bensing J. M., Verheul W. (2010). The silent healer: the role of communication in placebo effects. Patient Educ. Couns. 80, 293–299. 10.1016/j.pec.2010.05.033 - DOI - PubMed