Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Apr 13;7(6):471-477.
doi: 10.1136/bmjstel-2020-000727. eCollection 2021.

Acoustic analysis of surgeons' voices to assess change in the stress response during surgical in situ simulation

Affiliations

Acoustic analysis of surgeons' voices to assess change in the stress response during surgical in situ simulation

Andrew Hall et al. BMJ Simul Technol Enhanc Learn. .

Abstract

Introduction: Stress may serve as an adjunct (challenge) or hindrance (threat) to the learning process. Determining the effect of an individual's response to situational demands in either a real or simulated situation may enable optimisation of the learning environment. Studies of acoustic analysis suggest that mean fundamental frequency and formant frequencies of voice vary with an individual's response during stressful events. This hypothesis is reviewed within the otolaryngology (ORL) simulation environment to assess whether acoustic analysis could be used as a tool to determine participants' stress response and cognitive load in medical simulation. Such an assessment could lead to optimisation of the learning environment.

Methodology: ORL simulation scenarios were performed to teach the participants teamwork and refine clinical skills. Each was performed in an actual operating room (OR) environment (in situ) with a multidisciplinary team consisting of ORL surgeons, OR nurses and anaesthesiologists. Ten of the scenarios were led by an ORL attending and ten were led by an ORL fellow. The vocal communication of each of the 20 individual leaders was analysed using a long-term pitch analysis PRAAT software (autocorrelation method) to obtain mean fundamental frequency (F0) and first four formant frequencies (F1, F2, F3 and F4). In reviewing individual scenarios, each leader's voice was analysed during a non-stressful environment (WHO sign-out procedure) and compared with their voice during a stressful portion of the scenario (responding to deteriorating oxygen saturations in the manikin).

Results: The mean unstressed F0 for the male voice was 161.4 Hz and for the female voice was 217.9 Hz. The mean fundamental frequency of speech in the ORL fellow (lead surgeon) group increased by 34.5 Hz between the scenario's baseline and stressful portions. This was significantly different to the mean change of -0.5 Hz noted in the attending group (p=0.01). No changes were seen in F1, F2, F3 or F4.

Conclusions: This study demonstrates a method of acoustic analysis of the voices of participants taking part in medical simulations. It suggests acoustic analysis of participants may offer a simple, non-invasive, non-intrusive adjunct in evaluating and titrating the stress response during simulation.

Keywords: In Situ; cognitive load; communication; education; medical; otolaryngology.

PubMed Disclaimer

Conflict of interest statement

Competing interests: None declared.

Figures

Figure 1
Figure 1
PRAAT software interface demonstrating (A) voice waveform displaying changes in amplitude and (B) ‘heat map’ spectrogram visually representing the overall spectrum of frequencies. Fundamental frequency (F0) shown (*) calculated over 10.45 s (total duration).
Figure 2
Figure 2
Box-and-whisker diagram showing acoustic analysis of non-stress voice samples showing a baseline difference in fundamental frequency between the female and male voice.
Figure 3
Figure 3
Box-and-whisker diagram demonstrating the difference in change of fundamental frequency of voice between non-stressed and stressed scenario by attendings and fellows (p=0.0106).
Figure 4
Figure 4
Graphical depiction of change of fundamental frequency of voice from the non-stressed and stressed scenario shown by attending group (A) and fellows group (B).
Figure 5
Figure 5
Yerkes-Dodson curve. Initially, as the amount of arousal increases, the individual will show the physiological signs of a challenge response with improved performance. As the arousal continues to increase, the physiological response changes to a threat response (better known as stress) and performance declines. Adapted from Teigen and Dodson.

References

    1. Yerkes RM, Dodson JD. The relation of strength of stimulus to rapidity of habit-formation. J Comp Neurol Psychol 1908;18:459–82. 10.1002/cne.920180503 - DOI
    1. Teigen KH, Dodson Y. ‘A law for all seasons’. Theory & Psychology 1994;4:525–47.
    1. Saltsman TL, Seery MD, Ward DE, et al. . Is satisficing really satisfying? Satisficers exhibit greater threat than maximizers during choice overload. Psychophysiology 2021;58:e13705. 10.1111/psyp.13705 - DOI - PubMed
    1. Jones KI, Amawi F, Bhalla A, et al. . Assessing surgeon stress when operating using heart rate variability and the state trait anxiety inventory: will surgery be the death of us? Colorectal Dis 2015;17:335–41. 10.1111/codi.12844 - DOI - PubMed
    1. Wetzel CM, Kneebone RL, Woloshynowych M, et al. . The effects of stress on surgical performance. The American Journal of Surgery 2006;191:5–10. 10.1016/j.amjsurg.2005.08.034 - DOI - PubMed

LinkOut - more resources