Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2003 Aug;114(2):1048-54.
doi: 10.1121/1.1592161.

Audio-vocal responses to repetitive pitch-shift stimulation during a sustained vocalization: improvements in methodology for the pitch-shifting technique

Affiliations

Audio-vocal responses to repetitive pitch-shift stimulation during a sustained vocalization: improvements in methodology for the pitch-shifting technique

Jay J Bauer et al. J Acoust Soc Am. 2003 Aug.

Abstract

The pitch-shift reflex is a sophisticated system that produces a "compensatory" response in voice F0 that is opposite in direction to a change in voice pitch feedback (pitch-shift stimulus), thus correcting for the discrepancy between the intended voice F0 and the feedback pitch. In order to more fully exploit the pitch-shift reflex as a tool for studying the influence of sensory feedback mechanisms underlying voice control, the optimal characteristics of the pitch-shift stimulus must be understood. The present study was undertaken to assess the effects of altering the duration of the interstimulus interval (ISI) and the number of trials comprising an average on measures of the pitch-shift reflex. Pitch-shift stimuli were presented to vocalizing subjects with ISI of 5.0, 2.5, 1.0, and 0.5 s to determine if an increase in ISI altered response properties. With each ISI, measures of event-related averages of the first 10, 15, 20, or 30 pitch-shift reflex responses were compared to see if increases in the number of responses comprising an event-related average altered response properties. Measures of response latency, peak time, magnitude, and prevalence were obtained for all ISI and average conditions. While quantitative measures were similar across ISI and averaging conditions, we observed more instances of "non-responses" with averages of ten trials as well as at an ISI of 0.5 s. These findings suggest an ISI of 1.0 s and an average consisting of at least 15 trials produce optimal results. Future studies using these stimulus parameters may produce more reliable data due to the fivefold decrease in subject participation time and a concomitant decrease in fatigue, boredom, and inattention.

PubMed Disclaimer

Figures

FIG. 1
FIG. 1
Event-related averages of voice F0 for a representative subject tested with ISI of 2.5, 1.0, and 0.5 s. Under each rate, averages of 10, 15, 20, and 30 responses were calculated (vertically stacked traces). Black lines represent the overall average response. Gray bars represent standard error of the mean for all responses comprising an average on a point-by-point basis. Shaded boxes represent pitch-shift stimulus onset and duration (0.1 s) beginning at time 0 s. Vertical line scales=20 cents.
FIG. 2
FIG. 2
(a) Median box-plots of response latency (s) by ISI and number of averages (N=31). (b) Median box-plots of responses peak time (s). (c) Median box-plots of response magnitude (cents). Box-plot definitions: The horizontal line through a box is the median. The shaded region surrounding the median is the 95% confidence interval. The upper and lower limits of the box represent the 75th and 25th percentiles, respectively. Whiskers extend to upper and lower limits of the main body of data. Points depicted by a circle are considered to be extreme data values, while very extreme values are plotted as asterisks.
FIG. 3
FIG. 3
Event-related averages of voice F0 for a representative subject tested with ISI of 5.0, 2.5, 1.0, and 0.5 s. Under each rate, averages of 10, 15, 20, and 30 responses were calculated (vertically stacked traces). Black lines represent the overall average response. Gray bars represent standard error of the mean for all responses comprising an average on a point-by-point basis. Shaded boxes represent pitch-shift stimulus onset and duration (0.1 s) beginning at time 0 s. Vertical line scales=20 cents.
FIG. 4
FIG. 4
Event-related average for a representative subject displaying an overlap of successive pitch-shift reflex responses due to the short ISI at 0.5 s. Black lines represent the overall average voice F0 and arrows denote response onset. Duration of the post-stimulus averaging window was extended to 1.0 s. Shaded boxes represent pitch-shift stimulus onset and duration (0.1 s) beginning at time 0.0 and 0.6 s.

References

    1. Abbas PJ, Brown CJ. Electrically evoked auditory brainstem response: Refractory properties and strength-duration functions. Hear Res. 1991;51:139–147. - PubMed
    1. Abbs JH, Gracco VL. Control of complex motor gestures: Orofacial muscle responses to load perturbations of lip during speech. J Neurophysiol. 1984;51:705–723. - PubMed
    1. Barlow SM, Bradford PT. Comparison of perioral reflex modulation in the upper and lower lip. J Speech Hear Res. 1996;39:55–75. - PubMed
    1. Burnett TA, Larson CR. Early pitch shift response is active in both steady and dynamic voice pitch control. J Acoust Soc Am. 2002;112:1058–1063. - PubMed
    1. Burnett TA, Freedland MB, Larson CR, Hain TC. Voice F0 responses to manipulations in pitch feedback. J Acoust Soc Am. 1998;103:3153–3161. - PubMed

Publication types