Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Nov/Dec;40(6):1404-1417.
doi: 10.1097/AUD.0000000000000718.

Interactions Between Item Set and Vocoding in Serial Recall

Interactions Between Item Set and Vocoding in Serial Recall

Adam K Bosen et al. Ear Hear. 2019 Nov/Dec.

Abstract

Objectives: Serial recall of digits is frequently used to measure short-term memory span in various listening conditions. However, the use of digits may mask the effect of low quality auditory input. Digits have high frequency and are phonologically distinct relative to one another, so they should be easy to identify even with low quality auditory input. In contrast, larger item sets reduce listener ability to strategically constrain their expectations, which should reduce identification accuracy and increase the time and/or cognitive resources needed for identification when auditory quality is low. This diminished accuracy and increased cognitive load should interfere with memory for sequences of items drawn from large sets. The goal of this work was to determine whether this predicted interaction between auditory quality and stimulus set in short-term memory exists, and if so, whether this interaction is associated with processing speed, vocabulary, or attention.

Design: We compared immediate serial recall within young adults with normal hearing across unprocessed and vocoded listening conditions for multiple stimulus sets. Stimulus sets were lists of digits (1 to 9), consonant-vowel-consonant (CVC) words (chosen from a list of 60 words), and CVC nonwords (chosen from a list of 50 nonwords). Stimuli were unprocessed or vocoded with an eight-channel noise vocoder. To support interpretation of responses, words and nonwords were selected to minimize inclusion of multiple phonemes from within a confusion cluster. We also measured receptive vocabulary (Peabody Picture Vocabulary Test [PPVT-4]), sustained attention (test of variables of attention [TOVA]), and repetition speed for individual items from each stimulus set under both listening conditions.

Results: Vocoding stimuli had no impact on serial recall of digits, but reduced memory span for words and nonwords. This reduction in memory span was attributed to an increase in phonological confusions for nonwords. However, memory span for vocoded word lists remained reduced even after accounting for common phonetic confusions, indicating that lexical status played an additional role across listening conditions. Principal components analysis found two components that explained 84% of the variance in memory span across conditions. Component one had similar load across all conditions, indicating that participants had an underlying memory capacity, which was common to all conditions. Component two was loaded by performance in the vocoded word and nonword conditions, representing the sensitivity of memory span to vocoding of these stimuli. The order in which participants completed listening conditions had a small effect on memory span that could not account for the effect of listening condition. Repetition speed was fastest for digits, slower for words, and slowest for nonwords. On average, vocoding slowed repetition speed for all stimuli, but repetition speed was not predictive of individual memory span. Vocabulary and attention showed no correlation with memory span.

Conclusions: Our results replicated previous findings that low quality auditory input can impair short-term memory, and demonstrated that this impairment is sensitive to stimulus set. Using multiple stimulus sets in degraded listening conditions can isolate memory capacity (in digit span) from impaired item identification (in word and nonword span), which may help characterize the relationship between memory and speech recognition in difficult listening conditions.

PubMed Disclaimer

Figures

Fig 1.
Fig 1.
Group list recall accuracy as a function of list length. Points represent proportion of lists correct across all listeners for each list length, with unprocessed lists represented by solid lines and filled points and vocoded lists represented by broken lines and hollow points. Shaded regions represent 95% confidence intervals, calculated with the Clopper-Pearson method.
Fig 2.
Fig 2.
Individual variability in span (50% correct point) across participants. Filled circles indicate individual performance in the unprocessed condition, and hollow circles indicate performance in the vocoded condition. Horizontal lines indicate group medians.
Fig 3.
Fig 3.
Group item recall accuracy as a function of serial position. Points represent the proportion of items correctly recalled at each position within a list, with each line representing a different list length. For words and non-words, solid lines indicate performance when common phonetic confusions (see Table IV) were labeled incorrect, and dashed lines indicate performance when the same confusions were labeled correct.
Fig 4.
Fig 4.
Phonological edit distance between target items and responses for words and non-words. Color bars indicate edit distance probabilities for unprocessed listening conditions, and hollow bars indicate edit distance probabilities for vocoded listening conditions. NR indicates responses that were either absent at the end of a list or were a placeholder (typically the participant saying “something”). Responses with an edit distance of zero were correct, and the scale cuts off those responses to emphasize the pattern of incorrect responses.
Fig 5.
Fig 5.
Group list recall accuracy as a function of list length and listening condition order, as in Fig 1. Light colored lines and large points indicate data from participants who completed the unprocessed conditions first, and dark colored lines and small points indicate participants who completed the vocoded conditions first. Confidence intervals were omitted for clarity.
Fig 6.
Fig 6.
Individual variability in median repetition time for correct responses. Filled circles indicate individual median repetition times in the unprocessed condition, and hollow circles indicate individual median repetition times in the vocoded condition. Horizontal lines indicate group medians.
Fig 7.
Fig 7.
Relationship between memory span and repetition speed across individuals. The top panel shows this relationship for each listening condition. Each point represents an individual’s performance in both tasks with matched stimulus (digits are shown in green, words in red, and non-words in blue) and listening conditions (filled circles for the unprocessed condition, hollow circles for the vocoded condition). The bottom panel shows the difference in span and speed between the unprocessed and vocoded listening conditions. Points represent the difference between listening conditions for one individual, and lines indicate group averages for each condition.

References

    1. Allen B, Becker M (2015). Learning alternations from surface forms with sublexical phonology. Lingbuzz, Retrieved from http://ling.auf.net/lingbuzz/002503
    1. Aubuchon AM, Pisoni DB Kronenberger WG (2015). Short-Term and Working Memory Impairments in Early-Implanted, Long-Term Cochlear Implant Users Are Independent of Audibility and Speech Production. Ear Hear, 36, 733–37. - PMC - PubMed
    1. AuBuchon AM, Pisoni DB Kronenberger WG (2015a). Verbal Processing Speed and Executive Functioning in Long-Term Cochlear Implant Users. J Speech, Lang, Hear R, 58, 151–62. - PMC - PubMed
    1. Ayasse ND, Lash A, Wingfield A (2017). Effort Not Speed Characterizes Comprehension of Spoken Sentences by Older Adults with Mild Hearing Impairment. Front Aging Neurosci, January 10;8, 329. - PMC - PubMed
    1. Barrouillet P, Bernardin S, Camos V (2004). Time Constraints and Resource Sharing in Adults’ Working Memory Spans. J Exp Psychol Gen, 133, 83–100. - PubMed

Publication types