Review

. 1992 Oct-Dec;35 ( Pt 4)(Pt 4):351-89.

doi: 10.1177/002383099203500401.

Comprehension of synthetic speech produced by rule: a review and theoretical interpretation

S A Duffy¹, D B Pisoni

Affiliations

PMID: 1339919
PMCID: PMC3507427
DOI: 10.1177/002383099203500401

Review

Comprehension of synthetic speech produced by rule: a review and theoretical interpretation

S A Duffy et al. Lang Speech. 1992 Oct-Dec.

. 1992 Oct-Dec;35 ( Pt 4)(Pt 4):351-89.

doi: 10.1177/002383099203500401.

Authors

S A Duffy¹, D B Pisoni

Affiliation

¹ Department of Psychology, Amherst College, MA 01002.

PMID: 1339919
PMCID: PMC3507427
DOI: 10.1177/002383099203500401

Abstract

In this paper, we review research on the perception and comprehension of synthetic speech produced by rule. We discuss the difficulties that synthetic speech causes for the listener and the evidence that the immediate result of those difficulties is a delay in the point at which words are recognized. We then argue that this delay in processing affects not only lexical access but also comprehension processes. We consider the mechanisms by which the comprehension system adjusts to this delay, the resulting costs to higher level comprehension processes, and the changes that occur in the language processing system as its familiarity with synthetic speech increases. Based on the framework we have developed, we suggest several directions for future research on the comprehension of synthetic speech.

PubMed Disclaimer

Figures

**Fig. 1**
Error rates (in percent) for various synthesis systems tested in both the closed- and open-response format MRT (from Logan, Greene, and Pisoni, 1989).

**Fig. 2**
Response times (in msec) and percent correct for natural and synthetic words and non-words in a lexical decision task (from Pisoni, 1981).

**Fig. 3**
Mean number of natural and synthetic words recalled as a function of memory preload (from Luce, Feustel, and Pisoni, 1983).

**Fig. 4**
Number of subjects correctly recalling all of the digits as a function of memory preload (from Luce *et al.*, 1983).

**Fig. 5**
Probability of recall at each serial position for natural and synthetic word lists (from Luce *et al.*, 1983).

**Fig. 6**
Percent correct for different categories of information presented in natural and synthetic speech (from Luce, 1981).

**Fig. 7**
Probability of a correct response for two kinds of information presented in natural and synthetic speech (from Ralston, Pisoni, Lively, Greene, and Mullennix, 1991).

**Fig. 8**
Sentence verification response times (in msec) for True and False responses to three- and six-word sentences presented in seven voices (from Manous, Pisoni, Dedina, and Nusbaum, 1985). Means are based on only those trials on which the subject verified and transcribed the sentence correctly.

**Fig. 9**
Mean sentence verification times (in msec) for True and False responses to three- and six-word sentences presented in natural and synthetic speech (from Pisoni, Manous, and Dedina, 1987). High-predictability sentences are displayed with open bars; low-predictability sentences are displayed with striped bars. Means are based on only those trials on which the subject verified and transcribed the sentence correctly.

**Fig. 10**
Sentence-by-sentence listening times as a function of voice and text (from Ralston *et al.*, 1991). Open bars represent natural speech; striped bars represent synthetic speech. Error bars represent one standard error of the sample mean.

See this image and copyright information in PMC

Cited by

Influence of Turn-Taking in Musical and Spoken Activities on Empathy and Self-Esteem of Socially Vulnerable Young Teenagers.
Hawkins S, Farrant C. Hawkins S, et al. Front Psychol. 2022 Feb 7;12:801574. doi: 10.3389/fpsyg.2021.801574. eCollection 2021. Front Psychol. 2022. PMID: 35197885 Free PMC article.
Three challenges for future research on cochlear implants.
Pisoni DB, Kronenberger WG, Harris MS, Moberly AC. Pisoni DB, et al. World J Otorhinolaryngol Head Neck Surg. 2018 Jan 2;3(4):240-254. doi: 10.1016/j.wjorl.2017.12.010. eCollection 2017 Dec. World J Otorhinolaryngol Head Neck Surg. 2018. PMID: 29780970 Free PMC article. Review.
Interpreting chicken-scratch: lexical access for handwritten words.
Barnhart AS, Goldinger SD. Barnhart AS, et al. J Exp Psychol Hum Percept Perform. 2010 Aug;36(4):906-23. doi: 10.1037/a0019258. J Exp Psychol Hum Percept Perform. 2010. PMID: 20695708 Free PMC article.
The acceptability and validity of AI-generated psycholinguistic stimuli.
Alzahrani A. Alzahrani A. Heliyon. 2025 Jan 17;11(2):e42083. doi: 10.1016/j.heliyon.2025.e42083. eCollection 2025 Jan 30. Heliyon. 2025. PMID: 39906842 Free PMC article.
Effect of speaking rate on recognition of synthetic and natural speech by normal-hearing and cochlear implant listeners.
Ji C, Galvin JJ 3rd, Xu A, Fu QJ. Ji C, et al. Ear Hear. 2013 May-Jun;34(3):313-23. doi: 10.1097/AUD.0b013e31826fe79e. Ear Hear. 2013. PMID: 23238527 Free PMC article.

References

1. Allen J, Hunnicutt MS, Klatt D. From Text to Speech: The MITalk System. Cambridge, UK: Cambridge University Press; 1987.
1. Altmann GTM, editor. Cognitive Models of Speech Processing: Psycholinguistic and Computational Perspectives. Cambridge, MA: MIT Press; 1990.
1. Auberge V. Developing a structured lexicon for synthesis of prosody. In: Bailly G, Benoit C, Sawallis TR, editors. Talking Machines: Theories, Models, and Designs. Amsterdam: North-Holland; 1992. pp. 307–321.
1. Bard EG, Shillcock RC, Altmann GTM. The recognition of words after their acoustic offsets in spontaneous speech: Effects of subsequent context. Perception & Psychophysics. 1988;44:395–408. - PubMed
1. Balota DA, Flores D’Arcais G, Rayner K, editors. Comprehension Processes in Reading. Hillsdale, NJ: Erlbaum; 1990.

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

T32 DC000012/DC/NIDCD NIH HHS/United States

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Comprehension of synthetic speech produced by rule: a review and theoretical interpretation

Affiliation

Comprehension of synthetic speech produced by rule: a review and theoretical interpretation

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources