Linguistic aspects of speech synthesis

J Allen¹

Affiliations

PMID: 7479807
PMCID: PMC40716
DOI: 10.1073/pnas.92.22.9946

Linguistic aspects of speech synthesis

J Allen. Proc Natl Acad Sci U S A. 1995.

. 1995 Oct 24;92(22):9946-52.

doi: 10.1073/pnas.92.22.9946.

Author

J Allen¹

Affiliation

¹ Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge 02139-4307, USA.

PMID: 7479807
PMCID: PMC40716
DOI: 10.1073/pnas.92.22.9946

Abstract

The conversion of text to speech is seen as an analysis of the input text to obtain a common underlying linguistic description, followed by a synthesis of the output speech waveform from this fundamental specification. Hence, the comprehensive linguistic structure serving as the substrate for an utterance must be discovered by analysis from the text. The pronunciation of individual words in unrestricted text is determined by morphological analysis or letter-to-sound conversion, followed by specification of the word-level stress contour. In addition, many text character strings, such as titles, numbers, and acronyms, are abbreviations for normal words, which must be derived. To further refine these pronunciations and to discover the prosodic structure of the utterance, word part of speech must be computed, followed by a phrase-level parsing. From this structure the prosodic structure of the utterance can be determined, which is needed in order to specify the durational framework and fundamental frequency contour of the utterance. In discourse contexts, several factors such as the specification of new and old information, contrast, and pronominal reference can be used to further modify the prosodic specification. When the prosodic correlates have been computed and the segmental sequence is assembled, a complete input suitable for speech synthesis has been determined. Lastly, multilingual systems utilizing rule frameworks are mentioned, and future directions are characterized.

PubMed Disclaimer

References

1. J Acoust Soc Am. 1983 Oct;74(4):1155-71 - PubMed
1. J Acoust Soc Am. 1992 Mar;91(3):1707-17 - PubMed
1. J Acoust Soc Am. 1991 Dec;90(6):2956-70 - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Linguistic aspects of speech synthesis

Affiliation

Linguistic aspects of speech synthesis

Author

Affiliation

Abstract

References

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources