Speech Technology Progress Based on New Machine Learning Paradigm
- PMID: 31341467
- PMCID: PMC6614991
- DOI: 10.1155/2019/4368036
Speech Technology Progress Based on New Machine Learning Paradigm
Abstract
Speech technologies have been developed for decades as a typical signal processing area, while the last decade has brought a huge progress based on new machine learning paradigms. Owing not only to their intrinsic complexity but also to their relation with cognitive sciences, speech technologies are now viewed as a prime example of interdisciplinary knowledge area. This review article on speech signal analysis and processing, corresponding machine learning algorithms, and applied computational intelligence aims to give an insight into several fields, covering speech production and auditory perception, cognitive aspects of speech communication and language understanding, both speech recognition and text-to-speech synthesis in more details, and consequently the main directions in development of spoken dialogue systems. Additionally, the article discusses the concepts and recent advances in speech signal compression, coding, and transmission, including cognitive speech coding. To conclude, the main intention of this article is to highlight recent achievements and challenges based on new machine learning paradigms that, over the last decade, had an immense impact in the field of speech signal processing.
Figures
References
-
- Kuhn T. S. The Structure of Scientific Revolutions-50th Anniversary Edition. 4th. Vol. 3. Chicago, IL, USA: The University of Chicago Press; 2012.
-
- Moore R. K. Cognitive informatics: the future of spoken language processing?. Proceedings of the 10th International Conference on Speech and Computer (SPECOM); October 2005; Patras, Greece.
-
- Paul J. D. Re-creating the sigsaly quantizer: this 1943 analog-to-digital converter gave the allies an unbreakable scrambler-(resources) IEEE Spectrum. 2019;56(2):16–17. doi: 10.1109/mspec.2019.8635806. - DOI
-
- Jayant N. S., Noll P. Digital coding of waveforms. Principles and applications to speech and video. Signal Processing. 1985;9(2):139–140. doi: 10.1016/0165-1684(85)90053-2. - DOI
-
- Chu W. C. Speech Coding Algorithms: Foundation and Evolution of Standardized Coders. Hoboken, NJ, USA: John Wiley & Sons; 2003.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
