DocumentCode
352323
Title
Concatenating syllables for response generation in spoken language applications
Author
Fung, Tien Ying ; Meng, Helen M.
Author_Institution
Dept. of Syst. Eng. & Eng. Manage., Chinese Univ. of Hong Kong, Shatin, China
Volume
2
fYear
2000
fDate
2000
Abstract
We describe our approach in developing a speech synthesis technique for response generation in domain-specific spoken language applications. Our approach handles two Chinese dialects-Cantonese and Putonghua. We chose the foreign exchange domain, and worked with its constrained vocabulary and response expressions. The syllable is selected to be our basic unit for concatenation. Each unit label includes a two-digit appendix to encode the distinctive features of the left and right coarticulatory context. Our approach attempts to maximize intelligibility and naturalness of the responses within the application domain. Hence the synthesized outputs compare favorably with a domain-independent TD-PSOLA synthesizer
Keywords
natural languages; speech intelligibility; speech synthesis; Cantonese; Chinese dialects; Putonghua; TD-PSOLA synthesizer; constrained vocabulary; corpus development; domain-specific applications; foreign exchange domain; intelligibility maximization; left coarticulatory context; naturalness maximization; response generation; right coarticulatory context; speech synthesis technique; spoken language applications; syllables concatenation; two-digit appendix; unit label; Application software; Feeds; Laboratories; Natural languages; Research and development management; Satellites; Speech synthesis; Synthesizers; Systems engineering and theory; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location
Istanbul
ISSN
1520-6149
Print_ISBN
0-7803-6293-4
Type
conf
DOI
10.1109/ICASSP.2000.859114
Filename
859114
Link To Document