DocumentCode :
1798898
Title :
Low latency parameter generation for real-time speech synthesis system
Author :
Xingyu Na ; Xiang Xie ; Jingming Kuang
Author_Institution :
Beijing Inst. of Technol., Beijing, China
fYear :
2014
fDate :
14-18 July 2014
Firstpage :
1
Lastpage :
6
Abstract :
Speech synthesizer is commonly used in human-computer interaction. In many applicational cases, the computing resource is limited while real-time synthesis is demanded. The HMM-based speech synthesis technique allows creating a natural voice quality with small footprint, but current synthesizers require the concatenation of sentence level acoustic units, which is not applicable in real-time mode. In this paper, we propose a blocked parameter generation algorithm for low latency speech synthesis which can work real-time in resource limited applications. Phonetic units at various time spans are used as blocks. The objective and subjective evaluations suggest that the proposed system produce promising voice quality with a low demand for the computing resource.
Keywords :
human computer interaction; parameter estimation; real-time systems; speech processing; speech synthesis; voice equipment; HMM-based speech synthesis technique; blocked parameter generation algorithm; human-computer interaction; low latency parameter generation; natural voice quality; phonetic units; real-time speech synthesis system; real-time text-to-speech; sentence level acoustic units; speech synthesizer; Acoustics; Equations; Hidden Markov models; Mathematical model; Real-time systems; Speech; Speech synthesis; human computer interaction; real-time text-to-speech; speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo (ICME), 2014 IEEE International Conference on
Conference_Location :
Chengdu
Type :
conf
DOI :
10.1109/ICME.2014.6890197
Filename :
6890197
Link To Document :
بازگشت