Title :
Low latency parameter generation for real-time speech synthesis system
Author :
Xingyu Na ; Xiang Xie ; Jingming Kuang
Author_Institution :
Beijing Inst. of Technol., Beijing, China
Abstract :
Speech synthesizer is commonly used in human-computer interaction. In many applicational cases, the computing resource is limited while real-time synthesis is demanded. The HMM-based speech synthesis technique allows creating a natural voice quality with small footprint, but current synthesizers require the concatenation of sentence level acoustic units, which is not applicable in real-time mode. In this paper, we propose a blocked parameter generation algorithm for low latency speech synthesis which can work real-time in resource limited applications. Phonetic units at various time spans are used as blocks. The objective and subjective evaluations suggest that the proposed system produce promising voice quality with a low demand for the computing resource.
Keywords :
human computer interaction; parameter estimation; real-time systems; speech processing; speech synthesis; voice equipment; HMM-based speech synthesis technique; blocked parameter generation algorithm; human-computer interaction; low latency parameter generation; natural voice quality; phonetic units; real-time speech synthesis system; real-time text-to-speech; sentence level acoustic units; speech synthesizer; Acoustics; Equations; Hidden Markov models; Mathematical model; Real-time systems; Speech; Speech synthesis; human computer interaction; real-time text-to-speech; speech synthesis;
Conference_Titel :
Multimedia and Expo (ICME), 2014 IEEE International Conference on
Conference_Location :
Chengdu
DOI :
10.1109/ICME.2014.6890197