DocumentCode :
2147953
Title :
An HNM-Based Speaker-Nonspecific Timbre Transformation Scheme for Speech Synthesis
Author :
Gu, Hung-Yan ; Cai, Chen-Lin ; Cai, Song-Fong
Author_Institution :
Nat. Taiwan Univ. of Sci. & Technol., Taipei, Taiwan
fYear :
2009
fDate :
17-19 Oct. 2009
Firstpage :
1
Lastpage :
5
Abstract :
In this paper, the harmonic-plus-noise model (HNM) based speech signal synthesis scheme studied previously is further extended to provide the function of speaker nonspecific timbre transformation. To transform synthetic speech´s timbre, we have developed a formant based frequency mapping method called piece-wise linear frequency mapping (PLFM). In addition, a commonly adopted method is frequency axis scaling (FAS). Both methods have been integrated into our HNM speech synthesis scheme, and a realtime synthesis system is implemented according to this scheme. The perception test results show that the proposed scheme can indeed transform the source timbre of a female adult into the timbre of a male adult, boy, or girl. In addition, the method PLFM is shown to be better than FAS for obtaining more manful timbre.
Keywords :
piecewise linear techniques; speech synthesis; HNM-based speaker-nonspecific timbre transformation; frequency axis scaling; harmonic-plus-noise model; piecewise linear frequency mapping; speech signal synthesis; speech synthesis; Birth disorders; Electronic mail; Frequency conversion; Hidden Markov models; Piecewise linear techniques; Signal synthesis; Signal to noise ratio; Speech synthesis; Timbre; Vocoders;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image and Signal Processing, 2009. CISP '09. 2nd International Congress on
Conference_Location :
Tianjin
Print_ISBN :
978-1-4244-4129-7
Electronic_ISBN :
978-1-4244-4131-0
Type :
conf
DOI :
10.1109/CISP.2009.5303818
Filename :
5303818
Link To Document :
بازگشت