DocumentCode :
2311201
Title :
High Quality Sinusoidal Modeling of Wideband Speech for the Purposes of Speech Synthesis and Modification
Author :
Chazan, Dan ; Hoory, Ron ; Sagi, Ariel ; Shechtman, Slava ; Sorin, Alex ; Shuang, Zhi Wei ; Bakis, Raimo
Author_Institution :
IBM Research Laboratory in Haifa, Israel.
Volume :
1
fYear :
2006
fDate :
14-19 May 2006
Abstract :
This paper describes an efficient sinusoidal modeling framework for high quality wide band (WB) speech synthesis and modification. This technique may serve as a basis for speech compression in the context of small footprint concatenative Text to Speech systems. In addition, it is a useful representation for voice transformation and morphing purposes, e.g., simultaneous pitch modification and spectral envelope warping. The conventional sinusoidal modeling is enhanced with an adaptive frequency dithering mechanism, based on a degree of voicing analysis. Considerable reduction of the amount of model parameters is achieved by high band phase extension. The proposed model is evaluated and compared to the alternative STRAIGHT framework [1]. Being simpler and considerably more efficient than STRAIGHT, it outperforms it in speech quality for both speech reconstruction and transformation.
Keywords :
Acoustic waves; Frequency; Laboratories; Power harmonic filters; Signal synthesis; Speech analysis; Speech coding; Speech enhancement; Speech synthesis; Wideband;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1660161
Filename :
1660161
Link To Document :
بازگشت