DocumentCode :
1936772
Title :
Improving quality of MBROLA synthesis for non-uniform units synthesis
Author :
Bozkurt, Baris ; Dutoit, Thieny ; Prudon, Romain ; d´Alessandro, C. ; Pagel, Vincent
Author_Institution :
MULTITEL ASBL, Mons, Belgium
fYear :
2002
fDate :
11-13 Sept. 2002
Firstpage :
7
Lastpage :
10
Abstract :
The paper describes a new version of the MBROLA (multiband resynthesis overlap add) algorithms (Dutoit, T. and Leich, H., Speech Commun., vol.13, p.435-40,1993) for the synthesis of non-uniform units (NUU). This new version is called TP-MBROLA, standing for true-period MBROLA. The database preprocessing of MBROLA has been modified such that short-time speech frames are not systematically resynthesized at a constant pitch and constant phase envelope. This operation highly reduces the coding-decoding effect on signal quality. For spectral smoothing, only the smoothing frames are resynthesized at constant pitch and phase envelope and MBROLA smoothing is applied. Furthermore, these operations are performed on the fly, which brings some computational load to the synthesis (though it is restricted to smoothing frames). The new version of MBROLA is tested on nonuniform units synthesis by synthesizing speech with units provided by LIMSI´s unit selection system (Prudon, R. and d´Alessandro, C., Proc. 4th ISCA Speech Synthesis Workshop, p.137-42, 2001). Formal listener tests have shown that TP-MBROLA synthesis quality is preferred compared to MBROLA and raw concatenation synthesis.
Keywords :
smoothing methods; speech processing; speech synthesis; MBROLA synthesis; coding-decoding effect; concatenation synthesis; constant phase envelope; constant pitch; multiband resynthesis overlap add algorithm; nonuniform units synthesis; smoothing frames; spectral smoothing; speech frames; true-period MBROLA; Degradation; Humans; Signal processing algorithms; Signal synthesis; Smoothing methods; Spatial databases; Speech processing; Speech synthesis; Synthesizers; System testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speech Synthesis, 2002. Proceedings of 2002 IEEE Workshop on
Print_ISBN :
0-7803-7395-2
Type :
conf
DOI :
10.1109/WSS.2002.1224360
Filename :
1224360
Link To Document :
بازگشت