Title :
A frequency-weighted post-filtering transform for compensation of the over-smoothing effect in HMM-based speech synthesis
Author :
Eyben, Florian ; Agiomyrgiannakis, Yannis
Abstract :
Over-smoothing is one of the major sources of quality degradation in statistical parametric speech synthesis. Many methods have been proposed to compensate over-smoothing with the speech parameter generation algorithm considering Global Variance (GV) being one of the most successfull. This paper models over-smoothing as a radial relocation of poles and zeros of the spectral envelope towards the origin of the z-plane and uses radial scaling to enhance spectral peaks and to deepen spectral valeys. The radial scaling technique is improved by introducing over-emphasis, spectral-tilt compensation and frequency weighting. Listening test results indicate that the proposed method is 11%-13% more preferable than GV while it has less algorithmic delay (only 5 ms) and computational complexity.
Keywords :
cepstral analysis; hidden Markov models; poles and zeros; smoothing methods; speech synthesis; GV; HMM-based speech synthesis; algorithmic delay; computational complexity; frequency-weighted post-filtering transform; global variance; listening test; over-smoothing effect; poles and zeros; quality degradation; radial relocation; radial scaling technique; spectral envelope; spectral valeys; spectral-tilt compensation; speech parameter generation algorithm; statistical parametric speech synthesis; z-plane; Hidden Markov models; Poles and zeros; Pragmatics; Speech; Speech synthesis; Transforms; Vectors; global variance; hidden Markov model; mel-frequency cepstrum; post-filtering; radial scaling; speech synthesis;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6853601