DocumentCode
417228
Title
A low-band spectrum envelope modeling for high quality pitch modification
Author
Mochizuki, Ryo ; Kobayashi, Tetsunori
Author_Institution
Dept. of Comput. Sci., Waseda Univ., Tokyo, Japan
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
A low-band spectrum envelope reconstruction method was tested to see if it could improve the sound quality of speech modified by the PSOLA (pitch synchronous overlap add) method. In the conventional TD (time domain)-PSOLA method, the spectrum envelope extracted using a Hanning window with a two-pitch-period length had no reliable information in the band of frequencies lower than original F0. This problem causes the sound degradation of the F0 modified speech. In the proposed method, the low-band spectrum envelope was properly modified according to the F0 modification rate. The amplitude of the F0 harmonic components in the low-band was reproduced based on the spectral tilt of the spectrum envelope. Subjective listening test results suggest this proposed method yields better sound quality than the conventional TD-PSOLA method when the downward modification rate exceeds 0.4 octave.
Keywords
frequency estimation; signal reconstruction; spectral analysis; speech synthesis; F0 modification rate; PSOLA; high quality pitch modification; low-band spectrum envelope modeling; pitch synchronous overlap add method; sound quality; spectral tilt; spectrum envelope reconstruction; speech synthesis; Acoustic testing; Amplitude estimation; Computer industry; Computer science; Data mining; Degradation; Frequency; Interference; Reconstruction algorithms; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326068
Filename
1326068
Link To Document