DocumentCode :
393957
Title :
Joint pitch and voicing estimation for multiband excitation and sinusoidal speech coders
Author :
Jia, Wenhui ; Chan, Wui-Yip
Author_Institution :
Brooktrout Technol., Los Gatos, CA, USA
Volume :
1
fYear :
2002
fDate :
3-6 Nov. 2002
Firstpage :
210
Abstract :
In conventional multi-band excitation (MBE) speech encoding, pitch is estimated first from the speech signal. Using the estimated pitch, voicing decisions are made for pitch-spaced spectral bands. As the method invariably includes unvoiced components in the speech signal to estimate the pitch, the accuracy of the estimated pitch and voicing decisions are degraded. A novel pitch and voicing estimation scheme is presented, wherein the spectrum of the speech signal is segmented into voiced and unvoiced regions without knowledge of the pitch. Pitch is then estimated only from the voice regions. Experimental results show that the new scheme improves the accuracy of the estimated pitch and voicing decisions, and offers better speech quality.
Keywords :
frequency estimation; spectral analysis; speech coding; vocoders; MBE; estimated pitch; joint pitch estimation; multiband excitation; pitch-spaced spectral bands; sinusoidal speech coders; speech signal spectrum; unvoiced components; unvoiced regions; voiced regions; voicing decisions; voicing estimation; Computer errors; Degradation; Discrete Fourier transforms; Frequency conversion; Frequency estimation; Frequency synthesizers; Harmonic analysis; Speech analysis; Speech coding; Speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signals, Systems and Computers, 2002. Conference Record of the Thirty-Sixth Asilomar Conference on
Conference_Location :
Pacific Grove, CA, USA
ISSN :
1058-6393
Print_ISBN :
0-7803-7576-9
Type :
conf
DOI :
10.1109/ACSSC.2002.1197178
Filename :
1197178
Link To Document :
بازگشت