DocumentCode
393957
Title
Joint pitch and voicing estimation for multiband excitation and sinusoidal speech coders
Author
Jia, Wenhui ; Chan, Wui-Yip
Author_Institution
Brooktrout Technol., Los Gatos, CA, USA
Volume
1
fYear
2002
fDate
3-6 Nov. 2002
Firstpage
210
Abstract
In conventional multi-band excitation (MBE) speech encoding, pitch is estimated first from the speech signal. Using the estimated pitch, voicing decisions are made for pitch-spaced spectral bands. As the method invariably includes unvoiced components in the speech signal to estimate the pitch, the accuracy of the estimated pitch and voicing decisions are degraded. A novel pitch and voicing estimation scheme is presented, wherein the spectrum of the speech signal is segmented into voiced and unvoiced regions without knowledge of the pitch. Pitch is then estimated only from the voice regions. Experimental results show that the new scheme improves the accuracy of the estimated pitch and voicing decisions, and offers better speech quality.
Keywords
frequency estimation; spectral analysis; speech coding; vocoders; MBE; estimated pitch; joint pitch estimation; multiband excitation; pitch-spaced spectral bands; sinusoidal speech coders; speech signal spectrum; unvoiced components; unvoiced regions; voiced regions; voicing decisions; voicing estimation; Computer errors; Degradation; Discrete Fourier transforms; Frequency conversion; Frequency estimation; Frequency synthesizers; Harmonic analysis; Speech analysis; Speech coding; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Signals, Systems and Computers, 2002. Conference Record of the Thirty-Sixth Asilomar Conference on
Conference_Location
Pacific Grove, CA, USA
ISSN
1058-6393
Print_ISBN
0-7803-7576-9
Type
conf
DOI
10.1109/ACSSC.2002.1197178
Filename
1197178
Link To Document