DocumentCode
2268231
Title
Speech bandwidth extension based on speech phonetic content and speaker vocal tract shape estimation
Author
Katsir, Itai ; Cohen, Israel ; Malah, David
Author_Institution
Dept. of Electr. Eng., Technion - Israel Inst. of Technol., Haifa, Israel
fYear
2011
fDate
Aug. 29 2011-Sept. 2 2011
Firstpage
461
Lastpage
465
Abstract
In this paper, we introduce a new speech bandwidth extension (BWE) algorithm which involves phonetic and speaker dependent estimation of the high-band part of the spectral envelope. Speech phoneme information is extracted by using a hidden Markov model. Speaker vocal tract shape information corresponding to the wideband signal is extracted by a codebook search. The proposed method allows better estimation of high-band formant frequencies, especially for voiced sounds, and better estimation of spectral envelope gain, especially for unvoiced sounds. Postprocessing of the estimated vocal tract shape allows artifacts reduction in cases of erroneous estimation of speech phoneme or vocal tract shape. We present experimental results that demonstrate improved wideband quality for different speech sounds in comparison to other BWE methods.
Keywords
hidden Markov models; speech; BWE methods; artifact reduction; hidden Markov model; high-band formant frequencies; phonetic dependent estimation; speaker dependent estimation; speaker vocal tract shape estimation; speaker vocal tract shape information; spectral envelope; spectral envelope gain; speech bandwidth extension algorithm; speech phoneme; speech phonetic content; unvoiced sounds; wideband quality; wideband signal; Bandwidth; Estimation; Feature extraction; Hidden Markov models; Niobium; Shape; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2011 19th European
Conference_Location
Barcelona
ISSN
2076-1465
Type
conf
Filename
7074052
Link To Document