Title :
Speech bandwidth extension based on speech phonetic content and speaker vocal tract shape estimation
Author :
Katsir, Itai ; Cohen, Israel ; Malah, David
Author_Institution :
Dept. of Electr. Eng., Technion - Israel Inst. of Technol., Haifa, Israel
fDate :
Aug. 29 2011-Sept. 2 2011
Abstract :
In this paper, we introduce a new speech bandwidth extension (BWE) algorithm which involves phonetic and speaker dependent estimation of the high-band part of the spectral envelope. Speech phoneme information is extracted by using a hidden Markov model. Speaker vocal tract shape information corresponding to the wideband signal is extracted by a codebook search. The proposed method allows better estimation of high-band formant frequencies, especially for voiced sounds, and better estimation of spectral envelope gain, especially for unvoiced sounds. Postprocessing of the estimated vocal tract shape allows artifacts reduction in cases of erroneous estimation of speech phoneme or vocal tract shape. We present experimental results that demonstrate improved wideband quality for different speech sounds in comparison to other BWE methods.
Keywords :
hidden Markov models; speech; BWE methods; artifact reduction; hidden Markov model; high-band formant frequencies; phonetic dependent estimation; speaker dependent estimation; speaker vocal tract shape estimation; speaker vocal tract shape information; spectral envelope; spectral envelope gain; speech bandwidth extension algorithm; speech phoneme; speech phonetic content; unvoiced sounds; wideband quality; wideband signal; Bandwidth; Estimation; Feature extraction; Hidden Markov models; Niobium; Shape; Speech;
Conference_Titel :
Signal Processing Conference, 2011 19th European
Conference_Location :
Barcelona