Title :
Speech Modelingwith Magnitude-Normalized Complex Spectra and Its Application to Multisensory Speech Enhancement
Author :
Subramanya, Amarnag ; Zhang, Zhengyou ; Liu, Zicheng ; Acero, Alex
Author_Institution :
SSLI Lab., Washington Univ., Seattle, WA
Abstract :
A good speech model is essential for speech enhancement, but it is very difficult to build because of huge intra- and extra-speaker variation. We present a new speech model for speech enhancement, which is based on statistical models of magnitude-normalized complex spectra of speech signals. Most popular speech enhancement techniques work in the spectrum space, but the large variation of speech strength, even from the same speaker, makes accurate speech modeling very difficult because the magnitude is correlated across all frequency bins. By performing magnitude normalization for each speech frame, we are able to get rid of the magnitude variation and to build a much better speech model with only a small number of Gaussian components. This new speech model is applied to speech enhancement for our previously developed microphone headsets that combine a conventional air microphone with a bone sensor. Much improved results have been obtained
Keywords :
Gaussian processes; headphones; microphones; spectral analysis; speech enhancement; Gaussian component; bone sensor; magnitude-normalized complex spectra; microphone headset; multisensory speech enhancement; speaker variation; Acoustic noise; Bones; Frequency; Hidden Markov models; Microphones; Signal processing; Speech enhancement; Speech processing; Speech recognition; Working environment noise;
Conference_Titel :
Multimedia and Expo, 2006 IEEE International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
1-4244-0366-7
Electronic_ISBN :
1-4244-0367-7
DOI :
10.1109/ICME.2006.262741