DocumentCode :
2955972
Title :
Speech Modelingwith Magnitude-Normalized Complex Spectra and Its Application to Multisensory Speech Enhancement
Author :
Subramanya, Amarnag ; Zhang, Zhengyou ; Liu, Zicheng ; Acero, Alex
Author_Institution :
SSLI Lab., Washington Univ., Seattle, WA
fYear :
2006
fDate :
9-12 July 2006
Firstpage :
1157
Lastpage :
1160
Abstract :
A good speech model is essential for speech enhancement, but it is very difficult to build because of huge intra- and extra-speaker variation. We present a new speech model for speech enhancement, which is based on statistical models of magnitude-normalized complex spectra of speech signals. Most popular speech enhancement techniques work in the spectrum space, but the large variation of speech strength, even from the same speaker, makes accurate speech modeling very difficult because the magnitude is correlated across all frequency bins. By performing magnitude normalization for each speech frame, we are able to get rid of the magnitude variation and to build a much better speech model with only a small number of Gaussian components. This new speech model is applied to speech enhancement for our previously developed microphone headsets that combine a conventional air microphone with a bone sensor. Much improved results have been obtained
Keywords :
Gaussian processes; headphones; microphones; spectral analysis; speech enhancement; Gaussian component; bone sensor; magnitude-normalized complex spectra; microphone headset; multisensory speech enhancement; speaker variation; Acoustic noise; Bones; Frequency; Hidden Markov models; Microphones; Signal processing; Speech enhancement; Speech processing; Speech recognition; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2006 IEEE International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
1-4244-0366-7
Electronic_ISBN :
1-4244-0367-7
Type :
conf
DOI :
10.1109/ICME.2006.262741
Filename :
4036810
Link To Document :
بازگشت