DocumentCode :
2175548
Title :
Spectral magnitude minimum mean-square error binary masks for DFT based speech enhancement
Author :
Jensen, Jesper ; Hendriks, Richard C.
Author_Institution :
Oticon A/S, Denmark
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
4736
Lastpage :
4739
Abstract :
Originally, ideal binary mask (idbm) techniques have been used as a tool for studying aspects of the auditory system. More recently, idbm techniques have been adapted to the practical problem of retrieving a target speech signal from a noisy observation. In this practical setting, the biliary mask techniques show similarities with existing DFT based speech enhancement techniques. In this context, we derive single-channel, binary mask estimators which minimize the spectral magnitude mean-square error. We show in simulation experiments with natural speech and noise signals that the proposed estimators perform significantly better than existing binary mask estimators. However, even the best of the proposed estimators is clearly out performed by non-binary estimators, both in terms of speech quality and intelligibility.
Keywords :
discrete Fourier transforms; masks; mean square error methods; speech enhancement; DFT based speech enhancement; IDBM technique; auditory system; spectral magnitude minimum mean-square error ideal binary masks; speech quality; Discrete Fourier transforms; Gain; Noise measurement; Signal to noise ratio; Speech; Speech enhancement; Speech enhancement; binary masks; intelligibility; minimum mean-square error;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947413
Filename :
5947413
Link To Document :
بازگشت