DocumentCode
2175548
Title
Spectral magnitude minimum mean-square error binary masks for DFT based speech enhancement
Author
Jensen, Jesper ; Hendriks, Richard C.
Author_Institution
Oticon A/S, Denmark
fYear
2011
fDate
22-27 May 2011
Firstpage
4736
Lastpage
4739
Abstract
Originally, ideal binary mask (idbm) techniques have been used as a tool for studying aspects of the auditory system. More recently, idbm techniques have been adapted to the practical problem of retrieving a target speech signal from a noisy observation. In this practical setting, the biliary mask techniques show similarities with existing DFT based speech enhancement techniques. In this context, we derive single-channel, binary mask estimators which minimize the spectral magnitude mean-square error. We show in simulation experiments with natural speech and noise signals that the proposed estimators perform significantly better than existing binary mask estimators. However, even the best of the proposed estimators is clearly out performed by non-binary estimators, both in terms of speech quality and intelligibility.
Keywords
discrete Fourier transforms; masks; mean square error methods; speech enhancement; DFT based speech enhancement; IDBM technique; auditory system; spectral magnitude minimum mean-square error ideal binary masks; speech quality; Discrete Fourier transforms; Gain; Noise measurement; Signal to noise ratio; Speech; Speech enhancement; Speech enhancement; binary masks; intelligibility; minimum mean-square error;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947413
Filename
5947413
Link To Document