• DocumentCode
    2175548
  • Title

    Spectral magnitude minimum mean-square error binary masks for DFT based speech enhancement

  • Author

    Jensen, Jesper ; Hendriks, Richard C.

  • Author_Institution
    Oticon A/S, Denmark
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    4736
  • Lastpage
    4739
  • Abstract
    Originally, ideal binary mask (idbm) techniques have been used as a tool for studying aspects of the auditory system. More recently, idbm techniques have been adapted to the practical problem of retrieving a target speech signal from a noisy observation. In this practical setting, the biliary mask techniques show similarities with existing DFT based speech enhancement techniques. In this context, we derive single-channel, binary mask estimators which minimize the spectral magnitude mean-square error. We show in simulation experiments with natural speech and noise signals that the proposed estimators perform significantly better than existing binary mask estimators. However, even the best of the proposed estimators is clearly out performed by non-binary estimators, both in terms of speech quality and intelligibility.
  • Keywords
    discrete Fourier transforms; masks; mean square error methods; speech enhancement; DFT based speech enhancement; IDBM technique; auditory system; spectral magnitude minimum mean-square error ideal binary masks; speech quality; Discrete Fourier transforms; Gain; Noise measurement; Signal to noise ratio; Speech; Speech enhancement; Speech enhancement; binary masks; intelligibility; minimum mean-square error;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947413
  • Filename
    5947413