• DocumentCode
    667542
  • Title

    Ensemble learning for speech enhancement

  • Author

    Le Roux, Jonathan ; Watanabe, Shigetaka ; Hershey, John R.

  • Author_Institution
    Mitsubishi Electr. Res. Labs. (MERL), Cambridge, MA, USA
  • fYear
    2013
  • fDate
    20-23 Oct. 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Over the years, countless algorithms have been proposed to solve the problem of speech enhancement from a noisy mixture. Many have succeeded in improving at least parts of the signal, while often deteriorating others. Based on the assumption that different algorithms are likely to enjoy different qualities and suffer from different flaws, we investigate the possibility of combining the strengths of multiple speech enhancement algorithms, formulating the problem in an ensemble learning framework. As a first example of such a system, we consider the prediction of a time-frequency mask obtained from the clean speech, based on the outputs of various algorithms applied on the noisy mixture. We consider several approaches involving various notions of context and various machine learning algorithms for classification, in the case of binary masks, and regression, in the case of continuous masks. We show that combining several algorithms in this way can lead to an improvement in enhancement performance, while simple averaging or voting techniques fail to do so.
  • Keywords
    learning (artificial intelligence); masks; speech enhancement; time-frequency analysis; binary masks; clean speech; continuous masks; ensemble learning; learning framework; machine learning; multiple speech enhancement; noisy mixture; time-frequency mask; Context; Noise measurement; Signal processing algorithms; Speech; Speech enhancement; Support vector machines; Time-frequency analysis; Classification; Ensemble learning; Speech enhancement; Stacking; Time-frequency mask;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013 IEEE Workshop on
  • Conference_Location
    New Paltz, NY
  • ISSN
    1931-1168
  • Type

    conf

  • DOI
    10.1109/WASPAA.2013.6701888
  • Filename
    6701888