• DocumentCode
    1294701
  • Title

    Evaluating Source Separation Algorithms With Reverberant Speech

  • Author

    Mandel, Michael I. ; Bressler, Scott ; Shinn-Cunningham, Barbara ; Ellis, Daniel P W

  • Author_Institution
    Dept. d´´Inf. et de Rech. Operationnelle, Univ. de Montreal, Montreal, QC, Canada
  • Volume
    18
  • Issue
    7
  • fYear
    2010
  • Firstpage
    1872
  • Lastpage
    1883
  • Abstract
    This paper examines the performance of several source separation systems on a speech separation task for which human intelligibility has previously been measured. For anechoic mixtures, automatic speech recognition (ASR) performance on the separated signals is quite similar to human performance. In reverberation, however, while signal separation has some benefit for ASR, the results are still far below those of human listeners facing the same task. Performing this same experiment with a number of oracle masks created with a priori knowledge of the separated sources motivates a new objective measure of separation performance, the Direct-path, Early echo, and Reverberation, of the Target and Masker (DERTM), which is closely related to the ASR results. This measure indicates that while the non-oracle algorithms successfully reject the direct-path signal from the masking source, they reject less of its reverberation, explaining the disappointing ASR performance.
  • Keywords
    audio signal processing; reverberation; source separation; speech processing; anechoic mixtures; human intelligibility; reverberant speech; source separation algorithms; speech separation task; Humans; Measurement; Particle separators; Reverberation; Source separation; Speech; Speech recognition; Intelligibility; objective evaluation; reverberation; speech enhancement; time–frequency masking; underdetermined source separation;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2010.2052252
  • Filename
    5547555