• DocumentCode
    2175938
  • Title

    Investigations into the incorporation of the Ideal Binary Mask in ASR

  • Author

    Hartmann, William ; Fosler-Lussier, Eric

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    4804
  • Lastpage
    4807
  • Abstract
    While much work has been dedicated to exploring how best to incorporate the Ideal Binary Mask (IBM) in automatic speech recognition (ASR) for noisy signals, we demonstrate that the simple use of masked speech can outperform standard spectral reconstruction methods. We explore the effects of both the accuracy of the mask estimation and the strength of the language model on our results. The relative performance of these techniques is directly tied to the accuracy of the estimated mask. Although the use of masked speech fails when significant numbers of errors are present, the maximum performance for spectral reconstruction techniques also drops significantly. This implies improvements in mask estimation can provide greater gains in ASR performance than improvements in the incorporation of the IBM in ASR. Previous work may have ignored the direct use of masked speech due to its poor performance on tasks without a strong language model.
  • Keywords
    masks; speech recognition; ASR; IBM; automatic speech recognition; ideal binary mask; language model; noisy signals; spectral reconstruction techniques; Accuracy; Cepstral analysis; Noise measurement; Signal to noise ratio; Speech; Speech recognition; ideal binary mask; robust automatic speech recognition; spectral reconstruction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947430
  • Filename
    5947430