• DocumentCode
    2818459
  • Title

    Binary spectral masking for speech recognition systems

  • Author

    de Souza Siqueira Versiani, T. ; Rodrigues, Gustavo Fernandes ; de Souza, A.C.S. ; de Matos Moreira, J. ; Yehia, Hani Camille

  • Author_Institution
    Fed. Univ. of Sao Joao del-Rey, Ouro Branco, Brazil
  • fYear
    2012
  • fDate
    3-4 July 2012
  • Firstpage
    432
  • Lastpage
    436
  • Abstract
    The purpose of this paper is to examine the use of spectral masking techniques as a preprocessing step in speech recognition systems. The limits of these masking techniques for different levels of the signal-to-noise ratio are discussed. In general, speech recognition systems have low performance in noisy environments or in the presence of other speech signals. This work presents a blind source separation system based on ideal binary masks to deal with real situations in which speech signals are corrupted by noise, including other speech signals. The main contribution of this work is to analyze the performance limits of recognition systems using spectral masking. We obtain an increase of 18% on the speech hit rate and an average gain of 10 dB in terms of noise level attenuation, when the speech signals were corrupted by other voice signals, with different signal-to-noise ratio of approximately 1, 10 and 20 dB.
  • Keywords
    blind source separation; speech recognition; binary masks; binary spectral masking; blind source separation system; noise level attenuation; signal-to-noise ratio; spectral masking techniques; speech recognition systems; speech signals; Signal to noise ratio; Speech; Speech processing; Speech recognition; Time frequency analysis; White noise; Blind source separation; Independent component analysis; Neural networks; Spectral masking; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Telecommunications and Signal Processing (TSP), 2012 35th International Conference on
  • Conference_Location
    Prague
  • Print_ISBN
    978-1-4673-1117-5
  • Type

    conf

  • DOI
    10.1109/TSP.2012.6256330
  • Filename
    6256330