• DocumentCode
    117944
  • Title

    Noisy speech recognition using blind spatial subtraction array technique and deep bottleneck features

  • Author

    Kitaoka, Norihide ; Hayashi, Tomoki ; Takeda, Kazuya

  • Author_Institution
    Nagoya Unviersity, Nagoya, Japan
  • fYear
    2014
  • fDate
    9-12 Dec. 2014
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    In this study, we investigate the effect of blind spatial subtraction arrays (BSSA) on speech recognition systems by comparing the performance of a method using Mel-Frequency Cepstral Coefficients (MFCCs) with a method using Deep Bottleneck Features (DBNF) based on Deep Neural Networks (DNN). Performance is evaluated under various conditions, including noisy, in-vehicle conditions. Although performance of the DBNF-based system was much more degraded by noise than the MFCC-based system, BSSA improved the performance of both methods greatly, especially when matched condition training of acoustic models was employed. These results show the effectiveness of BSSA for speech recognition.
  • Keywords
    acoustic signal processing; cepstral analysis; feature extraction; neural nets; speech recognition; BSSA; DBNF; DNN; MFCC; acoustic model training; blind spatial subtraction array technique; deep bottleneck feature; deep neural network; in-vehicle condition; mel-frequency cepstral coefficient; noisy speech recognition; Feature extraction; Hidden Markov models; Noise; Speech; Speech enhancement; Speech recognition; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asia-Pacific Signal and Information Processing Association, 2014 Annual Summit and Conference (APSIPA)
  • Conference_Location
    Siem Reap
  • Type

    conf

  • DOI
    10.1109/APSIPA.2014.7041556
  • Filename
    7041556