• DocumentCode
    1513852
  • Title

    Robust Speech Recognition Based on Dereverberation Parameter Optimization Using Acoustic Model Likelihood

  • Author

    Gomez, Randy ; Kawahara, Tatsuya

  • Author_Institution
    ACCMS, Kyoto Univ., Kyoto, Japan
  • Volume
    18
  • Issue
    7
  • fYear
    2010
  • Firstpage
    1708
  • Lastpage
    1716
  • Abstract
    Automatic speech recognition (ASR) in reverberant environments is a challenging task. Most dereverberation techniques address this problem through signal processing and enhances the reverberant waveform independent from the speech recognizer. In this paper, we propose a novel scheme to perform dereverberation in relation with the likelihood of the back-end ASR system. Our proposed approach effectively selects the dereverberation parameters, in the form of multiband scale factors, so that they improve the likelihood of the acoustic model. Then, the acoustic model is retrained using the optimal parameters. During the recognition phase, we implement additional optimization of the parameters. By using Gaussian mixture model (GMM), the process for selecting the scale factors become efficient. Moreover, we remove the dependency of the adopted dereverberation technique on the room impulse response (RIR) measurement, by using an artificial RIR generator and selecting based on the acoustic likelihood. Experimental results show significant improvement in recognition performance with the proposed method over the conventional approach.
  • Keywords
    Gaussian processes; acoustic signal processing; optimisation; reverberation; speech recognition; GMM; Gaussian mixture model; RIR measurement; acoustic model likelihood; artificial RIR generator; back-end ASR system; dereverberation parameter optimization; multiband scale factors; robust speech recognition; room impulse response measurement; signal processing; speech recognizer; Acoustic reflection; Acoustic signal processing; Automatic speech recognition; Deconvolution; Filtering; Filters; Radio access networks; Robustness; Speech enhancement; Speech recognition; Automatic speech recognition (ASR); dereverberation; robustness;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2010.2052610
  • Filename
    5483169