• DocumentCode
    417290
  • Title

    Robust speech recognition in additive and channel noise environments using GMM and EM algorithm

  • Author

    Fujimoto, Masakiyo ; Riki, Y.A.

  • Author_Institution
    ATR Spoken Language Translation Res. Lab., Kyoto, Japan
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    In this paper, we evaluated the speech recognition in real driving car environments by using a GMM based speech estimation method and an EM algorithm based channel noise estimation method. The GMM based speech estimation method proposed by Segura et al (2001) was not robust for channel noise such as an acoustic transfer function, a microphone characteristic and so on. To cope with this problem, we propose a channel noise estimation method based on the EM algorithm. Furthermore, we estimate the speech signal more accurately by using a speech GMM and a silence GMM instead of the GMM trained without speech/silence discrimination. Our proposed method has been evaluated on the AURORA3 tasks. In the evaluation results, the proposed method showed the significant improvement in the high-mismatched condition test of AURORA3 tasks.
  • Keywords
    Gaussian distribution; channel estimation; parameter estimation; speech recognition; AURORA3 tasks; EM algorithm; GMM; Gaussian mixture models; additive noise; car environments; channel noise estimation; robust speech recognition; speech estimation; Acoustic noise; Additive noise; Microphones; Noise robustness; Speech analysis; Speech enhancement; Speech recognition; Testing; Transfer functions; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326142
  • Filename
    1326142