• DocumentCode
    2875593
  • Title

    Hands-free speech recognition and communication on PDAs using microphone array technology

  • Author

    Herbordt, W. ; Horiuchi, T. ; Fujimoto, M. ; Jitsuhiro, T. ; Nakamura, S.

  • Author_Institution
    ATR Spoken Language Commun. Res. Lab., Kyoto
  • fYear
    2005
  • fDate
    27-27 Nov. 2005
  • Firstpage
    302
  • Lastpage
    307
  • Abstract
    In this paper, a personal digital assistant (PDA) for hands-free speech recognition and communication with a microphone array mounted on the PDA is presented. An outlier-robust generalized sidelobe canceller (RGSC) and a minimum mean-squared error (MMSE) estimator for log Mel-spectral energy coefficients using a Gaussian mixture model (GMM) for clean speech are implemented in real-time and evaluated for speech recognition based on a small experimental multichannel database. It is shown that the joint system of beamformer and single-channel noise suppression highly improves the noise-robustness of a large-vocabulary speech recognizer so that down to SNR = 5 dB more than 91% word accuracy is obtained
  • Keywords
    Gaussian processes; array signal processing; interference suppression; least mean squares methods; microphone arrays; mobile communication; notebook computers; speech recognition; Gaussian mixture model; PDA; hands-free speech recognition; log Mel-spectral energy; microphone array technology; minimum mean-squared error estimation; multichannel database; personal digital assistant; robust generalized sidelobe canceller; single-channel noise suppression; Acoustic noise; Automatic speech recognition; Microphone arrays; Noise cancellation; Noise reduction; Noise robustness; Personal digital assistants; Sensor arrays; Speech recognition; Universal Serial Bus;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
  • Conference_Location
    San Juan
  • Print_ISBN
    0-7803-9478-X
  • Electronic_ISBN
    0-7803-9479-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2005.1566509
  • Filename
    1566509