DocumentCode
2875593
Title
Hands-free speech recognition and communication on PDAs using microphone array technology
Author
Herbordt, W. ; Horiuchi, T. ; Fujimoto, M. ; Jitsuhiro, T. ; Nakamura, S.
Author_Institution
ATR Spoken Language Commun. Res. Lab., Kyoto
fYear
2005
fDate
27-27 Nov. 2005
Firstpage
302
Lastpage
307
Abstract
In this paper, a personal digital assistant (PDA) for hands-free speech recognition and communication with a microphone array mounted on the PDA is presented. An outlier-robust generalized sidelobe canceller (RGSC) and a minimum mean-squared error (MMSE) estimator for log Mel-spectral energy coefficients using a Gaussian mixture model (GMM) for clean speech are implemented in real-time and evaluated for speech recognition based on a small experimental multichannel database. It is shown that the joint system of beamformer and single-channel noise suppression highly improves the noise-robustness of a large-vocabulary speech recognizer so that down to SNR = 5 dB more than 91% word accuracy is obtained
Keywords
Gaussian processes; array signal processing; interference suppression; least mean squares methods; microphone arrays; mobile communication; notebook computers; speech recognition; Gaussian mixture model; PDA; hands-free speech recognition; log Mel-spectral energy; microphone array technology; minimum mean-squared error estimation; multichannel database; personal digital assistant; robust generalized sidelobe canceller; single-channel noise suppression; Acoustic noise; Automatic speech recognition; Microphone arrays; Noise cancellation; Noise reduction; Noise robustness; Personal digital assistants; Sensor arrays; Speech recognition; Universal Serial Bus;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
Conference_Location
San Juan
Print_ISBN
0-7803-9478-X
Electronic_ISBN
0-7803-9479-8
Type
conf
DOI
10.1109/ASRU.2005.1566509
Filename
1566509
Link To Document