Title :
An information filter for voice prompt suppression
Author :
McDonough, John ; Chu, Wei ; Kumatani, Kenichi ; Raj, Bhiksha ; Lehman, Jill Fain
Author_Institution :
Language Technol. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
Modern speech enabled applications provide for dialog between a machine and one or more human users. The machine prompts the user with queries that are either prerecorded or synthesized on the fly. The human users respond with their own voices, and their speech is then recognized and understood by a human language understanding module. In order to achieve as natural an interaction as possible, the human user(s) must be allowed to interrupt the machine during a voice prompt. In this work, we compare two techniques for such voice prompt suppression. The first is a straightforward adaptation of a conventional Kalman filter, which has certain advantages over the normalized least squares algrithm in terms of robustness and speed of convergence. The second algorithm, which is novel in this work, is also based on a Kalman filter, but differs from the first in that the update or correction step is performed in information space and hence allows for the use of diagonal loading in order to control the growth of the subband filter coefficients, and thereby add robustness to the VPS.
Keywords :
Kalman filters; interactive systems; speech recognition; speech-based user interfaces; Kalman filter; convergence speed; diagonal loading; human language understanding module; human-machine dialog; information filter; information space; modern speech; speech recognition; subband filter coefficient; voice prompt suppression; Acoustics; Covariance matrix; Humans; Kalman filters; Robustness; Speech; Speech recognition; acoustic echo cancellation; speech recognition;
Conference_Titel :
Signals, Systems and Computers (ASILOMAR), 2011 Conference Record of the Forty Fifth Asilomar Conference on
Conference_Location :
Pacific Grove, CA
Print_ISBN :
978-1-4673-0321-7
DOI :
10.1109/ACSSC.2011.6190011