Title :
Automatic gain control and multi-style training for robust small-footprint keyword spotting with deep neural networks
Author :
Prabhavalkar, Rohit ; Alvarez, Raziel ; Parada, Carolina ; Nakkiran, Preetum ; Sainath, Tara N.
Author_Institution :
Google Inc., Mountain View, CA, USA
Abstract :
We explore techniques to improve the robustness of small-footprint keyword spotting models based on deep neural networks (DNNs) in the presence of background noise and in far-field conditions. We find that system performance can be improved significantly, with relative improvements up to 75% in far-field conditions, by employing a combination of multi-style training and a proposed novel formulation of automatic gain control (AGC) that estimates the levels of both speech and background noise. Further, we find that these techniques allow us to achieve competitive performance, even when applied to DNNs with an order of magnitude fewer parameters than our base-line.
Keywords :
automatic gain control; neural nets; speech processing; DNN; automatic gain control; background noise; base line; deep neural network; multistyle training combination; robust small-footprint keyword spotting; speech estimation; Gain control; Mathematical model; Noise; Noise measurement; Speech; Speech recognition; Training; automatic gain control; keyword spotting; multi-style training; small-footprint models;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178863