Title :
Feature enhancement with a Reservoir-based Denoising Auto Encoder
Author :
Jalalvand, Azarakhsh ; Demuynck, Kris ; Martens, Jean-Pierre
Author_Institution :
iMinds, Multimedia Lab., Ghent Univ., Ghent, Belgium
Abstract :
Recently, automatic speech recognition has advanced significantly by the introduction of deep neural networks for acoustic modeling. However, there is no clear evidence yet that this does not come at the price of less generalization to conditions that were not present during training. On the other hand, acoustic modeling with Reservoir Computing (RC) did not offer improved clean speech recognition but it leads to good robustness against noise and channel distortions. In this paper, the aim is to establish whether adding feature denoising in the front-end can further improve the robustness of an RC-based recognizer, and if so, whether one can devise an RC-based Denoising Auto Encoder that outperforms a traditional denoiser like the ETSI Advanced Front-End. In order to answer these questions, experiments are conducted on the Aurora-2 benchmark.
Keywords :
hidden Markov models; neural nets; signal denoising; speech enhancement; speech recognition; ETSI advanced front-end; RC-based recognizer; acoustic modeling; automatic speech recognition; channel distortions; deep neural networks; feature denoising; feature enhancement; noise distortions; reservoir computing; reservoir-based denoising auto encoder; Hidden Markov models; Neurons; Noise; Noise reduction; Reservoirs; Training; Vectors; denoising auto encoder; recurrent neural networks; reservoir computing; robust speech recognition;
Conference_Titel :
Signal Processing and Information Technology(ISSPIT), 2013 IEEE International Symposium on
Conference_Location :
Athens
DOI :
10.1109/ISSPIT.2013.6781884