DocumentCode :
669808
Title :
Noise robust continuous digit recognition with reservoir-based acoustic models
Author :
Jalalvand, Azarakhsh ; Demuynck, Kris ; Martens, Jean-Pierre
Author_Institution :
Multimedia Lab., iMinds, Ghent Univ., Ghent, Belgium
fYear :
2013
fDate :
12-15 Nov. 2013
Firstpage :
204
Lastpage :
209
Abstract :
Notwithstanding the many years of research, more work is needed to create automatic speech recognition (ASR) systems with a close-to-human robustness against confounding factors such as ambient noise, channel distortion, etc. Whilst most work thus far focused on the improvement of ASR systems embedding Gaussian Mixture Models (GMM)s to compute the acoustic likelihoods in the states of a Hidden Markov Model (HMM), the present work focuses on the noise robustness of systems employing Reservoir Computing (RC) as an alternative acoustic modeling technique. Previous work already demonstrated good noise robustness for continuous digit recognition (CDR). The present paper investigates whether further progress can be achieved by driving reservoirs with noise-robust inputs that have been shown to raise the robustness of GMM-based systems, by introducing bi-directional reservoirs and by combining reservoirs with GMMs in a single system. Experiments on Aurora-2 demonstrate that it is indeed possible to raise the noise robustness without significantly increasing the system complexity.
Keywords :
Gaussian processes; hidden Markov models; mixture models; recurrent neural nets; speech recognition; Aurora-2; acoustic likelihoods; alternative acoustic modeling technique; automatic speech recognition system; bidirectional reservoir; close-to-human robustness; hidden Markov model; noise robust continuous digit recognition; noise robustness; reservoir based acoustic models; reservoir computing; systems embedding Gaussian mixture models; Acoustics; Feature extraction; Hidden Markov models; Neurons; Reservoirs; Speech; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Signal Processing and Communications Systems (ISPACS), 2013 International Symposium on
Conference_Location :
Naha
Print_ISBN :
978-1-4673-6360-0
Type :
conf
DOI :
10.1109/ISPACS.2013.6704547
Filename :
6704547
Link To Document :
بازگشت