Title :
On improving discrimination capability of an RNN based recognizer
Author :
Lee, Tan ; Ching, P.C.
Author_Institution :
Dept. of Electron. Eng., Chinese Univ. of Hong Kong, Shatin, Hong Kong
Abstract :
This paper presents a set of effective and efficient techniques to improve the discrimination capability of a recurrent neural network (RNN) based isolated word recognizer. The recognizer contains a set of individually trained RNN speech models (RSMs). Each of them represents a different word in the vocabulary. Speech recognition is performed by selecting the RSM that best matches the input utterance. For temporal supervised training of the RSMs, a new error function is introduced, in which the contributions of all phonetic components are equalized regardless of their difference in duration. The learning rate for recurrent connections is amplified. This is aimed at strengthening temporal dependency in the RSMs to capture dynamic characteristics of speech signals. Furthermore, a hierarchical training strategy is employed to facilitate more efficient discriminative training among the RSMs. A series of speaker-dependent recognition experiments are performed to evaluate the effectiveness of the proposed techniques
Keywords :
errors; learning (artificial intelligence); recurrent neural nets; speech recognition; vocabulary; discrimination capability; discriminative training; error function; hierarchical training strategy; input utterance; isolated word recognizer; learning; phonetic components; recurrent neural network; speaker-dependent recognition experiments; speech models; speech recognition; speech signals; temporal dependency; temporal supervised training; vocabulary; Delay effects; Electronic mail; Impedance matching; Neural networks; Neurofeedback; Neurons; Performance evaluation; Recurrent neural networks; Speech recognition; Vocabulary;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607170