Title :
Framewise phoneme classification with bidirectional LSTM networks
Author :
Graves, Alex ; Schmidhuber, Jurgen
Author_Institution :
IDSIA, Switzerland
fDate :
July 31 2005-Aug. 4 2005
Abstract :
In this paper, we apply bidirectional training to a long short term memory (LSTM) network for the first time. We also present a modified, full gradient version of the LSTM learning algorithm. We discuss the significance of framewise phoneme classification to continuous speech recognition, and the validity of using bidirectional networks for online causal tasks. On the TIMIT speech database, we measure the framewise phoneme classification scores of bidirectional and unidirectional variants of both LSTM and conventional recurrent neural networks (RNNs). We find that bidirectional LSTM outperforms both RNNs and unidirectional LSTM.
Keywords :
recurrent neural nets; signal classification; speech recognition; bidirectional training; continuous speech recognition; framewise phoneme classification; long short term memory network; recurrent neural networks; speech database; Acoustic measurements; Data analysis; Databases; Electronic mail; Error correction; Hidden Markov models; Memory architecture; Neural networks; Recurrent neural networks; Speech recognition;
Conference_Titel :
Neural Networks, 2005. IJCNN '05. Proceedings. 2005 IEEE International Joint Conference on
Conference_Location :
Montreal, Que.
Print_ISBN :
0-7803-9048-2
DOI :
10.1109/IJCNN.2005.1556215