DocumentCode :
730843
Title :
Recurrent neural network language model with structured word embeddings for speech recognition
Author :
Tianxing He ; Xu Xiang ; Yanmin Qian ; Kai Yu
Author_Institution :
Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., Shanghai, China
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
5396
Lastpage :
5400
Abstract :
Due to effective word context encoding and long-term context preserving, recurrent neural network language model (RNNLM) has attracted great interest by showing better performance over back-off n-gram models and feed-forward neural network language models (FNNLM). However, it still has the difficulty of modelling words of very low frequency in training data. To address this issue, a new framework of structured word embedding is introduced to RNNLM, where both input and target word embeddings are factorized into weighted sum of the corresponding sub-word embeddings. The framework is instantiated for Chinese, where characters can be naturally used as the sub-word units. Experiments on a Chinese twitter LVCSR task showed that the proposed approach effectively outperformed the standard RNNLM, yielding a relative PPL improvement of 8:8% and an absolute 0:59% CER improvement in N-Best re-scoring.
Keywords :
natural language processing; recurrent neural nets; speech recognition; Chinese; N-best rescoring; RNNLM; effective word context encoding; input word embeddings; long-term context preserving; recurrent neural network language model; relative PPL improvement; speech recognition; structured word embeddings; subword embeddings; subword units; target word embeddings; Computational modeling; Context; Context modeling; Recurrent neural networks; Standards; Training; Language Model; Recurrent Neural Network; Speech Recognition; Word Embeddings;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7179002
Filename :
7179002
Link To Document :
بازگشت