• DocumentCode
    730843
  • Title

    Recurrent neural network language model with structured word embeddings for speech recognition

  • Author

    Tianxing He ; Xu Xiang ; Yanmin Qian ; Kai Yu

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., Shanghai, China
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    5396
  • Lastpage
    5400
  • Abstract
    Due to effective word context encoding and long-term context preserving, recurrent neural network language model (RNNLM) has attracted great interest by showing better performance over back-off n-gram models and feed-forward neural network language models (FNNLM). However, it still has the difficulty of modelling words of very low frequency in training data. To address this issue, a new framework of structured word embedding is introduced to RNNLM, where both input and target word embeddings are factorized into weighted sum of the corresponding sub-word embeddings. The framework is instantiated for Chinese, where characters can be naturally used as the sub-word units. Experiments on a Chinese twitter LVCSR task showed that the proposed approach effectively outperformed the standard RNNLM, yielding a relative PPL improvement of 8:8% and an absolute 0:59% CER improvement in N-Best re-scoring.
  • Keywords
    natural language processing; recurrent neural nets; speech recognition; Chinese; N-best rescoring; RNNLM; effective word context encoding; input word embeddings; long-term context preserving; recurrent neural network language model; relative PPL improvement; speech recognition; structured word embeddings; subword embeddings; subword units; target word embeddings; Computational modeling; Context; Context modeling; Recurrent neural networks; Standards; Training; Language Model; Recurrent Neural Network; Speech Recognition; Word Embeddings;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7179002
  • Filename
    7179002