• DocumentCode
    730845
  • Title

    Paraphrastic recurrent neural network language models

  • Author

    Liu, X. ; Chen, X. ; Gales, M.J.F. ; Woodland, P.C.

  • Author_Institution
    Eng. Dept., Cambridge Univ., Cambridge, UK
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    5406
  • Lastpage
    5410
  • Abstract
    Recurrent neural network language models (RNNLM) have become an increasingly popular choice for state-of-the-art speech recognition systems. Linguistic factors in??uencing the realization of surface word sequences, for example, expressive richness, are only implicitly learned by RNNLMs. Observed sentences and their associated alternative paraphrases representing the same meaning are not explicitly related during training. In order to improve context coverage and generalization, paraphrastic RNNLMs are investigated in this paper. Multiple paraphrase variants were automatically generated and used in paraphrastic RNNLM training. Using a paraphrastic multi-level RNNLM modelling both word and phrase sequences, signi??cant error rate reductions of 0.6% absolute and perplexity reduction of 10% relative were obtained over the baseline RNNLM on a large vocabulary conversational telephone speech recognition system trained on 2000 hours of audio and 545 million words of texts. The overall improvement over the baseline n-gram LM was increased from 8.4% to 11.6% relative.
  • Keywords
    learning (artificial intelligence); linguistics; recurrent neural nets; sequences; speech recognition; vocabulary; baseline n-gram LM; context coverage improvement; context generalization improvement; implicit learning; large vocabulary conversational telephone speech recognition system; linguistic factor; paraphrastic multilevel RNNLM training model; paraphrastic recurrent neural network language model; phrase sequence; significant error rate reduction; state-of-the-art speech recognition system; surface word sequence; Artificial neural networks; Computational modeling; Context; Lattices; Recurrent neural networks; Speech recognition; Training; language model; paraphrase; recurrent neural network; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7179004
  • Filename
    7179004