• DocumentCode
    3744901
  • Title

    Acoustic modelling with CD-CTC-SMBR LSTM RNNS

  • Author

    Andrew;Ha?im Sak;F?lix de Chaumont Quitry;Tara Sainath;Kanishka Rao

  • Author_Institution
    Google
  • fYear
    2015
  • Firstpage
    604
  • Lastpage
    609
  • Abstract
    This paper describes a series of experiments to extend the application of Context-Dependent (CD) long short-term memory (LSTM) recurrent neural networks (RNNs) trained with Connectionist Temporal Classification (CTC) and sMBR loss. Our experiments, on a noisy, reverberant voice search task, include training with alternative pronunciations and the application to child speech recognition; combination of multiple models, and convolutional input layers. We also investigate the latency of CTC models and show that constraining forward-backward alignment in training can reduce the delay for a real-time streaming speech recognition system. Finally we investigate transferring knowledge from one network to another through alignments.
  • Keywords
    "Hidden Markov models","Context modeling","Delays","Training","Speech recognition","Speech","Recurrent neural networks"
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding (ASRU), 2015 IEEE Workshop on
  • Type

    conf

  • DOI
    10.1109/ASRU.2015.7404851
  • Filename
    7404851