• DocumentCode
    69544
  • Title

    Lexical Prefix Tree and WFST: A Comparison of Two Dynamic Search Concepts for LVCSR

  • Author

    Rybach, David ; Ney, Hermann ; Schluter, Ralf

  • Author_Institution
    Google Inc., New York, NY, USA
  • Volume
    21
  • Issue
    6
  • fYear
    2013
  • fDate
    Jun-13
  • Firstpage
    1295
  • Lastpage
    1307
  • Abstract
    Dynamic network decoders have the advantage of significantly lower memory consumption compared to static network decoders, especially when huge vocabularies and complex language models are required. This paper compares the properties of two well-known search strategies for dynamic network decoding, namely history conditioned lexical tree search and weighted finite-state transducer-based search using on-the-fly transducer composition. The two search strategies share many common principles like the use of dynamic programming, beam search, and many more. We point out the similarities of both approaches and investigate the implications of their differing features, both formally and experimentally, with a focus on implementation independent properties. Therefore, experimental results are obtained with a single decoder by representing the history conditioned lexical tree search strategy in the transducer framework. The properties analyzed cover structure and size of the search space, differences in hypotheses recombination, language model look-ahead techniques, and lattice generation.
  • Keywords
    decoding; dynamic programming; finite state machines; search problems; speech recognition; trees (mathematics); ASR; LVCSR; WFST; automatic speech recognition; beam search; complex language models; cover structure; dynamic network decoders; dynamic programming; dynamic search concepts; history conditioned lexical tree search strategy; hypotheses recombination; implementation independent properties; language model look-ahead techniques; lattice generation; lexical prefix tree; on-the-fly transducer composition; static network decoders; weighted finite-state transducer-based search; Context; Context modeling; Decoding; Hidden Markov models; History; Search problems; Transducers; Beam search; LVCSR; history conditioned lexical tree (HCLT) search; weighted finite-state transducer (WFST);
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2013.2248723
  • Filename
    6470661