Title :
Generation and expansion of word graphs using long span context information
Author :
Neukirchen, Christoph ; Klako, Dietrich ; Aubert, Xavier
Author_Institution :
Philips Res. Lab., Aachen, Germany
Abstract :
An algorithm for the generation of word graphs in a cross-word decoder that uses long span m-gram language models (LMs) is presented. The generation of word hypotheses within the graph relies on the word m-tuple-based boundary optimization. The graphs contain the full word history knowledge information since the graph structure reflects all LM constraints used during the search. This results in better word boundaries and in enhanced capabilities to prune the graphs. Furthermore, the memory costs for expanding the m-gram constrained word graphs to apply very long span LMs (e.g. ten-grams that are constructed by log linear LM combination) are considerably reduced. Experiments for lattice generation and rescoring have been carried out on the 5K-word WSJ task and the 64K-word NAB task
Keywords :
decoding; dynamic programming; graph theory; hidden Markov models; search problems; speech recognition; NAB task; WSJ task; boundary optimization; cross-word decoder; dynamic programming search strategy; graph pruning; lattice generation; lexical tree; lexicon tree; long span context information; long span m-gram language models; partial search hypotheses; rescoring; search; word graphs; Acoustics; Context modeling; Costs; Decoding; Gold; Hidden Markov models; History; Laboratories; Speech recognition; Tree graphs;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.940762