Title :
A generalized construction of integrated speech recognition transducers
Author :
Allauzen, Cyril ; Mohri, Mehryar ; Riley, Michael ; Roark, Brian
Author_Institution :
AT&T Labs.-Res., USA
Abstract :
We showed in previous work that weighted finite-state transducers provide a common representation for many components of a speech recognition system and described general algorithms for combining these representations to build a single optimized and compact transducer integrating all these components, directly mapping from HMM states to words. This approach works well for certain well-controlled input transducers, but presents some problems related to the efficiency of composition and the applicability of determinization and weight-pushing with more general transducers. We generalize our prior construction of the integrated speech recognition transducer to work with an arbitrary number of component transducers and, to a large extent, release the constraints imposed on the type of input transducers by providing more general solutions to these problems. This generalization allowed us to deal with cases where our prior optimization did not apply. Our experiments in the AT&T HMIHY 0300 task and an AT&T VoiceTone task show the efficiency of our generalized optimization technique. We report a 1.6 recognition speed-up in the HMIHY 0300 task, 1.8 speed-up in a VoiceTone task using a word-based language model, and 1.7 using a class-based model.
Keywords :
finite automata; optimisation; speech recognition; AT&T HMIHY 0300 task; AT&T VoiceTone task; class-based model; finite-state transducers; generalized construction; generalized optimization; integrated speech recognition transducers; speed-up; word-based language model; Context modeling; Delay effects; Dictionaries; Hidden Markov models; Minimization methods; Natural languages; Optimization methods; Speech recognition; Stochastic processes; Transducers;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326097