Title :
Context-dependent hybrid HME/HMM speech recognition using polyphone clustering decision trees
Author :
Fritsch, Jürgen ; Finke, Michael ; Waibel, Alex
Author_Institution :
Interactive Syst. Lab., Karlsruhe Univ., Germany
Abstract :
This paper presents a context-dependent hybrid connectionist speech recognition system that uses a set of generalized hierarchical mixtures of experts (HME) to estimate context-dependent posterior acoustic class probabilities. The connectionist part of the system is organized in a modular fashion, allowing the distributed training of such a system on regular workstations. Context classes are based on polyphonic contexts, clustered using decision trees which we adopt from our continuous density HMM recognizer JANUS (Waibel et al., 1996). The system is evaluated on ESST, an English speaker-independent spontaneous speech database. Context dependent modeling is shown to yield significant improvements over simple context-independent modeling, requiring only small additional overhead in terms of training and decoding time
Keywords :
decision theory; expert systems; hidden Markov models; learning (artificial intelligence); neural nets; speech recognition; ESST; English speaker-independent spontaneous speech database; connectionist part; context-dependent hybrid HME/HMM speech recognition; context-dependent hybrid connectionist speech recognition system; context-dependent posterior acoustic class probabilities; continuous density HMM recognizer JANUS; decoding time; distributed training; generalized hierarchical mixtures of experts; polyphone clustering decision trees; polyphonic contexts; training; workstations; Context modeling; Decision trees; Decoding; Hidden Markov models; Interactive systems; Laboratories; Neural networks; Speech recognition; Text recognition; Workstations;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.598867