Title :
On the use of scalar quantization for fast HMM computation
Author :
Sayayama, S. ; Takahashi, Satoshi
Author_Institution :
NTT Human Interface Labs., Kanagawa, Japan
Abstract :
This paper describes an algorithm for reducing the amount of arithmetic operations in the likelihood computation of continuous mixture HMM (CMHMM) with diagonal covariance matrices while retaining high performance. The key points are the use of the scalar quantization of the input observation vector components and table look-up. These make multiplication, squaring and division operations entirely unnecessary in the whole HMM computation (i.e., output probability calculation and trellis/Viterbi computation). It is experimentally proved in an large-vocabulary isolated word recognition task that scalar quantization into no less than 16 levels does not cause significant degradation in the speech recognition performance. Scalar quantization is also utilized in the computation truncation for unlikely distributions; the total number of distribution likelihood computations can be reduced by 66% with only a slight performance degradation. This “multiplication-free” HMM algorithm has high potentiality in speech recognition applications on personal computers
Keywords :
covariance matrices; hidden Markov models; microcomputer applications; probability; quantisation (signal); speech recognition; statistical analysis; table lookup; CMHMM; arithmetic operations; computation truncation; continuous mixture HMM; diagonal covariance matrices; distribution likelihood computations; fast HMM computation; high performance; input observation vector components; large-vocabulary isolated word recognition; multiplication-free HMM algorithm; output probability; personal computers; scalar quantization; speech recognition applications; speech recognition performance; table look-up; trellis/Viterbi computation; Arithmetic; Covariance matrix; Degradation; Distributed computing; Hidden Markov models; High performance computing; Probability; Quantization; Speech recognition; Viterbi algorithm;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479402