On the use of scalar quantization for fast HMM computation

Author

Sayayama, S. ; Takahashi, Satoshi

Author_Institution

NTT Human Interface Labs., Kanagawa, Japan

Volume

1

fYear

1995

fDate

9-12 May 1995

Firstpage

213

Abstract

This paper describes an algorithm for reducing the amount of arithmetic operations in the likelihood computation of continuous mixture HMM (CMHMM) with diagonal covariance matrices while retaining high performance. The key points are the use of the scalar quantization of the input observation vector components and table look-up. These make multiplication, squaring and division operations entirely unnecessary in the whole HMM computation (i.e., output probability calculation and trellis/Viterbi computation). It is experimentally proved in an large-vocabulary isolated word recognition task that scalar quantization into no less than 16 levels does not cause significant degradation in the speech recognition performance. Scalar quantization is also utilized in the computation truncation for unlikely distributions; the total number of distribution likelihood computations can be reduced by 66% with only a slight performance degradation. This “multiplication-free” HMM algorithm has high potentiality in speech recognition applications on personal computers

Keywords

covariance matrices; hidden Markov models; microcomputer applications; probability; quantisation (signal); speech recognition; statistical analysis; table lookup; CMHMM; arithmetic operations; computation truncation; continuous mixture HMM; diagonal covariance matrices; distribution likelihood computations; fast HMM computation; high performance; input observation vector components; large-vocabulary isolated word recognition; multiplication-free HMM algorithm; output probability; personal computers; scalar quantization; speech recognition applications; speech recognition performance; table look-up; trellis/Viterbi computation; Arithmetic; Covariance matrix; Degradation; Distributed computing; Hidden Markov models; High performance computing; Probability; Quantization; Speech recognition; Viterbi algorithm;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location

Detroit, MI

ISSN

1520-6149

Print_ISBN

0-7803-2431-5

Type

conf

DOI

10.1109/ICASSP.1995.479402

Filename

479402