Enhancement of discriminative capabilities of HMM based recognizer through modification of Viterbi algorithm

Author

Song, Jianming

Author_Institution

Dept. of Electr. & Comput. Eng., Wollongong Univ., NSW, Australia

Volume

1

fYear

1995

fDate

9-12 May 1995

Firstpage

469

Abstract

The algorithm proposed in this paper integrates the concepts of variable frame rate and discriminative analysis based on Tanimoto ratio to modify the conventional Viterbi algorithm, in such a way that the steady or stationary signal is compressed, while transitional or non-stationary signal is emphasized through the frame-by-frame searching process. The usefulness of each frame is decided entirely within the Viterbi process and needs not to be the same for different models. To evaluate this algorithm, we tested a speech database of 9 highly confusable E-set English letters. With 5 state and 6 mixture components, the conventional HMM baseline system only delivered a recognition accuracy of 73.9%. In the preliminary experiment using the algorithm proposed, the recognition accuracy was increased to 82.5%

Keywords

Viterbi decoding; hidden Markov models; maximum likelihood estimation; speech enhancement; speech processing; speech recognition; HMM based recognizer; HMM baseline system; Tanimoto ratio; Viterbi decoding; discriminative analysis; discriminative capabilities; experiment; frame-by-frame searching process; highly confusable E-set English letters; mixture components; modified Viterbi algorithm; nonstationary signal; recognition accuracy; speech database; speech enhancement; stationary signal compression; transitional signal; variable frame rate; Australia; Cepstral analysis; Databases; Hidden Markov models; Maximum likelihood estimation; Power system modeling; Probability density function; Signal processing; Speech; Speech processing; Speech recognition; Viterbi algorithm; Vocabulary;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location

Detroit, MI

ISSN

1520-6149

Print_ISBN

0-7803-2431-5

Type

conf

DOI

10.1109/ICASSP.1995.479630

Filename

479630