DocumentCode
294616
Title
Enhancement of discriminative capabilities of HMM based recognizer through modification of Viterbi algorithm
Author
Song, Jianming
Author_Institution
Dept. of Electr. & Comput. Eng., Wollongong Univ., NSW, Australia
Volume
1
fYear
1995
fDate
9-12 May 1995
Firstpage
469
Abstract
The algorithm proposed in this paper integrates the concepts of variable frame rate and discriminative analysis based on Tanimoto ratio to modify the conventional Viterbi algorithm, in such a way that the steady or stationary signal is compressed, while transitional or non-stationary signal is emphasized through the frame-by-frame searching process. The usefulness of each frame is decided entirely within the Viterbi process and needs not to be the same for different models. To evaluate this algorithm, we tested a speech database of 9 highly confusable E-set English letters. With 5 state and 6 mixture components, the conventional HMM baseline system only delivered a recognition accuracy of 73.9%. In the preliminary experiment using the algorithm proposed, the recognition accuracy was increased to 82.5%
Keywords
Viterbi decoding; hidden Markov models; maximum likelihood estimation; speech enhancement; speech processing; speech recognition; HMM based recognizer; HMM baseline system; Tanimoto ratio; Viterbi decoding; discriminative analysis; discriminative capabilities; experiment; frame-by-frame searching process; highly confusable E-set English letters; mixture components; modified Viterbi algorithm; nonstationary signal; recognition accuracy; speech database; speech enhancement; stationary signal compression; transitional signal; variable frame rate; Australia; Cepstral analysis; Databases; Hidden Markov models; Maximum likelihood estimation; Power system modeling; Probability density function; Signal processing; Speech; Speech processing; Speech recognition; Viterbi algorithm; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location
Detroit, MI
ISSN
1520-6149
Print_ISBN
0-7803-2431-5
Type
conf
DOI
10.1109/ICASSP.1995.479630
Filename
479630
Link To Document