Title :
Continuous speech recognition with modified learning vector quantization algorithm and two-level DP-matching
Author :
Makino, Shozo ; Endo, Mitsuru ; Sone, Toshio ; Kido, Ken´iti
Author_Institution :
Res. Center for Appl. Inf. Sci., Tohoku Univ., Sendai, Japan
Abstract :
The authors propose a phoneme recognition method based on the learning vector quantization (LVQ2) algorithm. They propose three kinds of modified training algorithms for the LVQ2 algorithm. In the recognition stage, the likelihood matrix is computed using the reference vectors and then the optimum phoneme sequence is computed from the matrix using the two-level dynamic programming (DP)-matching with duration constraints. The recognition score of phonemes in isolated spoken words was 89.1% for the test set. The phoneme recognition scores obtained by the modified LVQ2 algorithm were higher than those obtained by the original LVQ2 algorithm. The authors applied this method to a multi-speaker-dependent phoneme recognition task for continuous speech uttered Bunsetsu-by-Bunsetsu. The phoneme recognition score was 85.5% for the test speech samples in continuous speech
Keywords :
dynamic programming; speech recognition; vector quantisation; LVQ2 algorithm; continuous speech recognition; duration constraints; isolated spoken words; learning vector quantization; likelihood matrix; modified LVQ2 algorithm; optimum phoneme sequence; phoneme recognition method; phoneme recognition scores; recognition score; reference vectors; test speech samples; two-level DP-matching; two-level dynamic programming; Clustering algorithms; Clustering methods; Hidden Markov models; Neural networks; Speech coding; Speech recognition; Testing; Vector quantization;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7803-0532-9
DOI :
10.1109/ICASSP.1992.225838