DocumentCode :
2022630
Title :
Utterance normalization using vowel features in a spoken word recognition system for multiple speakers
Author :
OHNO, Sumio ; Hirose, Keikichi ; Fujasaki, H.
Author_Institution :
Dept. of Electron. Eng., Tokyo Univ., Japan
Volume :
2
fYear :
1993
fDate :
27-30 April 1993
Firstpage :
578
Abstract :
The authors propose a novel method of normalization based on linear transformation of acoustic features of input speech using only one isolated utterance each of the five vowels of Japanese by each individual speaker. Experiments on isolated word recognition combining the proposed normalization method and multiple-template DP matching showed a marked improvement in the recognition rate, especially for smaller numbers of templates per word. The proposed method gives consistently higher word recognition scores than the four-dimensional representation on the Karhunen-Loeve transformation, and also gives higher scores than the original 16-dimensional representation of filter-bank outputs, especially when the number of templates is small. Together with the fact that this method reduces the dimension of the feature vector by a factor of four, the results demonstrate the validity of the proposed method.<>
Keywords :
feature extraction; speech recognition; Japanese; linear transformation of acoustic features; multiple speakers; recognition rate; spoken word recognition system; templates; utterance normalisation; validity; vowel features;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location :
Minneapolis, MN, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.1993.319373
Filename :
319373
Link To Document :
بازگشت