DocumentCode :
3400555
Title :
Speaker-dependent 100 word recognition using dynamic spectral features of speech and neural networks
Author :
Kitamura, Tadashi ; Nishioka, Ken ; Ito, Asanobu ; Hayahara, Etsuro
Author_Institution :
Dept. of Eng., Nagoya Inst. of Technol., Nagoya, Japan
fYear :
1991
fDate :
14-17 May 1991
Firstpage :
533
Abstract :
A spoken word recognition method using dynamic features of speech and neural networks is presented. Dynamic features of speech are obtained from a two-dimensional mel-cepstrum (TDMC). The TDMC is defined as the two-dimensional Fourier transform of mel-frequency scaled log spectra in the frequency and time domains. It has averaged spectral features, dynamic spectral features, and averaged and dynamic features of power of the two-dimensional mel-log spectra in the analyzed interval. The neural network in this study is a three-layered feedforward neural network and learns automatically using a back-propagation algorithm. Dynamic spectral features, and averaged and dynamic features of power are used as the input of a neural network. The experimental results of speaker-dependent word recognition experiments for 100 Japanese city names uttered by nine speakers show that dynamic spectral features smoothed with respect to time are effective, and a recognition accuracy of 99.1% was obtained
Keywords :
backpropagation; feedforward neural nets; learning systems; speech recognition; 100 word recognition; Japanese; averaged spectral features; back-propagation algorithm; dynamic spectral features; mel-frequency scaled log spectra; neural networks; speaker-dependent word recognition; speech; spoken word recognition method; three-layered feedforward neural network; two-dimensional Fourier transform; two-dimensional mel-cepstrum; Cities and towns; Discrete Fourier transforms; Feedforward neural networks; Fourier transforms; Frequency domain analysis; Neural networks; Speaker recognition; Speech recognition; Time domain analysis; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems, 1991., Proceedings of the 34th Midwest Symposium on
Conference_Location :
Monterey, CA
Print_ISBN :
0-7803-0620-1
Type :
conf
DOI :
10.1109/MWSCAS.1991.252106
Filename :
252106
Link To Document :
بازگشت