Title :
Optimal feature vector for speech recognition of unequally segmented spoken digits
Author :
Karam, J.R. ; Phillips, W.J. ; Robertson, W.
Author_Institution :
Dept. of Eng. Math., Dalhousie Univ., Halifax, NS, Canada
Abstract :
We describe a model obtained by applying the discrete wavelet transform (DWT) to unequally segmented digits. Each signal is divided with a pre-determined segmentation into a maximum of five subwords. The purpose is speaker independent single digit recognition. The parameterization of the subwords is accomplished by measuring its energy contents after decomposing it with the DWT. This model uses one coefficient per subword and produces up to a 99% recognition rate. It is superior in its class due to the high reduction in the size of the feature vector and consequently in the speed of processing. Typically the reduction is 20:1 if compared with the traditional Mel-scale model. A successful attempt to classify vowels and accurately identify digits visibly using the proposed model is undertaken. A radial basis function artificial neural network (RBF-ANN) is employed for the recognition tasks and for the comparison of the proposed model with the Fourier one. We use orthogonal wavelets from the Daubechies (1992) set. Also the performances of some biorthogonal wavelets are included
Keywords :
discrete wavelet transforms; feature extraction; optimisation; radial basis function networks; signal classification; speech recognition; DWT; Daubechies set; Fourier model; Mel-scale model; biorthogonal wavelets; digits identification; discrete wavelet transform; energy contents; feature vector size reduction; optimal feature vector; orthogonal wavelets; processing speed; radial basis function artificial neural network; recognition rate; recognition tasks; speaker independent single digit recognition; speech recognition; unequally segmented spoken digits; vowels classification; Continuous wavelet transforms; Discrete wavelet transforms; Fourier transforms; Frequency; Mathematics; Signal analysis; Speech analysis; Speech recognition; Wavelet analysis; Wavelet transforms;
Conference_Titel :
Electrical and Computer Engineering, 2000 Canadian Conference on
Conference_Location :
Halifax, NS
Print_ISBN :
0-7803-5957-7
DOI :
10.1109/CCECE.2000.849723