Title :
Shift-tolerant K-subspaces for phoneme recognition
Author :
Wu, Duanpei ; Gowdy, John N.
Author_Institution :
Dept. of Electr. & Comput. Eng., Clemson Univ., SC, USA
Abstract :
This paper presents a new high performance neural network architecture, shift-tolerant K-subspaces, for phoneme recognition. The architecture combines the time-delay design for phoneme recognition and the technique of MLP autoassociators. For each phoneme category, K time-delay linear autoassociators are constructed and trained with a proposed K-subspace clustering procedure, similar to the K-means algorithm, using speech data belonging to the phoneme category. This architecture with its non-classification training procedure provides an effective method for phoneme recognition. It avoids the drawback encountered in most conventional neural network based speech recognition systems that network output values do not represent candidate likelihoods. The architecture has obtained 87.37% recognition accuracy which is only slightly lower than 88.44% obtained with a TDNN and 88.30% with a shift-tolerant LVQ trained by classification learning procedures using the same data set
Keywords :
multilayer perceptrons; speech recognition; MLP autoassociators; TDNN; clustering procedure; high performance neural network architecture; nonclassification training; phoneme recognition; recognition accuracy; shift-tolerant K-subspaces; shift-tolerant LVQ; time-delay design; time-delay linear autoassociators; Clustering algorithms; Computer architecture; Covariance matrix; Data compression; Delay lines; Feature extraction; Neural networks; Principal component analysis; Speech recognition; Training data;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
0-7803-3192-3
DOI :
10.1109/ICASSP.1996.550602