DocumentCode
2965378
Title
Shift-tolerant K-subspaces for phoneme recognition
Author
Wu, Duanpei ; Gowdy, John N.
Author_Institution
Dept. of Electr. & Comput. Eng., Clemson Univ., SC, USA
Volume
6
fYear
1996
fDate
7-10 May 1996
Firstpage
3378
Abstract
This paper presents a new high performance neural network architecture, shift-tolerant K-subspaces, for phoneme recognition. The architecture combines the time-delay design for phoneme recognition and the technique of MLP autoassociators. For each phoneme category, K time-delay linear autoassociators are constructed and trained with a proposed K-subspace clustering procedure, similar to the K-means algorithm, using speech data belonging to the phoneme category. This architecture with its non-classification training procedure provides an effective method for phoneme recognition. It avoids the drawback encountered in most conventional neural network based speech recognition systems that network output values do not represent candidate likelihoods. The architecture has obtained 87.37% recognition accuracy which is only slightly lower than 88.44% obtained with a TDNN and 88.30% with a shift-tolerant LVQ trained by classification learning procedures using the same data set
Keywords
multilayer perceptrons; speech recognition; MLP autoassociators; TDNN; clustering procedure; high performance neural network architecture; nonclassification training; phoneme recognition; recognition accuracy; shift-tolerant K-subspaces; shift-tolerant LVQ; time-delay design; time-delay linear autoassociators; Clustering algorithms; Computer architecture; Covariance matrix; Data compression; Delay lines; Feature extraction; Neural networks; Principal component analysis; Speech recognition; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location
Atlanta, GA
ISSN
1520-6149
Print_ISBN
0-7803-3192-3
Type
conf
DOI
10.1109/ICASSP.1996.550602
Filename
550602
Link To Document