• DocumentCode
    2965378
  • Title

    Shift-tolerant K-subspaces for phoneme recognition

  • Author

    Wu, Duanpei ; Gowdy, John N.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Clemson Univ., SC, USA
  • Volume
    6
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    3378
  • Abstract
    This paper presents a new high performance neural network architecture, shift-tolerant K-subspaces, for phoneme recognition. The architecture combines the time-delay design for phoneme recognition and the technique of MLP autoassociators. For each phoneme category, K time-delay linear autoassociators are constructed and trained with a proposed K-subspace clustering procedure, similar to the K-means algorithm, using speech data belonging to the phoneme category. This architecture with its non-classification training procedure provides an effective method for phoneme recognition. It avoids the drawback encountered in most conventional neural network based speech recognition systems that network output values do not represent candidate likelihoods. The architecture has obtained 87.37% recognition accuracy which is only slightly lower than 88.44% obtained with a TDNN and 88.30% with a shift-tolerant LVQ trained by classification learning procedures using the same data set
  • Keywords
    multilayer perceptrons; speech recognition; MLP autoassociators; TDNN; clustering procedure; high performance neural network architecture; nonclassification training; phoneme recognition; recognition accuracy; shift-tolerant K-subspaces; shift-tolerant LVQ; time-delay design; time-delay linear autoassociators; Clustering algorithms; Computer architecture; Covariance matrix; Data compression; Delay lines; Feature extraction; Neural networks; Principal component analysis; Speech recognition; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.550602
  • Filename
    550602