• DocumentCode
    284618
  • Title

    Prototype-based discriminative training for various speech units

  • Author

    McDermott, Erik ; Katagiri, Shigeru

  • Author_Institution
    ATR Auditory & Visual Perception Res. Lab., Kyoto, Japan
  • Volume
    1
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    417
  • Abstract
    It has since been shown that learning vector quantisation (LVQ) is a special case of a more general method, generalized probabilistic descent (GPD), for gradient descent on a rigorously defined classification loss measure that closely reflects the misclassification rate. The authors to extend LVQ into a prototype-based classifier appropriate for the classification of various long speech units. For word recognition, a dynamic time warping procedure is integrated into the GPD learning procedure. The resulting minimum error classifier (MEC) is no longer a purely LVQ-like method, and it is called the prototype-based minimum error classifier (PBMEC). Results for the difficult Bell Labs E-set task as well as for speaker-dependent isolated word recognition for a vocabulary of 5240 words are presented. They reveal clear gains in performance as a result of using PBMEC
  • Keywords
    learning (artificial intelligence); neural nets; speech recognition; vector quantisation; Bell Labs E-set task; LVQ; classification loss measure; discriminative training; dynamic time warping procedure; generalized probabilistic descent; gradient descent; learning procedure; learning vector quantisation; long speech units; minimum error classifier; misclassification rate; neural networks; prototype-based classifier; speaker-dependent isolated word recognition; Bayesian methods; Laboratories; Loss measurement; Performance evaluation; Performance gain; Prototypes; Speech; Vector quantization; Visual perception; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.225883
  • Filename
    225883