• DocumentCode
    2020991
  • Title

    Prototype-based MCE/GPD training for word spotting and connected word recognition

  • Author

    McDermott, Erik ; Katagiri, Shigeru

  • Author_Institution
    ATR Auditory & Visual Perception Res. Lab., Soraku-gun, Kyoto, Japan
  • Volume
    2
  • fYear
    1993
  • fDate
    27-30 April 1993
  • Firstpage
    291
  • Abstract
    A straightforward application of PBMEC (prototype-based minimum error classifier) training to existing techniques for handling continuous speech is described. A novel MCE/GPD (minimum classification error/generalized probabilistic descent) loss function that can incorporate word spotting errors and other measures of symbolic distance between correct and incorrect categories is defined. Classification consists in a time-synchronous DTW (dynamic time warping) pass through a finite state machine; adaptation makes use of an A* based N-best algorithm and consists in propagating the derivative of the loss over the N best paths through the finite state machine. The key feature is that the loss function being optimized closely reflects the actual recognition performance of the system.<>
  • Keywords
    errors; finite state machines; learning (artificial intelligence); neural nets; speech recognition; N-best algorithm; connected word recognition; dynamic time warping; errors; finite state machine; generalized probabilistic descent; loss function; prototype-based minimum error classifier; recognition performance; training; word spotting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
  • Conference_Location
    Minneapolis, MN, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1993.319293
  • Filename
    319293