• DocumentCode
    284603
  • Title

    Application of a generalized probabilistic descent method to dynamic time warping-based speech recognition

  • Author

    Komori, Takshi ; Katagiri, Shigeru

  • Author_Institution
    ATR Auditory & Visual Perception Res. Lab., Kyoto, Japan
  • Volume
    1
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    497
  • Abstract
    Although several kinds of discriminative training methods based on artificial neural networks have been vigorously tested, the pursuit of highly capable classification of variable-duration speech patterns has been unsatisfactory. In this light, the authors evaluate a generalized probabilistic descent method (GPD) in designing a speech recognizer incorporated with the dynamic time warping methodology. The algorithm can be viewed as generalized learning vector quantization suited to the dynamic programming-based time warping. Experiments were conducted on two tasks: English syllable classification and Japanese phoneme classification. Results clearly demonstrate that GPD can be a viable candidate for a method to realize a high-performance speech recognizer
  • Keywords
    dynamic programming; learning (artificial intelligence); neural nets; speech recognition; vector quantisation; English syllable classification; GPD method; Japanese phoneme classification; artificial neural networks; discriminative training; dynamic programming-based time warping; dynamic time warping-based speech recognition; generalised learning VQ; generalized learning vector quantization; generalized probabilistic descent; high-performance speech recognizer; variable-duration speech patterns; Convergence; Cost function; Euclidean distance; Optimization methods; Particle measurements; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.225863
  • Filename
    225863