• DocumentCode
    2999364
  • Title

    Speaker-independent isolated word recognition based on emphasized spectral dynamics

  • Author

    Furui, Sadaoki

  • Author_Institution
    NTT Electrical Communication Laboratories, Tokyo, Japan
  • Volume
    11
  • fYear
    1986
  • fDate
    31503
  • Firstpage
    1991
  • Lastpage
    1994
  • Abstract
    A new speech analysis technique applicable to speech recognition is proposed considering the auditory mechanism of speech perception which emphasizes spectral dynamics and which compensates for the spectral undershoot associated with coarticulation. A speech wave is represented by the LPC cepstrum and logarithmic energy sequences, and the time sequences over short periods are expanded by the first- and second-order polynomial functions at every frame period. The dynamics of the cepstrum sequences are then emphasized by the linear combination of their polynomial expansion coefficients, that is, derivatives, and their instantaneous values. Speaker-independent word recognition experiments using time functions of the dynamics-emphasized cepstrum and the polynomial coefficient for energy indicate that the error rate can be largely reduced by this method.
  • Keywords
    Auditory system; Automatic speech recognition; Cepstral analysis; Cepstrum; Humans; Laboratories; Linear predictive coding; Polynomials; Speech analysis; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1986.1168654
  • Filename
    1168654