• DocumentCode
    302075
  • Title

    Time-frequency representation based cepstral processing for speech recognition

  • Author

    Fineberg, Adam B. ; Yu, Kevin C.

  • Author_Institution
    Lexicus Div., Motorola Inc., Palo Alto, CA, USA
  • Volume
    1
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    25
  • Abstract
    Both linear predictive coding (LPC) and mel scale frequency cepstral coefficient (MFCC) analysis, the most common techniques for speech recognition signal processing, make the assumption that the speech signal is stationary for some analysis window and produce a representation based upon the “stationary” frequency content within the window. This work uses a technique based upon Cohen´s (1989) class of generalized time frequency representations (TFR) to produce selected frequency representations that are not based upon an assumption of stationarity. This representation is used in a speech recognition system to produce improved accuracy. The proposed approach requires a kernel design to specify the attributes of the representations. The considerations used for analyzing speech signals and the resulting attributes are discussed. Comparisons with standard analysis techniques are presented. The significant computational requirements are also discussed
  • Keywords
    cepstral analysis; linear predictive coding; signal representation; speech coding; speech processing; speech recognition; time-frequency analysis; Cohen´s class; LPC analysis; MFCC analysis; analysis techniques; analysis window; cepstral processing; computational requirements; frequency representations; generalized time-frequency representation; kernel design; linear predictive coding; mel scale frequency cepstral coefficient; signal processing; speech recognition system; speech signal; stationary frequency content; Cepstral analysis; Linear predictive coding; Mel frequency cepstral coefficient; Signal analysis; Signal processing; Speech analysis; Speech coding; Speech processing; Speech recognition; Time frequency analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.540281
  • Filename
    540281