DocumentCode
302075
Title
Time-frequency representation based cepstral processing for speech recognition
Author
Fineberg, Adam B. ; Yu, Kevin C.
Author_Institution
Lexicus Div., Motorola Inc., Palo Alto, CA, USA
Volume
1
fYear
1996
fDate
7-10 May 1996
Firstpage
25
Abstract
Both linear predictive coding (LPC) and mel scale frequency cepstral coefficient (MFCC) analysis, the most common techniques for speech recognition signal processing, make the assumption that the speech signal is stationary for some analysis window and produce a representation based upon the “stationary” frequency content within the window. This work uses a technique based upon Cohen´s (1989) class of generalized time frequency representations (TFR) to produce selected frequency representations that are not based upon an assumption of stationarity. This representation is used in a speech recognition system to produce improved accuracy. The proposed approach requires a kernel design to specify the attributes of the representations. The considerations used for analyzing speech signals and the resulting attributes are discussed. Comparisons with standard analysis techniques are presented. The significant computational requirements are also discussed
Keywords
cepstral analysis; linear predictive coding; signal representation; speech coding; speech processing; speech recognition; time-frequency analysis; Cohen´s class; LPC analysis; MFCC analysis; analysis techniques; analysis window; cepstral processing; computational requirements; frequency representations; generalized time-frequency representation; kernel design; linear predictive coding; mel scale frequency cepstral coefficient; signal processing; speech recognition system; speech signal; stationary frequency content; Cepstral analysis; Linear predictive coding; Mel frequency cepstral coefficient; Signal analysis; Signal processing; Speech analysis; Speech coding; Speech processing; Speech recognition; Time frequency analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location
Atlanta, GA
ISSN
1520-6149
Print_ISBN
0-7803-3192-3
Type
conf
DOI
10.1109/ICASSP.1996.540281
Filename
540281
Link To Document