DocumentCode
2999364
Title
Speaker-independent isolated word recognition based on emphasized spectral dynamics
Author
Furui, Sadaoki
Author_Institution
NTT Electrical Communication Laboratories, Tokyo, Japan
Volume
11
fYear
1986
fDate
31503
Firstpage
1991
Lastpage
1994
Abstract
A new speech analysis technique applicable to speech recognition is proposed considering the auditory mechanism of speech perception which emphasizes spectral dynamics and which compensates for the spectral undershoot associated with coarticulation. A speech wave is represented by the LPC cepstrum and logarithmic energy sequences, and the time sequences over short periods are expanded by the first- and second-order polynomial functions at every frame period. The dynamics of the cepstrum sequences are then emphasized by the linear combination of their polynomial expansion coefficients, that is, derivatives, and their instantaneous values. Speaker-independent word recognition experiments using time functions of the dynamics-emphasized cepstrum and the polynomial coefficient for energy indicate that the error rate can be largely reduced by this method.
Keywords
Auditory system; Automatic speech recognition; Cepstral analysis; Cepstrum; Humans; Laboratories; Linear predictive coding; Polynomials; Speech analysis; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.
Type
conf
DOI
10.1109/ICASSP.1986.1168654
Filename
1168654
Link To Document