DocumentCode
1066741
Title
The short-time modified coherence representation and noisy speech recognition
Author
Mansour, David ; Juang, Biing Hwang
Author_Institution
AT&T Bell Labs., Murray Hill, NJ, USA
Volume
37
Issue
6
fYear
1989
fDate
6/1/1989 12:00:00 AM
Firstpage
795
Lastpage
804
Abstract
A technique for robust spectral representation of all-pole sequences is proposed. It is shown that the autocorrelation of an all-pole sequence, obtained by passing white noise through an all-pole filter 1/A (z ), is an all-pole sequence of the form 1/ A 2(z ). A short-time modified coherence (SMC) representation is proposed that is an all-pole modeling of the autocorrelation sequence with a spectral shaper. The spectral shaper, essentially a square root operator in the frequency domain, compensates for the inherent spectral distortion introduced by the autocorrelation operation on the autocorrelation sequence of the signal. The properties of the SMC representation, especially its robustness to additive white noise, are analyzed. Initial implementation of the SMC in a speaker-dependent isolated word recognizer shows an improvement in recognition accuracy equivalent to an increase in input SNR of approximately 13 dB, as compared to the LPC recognizer
Keywords
correlation theory; spectral analysis; speech recognition; white noise; all-pole filter; all-pole sequences; autocorrelation sequence; frequency domain; noisy speech recognition; recognition accuracy; robust spectral representation; short-time modified coherence representation; speaker-dependent isolated word recognizer; spectral distortion compensation; spectral shaper; square root operator; white noise; Additive white noise; Autocorrelation; Coherence; Distortion; Filters; Frequency domain analysis; Noise robustness; Sliding mode control; Speech; White noise;
fLanguage
English
Journal_Title
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher
ieee
ISSN
0096-3518
Type
jour
DOI
10.1109/ASSP.1989.28053
Filename
28053
Link To Document