DocumentCode :
867575
Title :
Recognition of noisy speech using dynamic spectral subband centroids
Author :
Chen, Jingdong ; Huang, Yiteng Arden ; Li, Qi ; Paliwal, Kuldip K.
Author_Institution :
Bell Labs., Murray Hill, NJ, USA
Volume :
11
Issue :
2
fYear :
2004
Firstpage :
258
Lastpage :
261
Abstract :
Despite their widespread popularity as front-end parameters for speech recognition, the cepstral coefficients derived from either linear prediction analysis or a filter-bank are found to be sensitive to additive noise. In this letter, we discuss the use of spectral subband centroids for robust speech recognition. We show that centroids, if properly selected, can achieve recognition performance comparable to that of the mel-frequency cepstral coefficients (MFCCs) in clean speech, while delivering better performance than MFCC in noisy environments. A procedure is proposed to construct the dynamic centroid feature vector that essentially embodies the transitional spectral information. We discuss some properties of the proposed dynamic features.
Keywords :
cepstral analysis; channel bank filters; prediction theory; speech recognition; additive noise; cepstral coefficient; cepstrum; clean speech; dynamic centroid feature vector; dynamic spectral subband centroid; filter-bank; linear prediction analysis; mel-frequency cepstral coefficient; noisy environment; noisy speech recognition; robust speech recognition; spectral subband centroid; transitional spectral information; Automatic speech recognition; Band pass filters; Cepstral analysis; Hidden Markov models; Mel frequency cepstral coefficient; Noise robustness; Speech analysis; Speech recognition; Testing; Working environment noise;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/LSP.2003.821689
Filename :
1261994
Link To Document :
بازگشت