DocumentCode
867575
Title
Recognition of noisy speech using dynamic spectral subband centroids
Author
Chen, Jingdong ; Huang, Yiteng Arden ; Li, Qi ; Paliwal, Kuldip K.
Author_Institution
Bell Labs., Murray Hill, NJ, USA
Volume
11
Issue
2
fYear
2004
Firstpage
258
Lastpage
261
Abstract
Despite their widespread popularity as front-end parameters for speech recognition, the cepstral coefficients derived from either linear prediction analysis or a filter-bank are found to be sensitive to additive noise. In this letter, we discuss the use of spectral subband centroids for robust speech recognition. We show that centroids, if properly selected, can achieve recognition performance comparable to that of the mel-frequency cepstral coefficients (MFCCs) in clean speech, while delivering better performance than MFCC in noisy environments. A procedure is proposed to construct the dynamic centroid feature vector that essentially embodies the transitional spectral information. We discuss some properties of the proposed dynamic features.
Keywords
cepstral analysis; channel bank filters; prediction theory; speech recognition; additive noise; cepstral coefficient; cepstrum; clean speech; dynamic centroid feature vector; dynamic spectral subband centroid; filter-bank; linear prediction analysis; mel-frequency cepstral coefficient; noisy environment; noisy speech recognition; robust speech recognition; spectral subband centroid; transitional spectral information; Automatic speech recognition; Band pass filters; Cepstral analysis; Hidden Markov models; Mel frequency cepstral coefficient; Noise robustness; Speech analysis; Speech recognition; Testing; Working environment noise;
fLanguage
English
Journal_Title
Signal Processing Letters, IEEE
Publisher
ieee
ISSN
1070-9908
Type
jour
DOI
10.1109/LSP.2003.821689
Filename
1261994
Link To Document