• DocumentCode
    867575
  • Title

    Recognition of noisy speech using dynamic spectral subband centroids

  • Author

    Chen, Jingdong ; Huang, Yiteng Arden ; Li, Qi ; Paliwal, Kuldip K.

  • Author_Institution
    Bell Labs., Murray Hill, NJ, USA
  • Volume
    11
  • Issue
    2
  • fYear
    2004
  • Firstpage
    258
  • Lastpage
    261
  • Abstract
    Despite their widespread popularity as front-end parameters for speech recognition, the cepstral coefficients derived from either linear prediction analysis or a filter-bank are found to be sensitive to additive noise. In this letter, we discuss the use of spectral subband centroids for robust speech recognition. We show that centroids, if properly selected, can achieve recognition performance comparable to that of the mel-frequency cepstral coefficients (MFCCs) in clean speech, while delivering better performance than MFCC in noisy environments. A procedure is proposed to construct the dynamic centroid feature vector that essentially embodies the transitional spectral information. We discuss some properties of the proposed dynamic features.
  • Keywords
    cepstral analysis; channel bank filters; prediction theory; speech recognition; additive noise; cepstral coefficient; cepstrum; clean speech; dynamic centroid feature vector; dynamic spectral subband centroid; filter-bank; linear prediction analysis; mel-frequency cepstral coefficient; noisy environment; noisy speech recognition; robust speech recognition; spectral subband centroid; transitional spectral information; Automatic speech recognition; Band pass filters; Cepstral analysis; Hidden Markov models; Mel frequency cepstral coefficient; Noise robustness; Speech analysis; Speech recognition; Testing; Working environment noise;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2003.821689
  • Filename
    1261994