• DocumentCode
    417143
  • Title

    Cepstral gain normalization for noise robust speech recognition

  • Author

    Yoshizawa, Shingo ; Hayasaka, Noboru ; Wada, Naoya ; Miyanaga, Yoshikazu

  • Author_Institution
    Graduate Sch. of Eng., Hokkaido Univ., Sapporo, Japan
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    The paper describes a robust speech recognition technique which normalizes cepstral gains in order to remove effects of additive noise. We assume that the effects can be expressed by an approximate model which consists of gain and DC components in log-spectrum. Accordingly, we propose cepstral gain normalization (CGN) which normalizes the gains by means of calculating maximum and minimum values of cepstral coefficients in speech frames. The proposed method can extract noise robust features without a priori knowledge and environmental adaptation because it is applied to both training and testing data. We have evaluated recognition performance under noisy environments using the Noisex-92 database and a 100 Japanese city names task. The CGN provides improvements of recognition accuracy at various SNRs compared with combinations of conventional methods.
  • Keywords
    acoustic noise; cepstral analysis; random noise; speech recognition; additive noise; cepstral coefficients; cepstral gain normalization; recognition accuracy; robust speech recognition; speech frames; Additive noise; Cepstral analysis; Cities and towns; Data mining; Feature extraction; Noise robustness; Spatial databases; Speech recognition; Testing; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1325959
  • Filename
    1325959