• DocumentCode
    417140
  • Title

    Higher order cepstral moment normalization (HOCMN) for robust speech recognition

  • Author

    Hsu, Chang-Wen ; Lee, Lin-shan

  • Author_Institution
    Graduate Inst. of Commun. Eng., Nat. Taiwan Univ., Taiwan
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    Cepstral mean subtraction (CMS) and cepstral normalization (CN) have been popularly used to normalize the first and the second moments of cepstral coefficients, and proved to be very helpful for robust speech recognition (Furui, S. 1981; Viikki, O. and Laurila, K., 1998). A unified formulation for higher order cepstral moment normalization (HOCMN) is developed by extending the concept of CMS and CN to orders much higher than three. A whole family of normalization techniques for different orders is thus proposed. Preliminary experimental results based on Aurora 2.0 showed that the recognition accuracy can be significantly improved with this approach under all noisy conditions. For example, HOCMN(1,5,100) (normalization of the first, fifth and 100th order cepstral moments) is shown to offer an error rate reduction of 32.83% as compared to the conventional CN with a full-utterance processing interval, or an error rate reduction of 20.78% as compared to CN with a segmental processing interval.
  • Keywords
    acoustic noise; cepstral analysis; error statistics; random noise; speech recognition; cepstral mean subtraction; cepstral normalization; error rate reduction; higher order cepstral moment normalization; robust speech recognition; Cepstral analysis; Collision mitigation; Error analysis; Low-frequency noise; Mel frequency cepstral coefficient; Neural networks; Noise reduction; Probability density function; Robustness; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1325956
  • Filename
    1325956