• DocumentCode
    178064
  • Title

    Mean normalization of power function based cepstral coefficients for robust speech recognition in noisy environment

  • Author

    Soonho Baek ; Hong-Goo Kang

  • Author_Institution
    Sch. of Electr. & Electron. Eng., Yonsei Univ., Seoul, South Korea
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    1735
  • Lastpage
    1739
  • Abstract
    This paper presents the effect of mean normalization to various types of cepstral coefficients for robust speech recognition in noisy environments. Although the cepstral mean normalization (CMN) technique was originally designed to compensate channel distortion, it has also been proved that the CMN also improves recognition accuracy in additive noisy environment. However, no one has yet considered the interaction of CMN with spectral mapping functions required for extracting cepstral features. This paper investigates the impact of CMN to the speech recognition system depending on the types of spectral mapping function by mathematically analyzing the amount of spectral distortion between clean and noisy conditions. The analytic result is also confirmed by comparing the type of recognition error patterns in automatic speech recognition experiment with Aurora 2 database. Experimental results show that the performance improvement by adopting CMN becomes significant if the logarithmic function is replaced with the appropriate setting of fractional power mapping function. Especially, the deletion errors are dramatically reduced.
  • Keywords
    feature extraction; speech recognition; Aurora 2 database; CMN technique; cepstral feature extraction; fractional power mapping function; logarithmic function; mean normalization; noisy environment; power function based cepstral coefficient; recognition error patterns; spectral mapping function; speech recognition; Mel frequency cepstral coefficient; Noise; Noise measurement; Speech; Speech processing; Speech recognition; CMN; Robust speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6853895
  • Filename
    6853895