• DocumentCode
    390663
  • Title

    Compensation of speech enhancement distortion for robust speech recognition

  • Author

    Pei, Ding ; Zhigang, Cao

  • Author_Institution
    Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
  • Volume
    1
  • fYear
    2002
  • fDate
    28-31 Oct. 2002
  • Firstpage
    449
  • Abstract
    The performance of an automatic speech recognition (ASR) system will degrade dramatically in noisy environments because of the mismatch between testing and training. This paper presents an efficient robust method, which combines the minimum mean square error (MMSE) speech enhancement with cepstral mean normalization (CMN). In the front-end stage, the MMSE enhancement is adopted to suppress the intrusive noise to a lower level, but this process is usually at the expense of spectral variation of clean speech, which also severely affects the recognition. Thus, CMN is then used to compensate the distortion. including the spectral variation and the residual noise. Experimental evaluations show that the proposed robust method can significantly improve the recognition accuracy across a wide range of signal-to-noise ratios (SNR), especially in very noisy environments.
  • Keywords
    least mean squares methods; speech enhancement; speech recognition; automatic speech recognition; cepstral mean normalization; minimum mean square error speech enhancement; robust speech recognition; signal-to-noise ratios; speech enhancement distortion; Automatic speech recognition; Automatic testing; Degradation; Mean square error methods; Noise robustness; Signal to noise ratio; Speech enhancement; Speech recognition; System testing; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    TENCON '02. Proceedings. 2002 IEEE Region 10 Conference on Computers, Communications, Control and Power Engineering
  • Print_ISBN
    0-7803-7490-8
  • Type

    conf

  • DOI
    10.1109/TENCON.2002.1181310
  • Filename
    1181310