• DocumentCode
    1932534
  • Title

    Robust Technologies towards Automatic Speech Recognition in Car Noise Environments

  • Author

    Ding, Pei ; He, Lei ; Yan, Xiang ; Zhao, Rui ; Hao, Jie

  • Author_Institution
    Toshiba Res. & Dev. Center, Beijing
  • Volume
    1
  • fYear
    2006
  • fDate
    16-20 2006
  • Abstract
    This paper presents the research on robust automatic speech recognition (ASR) in car noise environments. In the front-end design, speech enhancement technologies are used to suppress the background noise in frequency domain, and then spectrum smoothing is implemented both in time and frequency index to compensate those spectrum components distorted by noise over-reduction. In acoustic model training, we propose to use an immunity learning scheme, in which pre-recorded car noises are artificially added to clean training utterances with different signal-to-noise ratios (SNR) to imitate the in-car environments. After analyzing the SNR and noise spectrum of real in-car utterances, we further refine the immunity training set by adjusting the distribution of SNR and increasing the proportion of training noises that has a similar characteristic. Evaluation results of isolated phrase recognition show that the ASR system with proposed technologies achieves the average error rate reduction (ERR) of 90.68% and 79.08% for artificial car noisy speech and real in-car speech respectively, when compared with the baseline system in which no robust technology is used
  • Keywords
    acoustic noise; learning (artificial intelligence); smoothing methods; speech enhancement; speech recognition; SNR; automatic speech recognition; car noise environments; immunity learning scheme; noise over-reduction; signal-to-noise ratios; spectrum smoothing; speech enhancement technologies; Acoustic noise; Automatic speech recognition; Background noise; Frequency domain analysis; Isolation technology; Noise robustness; Signal to noise ratio; Speech analysis; Speech enhancement; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing, 2006 8th International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    0-7803-9736-3
  • Electronic_ISBN
    0-7803-9736-3
  • Type

    conf

  • DOI
    10.1109/ICOSP.2006.345538
  • Filename
    4128953