• DocumentCode
    734993
  • Title

    Local trajectory based speech enhancement for robust speech recognition with deep neural network

  • Author

    Yongbin You ; Yanmin Qian ; Kai Yu

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., Shanghai, China
  • fYear
    2015
  • fDate
    12-15 July 2015
  • Firstpage
    5
  • Lastpage
    9
  • Abstract
    Deep neural network(DNN) has achieved a great success in automatic speech recognition(ASR), and it can be regarded as a joint model combining the nonlinear feature transformation and the log-linear classifier. Recently DNN is adopted as a regression model to enhance the distorted feature in noisy condition and the enhanced feature is utilized to improve the performance of DNN based ASR. Previous work only predicts a single frame (log-spectrum) using the enhanced DNN and the final improvement of ASR is not big. In this paper, local trajectory, represented using multiple frames with dynamic features, is predicted instead to make the feature enhancement more stable. In addition, FBank features and long context window are used to better integrating the enhanced DNN into ASR DNN. Experiments on the Aurora4 corpus showed that, compared to the standard DNN baseline, the proposed approach can achieve 9.6% relative WER reduction and also significantly outperform the previously proposed DNN ASR system using the log-spectrum feature based enhancement.
  • Keywords
    feature extraction; neural nets; regression analysis; speech enhancement; speech recognition; ASR; DNN; automatic speech recognition; deep neural network; feature enhancement; local trajectory; log-linear classifier; nonlinear feature transformation; regression model; speech enhancement; Acoustics; Context; Hidden Markov models; Speech; Speech recognition; Training; Trajectory; deep neural network; dynamic feature; feature enhancement; noise robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal and Information Processing (ChinaSIP), 2015 IEEE China Summit and International Conference on
  • Conference_Location
    Chengdu
  • Type

    conf

  • DOI
    10.1109/ChinaSIP.2015.7230351
  • Filename
    7230351