• DocumentCode
    3124609
  • Title

    An analysis of vector Taylor series model compensation for non-stationary noise in speech recognition

  • Author

    Duc Hoang Ha Nguyen ; Xiong Xiao ; Eng Siong Chng ; Haizhou Li

  • Author_Institution
    Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • fYear
    2012
  • fDate
    5-8 Dec. 2012
  • Firstpage
    131
  • Lastpage
    135
  • Abstract
    In this paper, we investigate a feature conditioning method for the VTS-based model compensation. The VTS is a technique that predicts noisy acoustic model from clean acoustic model and noise model. It is noted that most of the previous studies use a single Gaussian noise model, which is unable to model noise statistics well, especially in non-stationary noisy environments. In this paper, we propose a combination of feature processing and VTS model compensation to handle non-stationary noise more efficiently. In the feature processing stage, the non-stationary characteristics of noise is reduced, hence the processed features is more suitable for VTS model compensation using single Gaussian noise model. Experimental analysis on the AURORA2 task shows that the proposed method has the potential to improve the performance of VTS method in non-stationary environments if good noise estimation is available.
  • Keywords
    acoustic signal processing; compensation; estimation theory; series (mathematics); speech recognition; statistical analysis; AURORA2 task; VTS model compensation; VTS-based model compensation; feature conditioning method; feature processing stage; noise estimation; noise statistics; noisy acoustic model; non-stationary characteristics; nonstationary environments; nonstationary noise; nonstationary noisy environments; single Gaussian noise model; speech recognition; vector Taylor series model compensation; Adaptation models; Estimation; Hidden Markov models; Noise; Noise measurement; Speech; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
  • Conference_Location
    Kowloon
  • Print_ISBN
    978-1-4673-2506-6
  • Electronic_ISBN
    978-1-4673-2505-9
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2012.6423503
  • Filename
    6423503