• DocumentCode
    1296263
  • Title

    Extended VTS for Noise-Robust Speech Recognition

  • Author

    van Dalen, R.C. ; Gales, M.J.F.

  • Author_Institution
    Eng. Dept., Cambridge Univ., Cambridge, UK
  • Volume
    19
  • Issue
    4
  • fYear
    2011
  • fDate
    5/1/2011 12:00:00 AM
  • Firstpage
    733
  • Lastpage
    743
  • Abstract
    Model compensation is a standard way of improving the robustness of speech recognition systems to noise. A number of popular schemes are based on vector Taylor series (VTS) compensation, which uses a linear approximation to represent the influence of noise on the clean speech. To compensate the dynamic parameters, the continuous time approximation is often used. This approximation uses a point estimate of the gradient, which fails to take into account that dynamic coefficients are a function of a number of consecutive static coefficients. In this paper, the accuracy of dynamic parameter compensation is improved by representing the dynamic features as a linear transformation of a window of static features. A modified version of VTS compensation is applied to the distribution of the window of static features and, importantly, their correlations. These compensated distributions are then transformed to distributions over standard static and dynamic features. With this improved approximation, it is also possible to obtain full-covariance corrupted speech distributions. This addresses the correlation changes that occur in noise. The proposed scheme outperformed the standard VTS scheme by 10% to 20% relative on a range of tasks.
  • Keywords
    approximation theory; compensation; parameter estimation; speech recognition; vectors; continuous time approximation; dynamic parameter compensation; extended VTS compensation; linear approximation; model compensation; noise-robust speech recognition; vector Taylor series; Model compensation; noise-robustness; speech recognition;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2010.2061226
  • Filename
    5549894