• DocumentCode
    1523202
  • Title

    Penalized Logistic Regression With HMM Log-Likelihood Regressors for Speech Recognition

  • Author

    Birkenes, Øystein ; Matsui, Tomoko ; Tanabe, Kunio ; Siniscalchi, Sabato Marco ; Myrvoll, Tor André ; Johnsen, Magne Hallstein

  • Author_Institution
    Norwegian Univ. of Sci. & Technol., Trondheim, Norway
  • Volume
    18
  • Issue
    6
  • fYear
    2010
  • Firstpage
    1440
  • Lastpage
    1454
  • Abstract
    Hidden Markov models (HMMs) are powerful generative models for sequential data that have been used in automatic speech recognition for more than two decades. Despite their popularity, HMMs make inaccurate assumptions about speech signals, thereby limiting the achievable performance of the conventional speech recognizer. Penalized logistic regression (PLR) is a well-founded discriminative classifier with long roots in the history of statistics. Its classification performance is often compared with that of the popular support vector machine (SVM). However, for speech classification, only limited success with PLR has been reported, partially due to the difficulty with sequential data. In this paper, we present an elegant way of incorporating HMMs in the PLR framework. This leads to a powerful discriminative classifier that naturally handles sequential data. In this approach, speech classification is done using affine combinations of HMM log-likelihoods. We believe that such combinations of HMMs lead to a more accurate classifier than the conventional HMM-based classifier. Unlike similar approaches, we jointly estimate the HMM parameters and the PLR parameters using a single training criterion. The extension to continuous speech recognition is done via rescoring of N-best lists or lattices.
  • Keywords
    hidden Markov models; regression analysis; speech recognition; HMM log-likelihood regressors; HMM-based classifier; N-best rescoring; PLR; SVM; affine combinations; automatic speech recognition; hidden Markov models; penalized logistic regression; sequential data; speech classification; support vector machine; well-founded discriminative classifier; Automatic speech recognition (ASR); N-best and lattice rescoring; hidden Markov model (HMM); logistic regression; machine learning;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2009.2035151
  • Filename
    5299048