• DocumentCode
    1343201
  • Title

    Training hidden Markov models with multiple observations-a combinatorial method

  • Author

    Li, Xiaolin ; Parizeau, Marc ; Plamondon, Rejean

  • Author_Institution
    CADLink Technol. Corp., Ottawa, Ont., Canada
  • Volume
    22
  • Issue
    4
  • fYear
    2000
  • fDate
    4/1/2000 12:00:00 AM
  • Firstpage
    371
  • Lastpage
    377
  • Abstract
    Hidden Markov models (HMM) are stochastic models capable of statistical learning and classification. They have been applied in speech recognition and handwriting recognition because of their great adaptability and versatility in handling sequential signals. On the other hand, as these models have a complex structure and also because the involved data sets usually contain uncertainty, it is difficult to analyze the multiple observation training problem without certain assumptions. For many years researchers have used the training equations of Levinson (1983) in speech and handwriting applications, simply assuming that all observations are independent of each other. This paper presents a formal treatment of HMM multiple observation training without imposing the above assumption. In this treatment, the multiple observation probability is expressed as a combination of individual observation probabilities without losing generality. This combinatorial method gives one more freedom in making different dependence-independence assumptions. By generalizing Baum´s auxiliary function into this framework and building up an associated objective function using the Lagrange multiplier method, it is proven that the derived training equations guarantee the maximization of the objective function. Furthermore, we show that Levinson´s training equations can be easily derived as a special case in this treatment
  • Keywords
    combinatorial mathematics; handwriting recognition; hidden Markov models; learning (artificial intelligence); speech recognition; HMM; Lagrange multiplier method; Levinson training equations; combinatorial method; dependence-independence assumptions; handling sequential signals; handwriting recognition; hidden Markov model training; multiple observation probability; multiple observations; speech recognition; statistical classification; statistical learning; Computer Society; Equations; Handwriting recognition; Hidden Markov models; Lagrangian functions; Maximum likelihood estimation; Speech recognition; Statistical learning; Stochastic processes; Uncertainty;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/34.845379
  • Filename
    845379