• DocumentCode
    3602257
  • Title

    Diversified Hidden Markov Models for Sequential Labeling

  • Author

    Maoying Qiao ; Wei Bian ; Da Xu, Richard Yi ; Dacheng Tao

  • Author_Institution
    Centre for Quantum Comput. & Intell. Syst., Univ. of Technol., Sydney, NSW, Australia
  • Volume
    27
  • Issue
    11
  • fYear
    2015
  • Firstpage
    2947
  • Lastpage
    2960
  • Abstract
    Labeling of sequential data is a prevalent meta-problem for a wide range of real world applications. While the first-order Hidden Markov Models (HMM) provides a fundamental approach for unsupervised sequential labeling, the basic model does not show satisfying performance when it is directly applied to real world problems, such as part-of-speech tagging (PoS tagging) and optical character recognition (OCR). Aiming at improving performance, important extensions of HMM have been proposed in the literatures. One of the common key features in these extensions is the incorporation of proper prior information. In this paper, we propose a new extension of HMM, termed diversified Hidden Markov Models (dHMM), which utilizes a diversity-encouraging prior over the statetransition probabilities and thus facilitates more dynamic sequential labellings. Specifically, the diversity is modeled by a continuous determinantal point process prior, which we apply to both unsupervised and supervised scenarios. Learning and inference algorithms for dHMM are derived. Empirical evaluations on benchmark datasets for unsupervised PoS tagging and supervised OCR confirmed the effectiveness of dHMM, with competitive performance to the state-of-the-art.
  • Keywords
    data analysis; hidden Markov models; inference mechanisms; learning (artificial intelligence); optical character recognition; OCR; PoS tagging; dHMM; diversified hidden Markov models; inference algorithms; learning; meta-problem; optical character recognition; part-of-speech tagging; sequential data; unsupervised sequential labeling; Hidden Markov models; Kernel; Labeling; Optical character recognition software; Probability; Tagging; Yttrium; Determinantal Point Processes (DPP); Determinantal point processes (DPP); Hidden Markov Models (HMM); hidden Markov models (HMM); sequential labeling;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2015.2433262
  • Filename
    7107991