Title :
Diversified Hidden Markov Models for Sequential Labeling
Author :
Maoying Qiao ; Wei Bian ; Da Xu, Richard Yi ; Dacheng Tao
Author_Institution :
Centre for Quantum Comput. & Intell. Syst., Univ. of Technol., Sydney, NSW, Australia
Abstract :
Labeling of sequential data is a prevalent meta-problem for a wide range of real world applications. While the first-order Hidden Markov Models (HMM) provides a fundamental approach for unsupervised sequential labeling, the basic model does not show satisfying performance when it is directly applied to real world problems, such as part-of-speech tagging (PoS tagging) and optical character recognition (OCR). Aiming at improving performance, important extensions of HMM have been proposed in the literatures. One of the common key features in these extensions is the incorporation of proper prior information. In this paper, we propose a new extension of HMM, termed diversified Hidden Markov Models (dHMM), which utilizes a diversity-encouraging prior over the statetransition probabilities and thus facilitates more dynamic sequential labellings. Specifically, the diversity is modeled by a continuous determinantal point process prior, which we apply to both unsupervised and supervised scenarios. Learning and inference algorithms for dHMM are derived. Empirical evaluations on benchmark datasets for unsupervised PoS tagging and supervised OCR confirmed the effectiveness of dHMM, with competitive performance to the state-of-the-art.
Keywords :
data analysis; hidden Markov models; inference mechanisms; learning (artificial intelligence); optical character recognition; OCR; PoS tagging; dHMM; diversified hidden Markov models; inference algorithms; learning; meta-problem; optical character recognition; part-of-speech tagging; sequential data; unsupervised sequential labeling; Hidden Markov models; Kernel; Labeling; Optical character recognition software; Probability; Tagging; Yttrium; Determinantal Point Processes (DPP); Determinantal point processes (DPP); Hidden Markov Models (HMM); hidden Markov models (HMM); sequential labeling;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2015.2433262