DocumentCode
3602257
Title
Diversified Hidden Markov Models for Sequential Labeling
Author
Maoying Qiao ; Wei Bian ; Da Xu, Richard Yi ; Dacheng Tao
Author_Institution
Centre for Quantum Comput. & Intell. Syst., Univ. of Technol., Sydney, NSW, Australia
Volume
27
Issue
11
fYear
2015
Firstpage
2947
Lastpage
2960
Abstract
Labeling of sequential data is a prevalent meta-problem for a wide range of real world applications. While the first-order Hidden Markov Models (HMM) provides a fundamental approach for unsupervised sequential labeling, the basic model does not show satisfying performance when it is directly applied to real world problems, such as part-of-speech tagging (PoS tagging) and optical character recognition (OCR). Aiming at improving performance, important extensions of HMM have been proposed in the literatures. One of the common key features in these extensions is the incorporation of proper prior information. In this paper, we propose a new extension of HMM, termed diversified Hidden Markov Models (dHMM), which utilizes a diversity-encouraging prior over the statetransition probabilities and thus facilitates more dynamic sequential labellings. Specifically, the diversity is modeled by a continuous determinantal point process prior, which we apply to both unsupervised and supervised scenarios. Learning and inference algorithms for dHMM are derived. Empirical evaluations on benchmark datasets for unsupervised PoS tagging and supervised OCR confirmed the effectiveness of dHMM, with competitive performance to the state-of-the-art.
Keywords
data analysis; hidden Markov models; inference mechanisms; learning (artificial intelligence); optical character recognition; OCR; PoS tagging; dHMM; diversified hidden Markov models; inference algorithms; learning; meta-problem; optical character recognition; part-of-speech tagging; sequential data; unsupervised sequential labeling; Hidden Markov models; Kernel; Labeling; Optical character recognition software; Probability; Tagging; Yttrium; Determinantal Point Processes (DPP); Determinantal point processes (DPP); Hidden Markov Models (HMM); hidden Markov models (HMM); sequential labeling;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2015.2433262
Filename
7107991
Link To Document