DocumentCode
29687
Title
Training Initialization of Hidden Markov Models in Human Action Recognition
Author
Moghaddam, Zia ; Piccardi, Massimo
Author_Institution
Fac. of Eng. & Inf. Technol., Univ. of Technol., Sydney, NSW, Australia
Volume
11
Issue
2
fYear
2014
fDate
Apr-14
Firstpage
394
Lastpage
408
Abstract
Human action recognition in video is often approached by means of sequential probabilistic models as they offer a natural match to the temporal dimension of the actions. However, effective estimation of the models´ parameters is critical if one wants to achieve significant recognition accuracy. Parameter estimation is typically performed over a set of training data by maximizing objective functions such as the data likelihood or the conditional likelihood. However, such functions are nonconvex in nature and subject to local maxima. This problem is major since any solution algorithm (expectation-maximization, gradient ascent, variational methods and others) requires an arbitrary initialization and can only find a corresponding local maximum. Exhaustive search is otherwise impossible since the number of local maxima is unknown. While no theoretical solutions are available for this problem, the only practicable mollification is to repeat training with different initializations until satisfactory cross-validation accuracy is attained. Such a process is overall empirical and highly time-consuming. In this paper, we propose two methods for one-off initialization of hidden Markov models achieving interesting tradeoffs between accuracy and training time. Experiments over three challenging human action video datasets (Weizmann, MuHAVi and Hollywood Human Actions) and with various feature sets measured from the frames (STIP descriptors, projection histograms, notable contour points) prove that the proposed one-off initializations are capable of achieving accuracy above the average of repeated random initializations and comparable to the best. In addition, the methods proposed are not restricted solely to human action recognition as they suit time series classification as a general problem.
Keywords
expectation-maximisation algorithm; gesture recognition; gradient methods; hidden Markov models; image classification; parameter estimation; probability; time series; variational techniques; video signal processing; STIP descriptor; arbitrary initialization; conditional likelihood; cross-validation accuracy; data likelihood; expectation-maximization; gradient ascent; hidden Markov models; human action recognition; human action video datasets; objective functions; parameter estimation; projection histogram; recognition accuracy; sequential probabilistic model; temporal dimension; time series classification; variational method; Accuracy; Graphical models; Hidden Markov models; Histograms; Time series analysis; Training; Video surveillance; HMM parameter initialization; expectation-maximization; human action recognition; maximum-likelihood estimation;
fLanguage
English
Journal_Title
Automation Science and Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1545-5955
Type
jour
DOI
10.1109/TASE.2013.2262940
Filename
6555968
Link To Document