DocumentCode
1434145
Title
Spatial Markov Kernels for Image Categorization and Annotation
Author
Lu, Zhiwu ; Ip, Horace H S
Author_Institution
Dept. of Comput. Sci., City Univ. of Hong Kong, Kowloon, China
Volume
41
Issue
4
fYear
2011
Firstpage
976
Lastpage
989
Abstract
This paper presents a novel discriminative stochastic method for image categorization and annotation. We first divide the images into blocks on a regular grid and then generate visual keywords through quantizing the features of image blocks. The traditional Markov chain model is generalized to capture 2-D spatial dependence between visual keywords by defining the notion of “past” as what we have observed in a row-wise raster scan. The proposed spatial Markov chain model can be trained via maximum-likelihood estimation and then be used directly for image categorization. Since this is completely a generative method, we can further improve it through developing new discriminative learning. Hence, spatial dependence between visual keywords is incorporated into kernels in two different ways, for use with a support vector machine in a discriminative approach to the image categorization problem. Moreover, a kernel combination is used to handle rotation and multiscale issues. Experiments on several image databases demonstrate that our spatial Markov kernel method for image categorization can achieve promising results. When applied to image annotation, which can be considered as a multilabel image categorization process, our method also outperforms state-of-the-art techniques.
Keywords
Markov processes; image classification; learning (artificial intelligence); maximum likelihood estimation; support vector machines; visual databases; 2-D spatial dependence; discriminative learning; discriminative stochastic method; generative method; image annotation; image blocks; image categorization; image databases; kernel combination; maximum likelihood estimation; row-wise raster scan; spatial Markov chain model; support vector machine; visual keywords; Computational modeling; Feature extraction; Hidden Markov models; Kernel; Markov processes; Semantics; Visualization; Image annotation; Markov models; image categorization; kernel methods; visual keywords;
fLanguage
English
Journal_Title
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
Publisher
ieee
ISSN
1083-4419
Type
jour
DOI
10.1109/TSMCB.2010.2102749
Filename
5699931
Link To Document