DocumentCode
578383
Title
Object localization based on discriminative visual words
Author
Fang, Su-wen ; Qu, Yan-yun ; Chen, Cheng ; Song, Shu-yang
Author_Institution
Dept. of Comput. Sci., Xiamen Univ., Xiamen, China
Volume
3
fYear
2012
fDate
15-17 July 2012
Firstpage
1111
Lastpage
1117
Abstract
This paper aims at learning discriminative visual words for object localization. These visual words are different from those learned from the generic object recognition which usually contain the negative visual words located on the background. For the purpose of object localization, the approach requires that the positive discriminative visual words mostly lie in the foreground object and the negative ones lie in the background. We firstly rank the visual words by the following three methods: the SVM classifier, the foreground likelihood ratio and the mutual information. Then, we integrate the three ranking results in an optimal combinational way and select the discriminative visual words by maximizing the object hit rate. Moreover, a coarse-to-fine detection framework to locate the object is designed. In the first stage, a branch-and-bound scheme combined with the discriminative visual words is implemented to find candidate object regions. In the second stage, a sliding window classifier is used to find the object location. The experimental results demonstrate that the approach is effective and efficient, and superior to Efficient Subwindow Search scheme.
Keywords
document image processing; learning (artificial intelligence); natural language processing; object recognition; support vector machines; SVM classifier; branch-and-bound scheme; coarse-to-fine detection framework; foreground likelihood ratio; generic object recognition; learning; mutual information; negative visual words; object localization; optimal combinational way; positive discriminative visual words; Abstracts; Argon; Cascade; Classifiers; Efficient subwindow search; Feature selection; Local features;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics (ICMLC), 2012 International Conference on
Conference_Location
Xian
ISSN
2160-133X
Print_ISBN
978-1-4673-1484-8
Type
conf
DOI
10.1109/ICMLC.2012.6359510
Filename
6359510
Link To Document