• DocumentCode
    578383
  • Title

    Object localization based on discriminative visual words

  • Author

    Fang, Su-wen ; Qu, Yan-yun ; Chen, Cheng ; Song, Shu-yang

  • Author_Institution
    Dept. of Comput. Sci., Xiamen Univ., Xiamen, China
  • Volume
    3
  • fYear
    2012
  • fDate
    15-17 July 2012
  • Firstpage
    1111
  • Lastpage
    1117
  • Abstract
    This paper aims at learning discriminative visual words for object localization. These visual words are different from those learned from the generic object recognition which usually contain the negative visual words located on the background. For the purpose of object localization, the approach requires that the positive discriminative visual words mostly lie in the foreground object and the negative ones lie in the background. We firstly rank the visual words by the following three methods: the SVM classifier, the foreground likelihood ratio and the mutual information. Then, we integrate the three ranking results in an optimal combinational way and select the discriminative visual words by maximizing the object hit rate. Moreover, a coarse-to-fine detection framework to locate the object is designed. In the first stage, a branch-and-bound scheme combined with the discriminative visual words is implemented to find candidate object regions. In the second stage, a sliding window classifier is used to find the object location. The experimental results demonstrate that the approach is effective and efficient, and superior to Efficient Subwindow Search scheme.
  • Keywords
    document image processing; learning (artificial intelligence); natural language processing; object recognition; support vector machines; SVM classifier; branch-and-bound scheme; coarse-to-fine detection framework; foreground likelihood ratio; generic object recognition; learning; mutual information; negative visual words; object localization; optimal combinational way; positive discriminative visual words; Abstracts; Argon; Cascade; Classifiers; Efficient subwindow search; Feature selection; Local features;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics (ICMLC), 2012 International Conference on
  • Conference_Location
    Xian
  • ISSN
    2160-133X
  • Print_ISBN
    978-1-4673-1484-8
  • Type

    conf

  • DOI
    10.1109/ICMLC.2012.6359510
  • Filename
    6359510