• DocumentCode
    3146973
  • Title

    Discriminative bag-of-visual phrase learning for landmark recognition

  • Author

    Chen, Tao ; Yap, Kim-Hui ; Zhang, Dajiang

  • Author_Institution
    Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    893
  • Lastpage
    896
  • Abstract
    Bag-of-visual phrase (BoP) has been proposed and developed for landmark recognition recently. However, existing BoP methods for landmark recognition have two major shortcomings: (i) they try to construct a universal phrase vocabulary for all object categories, which lacks specific descriptive capabilities for a particular category, and (ii) they often adopt simple criterion such as the frequency information to mine the visual phrases, which may cause the selected phrases to be less discriminative or representative for recognition. In view of this, this paper proposes a new discriminative BoP approach for landmark recognition. First, the candidate visual phrases defined as adjacent pairwise words are selected for each category. A phrase-level similarity measure at the latent space is proposed to evaluate the semantic similarity between pairwise phrases. This is then integrated with the phrase frequency information to shortlist the discriminative phrases for each category through a proposed phrase ranking algorithm. Finally, the BoP and bag-of-words (BoW) histograms are combined through a pyramid matching method for recognition. Experimental results on two different datasets demonstrate that the proposed method is effective in landmark recognition.
  • Keywords
    image recognition; learning (artificial intelligence); natural language processing; adjacent pairwise words; bag-of-words histograms; descriptive capabilities; discriminative BoP approach; discriminative bag-of-visual phrase learning; landmark recognition; latent space; object categories; phrase frequency information; phrase-level similarity measure; pyramid matching; semantic similarity; simple criterion; universal phrase vocabulary; Computer vision; Conferences; Frequency measurement; Histograms; Semantics; Visualization; Vocabulary; BoP; BoW; discriminative visual phrases; landmark recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288028
  • Filename
    6288028