DocumentCode
3146973
Title
Discriminative bag-of-visual phrase learning for landmark recognition
Author
Chen, Tao ; Yap, Kim-Hui ; Zhang, Dajiang
Author_Institution
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear
2012
fDate
25-30 March 2012
Firstpage
893
Lastpage
896
Abstract
Bag-of-visual phrase (BoP) has been proposed and developed for landmark recognition recently. However, existing BoP methods for landmark recognition have two major shortcomings: (i) they try to construct a universal phrase vocabulary for all object categories, which lacks specific descriptive capabilities for a particular category, and (ii) they often adopt simple criterion such as the frequency information to mine the visual phrases, which may cause the selected phrases to be less discriminative or representative for recognition. In view of this, this paper proposes a new discriminative BoP approach for landmark recognition. First, the candidate visual phrases defined as adjacent pairwise words are selected for each category. A phrase-level similarity measure at the latent space is proposed to evaluate the semantic similarity between pairwise phrases. This is then integrated with the phrase frequency information to shortlist the discriminative phrases for each category through a proposed phrase ranking algorithm. Finally, the BoP and bag-of-words (BoW) histograms are combined through a pyramid matching method for recognition. Experimental results on two different datasets demonstrate that the proposed method is effective in landmark recognition.
Keywords
image recognition; learning (artificial intelligence); natural language processing; adjacent pairwise words; bag-of-words histograms; descriptive capabilities; discriminative BoP approach; discriminative bag-of-visual phrase learning; landmark recognition; latent space; object categories; phrase frequency information; phrase-level similarity measure; pyramid matching; semantic similarity; simple criterion; universal phrase vocabulary; Computer vision; Conferences; Frequency measurement; Histograms; Semantics; Visualization; Vocabulary; BoP; BoW; discriminative visual phrases; landmark recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6288028
Filename
6288028
Link To Document