DocumentCode :
2106628
Title :
From Bag-of-Visual-Words to Bag-of-Visual-Phrases Using n-Grams
Author :
Pedrosa, Glauco Vitor ; Traina, Agma J. M.
Author_Institution :
Inst. de Cienc. Mat. e de Comput. - ICMC, Univ. de Sao Paulo, Sao Paulo, Brazil
fYear :
2013
fDate :
5-8 Aug. 2013
Firstpage :
304
Lastpage :
311
Abstract :
The Bag-of-Visual-Words has emerged as an effective modeling approach to represent local image features. It describes local image features by assigning them a visual word according to a visual dictionary. The image representation is given by the frequency of each visual word in the image, as a similar representation used in textual documents. In this paper, we present a novel approach building a high-level description using a group of words (phrases) for representing an image. We introduce the use of n-grams for image representation, based on the idea of "Bag-of-Visual-Phrases". In the field of computational linguistics, an n-gram is a phrase formed by a sequence of n-consecutive words. As analogy, we represent an image by a combination of n-consecutive visual words. We made representative experiments using three public benchmark databases of textures and nature scenes and two medical databases to demonstrate an area that can benefit from the proposed technique. Our proposed Bag-of-Visual-Phrases approach improved up to 44% the retrieval precision and up to 33% the classification rate compared to the traditional Bag-of-Visual-Words, being a valuable asset for content-based image retrieval and image classification.
Keywords :
computational linguistics; content-based retrieval; feature extraction; image classification; image representation; image retrieval; text analysis; visual databases; bag-of-visual-phrase approach; bag-of-visual-words; computational linguistics; content-based image classification; content-based image retrieval; high-level description; local image feature representation; medical databases; modeling approach; n-consecutive visual words; n-grams; public benchmark texture databases; textual documents; visual dictionary; Biomedical imaging; Databases; Dictionaries; Feature extraction; Image representation; Semantics; Visualization; Bag-of-features; CBIR; Image description; SIFT;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Graphics, Patterns and Images (SIBGRAPI), 2013 26th SIBGRAPI - Conference on
Conference_Location :
Arequipa
ISSN :
1530-1834
Type :
conf
DOI :
10.1109/SIBGRAPI.2013.49
Filename :
6656200
Link To Document :
بازگشت