Learning to Annotate Clothes in Everyday Photos: Multi-modal, Multi-label, Multi-instance Approach

Author

Veloso, Adriano Alonso ; dos Santos, Jefersson A. ; Nogueira, Kenedy

Author_Institution

Dept. of Comput. Sci., Univ. Fed. de Minas Gerais, Belo Horizonte, Brazil

fYear

2014

fDate

26-30 Aug. 2014

Firstpage

327

Lastpage

334

Abstract

In this paper, we present an effective algorithm to automatically annotate clothes in everyday photos posted in online social networks, such as Facebook and Instagram. Specifically, clothing annotation can be informally stated as predicting, as accurately as possible, the garment items appearing in the target photo. This task not only poses interesting challenges for existing vision and recognition algorithms, but also brings huge opportunities for recommender and e-commerce systems. We formulate the annotation task as a multi-modal, multi-label and multi-instance classification problem: (i) both image and textual content (i.e., comments about the image) are available for learning classifiers, (ii) the classifiers must predict a set of labels (i.e., a set of garment items), and (iii) the decision on which labels to predict comes from a bag of instances that are used to build a function, which separates labels that should be predicted from those that should not be. Under this setting, we propose a classification algorithm which employs association rules in order to build a prediction model that combines image and textual information, and adopts an entropy-minimization strategy in order to the find the best set of labels to predict. We conducted a systematic evaluation of the proposed algorithm using everyday photos collected from two major fashion-related social networks, namely pose.com and chictopia.com. Our results show that the proposed algorithm provides improvements when compared to popular first choice multi-label algorithms that range from 2% to 40% in terms of accuracy.

Keywords

clothing; data mining; electronic commerce; entropy; image classification; learning (artificial intelligence); minimisation; recommender systems; social networking (online); Facebook; Instagram; annotation task; association rules; chictopia.com; classification algorithm; clothing annotation; e-commerce systems; entropy-minimization strategy; everyday photos; fashion-related social networks; garment items; image comments; image content; learning classifiers; multiinstance approach; multiinstance classification problem; multilabel approach; multilabel classification problem; multimodal approach; multimodal classification problem; online social networks; pose.com; prediction model; recognition algorithms; recommender systems; systematic evaluation; target photo; textual content; vision algorithms; Clothing; Entropy; Feature extraction; Image color analysis; Prediction algorithms; Social network services; Visualization; Automatic annotation; machine learning; multi-label classification;

fLanguage

English

Publisher

ieee

Conference_Titel

Graphics, Patterns and Images (SIBGRAPI), 2014 27th SIBGRAPI Conference on

Conference_Location

Rio de Janeiro

Type

conf

DOI

10.1109/SIBGRAPI.2014.37

Filename

6915325

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=261991