DocumentCode
1723639
Title
Unsupervised Generation of Context-Relevant Training-Sets for Visual Object Recognition Employing Multilinguality
Author
Schoeler, Markus ; Worgotter, Florentin ; Kulvicius, Tomas ; Papon, Jeremie
Author_Institution
III. Phys. Inst. - Biophys., Georg-August Univ. of Gottingen, Gottingen, Germany
fYear
2015
Firstpage
805
Lastpage
812
Abstract
Image based object classification requires clean training data sets. Gathering such sets is usually done manually by humans, which is time-consuming and laborious. On the other hand, directly using images from search engines creates very noisy data due to ambiguous noun-focused indexing. However, in daily speech nouns and verbs are always coupled. We use this for the automatic generation of clean data sets by the here-presented TRANSCLEAN algorithm, which through the use of multiple languages also solves the problem of polyesters (a single spelling with multiple meanings). Thus, we use the implicit knowledge contained in verbs, e.g. in an imperative such as "hit the nail", implicating a metal nail and not the fingernail. One type of reference application where this method can automatically operate is human-robot collaboration based on discourse. A second is the generation of clean image data sets, where tedious manual cleaning can be replaced by the much simpler manual generation of a single relevant verb-noun tuple. Here we show the impact of our improved training sets for several widely used and state-of-the-art classifiers including Multipath Hierarchical Matching Pursuit. All tested classifiers show a substantial boost of about +20% in recognition performance.
Keywords
image classification; indexing; iterative methods; natural language processing; object recognition; unsupervised learning; TRANSCLEAN algorithm; automatic clean data set generation; clean image data sets; fingernail; hit the nail; human-robot collaboration; image based object classification; metal nail; multilinguality; multipath hierarchical matching pursuit; noun-focused indexing; polysemes; search engines; unsupervised context-relevant training-set generation; verb-noun tuple; visual object recognition; Clutter; Context; Fasteners; Google; Nails; Search engines; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Applications of Computer Vision (WACV), 2015 IEEE Winter Conference on
Conference_Location
Waikoloa, HI
Type
conf
DOI
10.1109/WACV.2015.112
Filename
7045966
Link To Document