Mining semantic affordances of visual object categories

Author

Yu-Wei Chao;Zhan Wang;Rada Mihalcea;Jia Deng

Author_Institution

Computer Science &

fYear

2015

fDate

6/1/2015 12:00:00 AM

Firstpage

4259

Lastpage

4267

Abstract

Affordances are fundamental attributes of objects. Affordances reveal the functionalities of objects and the possible actions that can be performed on them. Understanding affordances is crucial for recognizing human activities in visual data and for robots to interact with the world. In this paper we introduce the new problem of mining the knowledge of semantic affordance: given an object, determining whether an action can be performed on it. This is equivalent to connecting verb nodes and noun nodes in WordNet, or filling an affordance matrix encoding the plausibility of each action-object pair. We introduce a new benchmark with crowdsourced ground truth affordances on 20 PASCAL VOC object classes and 957 action classes. We explore a number of approaches including text mining, visual mining, and collaborative filtering. Our analyses yield a number of significant insights that reveal the most effective ways of collecting knowledge of semantic affordances.

Keywords

"Semantics","Visualization","Feeds","Videos","Crowdsourcing","Switches","Acceleration"

Publisher

ieee

Conference_Titel

Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on

Electronic_ISBN

1063-6919

Type

conf

DOI

10.1109/CVPR.2015.7299054

Filename

7299054

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3672511