DocumentCode :
2947357
Title :
Real-time full-body human gender recognition in (RGB)-D data
Author :
Linder, Timm ; Wehner, Sven ; Arras, Kai O.
Author_Institution :
Dept. of Comput. Sci., Univ. of Freiburg, Freiburg, Germany
fYear :
2015
fDate :
26-30 May 2015
Firstpage :
3039
Lastpage :
3045
Abstract :
Understanding social context is an important skill for robots that share a space with humans. In this paper, we address the problem of recognizing gender, a key piece of information when interacting with people and understanding human social relations and rules. Unlike previous work which typically considered faces or frontal body views in image data, we address the problem of recognizing gender in RGB-D data from side and back views as well. We present a large, gender-balanced, annotated, multi-perspective RGB-D dataset with full-body views of over a hundred different persons captured with both the Kinect v1 and Kinect v2 sensor. We then learn and compare several classifiers on the Kinect v2 data using a HOG baseline, two state-of-the-art deep-learning methods, and a recent tessellation-based learning approach. Originally developed for person detection in 3D data, the latter is able to learn the best selection, location and scale of a set of simple point cloud features. We show that for gender recognition, it outperforms the other approaches for both standing and walking people while being very efficient to compute with classification rates up to 150 Hz.
Keywords :
human-robot interaction; image classification; image sensors; learning (artificial intelligence); object detection; object recognition; HOG baseline; Kinect v1 sensor; Kinect v2 sensor; RGB-D data; annotated RGB-D dataset; deep-learning methods; gender-balanced RGB-D dataset; human social relations; human social rules; image data; multiperspective RGB-D dataset; person detection; point cloud features; real-time full-body human gender recognition; tessellation-based learning approach; Accuracy; Legged locomotion; Robot sensing systems; Support vector machines; Three-dimensional displays; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Robotics and Automation (ICRA), 2015 IEEE International Conference on
Conference_Location :
Seattle, WA
Type :
conf
DOI :
10.1109/ICRA.2015.7139616
Filename :
7139616
Link To Document :
بازگشت