Title :
Activity Recognition from RGB-D Camera with 3D Local Spatio-temporal Features
Author :
Ming, Yue ; Ruan, Qiuqi ; Hauptmann, Alexander G.
Author_Institution :
Inst. of Inf. Sci., Beijing Jiaotong Univ., Beijing, China
Abstract :
Kinect, as a 3D digital capturing device, can collect the RGB and depth information of human activities rapidly. We study fusing the depth and RGB information for activity recognition. We introduce histogram color-based image thresholding to detect skin on human body, and use a GMM model to segment human hand areas. We design a new local descriptor, called a 3D Motion Scale-Invariant Feature Transform (3D MoSIFT), which can effectively detect interesting points based on both RGB and depth information, and consequently encode the visual and motion information from both to describe the interesting points. Experiments, based on a video dataset collected by a Kinect camera, show that adding depth information in the descriptor can distinctly improve the accuracy of human activity recognition. We introduce the F1-score measurement to evaluate and compare our performance with the other algorithms.
Keywords :
image colour analysis; image motion analysis; image recognition; 3D MoSIFT; 3D digital capturing device; 3D local spatio-temporal features; 3D motion scale-invariant feature transform; F1-score measurement; GMM model; Kinect camera; RGB information; RGB-D camera; depth information; histogram color-based image thresholding; human activities; human activity recognition; human body; human hand areas; local descriptor; motion information; visual information; Cameras; Feature extraction; Histograms; Humans; Image color analysis; Skin; Visualization; 3D MoSIFT; Activity recognition; F1-score; hand detection; histogram color-based image thresholding;
Conference_Titel :
Multimedia and Expo (ICME), 2012 IEEE International Conference on
Conference_Location :
Melbourne, VIC
Print_ISBN :
978-1-4673-1659-0
DOI :
10.1109/ICME.2012.8