• DocumentCode
    1824760
  • Title

    Simultaneous categorical and spatio-temporal 3D gestures using Kinect

  • Author

    Bigdelou, A. ; Benz, T. ; Schwarz, L. ; Navab, N.

  • Author_Institution
    Comput. Aided Med. Procedures (CAMP), Tech. Univ. Munchen, Munich, Germany
  • fYear
    2012
  • fDate
    4-5 March 2012
  • Firstpage
    53
  • Lastpage
    60
  • Abstract
    Recent technological advances have led to an increasing popularity of 3D gesture-based interfaces, in particular in gaming and entertainment consoles. However, unlike 2D gestures, which have been successfully utilized in many multi-touch devices, developing a 3D gesture-based interface is not an easy endeavor. Reasons include the complexity of capturing human movements in 3D and the difficulties associated with recognizing gestures from human motion data. In this work, we target the latter problem by proposing a novel gesture recognition technique for skeletal input data that simultaneously allows for categorical and spatio-temporal gestures. In other words, it recognizes the gesture type and the relative pose within a gesture at the same time. Moreover, our method can learn gestures that are most appropriate for the user from examples. In order to avoid the need for user-specific training, we further propose and evaluate several types of feature representations for human pose data. We argue how our approach can facilitate the development of a customizable 3D gesture-based interface and explore possibilities in order to smoothly integrate the proposed recognition approach into available component-based user interface frameworks. Besides a quantitative evaluation, we present a user study in the scenario of a 3D gesture-based interface for an intra-operative medical image viewer. Our studies support the applicability of our method for developing 3D gesture-based interfaces in practice.
  • Keywords
    image recognition; image representation; interactive devices; learning (artificial intelligence); medical image processing; user interfaces; 3D gesture-based interface; Kinect; categorical 3D gesture; component-based user interface; entertainment console; gaming console; gesture recognition technique; human motion data; intra-operative medical image viewer; multitouch device; spatio-temporal 3D gesture; user-specific training; Feature extraction; Gesture recognition; Joints; Three dimensional displays; Training; Training data; Vectors; H.5.2 [Information Systems]: Information Interfaces and Presentation — User Interfaces;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    3D User Interfaces (3DUI), 2012 IEEE Symposium on
  • Conference_Location
    Costa Mesa, CA
  • Print_ISBN
    978-1-4673-1204-2
  • Type

    conf

  • DOI
    10.1109/3DUI.2012.6184184
  • Filename
    6184184