• DocumentCode
    3362133
  • Title

    Robust human action recognition using improved BOW and hybrid features

  • Author

    Viet Vo ; Ngoc Ly

  • Author_Institution
    Inf. Technol. Fac., Univ. Of Sci., Ho Chi Minh City, Vietnam
  • fYear
    2012
  • fDate
    12-15 Dec. 2012
  • Abstract
    Recognizing human action in video has many applications in computer vision and robotics. It is a challenging task not only because of the variations produced by general factors like illumination, background clutter, occlusion or intra-class variation, but also because of subtle behavioral patterns among interacting people or between people and objects in images as well as attracts many attentions in activity in recently years. However, these researches are not yet fully realized due to the lack of an effective feature to present human action. In this paper, we present a novel for human action representation based on hybrid features from local and global features. Firstly, a local-based feature descriptor is combined by motion and SURF. Secondly, improved BOW with kmeans++ and soft-weighting scheme are used to yield the histogram of word occurrences (HoWO) to present for action in video. Thirdly, HOG/HOF features are extracted from video for global features. Next, hybrid features is created by concatenating HoWO and HOG/HOF. Lastly, Support Vector Machine is used for classification on KTH, Weizmann and YouTube datasets. The experimental results also indicated that the extraction of features is effective and shows the feasibility of our proposal. In addition, compared with other approaches our approach is more robust, more flexible, easier to implement and simpler to comprehend.
  • Keywords
    feature extraction; image classification; image recognition; image representation; learning (artificial intelligence); support vector machines; video signal processing; HoWO; KTH dataset; SURF feature; Weizmann dataset; YouTube dataset; background clutter factor; bag-of-words feature; computer vision; feature extraction; histogram-of-word occurrences; human action representation; illumination factor; improved BOW feature; intra-class variation factor; kmeans++ scheme; local-based feature descriptor; motion feature; occlusion factor; robotics; robust human action recognition; soft-weighting scheme; speeded-up robust feature; support vector machine; Accuracy; Noise; Optical filters; Optical imaging; Robustness; Support vector machines; Vectors; BOW; HOF/HOG; Human Action Recognition; Hybrid features; Kmeans++; SURF; SVM;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Information Technology (ISSPIT), 2012 IEEE International Symposium on
  • Conference_Location
    Ho Chi Minh City
  • Print_ISBN
    978-1-4673-5604-6
  • Type

    conf

  • DOI
    10.1109/ISSPIT.2012.6621291
  • Filename
    6621291