Author_Institution :
Inf. Technol. Fac., Univ. Of Sci., Ho Chi Minh City, Vietnam
Abstract :
Recognizing human action in video has many applications in computer vision and robotics. It is a challenging task not only because of the variations produced by general factors like illumination, background clutter, occlusion or intra-class variation, but also because of subtle behavioral patterns among interacting people or between people and objects in images as well as attracts many attentions in activity in recently years. However, these researches are not yet fully realized due to the lack of an effective feature to present human action. In this paper, we present a novel for human action representation based on hybrid features from local and global features. Firstly, a local-based feature descriptor is combined by motion and SURF. Secondly, improved BOW with kmeans++ and soft-weighting scheme are used to yield the histogram of word occurrences (HoWO) to present for action in video. Thirdly, HOG/HOF features are extracted from video for global features. Next, hybrid features is created by concatenating HoWO and HOG/HOF. Lastly, Support Vector Machine is used for classification on KTH, Weizmann and YouTube datasets. The experimental results also indicated that the extraction of features is effective and shows the feasibility of our proposal. In addition, compared with other approaches our approach is more robust, more flexible, easier to implement and simpler to comprehend.
Keywords :
feature extraction; image classification; image recognition; image representation; learning (artificial intelligence); support vector machines; video signal processing; HoWO; KTH dataset; SURF feature; Weizmann dataset; YouTube dataset; background clutter factor; bag-of-words feature; computer vision; feature extraction; histogram-of-word occurrences; human action representation; illumination factor; improved BOW feature; intra-class variation factor; kmeans++ scheme; local-based feature descriptor; motion feature; occlusion factor; robotics; robust human action recognition; soft-weighting scheme; speeded-up robust feature; support vector machine; Accuracy; Noise; Optical filters; Optical imaging; Robustness; Support vector machines; Vectors; BOW; HOF/HOG; Human Action Recognition; Hybrid features; Kmeans++; SURF; SVM;