Title :
High-level spatial modeling in convolutional neural network with application to pedestrian detection
Author :
Feng Liu ; Yongzhen Huang ; Wankou Yang ; Changyin Sun
Author_Institution :
Sch. of Autom., Southeast Univ., Nanjing, China
Abstract :
Convolutional neural network (CNN) has achieved great success in many vision tasks. A key to this success is its ability to powerful automatically learns both high-level and low-level features. In general, low-level features have a small size of receptive fields and appear multiple times in different locations of objects, while high-level semantic features have a relatively large size of receptive fields and only appear once in a specific location of objects. However, traditional CNN treats these two kinds of features in the same manner, i.e, learning them by the convolution operation, which can be approximately considered as cumulating the probabilities that a feature appears in different locations. This strategy is reasonable for low-level features but not for high-level semantic ones, especially in the case of pedestrian detection, where a local feature can be shared by different locations but a semantic part, e.g, a head, only appears once for a human. To jointly model the spatial structure and appearance of high-level semantic features, we propose a new module to learn spatially weighted max pooling in CNN. The proposed method is evaluated on several pedestrian detection databases and the experimental results show that it achieves much better performance than traditional CNN.
Keywords :
neural nets; object detection; pedestrians; CNN; convolutional neural network; high-level semantic features; high-level spatial modeling; pedestrian detection; spatial structure; spatially weighted max pooling; Computational modeling; Convolution; Deformable models; Feature extraction; Kernel; Semantics; Training;
Conference_Titel :
Electrical and Computer Engineering (CCECE), 2015 IEEE 28th Canadian Conference on
Conference_Location :
Halifax, NS
Print_ISBN :
978-1-4799-5827-6
DOI :
10.1109/CCECE.2015.7129373