Title :
Beyond the Sum of Parts: Voting with Groups of Dependent Entities
Author :
Yarlagadda, Pradeep ; Ommer, Bjorn
Author_Institution :
Dept. of Math. & Comput. Sci., Univ. of Heidelberg, Heidelberg, Germany
Abstract :
The high complexity of multi-scale, category-level object detection in cluttered scenes is efficiently handled by Hough voting methods. However, the main shortcoming of the approach is that mutually dependent local observations are independently casting their votes for intrinsically global object properties such as object scale. Object hypotheses are then assumed to be a mere sum of their part votes. Popular representation schemes are, however, based on a dense sampling of semi-local image features, which are consequently mutually dependent. We take advantage of part dependencies and incorporate them into probabilistic Hough voting by deriving an objective function that connects three intimately related problems: i) grouping mutually dependent parts, ii) solving the correspondence problem conjointly for dependent parts, and iii) finding concerted object hypotheses using extended groups rather than based on local observations alone. Early commitments are avoided by not restricting parts to only a single vote for a locally best correspondence and we learn a weighting of parts during training to reflect their differing relevance for an object. Experiments successfully demonstrate the benefit of incorporating part dependencies through grouping into Hough voting. The joint optimization of groupings, correspondences, and votes not only improves the detection accuracy over standard Hough voting and a sliding window baseline, but it also reduces the computational complexity by significantly decreasing the number of candidate hypotheses.
Keywords :
Hough transforms; category theory; computational complexity; image representation; image sampling; object detection; optimisation; probability; Hough voting method; category-level object detection; cluttered scene; computational complexity; dense sampling; dependent entity; detection accuracy; joint optimization; object hypotheses; object property; popular representation scheme; probabilistic Hough voting; semi-local image feature; sliding window baseline; Computational modeling; Feature extraction; Joints; Object detection; Training; Transforms; Vectors; Grouping; Hough Voting; Object detection; Recognition; Visual learning; grouping; hough voting; recognition; visual learning;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
DOI :
10.1109/TPAMI.2014.2363456