Author :
Teichman, Alex ; Levinson, Jesse ; Thrun, Sebastian
Abstract :
Object recognition is a critical next step for autonomous robots, but a solution to the problem has remained elusive. Prior 3D-sensor-based work largely classifies individual point cloud segments or uses class-specific trackers. In this paper, we take the approach of classifying the tracks of all visible objects. Our new track classification method, based on a mathematically principled method of combining log odds estimators, is fast enough for real time use, is non-specific to object class, and performs well (98.5% accuracy) on the task of classifying correctly-tracked, well-segmented objects into car, pedestrian, bicyclist, and background classes. We evaluate the classifier´s performance using the Stanford Track Collection, a new dataset of about 1.3 million labeled point clouds in about 14,000 tracks recorded from an autonomous vehicle research platform. This dataset, which we make publicly available, contains tracks extracted from about one hour of 360-degree, 10Hz depth information recorded both while driving on busy campus streets and parked at busy intersections.
Keywords :
estimation theory; image classification; image segmentation; mobile robots; object recognition; 3D object recognition; 3D-sensor-based work; autonomous robot; class-specific tracker; log odds estimator; object track classification; point cloud segment; Boosting; Image segmentation; Object recognition; Sensors; Three dimensional displays; Training; Vehicle dynamics;