Title :
SAGTA: Semi-automatic Ground Truth Annotation in crowd scenes
Author :
Shuang Wu ; Shibao Zheng ; Hua Yang ; Yawen Fan ; Longfei Liang ; Hang Su
Author_Institution :
Shanghai Key Lab. of Digital Media Process. & Transmissions, Jiao Tong Univ., Shanghai, China
Abstract :
Ground truth is crucial in the performance evaluation of algorithms. Nevertheless, it is a tedious and time-consuming task to annotate ground truth manually, especially in crowd scenes. In this paper, we propose a novel semi-automatic tool called SAGTA (Semi-automatic Ground Truth Annotation Tool), which can assist researchers to annotate pedestrians easily and quickly in crowd scenes. Firstly, users label pedestrians manually in a few key frames by drawing bounding boxes through the friendly GUI of SAGTA. Then, the annotations in the rest frames are coarsely estimated by automatically interpolating based on 3D linear motion assumption. Moreover, our tool refines the estimated annotations through using ORB feature matching. This coarse-to-fine method facilitates the annotation process efficiently. Afterwards, the refined annotations are manually verified and corrected to guarantee the accuracy of annotations. In addition, some extra information (such as density, trajectory and occlusion relationships) can be inferred automatically and visualized vividly. The proposed tool has been tested on PETS and real surveillance data sets. Experimental results demonstrate that SAGTA achieves superior performance in time cost than ViPER-GT, which is the widely used annotation tool.
Keywords :
computer vision; feature extraction; graphical user interfaces; image matching; image motion analysis; interpolation; pedestrians; video surveillance; 3D linear motion assumption; GUI; ORB feature matching; SAGTA; automatic interpolation; crowd scenes; drawing bounding boxes; graphical user interface; pedestrian annotation; semiautomatic ground truth annotation; vision based surveillance technologies; Algorithm design and analysis; Graphical user interfaces; Head; Interpolation; Surveillance; Three-dimensional displays; Trajectory; 3D interpolation; ORB feature matching; coarse-to-fine; ground truth; semi-automatic;
Conference_Titel :
Multimedia and Expo Workshops (ICMEW), 2014 IEEE International Conference on
Conference_Location :
Chengdu
DOI :
10.1109/ICMEW.2014.6890539