SAGTA: Semi-automatic Ground Truth Annotation in crowd scenes

Author

Shuang Wu ; Shibao Zheng ; Hua Yang ; Yawen Fan ; Longfei Liang ; Hang Su

Author_Institution

Shanghai Key Lab. of Digital Media Process. & Transmissions, Jiao Tong Univ., Shanghai, China

fYear

2014

fDate

14-18 July 2014

Firstpage

1

Lastpage

6

Abstract

Ground truth is crucial in the performance evaluation of algorithms. Nevertheless, it is a tedious and time-consuming task to annotate ground truth manually, especially in crowd scenes. In this paper, we propose a novel semi-automatic tool called SAGTA (Semi-automatic Ground Truth Annotation Tool), which can assist researchers to annotate pedestrians easily and quickly in crowd scenes. Firstly, users label pedestrians manually in a few key frames by drawing bounding boxes through the friendly GUI of SAGTA. Then, the annotations in the rest frames are coarsely estimated by automatically interpolating based on 3D linear motion assumption. Moreover, our tool refines the estimated annotations through using ORB feature matching. This coarse-to-fine method facilitates the annotation process efficiently. Afterwards, the refined annotations are manually verified and corrected to guarantee the accuracy of annotations. In addition, some extra information (such as density, trajectory and occlusion relationships) can be inferred automatically and visualized vividly. The proposed tool has been tested on PETS and real surveillance data sets. Experimental results demonstrate that SAGTA achieves superior performance in time cost than ViPER-GT, which is the widely used annotation tool.

Keywords

computer vision; feature extraction; graphical user interfaces; image matching; image motion analysis; interpolation; pedestrians; video surveillance; 3D linear motion assumption; GUI; ORB feature matching; SAGTA; automatic interpolation; crowd scenes; drawing bounding boxes; graphical user interface; pedestrian annotation; semiautomatic ground truth annotation; vision based surveillance technologies; Algorithm design and analysis; Graphical user interfaces; Head; Interpolation; Surveillance; Three-dimensional displays; Trajectory; 3D interpolation; ORB feature matching; coarse-to-fine; ground truth; semi-automatic;

fLanguage

English

Publisher

ieee

Conference_Titel

Multimedia and Expo Workshops (ICMEW), 2014 IEEE International Conference on

Conference_Location

Chengdu

ISSN

1945-7871

Type

conf

DOI

10.1109/ICMEW.2014.6890539

Filename

6890539