• DocumentCode
    1799360
  • Title

    SAGTA: Semi-automatic Ground Truth Annotation in crowd scenes

  • Author

    Shuang Wu ; Shibao Zheng ; Hua Yang ; Yawen Fan ; Longfei Liang ; Hang Su

  • Author_Institution
    Shanghai Key Lab. of Digital Media Process. & Transmissions, Jiao Tong Univ., Shanghai, China
  • fYear
    2014
  • fDate
    14-18 July 2014
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Ground truth is crucial in the performance evaluation of algorithms. Nevertheless, it is a tedious and time-consuming task to annotate ground truth manually, especially in crowd scenes. In this paper, we propose a novel semi-automatic tool called SAGTA (Semi-automatic Ground Truth Annotation Tool), which can assist researchers to annotate pedestrians easily and quickly in crowd scenes. Firstly, users label pedestrians manually in a few key frames by drawing bounding boxes through the friendly GUI of SAGTA. Then, the annotations in the rest frames are coarsely estimated by automatically interpolating based on 3D linear motion assumption. Moreover, our tool refines the estimated annotations through using ORB feature matching. This coarse-to-fine method facilitates the annotation process efficiently. Afterwards, the refined annotations are manually verified and corrected to guarantee the accuracy of annotations. In addition, some extra information (such as density, trajectory and occlusion relationships) can be inferred automatically and visualized vividly. The proposed tool has been tested on PETS and real surveillance data sets. Experimental results demonstrate that SAGTA achieves superior performance in time cost than ViPER-GT, which is the widely used annotation tool.
  • Keywords
    computer vision; feature extraction; graphical user interfaces; image matching; image motion analysis; interpolation; pedestrians; video surveillance; 3D linear motion assumption; GUI; ORB feature matching; SAGTA; automatic interpolation; crowd scenes; drawing bounding boxes; graphical user interface; pedestrian annotation; semiautomatic ground truth annotation; vision based surveillance technologies; Algorithm design and analysis; Graphical user interfaces; Head; Interpolation; Surveillance; Three-dimensional displays; Trajectory; 3D interpolation; ORB feature matching; coarse-to-fine; ground truth; semi-automatic;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo Workshops (ICMEW), 2014 IEEE International Conference on
  • Conference_Location
    Chengdu
  • ISSN
    1945-7871
  • Type

    conf

  • DOI
    10.1109/ICMEW.2014.6890539
  • Filename
    6890539