DocumentCode :
1114862
Title :
Framework for Performance Evaluation of Face, Text, and Vehicle Detection and Tracking in Video: Data, Metrics, and Protocol
Author :
Kasturi, Rangachar ; Goldgof, Dmitry ; Soundararajan, Padmanabhan ; Manohar, Vasant ; Garofolo, John ; Bowers, Rachel ; Boonstra, Matthew ; Korzhova, Valentina ; Zhang, Jing
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of South Florida, Tampa, FL
Volume :
31
Issue :
2
fYear :
2009
Firstpage :
319
Lastpage :
336
Abstract :
Common benchmark data sets, standardized performance metrics, and baseline algorithms have demonstrated considerable impact on research and development in a variety of application domains. These resources provide both consumers and developers of technology with a common framework to objectively compare the performance of different algorithms and algorithmic improvements. In this paper, we present such a framework for evaluating object detection and tracking in video: specifically for face, text, and vehicle objects. This framework includes the source video data, ground-truth annotations (along with guidelines for annotation), performance metrics, evaluation protocols, and tools including scoring software and baseline algorithms. For each detection and tracking task and supported domain, we developed a 50-clip training set and a 50-clip test set. Each data clip is approximately 2.5 minutes long and has been completely spatially/temporally annotated at the I-frame level. Each task/domain, therefore, has an associated annotated corpus of approximately 450,000 frames. The scope of such annotation is unprecedented and was designed to begin to support the necessary quantities of data for robust machine learning approaches, as well as a statistically significant comparison of the performance of algorithms. The goal of this work was to systematically address the challenges of object detection and tracking through a common evaluation framework that permits a meaningful objective comparison of techniques, provides the research community with sufficient data for the exploration of automatic modeling techniques, encourages the incorporation of objective evaluation into the development process, and contributes useful lasting resources of a scale and magnitude that will prove to be extremely useful to the computer vision research community for years to come.
Keywords :
face recognition; learning (artificial intelligence); object detection; optical character recognition; protocols; vehicles; video signal processing; baseline algorithm; face detection; machine learning; object detection; object tracking; performance evaluation protocol; performance metrics; scoring software; text detection; vehicle detection; video data; Face detection; Guidelines; Measurement; Object detection; Protocols; Research and development; Software performance; Software tools; Vehicle detection; Vehicles; Miscellaneous; Performance evaluation; Tracking; Video analysis; baseline algorithms; face; object detection and tracking; performance evaluation; text; vehicle.; Algorithms; Artificial Intelligence; Automatic Data Processing; Face; Humans; Image Enhancement; Image Interpretation, Computer-Assisted; Motor Vehicles; Pattern Recognition, Automated; Sensitivity and Specificity; Subtraction Technique; Video Recording;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2008.57
Filename :
4479472
Link To Document :
بازگشت