مرکز منطقه ای اطلاع رساني علوم و فناوري - A multi-modal system for the retrieval of semantic video events

Title of article :

A multi-modal system for the retrieval of semantic video events

Author/Authors :

Amir، نويسنده , , Arnon and Basu، نويسنده , , Sankar and Iyengar، نويسنده , , Giridharan and Lin، نويسنده , , Ching-Yung and Naphade، نويسنده , , Milind and Smith، نويسنده , , John R. and Srinivasan، نويسنده , , Savitha and Tseng، نويسنده , , Belle، نويسنده ,

Issue Information :

روزنامه با شماره پیاپی سال 2004

Pages :

From page :

216

To page :

236

Abstract :

A framework for event detection is proposed where events, objects, and other semantic concepts are detected from video using trained classifiers. These classifiers are used to automatically annotate video with semantic labels, which in turn are used to search for new, untrained types of events and semantic concepts. The novelty of the approach lies in the: (1) semi-automatic construction of models of events from feature descriptors and (2) integration of content-based and concept-based querying in the search process. Speech retrieval is independently applied and combined results are produced. Results of applying these to the Search benchmark of the NIST TREC Video track 2001 are reported, and the gained experience and future work are discussed.

Keywords :

Multimedia indexing , Semantic video annotation , event detection , Content-based video retrieval

Journal title :

Computer Vision and Image Understanding

Serial Year :

2004

Journal title :

Computer Vision and Image Understanding

Record number :

1694406

Link To Document :

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=1694406