مرکز منطقه ای اطلاع رساني علوم و فناوري - Video annotation using hierarchical Dirichlet process mixture model

Title of article :

Video annotation using hierarchical Dirichlet process mixture model

Author/Authors :

Wu، نويسنده , , Roung-Shiunn and Li، نويسنده , , Po-Chun، نويسنده ,

Issue Information :

روزنامه با شماره پیاپی سال 2011

Pages :

From page :

3040

To page :

3048

Abstract :

Video annotation has become an important topic to support multimedia information retrieval. Video content analysis using low-level features cannot reduce the gap between low-level features and high level semantic concept. In this study, we propose an approach which combines visual features extracted from visual track of video and keywords extracted from speech transcripts of audio track. We construct a predictive model using hierarchical Dirichlet process mixture model. In the hierarchical model, one more layer is added to exploit sharing visual feature distributions among frames and use the shared information to enhance model learning. At top level the visual features in the groups are shared appropriately by imposing a prior correlation. At the bottom level each visual feature and associated annotation are modeled with mixture distributions. The leaned predictive model allows us to compute a conditional likelihood over words which are used to predict the most likely annotation words for the testing sample. The model achieves high accuracy in video annotation than the model without using hierarchy.

Keywords :

Visual feature , Dirichlet process , Speech transcripts , Video annotation , Hierarchical Dirichlet process mixture model

Journal title :

Expert Systems with Applications

Serial Year :

2011

Journal title :

Expert Systems with Applications

Record number :

2348954

Link To Document :

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=2348954