مرکز منطقه ای اطلاع رساني علوم و فناوري - Topical Video Object Discovery from Key Frames by Modeling Word Co-occurrence Prior

DocumentCode :

3331569

Title :

Topical Video Object Discovery from Key Frames by Modeling Word Co-occurrence Prior

Author :

Gangqiang Zhao ; Junsong Yuan ; Gang Hua

Author_Institution :

Sch. of EEE, Nanyang Technol. Univ., Singapore, Singapore

fYear :

2013

fDate :

23-28 June 2013

Firstpage :

1602

Lastpage :

1609

Abstract :

A topical video object refers to an object that is frequently highlighted in a video. It could be, e.g., the product logo and the leading actor/actress in a TV commercial. We propose a topic model that incorporates a word co-occurrence prior for efficient discovery of topical video objects from a set of key frames. Previous work using topic models, such as Latent Dirichelet Allocation (LDA), for video object discovery often takes a bag-of-visual-words representation, which ignored important co-occurrence information among the local features. We show that such data driven co-occurrence information from bottom-up can conveniently be incorporated in LDA with a Gaussian Markov prior, which combines top down probabilistic topic modeling with bottom up priors in a unified model. Our experiments on challenging videos demonstrate that the proposed approach can discover different types of topical objects despite variations in scale, view-point, color and lighting changes, or even partial occlusions. The efficacy of the co-occurrence prior is clearly demonstrated when comparing with topic models without such priors.

Keywords :

Gaussian processes; Markov processes; document image processing; feature extraction; image colour analysis; probability; video signal processing; Gaussian Markov prior; LDA; TV commercial; bag-of-visual-words representation; color changes; data driven cooccurrence information; key frames; latent Dirichelet allocation; lighting changes; local features; partial occlusions; product logo; top down probabilistic topic modeling; topical video object discovery; view-point; word cooccurrence prior modeling; Computational modeling; Data mining; Feature extraction; Resource management; Vectors; Video sequences; Visualization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on

Conference_Location :

Portland, OR

ISSN :

1063-6919

Type :

conf

DOI :

10.1109/CVPR.2013.210

Filename :

6619054

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3331569