مرکز منطقه ای اطلاع رساني علوم و فناوري - Video-to-Shot Tag Propagation by Graph Sparse Group Lasso

DocumentCode :

1757608

Title :

Video-to-Shot Tag Propagation by Graph Sparse Group Lasso

Author :

Xiaofeng Zhu ; Zi Huang ; Jiangtao Cui ; Heng Tao Shen

Author_Institution :

Sch. of Inf. Technol. & Electr. Eng., Univ. of Queensland, Brisbane, QLD, Australia

Volume :

Issue :

fYear :

2013

fDate :

41365

Firstpage :

633

Lastpage :

646

Abstract :

Traditional approaches to video tagging are designed to propagate tags at the same level, such as assigning the tags of training videos (or shots) to the test videos (or shots), such as generating tags for the test video when the training videos are associated with the tags at the video-level or assigning tags to the test shot when given a collection of annotated shots. This paper focuses on automatical shot tagging given a collection of videos with the tags at the video-level. In other words, we aim to assign specific tags from the training videos to the test shot. The paper solves the V2S issue by assigning the test shot with the tags deriving from parts of the tags in a part of training videos. To achieve the goal, the paper first proposes a novel Graph Sparse Group Lasso (shorted for GSGL) model to linearly reconstruct the visual feature of the test shot with the visual features of the training videos, i.e., finding the correlation between the test shot and the training videos. The paper then proposes a new tagging propagation rule to assign the video-level tags to the test shot by the learnt correlations. Moreover, to effectively build the reconstruction model, the proposed GSGL simultaneously takes several constraints into account, such as the inter-group sparsity, the intra-group sparsity, the temporal-spatial prior knowledge in the training videos and the local structure of the test shot. Extensive experiments on public video datasets are conducted, which clearly demonstrate the effectiveness of the proposed method for dealing with the video-to-shot tag propagation.

Keywords :

feature extraction; graph theory; image reconstruction; video databases; video retrieval; GSGL; V2S tag propagation; annotated shot collection; automatic shot tagging; graph sparse group lasso; inter-group sparsity; intra-group sparsity; linear visual feature reconstruction; local test shot structure; public video datasets; tagging propagation rule; test shot assignment; training videos; video collection; video tagging; video-level tag assignment; video-to-shot tag propagation; visual features; Correlation; Educational institutions; Hidden Markov models; Manifolds; Tagging; Training; Visualization; Manifold learning; sparse coding; sparse group lasso; structure sparsity; video annotation; video tagging;

fLanguage :

English

Journal_Title :

Multimedia, IEEE Transactions on

Publisher :

ieee

ISSN :

1520-9210

Type :

jour

DOI :

10.1109/TMM.2012.2233723

Filename :

6380623

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1757608