DocumentCode
1762199
Title
Scalable Mobile Video Retrieval with Sparse Projection Learning and Pseudo Label Mining
Author
Guan-Long Wu ; Yin-Hsi Kuo ; Tzu-Hsuan Chiu ; Hsu, W.H. ; Lexing Xie
Author_Institution
Nat. Taiwan Univ., Taipei, Taiwan
Volume
20
Issue
3
fYear
2013
fDate
July-Sept. 2013
Firstpage
47
Lastpage
57
Abstract
Retrieving relevant videos from a large corpus on mobile devices is a vital challenge. This article addresses two key issues for mobile search on user-generated videos. The first is the lack of good relevance measurement for learning semantically rich representations, due to the unconstrained nature of online videos. The second is the limited resources on mobile devices, stringent bandwidth, and delay requirement between the device and video server. The authors propose a knowledge-embedded sparse projection learning approach. To alleviate the need for expensive annotation in hash learning, they investigate varying approaches for pseudo label mining, where explicit semantic analysis leverages Wikipedia. In addition, they propose a novel sparse projection method to address the efficiency challenge by learning a discriminative compact representation that drastically reduces transmission costs. With less than 10 percent nonzero elements in the projection matrix, it also reduces computational and storage costs. The experimental results on 100,000 videos show that the proposed algorithm yields performance competitive with the prior state-of-the-art hashing methods, which are not applicable for mobiles and solely rely on costly manual annotations. The average query time for 100,000 videos was only 0.592 seconds.
Keywords
Web sites; cryptography; data mining; learning (artificial intelligence); matrix algebra; mobile computing; video retrieval; Wikipedia; computational cost reduction; delay requirement; hashing methods; knowledge-embedded sparse projection learning; mobile devices; mobile search; projection matrix; pseudo label mining; relevance measurement; scalable mobile video retrieval; semantic analysis; storage cost reduction; stringent bandwidth; transmission cost reduction; user-generated videos; Electronic publishing; Encyclopedias; Mobile communication; Mobile handsets; Semantics; Sparse matrices; Electronic publishing; Encyclopedias; Mobile communication; Mobile handsets; Semantics; Sparse matrices; content-based video search; explicit semantic analysis; hashing; mobile video retrieval; multimedia; multimedia applications; sparsity;
fLanguage
English
Journal_Title
MultiMedia, IEEE
Publisher
ieee
ISSN
1070-986X
Type
jour
DOI
10.1109/MMUL.2013.13
Filename
6482124
Link To Document