DocumentCode
1071403
Title
Multimodal Fusion for Video Search Reranking
Author
Wei, Shikui ; Zhao, Yao ; Zhu, Zhenfeng ; Liu, Nan
Author_Institution
Inst. of Inf. Sci., Beijing Jiaotong Univ., Beijing, China
Volume
22
Issue
8
fYear
2010
Firstpage
1191
Lastpage
1199
Abstract
Analysis on click-through data from a very large search engine log shows that users are usually interested in the top-ranked portion of returned search results. Therefore, it is crucial for search engines to achieve high accuracy on the top-ranked documents. While many methods exist for boosting video search performance, they either pay less attention to the above factor or encounter difficulties in practical applications. In this paper, we present a flexible and effective reranking method, called CR-Reranking, to improve the retrieval effectiveness. To offer high accuracy on the top-ranked results, CR-Reranking employs a cross-reference (CR) strategy to fuse multimodal cues. Specifically, multimodal features are first utilized separately to rerank the initial returned results at the cluster level, and then all the ranked clusters from different modalities are cooperatively used to infer the shots with high relevance. Experimental results show that the search quality, especially on the top-ranked results, is improved significantly.
Keywords
content-based retrieval; search engines; sensor fusion; video retrieval; CR-Reranking method; cross-reference strategy; multimodal fusion; search engine log; top-ranked documents; video retrieval; video search reranking; Clustering; image/video retrieval; multimedia databases.;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2009.145
Filename
5072222
Link To Document