DocumentCode
1167930
Title
A generic framework of user attention model and its application in video summarization
Author
Ma, Yu-Fei ; Hua, Xian-Sheng ; Lu, Lie ; Zhan, Hong-Jiang
Author_Institution
Microsoft Res. Asia, Beijing, China
Volume
7
Issue
5
fYear
2005
Firstpage
907
Lastpage
919
Abstract
Due to the information redundancy of video, automatically extracting essential video content is one of key techniques for accessing and managing large video library. In this paper, we present a generic framework of a user attention model, which estimates the attentions viewers may pay to video contents. As human attention is an effective and efficient mechanism for information prioritizing and filtering, user attention model provides an effective approach to video indexing based on importance ranking. In particular, we define viewer attention through multiple sensory perceptions, i.e. visual and aural stimulus as well as partly semantic understanding. Also, a set of modeling methods for visual and aural attentions are proposed. As one of important applications of user attention model, a feasible solution of video summarization, without fully semantic understanding of video content as well as complex heuristic rules, is implemented to demonstrate the effectiveness, robustness, and generality of the user attention model. The promising results from the user study on video summarization indicate that the user attention model is an alternative way to video understanding.
Keywords
indexing; multimedia computing; psychology; speech processing; user modelling; video databases; video signal processing; aural stimulus; importance ranking; information redundancy; multiple sensory perception; user attention model; video content analysis; video indexing; video summarization; visual stimulus; Asia; Content management; Data mining; Humans; Indexing; Information filtering; Information filters; Libraries; Robustness; Technology management; Attention modeling; video content analysis; video summarization;
fLanguage
English
Journal_Title
Multimedia, IEEE Transactions on
Publisher
ieee
ISSN
1520-9210
Type
jour
DOI
10.1109/TMM.2005.854410
Filename
1510638
Link To Document