Title :
Video saliency prediction through machine learning with semantic information
Author :
Xiaohui Fu ; Li Su ; Lei Qin
Author_Institution :
Univ. of Chinese Acad. of Sci., Beijing, China
Abstract :
Saliency prediction is valuable in many video applications, such as intelligent retrieval, advertisement design and delivering, video coding and video summarization generating. Although image saliency is well explored, less works have been done on videos. Compared to images, the semantic orientation is more obvious for video saliency. In this paper, we propose a method to predict video saliency by introducing semantic information. Different from existing approaches, we simultaneously consider the bottom-up and top-down factors in a machine learning framework and utilize a semantic object learning model to compute the semantic related saliency map. The proposed method is tested on two datasets. The experiment results show that the proposed method keeps higher consistent with human´s gaze tracks data on various video contents. Furthermore, the computation efficiency is also improved as we don´t need to process every pixel of each frame during prediction features extraction.
Keywords :
learning (artificial intelligence); semantic networks; video coding; advertisement design; bottom-up factor; feature extraction; image saliency; intelligent retrieval; machine learning; saliency map; semantic information; semantic object learning model; semantic orientation; top-down factor; video coding; video saliency prediction; video summarization; Decision support systems; Feature extraction; Indexes; Semantics; Training; Training data; Videos; bottom-up; machine learning; semantic orientation information; top-down; video saliency;
Conference_Titel :
Signal and Information Processing (ChinaSIP), 2015 IEEE China Summit and International Conference on
Conference_Location :
Chengdu
DOI :
10.1109/ChinaSIP.2015.7230461