DocumentCode :
1498110
Title :
Learning Hierarchical Semantic Description Via Mixed-Norm Regularization for Image Understanding
Author :
Li, Liang ; Jiang, Shuqiang ; Huang, Qingming
Author_Institution :
Key Lab. of Intell. Inf. Process., Inst. of Comput. Technol., Beijing, China
Volume :
14
Issue :
5
fYear :
2012
Firstpage :
1401
Lastpage :
1413
Abstract :
This paper proposes a new perspective-Vicept representation to solve the problem of visual polysemia and concept polymorphism in the large-scale semantic image understanding. Vicept characterizes the membership probability distribution between visual appearances and semantic concepts, and forms a hierarchical representation of image semantic from local to global. In the implementation, incorporating group sparse coding, visual appearance is encoded as a weighted sum of dictionary elements, which could obtain more accurate image representation with sparsity at the image level. To obtain discriminative Vicept descriptions with structural sparsity, mixed-norm regularization is adopted in the optimization problem for learning the concept membership distribution of visual appearance. Furthermore, we introduce a novel image distance measurement based on the hierarchical Vicept description, where different levels of Vicept distance are fused together by multi-level separability analysis. Finally, the wide applications of Vicept description are validated in our experiments, including large-scale semantic image search, image annotation, and semantic image re-ranking.
Keywords :
image representation; learning (artificial intelligence); optimisation; Vicept distance; Vicept representation; concept membership distribution; dictionary elements; discriminative Vicept descriptions; group sparse coding; hierarchical semantic description learning; image annotation; image level; image representation; image reranking; image search; large-scale semantic image understanding; membership probability distribution; mixed-norm regularization; multilevel separability analysis; optimization problem; structural sparsity; visual appearances; visual polysemia; weighted sum; Dictionaries; Electronic mail; Image representation; Information processing; Laboratories; Semantics; Visualization; Image representation; large-scale systems; pattern analysis; semantic web; statistical learning;
fLanguage :
English
Journal_Title :
Multimedia, IEEE Transactions on
Publisher :
ieee
ISSN :
1520-9210
Type :
jour
DOI :
10.1109/TMM.2012.2194993
Filename :
6185680
Link To Document :
بازگشت