DocumentCode
81984
Title
Fusion of Multichannel Local and Global Structural Cues for Photo Aesthetics Evaluation
Author
Luming Zhang ; Yue Gao ; Zimmermann, Raphael ; Qi Tian ; Xuelong Li
Author_Institution
Sch. of Comput., Nat. Univ. of Singapore, Singapore, Singapore
Volume
23
Issue
3
fYear
2014
fDate
Mar-14
Firstpage
1419
Lastpage
1429
Abstract
Photo aesthetic quality evaluation is a fundamental yet under addressed task in computer vision and image processing fields. Conventional approaches are frustrated by the following two drawbacks. First, both the local and global spatial arrangements of image regions play an important role in photo aesthetics. However, existing rules, e.g., visual balance, heuristically define which spatial distribution among the salient regions of a photo is aesthetically pleasing. Second, it is difficult to adjust visual cues from multiple channels automatically in photo aesthetics assessment. To solve these problems, we propose a new photo aesthetics evaluation framework, focusing on learning the image descriptors that characterize local and global structural aesthetics from multiple visual channels. In particular, to describe the spatial structure of the image local regions, we construct graphlets small-sized connected graphs by connecting spatially adjacent atomic regions. Since spatially adjacent graphlets distribute closely in their feature space, we project them onto a manifold and subsequently propose an embedding algorithm. The embedding algorithm encodes the photo global spatial layout into graphlets. Simultaneously, the importance of graphlets from multiple visual channels are dynamically adjusted. Finally, these post-embedding graphlets are integrated for photo aesthetics evaluation using a probabilistic model. Experimental results show that: 1) the visualized graphlets explicitly capture the aesthetically arranged atomic regions; 2) the proposed approach generalizes and improves four prominent aesthetic rules; and 3) our approach significantly outperforms state-of-the-art algorithms in photo aesthetics prediction.
Keywords
computer vision; graph theory; image coding; image fusion; probability; adjacent atomic regions; computer vision; embedding algorithm; global spatial arrangement; global structural aesthetics; graphlets small-sized connected graphs; image descriptors; image processing fields; local spatial arrangement; local structural aesthetics; multichannel global structural cue fusion; multichannel local cue fusion; photo aesthetic quality evaluation; photo aesthetics assessment; photo aesthetics evaluation framework; photo aesthetics prediction; photo global spatial layout encoding; post-embedding graphlets; probabilistic model; spatial structure; spatially adjacent graphlets; visual channels; visual cues; visualized graphlets; Graphical models; Image color analysis; Layout; Probabilistic logic; Training; Vectors; Visualization; Multi-channel; aesthetic evaluation; probabilistic model; structural cues;
fLanguage
English
Journal_Title
Image Processing, IEEE Transactions on
Publisher
ieee
ISSN
1057-7149
Type
jour
DOI
10.1109/TIP.2014.2303650
Filename
6728663
Link To Document