Title :
Mining salient images from a large-scale blogosphere
Author :
Xian Chen ; Meilian Chen ; Hyoseop Shin ; Eun Yi Kim
Author_Institution :
Internet Multi-Media Dept., Konkuk Univ., Seoul, South Korea
Abstract :
User-generated images are now prevalent across social media platforms, such as Facebook, Twitter, and various blogospheres. These images can be categorized and ranked based on their relevant topics. In this paper, we present and compare candidate schemes for mining salient images related to a specific topic or object among a large number of images from a blogosphere. Identifying salient images consists of several steps: calculating the similarity between images, k-means clustering images, and ranking images. In each step, we propose a set of alternatives and as a result, present an optimal combination scheme by conducting an empirical comparison of the performance of each scheme. Furthermore, to address scalability, we also present a distributed version of the schemes and experimental results based on MapReduce on top of a Hadoop environment.
Keywords :
Internet; data mining; image processing; pattern clustering; social networking (online); Facebook; Hadoop environment; MapReduce; Twitter; k-means clustering images; large-scale blogosphere; mining salient images; optimal combination; ranking images; social media platforms; user generated images; Blogs; Feature extraction; Image color analysis; Color histogram; Hadoop and MapReduce; Image clustering; Image ranking; SIFT; Salient images;
Conference_Titel :
Internet Technology and Secured Transactions (ICITST), 2013 8th International Conference for
Conference_Location :
London
DOI :
10.1109/ICITST.2013.6750177