• DocumentCode
    3409241
  • Title

    Mining salient images from a large-scale blogosphere

  • Author

    Xian Chen ; Meilian Chen ; Hyoseop Shin ; Eun Yi Kim

  • Author_Institution
    Internet Multi-Media Dept., Konkuk Univ., Seoul, South Korea
  • fYear
    2013
  • fDate
    9-12 Dec. 2013
  • Firstpage
    132
  • Lastpage
    136
  • Abstract
    User-generated images are now prevalent across social media platforms, such as Facebook, Twitter, and various blogospheres. These images can be categorized and ranked based on their relevant topics. In this paper, we present and compare candidate schemes for mining salient images related to a specific topic or object among a large number of images from a blogosphere. Identifying salient images consists of several steps: calculating the similarity between images, k-means clustering images, and ranking images. In each step, we propose a set of alternatives and as a result, present an optimal combination scheme by conducting an empirical comparison of the performance of each scheme. Furthermore, to address scalability, we also present a distributed version of the schemes and experimental results based on MapReduce on top of a Hadoop environment.
  • Keywords
    Internet; data mining; image processing; pattern clustering; social networking (online); Facebook; Hadoop environment; MapReduce; Twitter; k-means clustering images; large-scale blogosphere; mining salient images; optimal combination; ranking images; social media platforms; user generated images; Blogs; Feature extraction; Image color analysis; Color histogram; Hadoop and MapReduce; Image clustering; Image ranking; SIFT; Salient images;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Internet Technology and Secured Transactions (ICITST), 2013 8th International Conference for
  • Conference_Location
    London
  • Type

    conf

  • DOI
    10.1109/ICITST.2013.6750177
  • Filename
    6750177