• DocumentCode
    2063530
  • Title

    Mining visual web knowledge utilizing multiple classifier architecture

  • Author

    Saad, A.A. ; Ismail, M.A. ; Basiouny, S.A. ; Ramadan, A.A.

  • Author_Institution
    Comput. & Syst. Eng. Dept., Alexandria Univ., Alexandria, Egypt
  • fYear
    2010
  • fDate
    Nov. 29 2010-Dec. 1 2010
  • Firstpage
    598
  • Lastpage
    603
  • Abstract
    Inspite of the huge amounts of image data on the web, mining image data from the web is paid less attention than mining text data, since treating the semantics of images is much more difficult. This paper introduces a new system to mine visual knowledge on the web that aims to build a Domain Oriented Image Directory by using the Earth Mover´s Distance and Color signatures. Instead of using a flat classifier to combine text and image classification, the system suggests dividing the classification task into smaller classification problems corresponding to the branches in the classification hierarchy. Thus a multiple classifier system is presented. This paper illustrates the suggested system and discusses each of its components. Extensive experiments were conducted to test the system and also to compare it with commercial search engines. By the experiments we show that the proposed system accuracy outperforms the mostly used commercial search engines.
  • Keywords
    Internet; data mining; image classification; search engines; color signatures; commercial search engines; domain oriented image directory; earth mover distance; flat classifier; image classification; multiple classifier architecture utilization; text classification; text data mining; visual web knowledge mining; Color signature; Content based Image Retrieval (CBIR); Decision Tree classifier; Earth Mover´s Distance; K Nearest Neighbor Classifier; Naïve Bayes Classifier;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems Design and Applications (ISDA), 2010 10th International Conference on
  • Conference_Location
    Cairo
  • Print_ISBN
    978-1-4244-8134-7
  • Type

    conf

  • DOI
    10.1109/ISDA.2010.5687198
  • Filename
    5687198