• DocumentCode
    3073598
  • Title

    Multi-class Enhanced Image Mining of Heterogeneous Textual Images Using Multiple Image Features

  • Author

    Chitrakala, S. ; Shamini, P. ; Manjula, D.

  • Author_Institution
    Easwari Eng. Coll., Anna Univ., Chennai
  • fYear
    2009
  • fDate
    6-7 March 2009
  • Firstpage
    496
  • Lastpage
    501
  • Abstract
    Image mining deals with the extraction of implicit knowledge, image data relationship, or other patterns not explicitly stored in the images. This paper proposes an enhanced image classifier to extract patterns from images containing text using a combination of features. Image containing text can be divided into the following types: scene text image, caption text image and document image. A total of eight features including intensity histogram features and GLCM texture features are used to classify the images. In the first level of classification, the histogram features are extracted from grayscale images to separate document image from the others. In the second stage, the GLCM features are extracted from binary images to classify scene text and caption text images. In both stages, the decision tree classifier (DTC) is used for the classification. Experimental results have been obtained for a dataset of about 60 images of different types. This technique of classification has not been attempted before and its applications include preprocessing for indexing of images, for simplifying and speeding up content based image retrieval (CBIR) techniques and in areas of machine vision.
  • Keywords
    data mining; decision trees; document image processing; feature extraction; image classification; image enhancement; image texture; statistical analysis; text analysis; GLCM texture feature extraction; binary image; caption text image; decision tree classifier; document image; grayscale image; image classification; image data relationship; intensity histogram feature extraction; knowledge extraction; multiclass enhanced heterogeneous textual image mining; scene text image; Classification tree analysis; Content based retrieval; Data mining; Decision trees; Feature extraction; Gray-scale; Histograms; Image retrieval; Indexing; Layout; GLCM features; caption text; decision tree; image classification; image features; scene text;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advance Computing Conference, 2009. IACC 2009. IEEE International
  • Conference_Location
    Patiala
  • Print_ISBN
    978-1-4244-2927-1
  • Electronic_ISBN
    978-1-4244-2928-8
  • Type

    conf

  • DOI
    10.1109/IADCC.2009.4809061
  • Filename
    4809061