DocumentCode
3073598
Title
Multi-class Enhanced Image Mining of Heterogeneous Textual Images Using Multiple Image Features
Author
Chitrakala, S. ; Shamini, P. ; Manjula, D.
Author_Institution
Easwari Eng. Coll., Anna Univ., Chennai
fYear
2009
fDate
6-7 March 2009
Firstpage
496
Lastpage
501
Abstract
Image mining deals with the extraction of implicit knowledge, image data relationship, or other patterns not explicitly stored in the images. This paper proposes an enhanced image classifier to extract patterns from images containing text using a combination of features. Image containing text can be divided into the following types: scene text image, caption text image and document image. A total of eight features including intensity histogram features and GLCM texture features are used to classify the images. In the first level of classification, the histogram features are extracted from grayscale images to separate document image from the others. In the second stage, the GLCM features are extracted from binary images to classify scene text and caption text images. In both stages, the decision tree classifier (DTC) is used for the classification. Experimental results have been obtained for a dataset of about 60 images of different types. This technique of classification has not been attempted before and its applications include preprocessing for indexing of images, for simplifying and speeding up content based image retrieval (CBIR) techniques and in areas of machine vision.
Keywords
data mining; decision trees; document image processing; feature extraction; image classification; image enhancement; image texture; statistical analysis; text analysis; GLCM texture feature extraction; binary image; caption text image; decision tree classifier; document image; grayscale image; image classification; image data relationship; intensity histogram feature extraction; knowledge extraction; multiclass enhanced heterogeneous textual image mining; scene text image; Classification tree analysis; Content based retrieval; Data mining; Decision trees; Feature extraction; Gray-scale; Histograms; Image retrieval; Indexing; Layout; GLCM features; caption text; decision tree; image classification; image features; scene text;
fLanguage
English
Publisher
ieee
Conference_Titel
Advance Computing Conference, 2009. IACC 2009. IEEE International
Conference_Location
Patiala
Print_ISBN
978-1-4244-2927-1
Electronic_ISBN
978-1-4244-2928-8
Type
conf
DOI
10.1109/IADCC.2009.4809061
Filename
4809061
Link To Document