DocumentCode :
3429235
Title :
Image patches analysis for text block identification
Author :
Zhong, Guoqiang ; Cheriet, Mohamed
Author_Institution :
Synchromedia Lab. for Multimedia Commun. in Telepresence, Ecole de Technol. Super., Montréal, QC, Canada
fYear :
2012
fDate :
2-5 July 2012
Firstpage :
1241
Lastpage :
1246
Abstract :
In this paper, we propose a novel text block identification method for ancient document understanding. Unlike traditional top-down and bottom-up approaches, our method is based on supervised learning on the patches of document images, which can be considered as an intermediate level method but integrates essential advantages of both the top-down and the bottom-up strategies. In our method, the document images are firstly partitioned into small patches, and then positive and negative patches are selected to form an active training set. Gabor features are extracted on each patch, while multi-linear discriminant analysis (MDA) is employed to reduce the dimensionality of the data. To deal with unseen documents, a random forest classifier is learned on the new representations of the patches. Compared to traditional approaches, our method can not only capture local texture features of each patch, but also preserve the global information of the training images. Furthermore, MDA is guaranteed to learn a low dimensional tensor subspace, which significantly avoids the curse of dimensionality dilemma. Moreover, the random forest classifier can automatically select useful features and deliver satisfactory identification results. Extensive experiments on some scripts of ancient document images demonstrated the effectiveness of our method.
Keywords :
Gabor filters; character recognition; document image processing; feature extraction; history; image classification; image texture; learning (artificial intelligence); Gabor feature extraction; active training set; ancient document image; bottom-up strategy; data dimensionality reduction; document image partitioning; document understanding; image patches analysis; intermediate level method; local texture feature; multilinear discriminant analysis; random forest classifier; supervised learning; tensor subspace; text block identification; top-down strategy; Algorithm design and analysis; Feature extraction; Layout; Tensile stress; Text analysis; Training; Vegetation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on
Conference_Location :
Montreal, QC
Print_ISBN :
978-1-4673-0381-1
Electronic_ISBN :
978-1-4673-0380-4
Type :
conf
DOI :
10.1109/ISSPA.2012.6310482
Filename :
6310482
Link To Document :
بازگشت