DocumentCode
2599570
Title
Script Identification Based on Morphological Reconstruction in Document Images
Author
Dhandra, B.V. ; Nagabhushan, P. ; Hangarge, Mallikarjun ; Hegadi, Ravindra ; Malemath, V.S.
Author_Institution
Dept. of P.G. Studies & Res. in Comput. Sci., Gulbarga Univ., Karnataka
Volume
2
fYear
0
fDate
0-0 0
Firstpage
950
Lastpage
953
Abstract
In this paper, the study of script identification based on morphological reconstruction for printed document images is carried out. The system is developed by using 609-scanned document images representing English, Hindi, Kannada, and Urdu scripts. The system developed includes a feature extractor and a classifier. The feature extractor consists of two stages. In the first stage, the morphological erosion and opening by reconstruction is carried out on a document image in horizontal, vertical, right and left diagonal directions using the line structuring element. The length of the structuring element is fixed, based on the average height of all the connected components of an image. In the next stage, average pixel distribution is found in these resulting images. A nearest neighbor analysis is used to classify the new documents. Accuracy of classification averaged 97% across the four scripts. The method shows robustness with respect to noise, font sizes and styles
Keywords
document image processing; feature extraction; image classification; image reconstruction; natural languages; English script; Hindi script; Kannada script; Urdu script; average pixel distribution; feature classifier; feature extractor; morphological erosion; morphological reconstruction; nearest neighbor analysis; printed document image; script identification; Character recognition; Computer science; Feature extraction; Gabor filters; Image reconstruction; Natural languages; Nearest neighbor searches; Optical character recognition software; Optical noise; Pixel;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location
Hong Kong
ISSN
1051-4651
Print_ISBN
0-7695-2521-0
Type
conf
DOI
10.1109/ICPR.2006.1030
Filename
1699363
Link To Document