DocumentCode :
2400418
Title :
An enhanced algorithm for Character Segmentation in document image processing
Author :
Manikandan, V. ; Venkatachalam, V. ; Kirthiga, M. ; Harini, K. ; Devarajan, N.
Author_Institution :
Dept. of Electr. & Electron. Eng., Coimbatore Inst. of Technol., Coimbatore, India
fYear :
2010
fDate :
28-29 Dec. 2010
Firstpage :
1
Lastpage :
5
Abstract :
Optical Character Recognition consists of various steps like skew detection, segmentation of columns, lines, words, and characters before feeding the isolated character to an optical character recognition system. Several methodologies are followed to perform these steps using conventional Hough Transformation. In this paper, a new algorithm is proposed to perform all those steps involved in document image processing. The algorithm is implemented for skew detection, column and line segmentation and Character Segmentation. This can be extended to all other steps like character recognition. The novelty of this approach lies in “the consideration of any image, as one formed by several black and white lines of various lengths and at various angles”. The pixel values of the binary image are stored in an array. All the pixel values in the array are compared with their horizontally adjacent pixel values, row by row, for the presence of collinear points (i.e., a line). It is done by detecting the continuity of either the white or black pixels accordingly. Once the continuity is detected, the starting and end coordinates are displayed as an intermediate result. A new image will be generated as a result, which indicates the pixel area of line, identified from the input image. The algorithm is applied for English and other regional languages.
Keywords :
Hough transforms; document image processing; image segmentation; object detection; optical character recognition; Hough transformation; character segmentation; column segmentation; document image processing; line segmentation; optical character recognition system; skew detection; Character recognition; Image segmentation; Optical character recognition software; Optical imaging; Pixel; Text analysis; Transforms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Computing Research (ICCIC), 2010 IEEE International Conference on
Conference_Location :
Coimbatore
Print_ISBN :
978-1-4244-5965-0
Electronic_ISBN :
978-1-4244-5967-4
Type :
conf
DOI :
10.1109/ICCIC.2010.5705728
Filename :
5705728
Link To Document :
بازگشت