DocumentCode :
1634900
Title :
Handwritten Text Line Identification in Indian Scripts
Author :
Chaudhuri, Bidyut B. ; Bera, Sumedha
Author_Institution :
CVPR Unit, Indian Stat. Inst., Kolkata, India
fYear :
2009
Firstpage :
636
Lastpage :
640
Abstract :
Preprocessing in handwritten text OCR involves line, word and character segmentation. This paper deals with text line identification of handwritten Indian scripts, especially of Bangla, as well as English, Hindi, Malayalam, etc. Here, a new dual method based on interdependency between text-line and inter-line gap is proposed. The method draws curves simultaneously through the text and inter-line gap points found from strip-wise histogram peaks and inter-peak valleys. The curves start from left and move right while one type of points guides the curve of other type so that the curves do not intersect. Then these curves are allowed to iteratively evolve so that the text-line curves cross more character strokes while inter-line curves cross less character strokes and yet keep the curves as straight as possible. After several iterations, the curves stabilize and define the final text-lines and inter-line gaps. The approach works well on text of different scripts with various geometric layouts, including poetry.
Keywords :
document image processing; handwritten character recognition; image segmentation; natural languages; optical character recognition; text analysis; Indian script; OCR; character segmentation; handwritten text line identification; strip-wise histogram peak; Character recognition; Fuzzy sets; Handwriting recognition; Histograms; Optical character recognition software; Particle separators; Robustness; Strips; Text analysis; Text recognition; Handwritten line segmentation; Indian script processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
Conference_Location :
Barcelona
ISSN :
1520-5363
Print_ISBN :
978-1-4244-4500-4
Electronic_ISBN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2009.69
Filename :
5277570
Link To Document :
بازگشت