Title :
Segmentation of Bangla handwritten text into characters by recursive contour following
Author :
Bishnu, A. ; Chaudhuri, Bidyut B.
Author_Institution :
Comput. Vision & Pattern Recognition Unit, Indian Stat. Inst., Calcutta, India
Abstract :
Segmentation of handwritten words into characters is one of the important components in handwritten text OCR. In this paper we put forward a method for the segmentation of handwritten Bangla (an Indo-Bangladeshi language) text into characters. Based on certain characteristics of Bangla writing methods, different zones across the height of the word are detected. These zones provide certain structural information about the constituent characters of the respective word. In Bangla handwritten texts often there is overlap between rectangular hulls of successive characters. As such the characters are seldom vertically separable. So, we propose a method of recursive contour following in one of the zones across the height of the word to find out the extents within which the main portion of the character lies. If the successive characters are not touching in the zone of contour following, the algorithm gives fairly good results
Keywords :
document image processing; feature extraction; handwritten character recognition; image segmentation; optical character recognition; Bangla handwritten text segmentation; Bangla writing method; certain structural information; constituent characters; handwritten text OCR; handwritten words; rectangular hulls; recursive contour following; zones; Computer vision; Feature extraction; Hidden Markov models; Image analysis; Image segmentation; Lattices; Linear programming; Optical character recognition software; Pattern recognition; Writing;
Conference_Titel :
Document Analysis and Recognition, 1999. ICDAR '99. Proceedings of the Fifth International Conference on
Conference_Location :
Bangalore
Print_ISBN :
0-7695-0318-7
DOI :
10.1109/ICDAR.1999.791809