Title :
Segmentation of touching characters in printed Devnagari and Bangla scripts using fuzzy multifactorial analysis
Author :
Garain, U. ; Chaudhuri, B.B.
Author_Institution :
Comput. Vision & Pattern Recognition Unit, Indian Stat. Inst., Calcutta, India
fDate :
6/23/1905 12:00:00 AM
Abstract :
Existence of touching characters in scanned documents is a major problem in designing an effective character segmentation procedure for OCR systems. In this paper, new techniques are presented for identification and segmentation of touching characters. The techniques are based on fuzzy multifactorial analysis. A predictive algorithm is developed for effectively selecting cut-points to segment touching characters. Initially, our proposed method has been applied for segmenting touching characters that appear in Devnagari (Hindi) and Bangla, two major scripts in the Indian sub-continent. The results obtained from a test-set of considerable size show that a high recognition rate can be achieved with a reasonable amount of computations
Keywords :
fuzzy set theory; image segmentation; optical character recognition; Bangla; Devnagari; OCR systems; character segmentation; fuzzy multifactorial analysis; identification; scanned documents; segmentation; touching characters; Character recognition; Computer vision; Decision making; Degradation; Fuzzy systems; Optical character recognition software; Pattern analysis; Pattern recognition; Prediction algorithms; Testing;
Conference_Titel :
Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7695-1263-1
DOI :
10.1109/ICDAR.2001.953899