DocumentCode :
1583450
Title :
Segmentation of touching characters in printed Devnagari and Bangla scripts using fuzzy multifactorial analysis
Author :
Garain, U. ; Chaudhuri, B.B.
Author_Institution :
Comput. Vision & Pattern Recognition Unit, Indian Stat. Inst., Calcutta, India
fYear :
2001
fDate :
6/23/1905 12:00:00 AM
Firstpage :
805
Lastpage :
809
Abstract :
Existence of touching characters in scanned documents is a major problem in designing an effective character segmentation procedure for OCR systems. In this paper, new techniques are presented for identification and segmentation of touching characters. The techniques are based on fuzzy multifactorial analysis. A predictive algorithm is developed for effectively selecting cut-points to segment touching characters. Initially, our proposed method has been applied for segmenting touching characters that appear in Devnagari (Hindi) and Bangla, two major scripts in the Indian sub-continent. The results obtained from a test-set of considerable size show that a high recognition rate can be achieved with a reasonable amount of computations
Keywords :
fuzzy set theory; image segmentation; optical character recognition; Bangla; Devnagari; OCR systems; character segmentation; fuzzy multifactorial analysis; identification; scanned documents; segmentation; touching characters; Character recognition; Computer vision; Decision making; Degradation; Fuzzy systems; Optical character recognition software; Pattern analysis; Pattern recognition; Prediction algorithms; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7695-1263-1
Type :
conf
DOI :
10.1109/ICDAR.2001.953899
Filename :
953899
Link To Document :
بازگشت