DocumentCode :
2426245
Title :
Recognition of Multi-oriented Touching Characters in Graphical Documents
Author :
Roy, Partha Pratim ; Pal, Umapada ; Llados, Josep
Author_Institution :
Comput. Vision Center, Univ. Autonoma De Barcelona, Barcelona
fYear :
2008
fDate :
16-19 Dec. 2008
Firstpage :
297
Lastpage :
304
Abstract :
Touching characters are major problem of achieving higher recognition rate in optical character recognition (OCR). Present OCR systems do not perform well when adjacent characters touch. If characters are touched in graphical documents (e.g. map) then such touching string recognition is more difficult because in such documents touching characters appear in multi-oriented direction. In this paper, we present a scheme towards the recognition of English two-character multi-oriented touching strings. When two or more characters touch, they generate a big cavity region at the background portion and we used this background information in our scheme. To handle the background information, convex hull is used. In this scheme, at first, a set of initial segmentation points is predicted based on the concave residues of the convex hull of the touching characters. Next, based on the initial points, we select some candidate segmentation lines. Finally the recognition confidence of two sub-images of a touching string, obtained from each candidate segmentation line is computed. The candidate segmentation line from which we get optimum confidence is the actual segmentation line and the corresponding characters in favour of which the two segmentation parts show optimum confidence is the recognition result of the touching string. To compute the recognition confidence, SVM classifier is used. The features used in the SVM are invariant to character orientation. Circular ring and convex hull ring based approach has been used along with angular information of the contour pixels of the character to make the feature rotation invariant. From the experiment we obtained encouraging result.
Keywords :
computational geometry; document image processing; feature extraction; image classification; image segmentation; optical character recognition; support vector machines; OCR system; SVM classifier; candidate segmentation line; circular ring based approach; concave residue; contour pixel; convex hull ring based approach; graphical document; multioriented touching English character recognition; optical character recognition; rotation invariant feature extraction; touching string recognition; Character generation; Character recognition; Degradation; Frequency; Image recognition; Image segmentation; Optical character recognition software; Pattern recognition; Support vector machine classification; Support vector machines; OCR; Segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision, Graphics & Image Processing, 2008. ICVGIP '08. Sixth Indian Conference on
Conference_Location :
Bhubaneswar
Print_ISBN :
978-0-7695-3476-3
Electronic_ISBN :
978-0-7695-3476-3
Type :
conf
DOI :
10.1109/ICVGIP.2008.26
Filename :
4756085
Link To Document :
بازگشت