Title :
A High Performance European OCR System
Author :
Wang, Kai ; Wang, Qingren
Author_Institution :
Nankai Univ., Tianjin
Abstract :
The construction of Latin based European OCR system is studied in this paper. Compared with English, other Latin based European languages use more characters, which is called European special characters in this paper to be distinct from English letters. To construct a European system with high performance, the key is the recognition of the European special characters. In this paper, the European special characters are automatically divided into three subsets by the different handwritten position. And two solutions are proposed, one solution in which is used to recognize "i", "j " and the European special characters in subset 1, while another solution is used to recognize other English characters, digits and the European special character in other subsets. Experiment shows, the new system is more effective than the old one, which provides an experimental support for our research work.
Keywords :
handwritten character recognition; natural language processing; optical character recognition; European OCR system; European special character; handwritten position; optical character recognition; Character recognition; Entropy; Machine intelligence; Natural languages; Optical character recognition software; Text analysis; Typesetting; Uncertainty;
Conference_Titel :
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location :
Parana
Print_ISBN :
978-0-7695-2822-9
DOI :
10.1109/ICDAR.2007.4378710