DocumentCode :
2483968
Title :
Recognition of books by verification and retraining
Author :
Neeba, N.V. ; Jawahar, C.V.
Author_Institution :
Centre for Visual Inf. Technol., Int. Inst. of Inf. Technol., Hyderabad
fYear :
2008
fDate :
8-11 Dec. 2008
Firstpage :
1
Lastpage :
4
Abstract :
The problem of character recognition in a book should be formulated significantly different from that of a single page or word. An ideal approach to design such a recognizer is to adapt the classifier to the font and style of the collection. In this paper, we propose an adaptation framework to recognize characters in a book with a learning framework. In the proposed system, the post processor verifies the output of the recognition module, which is further used for learning and thus to improve the performance over iteration. Experiments are conducted on about 500,000 annotated symbols from five books in Malayalam (an Indian language). We achieve an average improvement of 14% in classification accuracy.
Keywords :
document image processing; image classification; learning (artificial intelligence); optical character recognition; adaptation framework; book recognition; image classification; learning framework; optical character recognition; verification module; Books; Character recognition; Dictionaries; Image converters; Image recognition; Information technology; Natural languages; Optical character recognition software; Pattern recognition; Software libraries;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location :
Tampa, FL
ISSN :
1051-4651
Print_ISBN :
978-1-4244-2174-9
Electronic_ISBN :
1051-4651
Type :
conf
DOI :
10.1109/ICPR.2008.4761538
Filename :
4761538
Link To Document :
بازگشت