Title :
A feature learning method for scene text recognition
Author :
Duong Ho Vu ; Ly Quoc Ngoc
Author_Institution :
Outsourceit Int. AS, Vietnam
Abstract :
Reading text in scene images is a challenging task and is still an active research nowadays. The difficulties come from low resolution, complex background, non uniform lightning or blurring effects of scene images. This paper focuses on recognizing characters in scene images based on the feature learning method proposed in [6] and the conclusion on comparison between sparse coding and vector quantization in [8] to build better feature representations before training the model by using SVM. We asset the performance of the proposed method on some popular scene image datasets such as ICDAR 20033 and Chars74k4. Experimental results show that our proposed system has reached an encouraging recognition rate for both ICDAR 2003 and Chars74k datasets. More specially, our system archived 83.8% (62-class problem), 87% (36-class problem) of recognition rate on ICDAR 2003 Sample subset (698 images), and 73.8% accuracy on GoodImg subset (7705 images) of Chars74K dataset. In this work, our contribution is that we applied the ideas as well as the conclusions in [8] for scene text recognition problem and the experimental results show that our system outperforms other existing methods.
Keywords :
character recognition; feature extraction; image coding; image representation; learning (artificial intelligence); support vector machines; text detection; vector quantisation; 36-class problem; 62-class problem; Chars74k datasets; GoodImg subset; ICDAR 2003 datasets; SVM; character recognition; feature learning method; feature representations; scene image datasets; scene text recognition; sparse coding; text reading; vector quantization; Data preprocessing; Educational institutions; Feature extraction; Image recognition; Image resolution; Optical character recognition software; Text recognition; Scene text recognition; character recognition in real-world images; feature learning; sparse coding;
Conference_Titel :
Signal Processing and Information Technology (ISSPIT), 2012 IEEE International Symposium on
Conference_Location :
Ho Chi Minh City
Print_ISBN :
978-1-4673-5604-6
DOI :
10.1109/ISSPIT.2012.6621282