• DocumentCode
    3361969
  • Title

    A feature learning method for scene text recognition

  • Author

    Duong Ho Vu ; Ly Quoc Ngoc

  • Author_Institution
    Outsourceit Int. AS, Vietnam
  • fYear
    2012
  • fDate
    12-15 Dec. 2012
  • Abstract
    Reading text in scene images is a challenging task and is still an active research nowadays. The difficulties come from low resolution, complex background, non uniform lightning or blurring effects of scene images. This paper focuses on recognizing characters in scene images based on the feature learning method proposed in [6] and the conclusion on comparison between sparse coding and vector quantization in [8] to build better feature representations before training the model by using SVM. We asset the performance of the proposed method on some popular scene image datasets such as ICDAR 20033 and Chars74k4. Experimental results show that our proposed system has reached an encouraging recognition rate for both ICDAR 2003 and Chars74k datasets. More specially, our system archived 83.8% (62-class problem), 87% (36-class problem) of recognition rate on ICDAR 2003 Sample subset (698 images), and 73.8% accuracy on GoodImg subset (7705 images) of Chars74K dataset. In this work, our contribution is that we applied the ideas as well as the conclusions in [8] for scene text recognition problem and the experimental results show that our system outperforms other existing methods.
  • Keywords
    character recognition; feature extraction; image coding; image representation; learning (artificial intelligence); support vector machines; text detection; vector quantisation; 36-class problem; 62-class problem; Chars74k datasets; GoodImg subset; ICDAR 2003 datasets; SVM; character recognition; feature learning method; feature representations; scene image datasets; scene text recognition; sparse coding; text reading; vector quantization; Data preprocessing; Educational institutions; Feature extraction; Image recognition; Image resolution; Optical character recognition software; Text recognition; Scene text recognition; character recognition in real-world images; feature learning; sparse coding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Information Technology (ISSPIT), 2012 IEEE International Symposium on
  • Conference_Location
    Ho Chi Minh City
  • Print_ISBN
    978-1-4673-5604-6
  • Type

    conf

  • DOI
    10.1109/ISSPIT.2012.6621282
  • Filename
    6621282