• DocumentCode
    1791395
  • Title

    Scene text character recognition based on image-to-class distance metric learning

  • Author

    Xiao Wang ; Chunheng Wang ; Baihua Xiao ; Cunzhao Shi ; Song Gao

  • Author_Institution
    State Key Lab. of Manage. & Control for Complex Syst., Inst. of Autom., Beijing, China
  • fYear
    2014
  • fDate
    14-16 Oct. 2014
  • Firstpage
    721
  • Lastpage
    725
  • Abstract
    With the increasing needs from a variety of real world applications like context retrieval and aid reading, scene text recognition is attracting more and more attention from the computer vision community. Scene text character (STC) recognition plays an important role in this task. However, recognition of STC is a challenging task due to a series of problems, like different illumination conditions, heavy occlusions and complex backgrounds. STC recognition can be further divided into two stages: feature representation and multi-class classification. Most work have been done to find better feature representation. In this paper, we focus on the stage of multi-class classification and propose a novel method names KNN based image-to-class distance metric learning (T2CDML). We first implement a global histogram of oriented gradients (GHOG) descriptor for feature representation. Then the KNN based I2CDML method is introduced to deal with the STC classification. Our KNN based I2CDML is robust to high intra-class variations. Besides, the proposed method supports incremental learning. To better evaluate our method, we conduct a series of experiments on two benchmark datasets, CHARS74K and ICDAR2003CH. Compared with other existing methods, our experimental results on these two datasets demonstrate very promising performance of the proposed method on recognizing characters in complex natural scenes.
  • Keywords
    character recognition; computer vision; feature extraction; image classification; image representation; learning (artificial intelligence); text detection; CHARS74K dataset; GHOG descriptor; ICDAR2003CH dataset; KNN based I2CDML method; KNN based image-to-class distance metric learning; STC classification; STC recognition; T2CDML; complex backgrounds; computer vision community; feature representation; global histogram of oriented gradients descriptor; heavy occlusions; illumination conditions; incremental learning; intra-class variations; multiclass classification; scene text character recognition; Character recognition; Computer vision; Context; Linear programming; Measurement; Shape; Text recognition; Distance Metric Learning; Image-to-Class; KNN; Recognition; Scene Text Character;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image and Signal Processing (CISP), 2014 7th International Congress on
  • Conference_Location
    Dalian
  • Type

    conf

  • DOI
    10.1109/CISP.2014.7003872
  • Filename
    7003872