• DocumentCode
    1791387
  • Title

    Character recognition in low quality document images using local and global features

  • Author

    Chenqiang Gao ; Xiaoming Huang

  • Author_Institution
    State Key Lab. of Digital Publishing Technol., Beijing, China
  • fYear
    2014
  • fDate
    14-16 Oct. 2014
  • Firstpage
    676
  • Lastpage
    680
  • Abstract
    Although a great success has been achieved for the situation of high quality images during the past decades, Character recognition in low quality images still remains a challenge. To tackle this challenge, in this paper a novel method in the SVM framework is proposed to recognize the characters in low quality document images by using local and global features. Firstly, a multi-scale sliding window strategy with a pruning method of character traits is adopted to generate potential character sub-regions. Then, the conventional global feature and state-of-art local feature, namely histogram of oriented gradients (HOG), are extracted to form the representation of the potential character sub-region. Finally, the Support Vector Machine (SVM) is used to recognize characters with a late fusion strategy. Experimental results show that the proposed method has a better performance even in the situation of existing touched and broken characters situation compared to the conventional method.
  • Keywords
    document image processing; feature extraction; image fusion; image representation; optical character recognition; support vector machines; HOG; SVM framework; character trait; global feature; high quality image fusion; histogram of oriented gradient; low quality document image; multiscale sliding window strategy; optical character recognition; potential character subregion representation; pruning method; state-of-art local feature; support vector machine; Accuracy; Character recognition; Feature extraction; Optical character recognition software; Support vector machines; Text recognition; Training; character recognition; global feature; local feature; low quality images; sliding windows;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image and Signal Processing (CISP), 2014 7th International Congress on
  • Conference_Location
    Dalian
  • Type

    conf

  • DOI
    10.1109/CISP.2014.7003864
  • Filename
    7003864