DocumentCode :
3488550
Title :
Feature Representations for Scene Text Character Recognition: A Comparative Study
Author :
Chucai Yi ; Xiaodong Yang ; YingLi Tian
Author_Institution :
Dept. of Comput. Sci., City Univ. of New York, New York, NY, USA
fYear :
2013
fDate :
25-28 Aug. 2013
Firstpage :
907
Lastpage :
911
Abstract :
Recognizing text character from natural scene images is a challenging problem due to background interferences and multiple character patterns. Scene Text Character (STC) recognition, which generally includes feature representation to model character structure and multi-class classification to predict label and score of character class, mostly plays a significant role in word-level text recognition. The contribution of this paper is a complete performance evaluation of image-based STC recognition, by comparing different sampling methods, feature descriptors, dictionary sizes, coding and pooling schemes, and SVM kernels. We systematically analyze the impact of each option in the feature representation and classification. The evaluation results on two datasets CHARS74K and ICDAR2003 demonstrate that Histogram of Oriented Gradient (HOG) descriptor, soft-assignment coding, max pooling, and Chi-Square Support Vector Machines (SVM) obtain the best performance among local sampling based feature representations. To improve STC recognition, we apply global sampling feature representation. We generate Global HOG (GHOG) by computing HOG descriptor from global sampling. GHOG enables better character structure modeling and obtains better performance than local sampling based feature representations. The GHOG also outperforms existing methods in the two benchmark datasets.
Keywords :
character recognition; image classification; image representation; sampling methods; support vector machines; CHARS74K dataset; HOG descriptor; ICDAR2003 dataset; SVM kernels; character class label; character class score; character structure; chi-square support vector machines; coding schemes; dictionary sizes; feature descriptors; feature representation; histogram-of-oriented gradient; image-based STC recognition; max pooling; multiclass classification; natural scene image; pooling schemes; sampling methods; scene text character recognition; soft-assignment coding; support vector machines; word-level text recognition; Character recognition; Dictionaries; Encoding; Feature extraction; Support vector machines; Text recognition; Visualization; Global HOG; coding-pooling; dictionary of visual words; feature descriptors; performance evaluation; scene text character recognition; text feature representation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
ISSN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2013.185
Filename :
6628750
Link To Document :
بازگشت