DocumentCode
2013188
Title
An SVM-Based High-accurate Recognition Approach for Handwritten Numerals by Using Difference Features
Author
Huang, Kaizhu ; Sun, Jun ; Hotta, Yoshinobu ; Fujimoto, Katsuhito ; Naoi, Satoshi
Author_Institution
Fujitsu R&D Center, Beijing
Volume
2
fYear
2007
fDate
23-26 Sept. 2007
Firstpage
589
Lastpage
593
Abstract
Handwritten numeral recognition is an important pattern recognition task. It can be widely used in various domains, e.g., bank money recognition, which requires a very high recognition rate. As a state-of-the-art classifier, support vector machine (SVM), has been extensively used in this area. Typically, SVM is trained in a batch model, i.e., all data points are simultaneously input for training the classification boundary. However, some slightly exceptional data, only accounting for a small proportion, are critical for the recognition rates. Training a classifier among all the data may possibly treat such legal but slightly exceptional samples as "noise ". In this paper, we propose a novel approach to attack this problem. This approach exploits a two-stage framework by using difference features. In the first stage, a regular SVM is trained on all the training data; in the second stage, only the samples misclassified in the first stage are specially considered. Therefore, the performance can be lifted. The number of misclassifications is often small because of the good performance of SVM. This will present difficulties in training an accurate SVM engine only for these misclassified samples. We then further propose a multi-way to binary approach using difference features. This approach successfully transforms multi-category classification to binary classification and expands the training samples greatly. In order to evaluate the proposed method, experiments are performed on 10,000 handwritten numeral samples extracted from real banks forms. This new algorithm achieves 99.0% accuracy. In comparison, the traditional SVM only gets 98.4%.
Keywords
handwritten character recognition; image classification; support vector machines; banks forms; binary classification; classifier training; difference features; handwritten numeral recognition; multicategory classification; pattern recognition; support vector machine; Engines; Handwriting recognition; Law; Legal factors; Pattern recognition; Performance evaluation; Sun; Support vector machine classification; Support vector machines; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location
Parana
ISSN
1520-5363
Print_ISBN
978-0-7695-2822-9
Type
conf
DOI
10.1109/ICDAR.2007.4376983
Filename
4376983
Link To Document