DocumentCode :
1106738
Title :
Effects of classifier structures and training regimes on integrated segmentation and recognition of handwritten numeral strings
Author :
Liu, Cheng-Lin ; Sako, Hiroshi ; Fujisawa, Hiromichi
Author_Institution :
Central Res. Lab., Hitachi Ltd., Tokyo, Japan
Volume :
26
Issue :
11
fYear :
2004
Firstpage :
1395
Lastpage :
1407
Abstract :
In integrated segmentation and recognition of character strings, the underlying classifier is trained to be resistant to noncharacters. We evaluate the performance of state-of-the-art pattern classifiers of this kind. First, we build a baseline numeral string recognition system with simple but effective presegmentation. The classification scores of the candidate patterns generated by presegmentation are combined to evaluate the segmentation paths and the optimal path is found using the beam search strategy. Three neural classifiers, two discriminative density models, and two support vector classifiers are evaluated. Each classifier has some variations depending on the training strategy: maximum likelihood, discriminative learning both with and without noncharacter samples. The string recognition performances are evaluated on the numeral string images of the NIST special database 19 and the zipcode images of the CEDAR CDROM-1. The results show that noncharacter training is crucial for neural classifiers and support vector classifiers, whereas, for the discriminative density models, the regularization of parameters is important. The string recognition results compare favorably to the best ones reported in the literature though we totally ignored the geometric context. The best results were obtained using a support vector classifier, but the neural classifiers and discriminative density models show better trade-off between accuracy and computational overhead.
Keywords :
handwritten character recognition; image classification; image segmentation; learning (artificial intelligence); maximum likelihood estimation; multilayer perceptrons; support vector machines; CEDAR CDROM-1; NIST special database 19; beam search strategy; character string recognition; classification scores; classifier structures; discriminative density models; discriminative learning; geometric context; handwritten numeral strings; integrated segmentation; maximum likelihood method; neural classifiers; noncharacter training; numeral string recognition system; parameter regularization; performance evaluation; state of the art pattern classifiers; support vector classifiers; training regimes; zipcode images; Character generation; Character recognition; Cost function; Handwriting recognition; Image databases; Image recognition; Image segmentation; NIST; Performance evaluation; Target recognition; Index Terms- Numeral string recognition; character classification; discriminative density models; integrated segmentation and recognition; neural classifiers; noncharacter resistance; support vector classifiers.; Algorithms; Artificial Intelligence; Cluster Analysis; Computer Simulation; Face; Humans; Image Enhancement; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Models, Biological; Models, Statistical; Neural Networks (Computer); Numerical Analysis, Computer-Assisted; Pattern Recognition, Automated; Regression Analysis; Reproducibility of Results; Sensitivity and Specificity; Signal Processing, Computer-Assisted; Subtraction Technique;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2004.104
Filename :
1335447
Link To Document :
بازگشت