Title :
An incremental and hierarchical k-NN classifier for handwritten characters
Author :
Rodriguez, C. ; Boto, F. ; Soraluze, I. ; Pérez, A.
Author_Institution :
Comput. Archit. & Technol. Dept., UPV/EHU, San Sebastian, Spain
Abstract :
This paper analyses the application of hierarchical classifiers based on the k-NN rule to the automatic classification of handwritten characters. The discriminating capacity of a k-NN classifier increases as the size of the reference pattern set (RPS) increases. This supposes a problem for k-NN classifiers in real applications: the high computational cost required when the RPS is large. In order to accelerate the process of calculating the distance to each pattern of the RPS, some authors propose the use of condensing techniques. These methods try to reduce the size of the RPS without losing classification power. Our alternative proposal is based on incremental learning and hierarchical classifiers with rejection techniques that reduce the computational cost of the classifier. We have used 133,944 characters (72,105 upper-case characters and 61,839 lower-case characters) of the NIST Special Data Bases 3 and 7 as experimental data set. The binary image of the character is transformed to a gray image. The best non-hierarchical classifier achieves a hit rate of 94.92% (upper-case) and 87,884% (lower-case). The hierarchical classifier achieves the same hit ratio, but with 3 times lower computational cost than the cost of the best non-hierarchical classifier found in our experimentation and 14% less than Hart´s (1968) algorithm.
Keywords :
document image processing; handwritten character recognition; image classification; learning (artificial intelligence); NIST Special Data Bases; binary image; computational cost; condensing algorithm; gray image; handwritten character classification; handwritten character recognition; hierarchical k-NN classifier; incremental learning; reference pattern set; Acceleration; Application software; Character recognition; Computational efficiency; Computer architecture; Computer science; Electronic mail; Handwriting recognition; Spatial databases; Testing;
Conference_Titel :
Pattern Recognition, 2002. Proceedings. 16th International Conference on
Print_ISBN :
0-7695-1695-X
DOI :
10.1109/ICPR.2002.1047804