Title :
A comparison of clustering methods for writer identification and verification
Author :
Bulacu, Marius ; Schomaker, Lambert
Author_Institution :
Artificial Intelligence Inst., Groningen Univ., Netherlands
fDate :
29 Aug.-1 Sept. 2005
Abstract :
An effective method for writer identification and verification is based on assuming that each writer acts as a stochastic generator of ink-trace fragments, or graphemes. The probability distribution of these simple shapes in a given handwriting sample is characteristic for the writer and is computed using a common codebook of graphemes obtained by clustering. In previous studies we used contours to encode the graphemes, in the current paper we explore a complementary shape representation using normalized bitmaps. The most important aim of the current work is to compare three different clustering methods for generating the grapheme codebook: k-means Kohonen SOM 1D and 2D. Large scale computational experiments show that the proposed method is robust to the underlying shape representation used (whether contours or normalized bitmaps), to the size of codebook used (stable performance for sizes from 102 to 2.5 × 103) and to the clustering method used to generate the codebook (essentially the same performance was obtained for all three clustering methods).
Keywords :
handwriting recognition; pattern clustering; clustering methods; grapheme codebook; graphemes; handwriting sample; ink-trace fragments; k-means Kohonen SOM 1D; k-means Kohonen SOM 2D; normalized bitmaps; probability distribution; shape representation; stochastic generator; writer identification; writer verification; Artificial intelligence; Clustering methods; Databases; Distributed computing; Forensics; Large-scale systems; Probability distribution; Robustness; Shape; Stochastic processes;
Conference_Titel :
Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
Print_ISBN :
0-7695-2420-6
DOI :
10.1109/ICDAR.2005.4