DocumentCode
3024010
Title
A comparison of clustering methods for writer identification and verification
Author
Bulacu, Marius ; Schomaker, Lambert
Author_Institution
Artificial Intelligence Inst., Groningen Univ., Netherlands
fYear
2005
fDate
29 Aug.-1 Sept. 2005
Firstpage
1275
Abstract
An effective method for writer identification and verification is based on assuming that each writer acts as a stochastic generator of ink-trace fragments, or graphemes. The probability distribution of these simple shapes in a given handwriting sample is characteristic for the writer and is computed using a common codebook of graphemes obtained by clustering. In previous studies we used contours to encode the graphemes, in the current paper we explore a complementary shape representation using normalized bitmaps. The most important aim of the current work is to compare three different clustering methods for generating the grapheme codebook: k-means Kohonen SOM 1D and 2D. Large scale computational experiments show that the proposed method is robust to the underlying shape representation used (whether contours or normalized bitmaps), to the size of codebook used (stable performance for sizes from 102 to 2.5 × 103) and to the clustering method used to generate the codebook (essentially the same performance was obtained for all three clustering methods).
Keywords
handwriting recognition; pattern clustering; clustering methods; grapheme codebook; graphemes; handwriting sample; ink-trace fragments; k-means Kohonen SOM 1D; k-means Kohonen SOM 2D; normalized bitmaps; probability distribution; shape representation; stochastic generator; writer identification; writer verification; Artificial intelligence; Clustering methods; Databases; Distributed computing; Forensics; Large-scale systems; Probability distribution; Robustness; Shape; Stochastic processes;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
ISSN
1520-5263
Print_ISBN
0-7695-2420-6
Type
conf
DOI
10.1109/ICDAR.2005.4
Filename
1575747
Link To Document