DocumentCode :
183281
Title :
A New Method for Writer Identification Based on Histogram Symbolic Representation
Author :
Alaei, Alireza ; Roy, Partha Pratim
Author_Institution :
Lab. d´Inf. (LI EA6300), Univ. Francois-Rabelais de Tours, Tours, France
fYear :
2014
fDate :
1-4 Sept. 2014
Firstpage :
216
Lastpage :
221
Abstract :
In this paper, a new model-based writer identification scheme using histogram symbolic representation approach is proposed. In the proposed scheme, initially, some pre-processing techniques are employed to enhance image quality and extract text-lines from each handwritten document image. For each extracted text-line, a set of 92 features are computed based on analysis of connected component, enclosed region, lower and upper contours, fractal code, and Curve let. Considering the extracted feature vectors, a histogram is created for each feature of every writer as a histogram-valued symbolic data. This process results in a handwriting style model for each individual that consists of a set of histograms. To evaluate the proposed scheme, two different handwritten datasets written in two different scripts (Kannada as an Indian based script and English) were used. The first dataset contains 228 pages written in Kannada by 57 people. The other one is the dataset used in SigWiComp2013 composed of 330 document pages written in English by 55 individuals. The same criteria used in the SigWiComp2013 were followed in our evaluation strategy. Concerning the Kannada dataset, an F-measure of 92.79% was obtained when 114 documents were used in learning stage and the rest (114) were used for testing. For the SigWiComp2013 dataset an F-measure of 26.67% was obtained that is fairly comparable to the best result reported in the literature.
Keywords :
curvelet transforms; document image processing; feature extraction; handwriting recognition; image enhancement; image representation; learning (artificial intelligence); English; F-measure; Indian based script; Kannada; SigWiComp2013; connected component analysis; contour; curvelet; enclosed region; feature vector extraction; fractal code; handwriting style model; handwritten document image; histogram symbolic representation approach; histogram-valued symbolic data; image quality enhancement; learning; model-based writer identification; text-line extraction; Computational modeling; Data models; Feature extraction; Frequency modulation; Hidden Markov models; Histograms; Training; English and Kannada Handwritten Documents; Histogram Symbolic Representation; Similarity Measure; Writer Identification/Verification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2014 14th International Conference on
Conference_Location :
Heraklion
ISSN :
2167-6445
Print_ISBN :
978-1-4799-4335-7
Type :
conf
DOI :
10.1109/ICFHR.2014.44
Filename :
6981023
Link To Document :
بازگشت