DocumentCode :
178461
Title :
Data Sufficiency for Online Writer Identification: A Comparative Study of Writer-Style Space vs. Feature Space Models
Author :
Shivram, A. ; Ramaiah, C. ; Govindaraju, V.
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. at Buffalo - SUNY, Buffalo, NY, USA
fYear :
2014
fDate :
24-28 Aug. 2014
Firstpage :
3121
Lastpage :
3125
Abstract :
A key factor in building effective writer identification/verification systems is the amount of data required to build the underlying models. In this research we systematically examine data sufficiency bounds for two broad approaches to online writer identification -- feature space models vs. writer-style space models. We report results from 40 experiments conducted on two publicly available datasets and also test identification performance for the target models using two different feature functions. Our findings show that the writer-style space model gives higher identification performance for a given level of data and further, achieves high performance levels with lesser data costs. This model appears to require as less as 20 words per page to achieve identification performance close to 80% and reaches more than 90% accuracy with higher levels of data enrollment.
Keywords :
data analysis; formal verification; text analysis; data sufficiency; feature space model; online writer identification; writer identification-verification system; writer-style space model; Buildings; Data models; Fasteners; Feature extraction; Standards; Text analysis; Writing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location :
Stockholm
ISSN :
1051-4651
Type :
conf
DOI :
10.1109/ICPR.2014.538
Filename :
6977250
Link To Document :
بازگشت