DocumentCode :
2895779
Title :
Discriminating the Machine-Printed and Hand-Written Words Based on Legibility
Author :
Akbarpour, Shahin ; Sulaiman, Md Nasir Bin ; Mustapha, Norwati ; Rahmat, Rahmita Wirza
Author_Institution :
Dept of Comput. & Math., Islamic Azad Univ. of Shabestar, Shabestar, Iran
fYear :
2010
fDate :
12-14 April 2010
Firstpage :
364
Lastpage :
369
Abstract :
Discrimination of machine-printed and hand-written words is deemed as a major problem in the recognition of the mixed texts. To present a new method to distinguish between machine-printed words and hand-written words using a novel statistical feature on base legibility and discriminator threshold are objectives of this study. Because of the hand trembling, sudden uncontrollable movement of hand and sudden pen shift on the paper, machine-printed words are more legible than hand-written words. The feature is extracted using the Freeman chain code as they are focused on measurement of words legibility. The obtained quantity, which is introduced in this work for the first time, could be a distinguishing criterion for machine-printed words from hand-written. Practically, our method is applied to a mixed and unrefined Farsi database which includes the two above typologies of words. Removing machine-printed words from database and constructing a pure hand-written Farsi words is the other objective. Determining the threshold level, the accuracy rate of the method employed was calculated to be over 96.02%.
Keywords :
database management systems; handwritten character recognition; natural language processing; text analysis; word processing; Farsi database; Freeman chain code; discriminator threshold; hand trembling; handwritten words; machine-printed words; text recognition; word legibility measurement; Computer science; Databases; Feature extraction; Frequency domain analysis; Information technology; Mathematics; Natural languages; Noise level; Postal services; Text recognition; Discriminating the Machine-printed and Hand-written Words; Freeman Chain cods; Legibility of word; Word recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology: New Generations (ITNG), 2010 Seventh International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-6270-4
Type :
conf
DOI :
10.1109/ITNG.2010.187
Filename :
5501701
Link To Document :
بازگشت