DocumentCode
2895779
Title
Discriminating the Machine-Printed and Hand-Written Words Based on Legibility
Author
Akbarpour, Shahin ; Sulaiman, Md Nasir Bin ; Mustapha, Norwati ; Rahmat, Rahmita Wirza
Author_Institution
Dept of Comput. & Math., Islamic Azad Univ. of Shabestar, Shabestar, Iran
fYear
2010
fDate
12-14 April 2010
Firstpage
364
Lastpage
369
Abstract
Discrimination of machine-printed and hand-written words is deemed as a major problem in the recognition of the mixed texts. To present a new method to distinguish between machine-printed words and hand-written words using a novel statistical feature on base legibility and discriminator threshold are objectives of this study. Because of the hand trembling, sudden uncontrollable movement of hand and sudden pen shift on the paper, machine-printed words are more legible than hand-written words. The feature is extracted using the Freeman chain code as they are focused on measurement of words legibility. The obtained quantity, which is introduced in this work for the first time, could be a distinguishing criterion for machine-printed words from hand-written. Practically, our method is applied to a mixed and unrefined Farsi database which includes the two above typologies of words. Removing machine-printed words from database and constructing a pure hand-written Farsi words is the other objective. Determining the threshold level, the accuracy rate of the method employed was calculated to be over 96.02%.
Keywords
database management systems; handwritten character recognition; natural language processing; text analysis; word processing; Farsi database; Freeman chain code; discriminator threshold; hand trembling; handwritten words; machine-printed words; text recognition; word legibility measurement; Computer science; Databases; Feature extraction; Frequency domain analysis; Information technology; Mathematics; Natural languages; Noise level; Postal services; Text recognition; Discriminating the Machine-printed and Hand-written Words; Freeman Chain cods; Legibility of word; Word recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Technology: New Generations (ITNG), 2010 Seventh International Conference on
Conference_Location
Las Vegas, NV
Print_ISBN
978-1-4244-6270-4
Type
conf
DOI
10.1109/ITNG.2010.187
Filename
5501701
Link To Document