• DocumentCode
    2895779
  • Title

    Discriminating the Machine-Printed and Hand-Written Words Based on Legibility

  • Author

    Akbarpour, Shahin ; Sulaiman, Md Nasir Bin ; Mustapha, Norwati ; Rahmat, Rahmita Wirza

  • Author_Institution
    Dept of Comput. & Math., Islamic Azad Univ. of Shabestar, Shabestar, Iran
  • fYear
    2010
  • fDate
    12-14 April 2010
  • Firstpage
    364
  • Lastpage
    369
  • Abstract
    Discrimination of machine-printed and hand-written words is deemed as a major problem in the recognition of the mixed texts. To present a new method to distinguish between machine-printed words and hand-written words using a novel statistical feature on base legibility and discriminator threshold are objectives of this study. Because of the hand trembling, sudden uncontrollable movement of hand and sudden pen shift on the paper, machine-printed words are more legible than hand-written words. The feature is extracted using the Freeman chain code as they are focused on measurement of words legibility. The obtained quantity, which is introduced in this work for the first time, could be a distinguishing criterion for machine-printed words from hand-written. Practically, our method is applied to a mixed and unrefined Farsi database which includes the two above typologies of words. Removing machine-printed words from database and constructing a pure hand-written Farsi words is the other objective. Determining the threshold level, the accuracy rate of the method employed was calculated to be over 96.02%.
  • Keywords
    database management systems; handwritten character recognition; natural language processing; text analysis; word processing; Farsi database; Freeman chain code; discriminator threshold; hand trembling; handwritten words; machine-printed words; text recognition; word legibility measurement; Computer science; Databases; Feature extraction; Frequency domain analysis; Information technology; Mathematics; Natural languages; Noise level; Postal services; Text recognition; Discriminating the Machine-printed and Hand-written Words; Freeman Chain cods; Legibility of word; Word recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology: New Generations (ITNG), 2010 Seventh International Conference on
  • Conference_Location
    Las Vegas, NV
  • Print_ISBN
    978-1-4244-6270-4
  • Type

    conf

  • DOI
    10.1109/ITNG.2010.187
  • Filename
    5501701