• DocumentCode
    2722671
  • Title

    Global and Local Features Based Handwritten Text Words and Numerals Script Identification

  • Author

    Dhandra, B.V. ; Hangarge, Mallikarjun

  • Volume
    2
  • fYear
    2007
  • fDate
    13-15 Dec. 2007
  • Firstpage
    471
  • Lastpage
    475
  • Abstract
    This paper aims at the script identification problem of handwritten document images, which facilitates many important applications such as sorting, transcription of multilingual documents and indexing of large collection of such images, or as a precursor to optical character recognition (OCR). The script identification scheme proposed in this paper has two phases. First phase reports the script identification of text words using global and local features, extracted by morphological filters and regional descriptors of three major Indian languages/scripts: Kannada, Roman and Devnagari. In the second phase Kannada and Roman handwritten numerals script identification is carried out. For classification of text words and numerals, a K nearest neighbour algorithm is used. The proposed algorithm achieves an average maximum recognition accuracy is 96.05% and 99% respectively for text words and numerals with five fold cross validation test. The data set containing 3000 text words and 400 numerals collected from 250 writers. The novelty of the proposed algorithm is robust for noise, writer style, size and ink etc.
  • Keywords
    Character recognition; Feature extraction; Indexing; Ink; Noise robustness; Optical character recognition software; Optical filters; Sorting; Testing; Text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Conference on Computational Intelligence and Multimedia Applications, 2007. International Conference on
  • Conference_Location
    Sivakasi, Tamil Nadu
  • Print_ISBN
    0-7695-3050-8
  • Type

    conf

  • DOI
    10.1109/ICCIMA.2007.125
  • Filename
    4426742