• DocumentCode
    2722396
  • Title

    Wavelet Feature Based Confusion Character Sets for Gujarati Script

  • Author

    Dholakia, Jignesh ; Yajnik, Archit ; Negi, Atul

  • Author_Institution
    M. S. Univ. of Baroda, Gujarat
  • Volume
    2
  • fYear
    2007
  • fDate
    13-15 Dec. 2007
  • Firstpage
    366
  • Lastpage
    370
  • Abstract
    Indic script recognition is a difficult task due to the large number of symbols that result from concatenation of vowel modifiers to basic consonants and the conjunction of consonants with modifiers etc. Recognition of Gujarati script is a less studied area and no attempt is made so far to constitute confusion sets of Gujarati glyphs. In this paper, we present confusion sets of glyphs in printed Gujarati. Feature vector made up of Daubechies D4 wavelet coefficients were subjected to two different classifiers, giving more than 96% accuracy for a larger set of symbols. Novel application of GR neural-net architecture allows for fast building of a classifier for the large character data set. The combined approach of wavelet feature extraction and GRNN classification has given the highest recognition accuracy reported on this script.
  • Keywords
    character sets; feature extraction; natural language processing; neural net architecture; optical character recognition; pattern classification; wavelet transforms; GRNN classification; Gujarati glyph; Gujarati script; Indic script recognition; confusion character sets; feature vector; neural net architecture; optical character recognition; wavelet coefficient; wavelet feature extraction; Buildings; Character recognition; Computational intelligence; Feature extraction; Nearest neighbor searches; Optical character recognition software; Optical design; Robustness; Speech recognition; Wavelet coefficients;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Conference on Computational Intelligence and Multimedia Applications, 2007. International Conference on
  • Conference_Location
    Sivakasi, Tamil Nadu
  • Print_ISBN
    0-7695-3050-8
  • Type

    conf

  • DOI
    10.1109/ICCIMA.2007.230
  • Filename
    4426723