Wavelet Feature Based Confusion Character Sets for Gujarati Script

Author

Dholakia, Jignesh ; Yajnik, Archit ; Negi, Atul

Author_Institution

M. S. Univ. of Baroda, Gujarat

Volume

2

fYear

2007

fDate

13-15 Dec. 2007

Firstpage

366

Lastpage

370

Abstract

Indic script recognition is a difficult task due to the large number of symbols that result from concatenation of vowel modifiers to basic consonants and the conjunction of consonants with modifiers etc. Recognition of Gujarati script is a less studied area and no attempt is made so far to constitute confusion sets of Gujarati glyphs. In this paper, we present confusion sets of glyphs in printed Gujarati. Feature vector made up of Daubechies D4 wavelet coefficients were subjected to two different classifiers, giving more than 96% accuracy for a larger set of symbols. Novel application of GR neural-net architecture allows for fast building of a classifier for the large character data set. The combined approach of wavelet feature extraction and GRNN classification has given the highest recognition accuracy reported on this script.

Keywords

character sets; feature extraction; natural language processing; neural net architecture; optical character recognition; pattern classification; wavelet transforms; GRNN classification; Gujarati glyph; Gujarati script; Indic script recognition; confusion character sets; feature vector; neural net architecture; optical character recognition; wavelet coefficient; wavelet feature extraction; Buildings; Character recognition; Computational intelligence; Feature extraction; Nearest neighbor searches; Optical character recognition software; Optical design; Robustness; Speech recognition; Wavelet coefficients;

fLanguage

English

Publisher

ieee

Conference_Titel

Conference on Computational Intelligence and Multimedia Applications, 2007. International Conference on

Conference_Location

Sivakasi, Tamil Nadu

Print_ISBN

0-7695-3050-8

Type

conf

DOI

10.1109/ICCIMA.2007.230

Filename

4426723