DocumentCode
2722396
Title
Wavelet Feature Based Confusion Character Sets for Gujarati Script
Author
Dholakia, Jignesh ; Yajnik, Archit ; Negi, Atul
Author_Institution
M. S. Univ. of Baroda, Gujarat
Volume
2
fYear
2007
fDate
13-15 Dec. 2007
Firstpage
366
Lastpage
370
Abstract
Indic script recognition is a difficult task due to the large number of symbols that result from concatenation of vowel modifiers to basic consonants and the conjunction of consonants with modifiers etc. Recognition of Gujarati script is a less studied area and no attempt is made so far to constitute confusion sets of Gujarati glyphs. In this paper, we present confusion sets of glyphs in printed Gujarati. Feature vector made up of Daubechies D4 wavelet coefficients were subjected to two different classifiers, giving more than 96% accuracy for a larger set of symbols. Novel application of GR neural-net architecture allows for fast building of a classifier for the large character data set. The combined approach of wavelet feature extraction and GRNN classification has given the highest recognition accuracy reported on this script.
Keywords
character sets; feature extraction; natural language processing; neural net architecture; optical character recognition; pattern classification; wavelet transforms; GRNN classification; Gujarati glyph; Gujarati script; Indic script recognition; confusion character sets; feature vector; neural net architecture; optical character recognition; wavelet coefficient; wavelet feature extraction; Buildings; Character recognition; Computational intelligence; Feature extraction; Nearest neighbor searches; Optical character recognition software; Optical design; Robustness; Speech recognition; Wavelet coefficients;
fLanguage
English
Publisher
ieee
Conference_Titel
Conference on Computational Intelligence and Multimedia Applications, 2007. International Conference on
Conference_Location
Sivakasi, Tamil Nadu
Print_ISBN
0-7695-3050-8
Type
conf
DOI
10.1109/ICCIMA.2007.230
Filename
4426723
Link To Document