DocumentCode :
3141597
Title :
Gujarati character recognition
Author :
Antani, Sameer ; Agnihotri, Lalitha
Author_Institution :
Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
fYear :
1999
fDate :
20-22 Sep 1999
Firstpage :
418
Lastpage :
421
Abstract :
This paper describes the classification of a subset of printed or digitized Gujarati characters. Gujarati belongs to the genre of Devanagri scripts from the Indian subcontinent. Very little work is found in the literature for recognition of Indian language scripts. For this paper a subset of similar appearing Gujarati characters was chosen and subjected to classification by different classifiers. The sample and test images for the characters were obtained from digital images available on the Internet and from scanned images of printed Gujarati text. For their classification, the Euclidean Minimum Distance and the k-Nearest Neighbor classifiers were used with regular and invariant moments. The characters were also classified in the binary feature space using Hamming Distance classifier. The paper presents the recognition rates for these classifiers. A recognition rate of 67% is achieved
Keywords :
document image processing; image classification; optical character recognition; Devanagri script; Euclidean Minimum Distance; Gujarati character recognition; Hamming Distance classifier; Indian language script; Internet; character classification; k-Nearest Neighbor classifiers; scanned images; Character recognition; Computer science; Digital images; Hamming distance; Hidden Markov models; Humans; Internet; Natural languages; Optical character recognition software; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 1999. ICDAR '99. Proceedings of the Fifth International Conference on
Conference_Location :
Bangalore
Print_ISBN :
0-7695-0318-7
Type :
conf
DOI :
10.1109/ICDAR.1999.791813
Filename :
791813
Link To Document :
بازگشت