DocumentCode :
3211515
Title :
Zone-based hybrid feature extraction algorithm for handwritten numeral recognition of two popular Indian scripts
Author :
Rajashekararadhya, S.V. ; Ranjan, Vanaja P.
Author_Institution :
Dept. of Electr. Eng., Anna Univ., Chennai, India
fYear :
2009
fDate :
9-11 Dec. 2009
Firstpage :
526
Lastpage :
530
Abstract :
India is a multi-lingual multi-script country, where eighteen official scripts are accepted and there are over hundred regional languages. In this paper we propose a zone-based hybrid feature extraction algorithm scheme towards the recognition of off-line handwritten numerals of two popular south-Indian scripts. The character centroid is computed and the character/numeral image (50 × 50) is further divided in to 25 equal zones (10 × 10). An average distance from the character centroid to the pixels present in the zone column is computed. This procedure is sequentially repeated for all the zone/grid/box columns present in the zone (10 features). There could be some zone column having empty foreground pixels. Hence feature value of such zone column in the feature vector is zero. This procedure is sequentially repeated for the entire zone present in the numeral image (250 features). Similarly we extract zone centroid coordinates as features. The numeral image is divided into 50 equal zones (5 × 10). The zone centroid is computed. This procedure is sequentially repeated for the entire zone present in the numeral image (100 features). There could be some zone having empty foreground pixels. Hence feature value of such zone in the feature vector is zero. Finally 350 such features are extracted for classification and recognition. The nearest neighbor and the support vector machine classifiers are used for subsequent classification and recognition purpose. We obtained 97.75% and 93.9% of recognition rate for Kannada and Tamil numerals respectively using nearest neighbor classifier. We obtained 98.2% and 94.9% of recognition rate for Kannada and Tamil numerals respectively using support vector machine classifier.
Keywords :
feature extraction; handwritten character recognition; image classification; image resolution; natural language processing; support vector machines; character centroid; empty foreground pixels; feature vector; handwritten character recognition; handwritten numeral recognition; multilingual multiscript country; nearest neighbor classifier; regional languages; south-Indian scripts; support vector machine classifier; zone based hybrid feature extraction algorithm; Character recognition; Feature extraction; Handwriting recognition; Image processing; Image recognition; Image segmentation; Nearest neighbor searches; Pattern recognition; Support vector machine classification; Support vector machines; Feature extraction; Handwriiten character recognition; Image processing; Indian scripts; Nearest neighbor classifier; Support vector machine;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Nature & Biologically Inspired Computing, 2009. NaBIC 2009. World Congress on
Conference_Location :
Coimbatore
Print_ISBN :
978-1-4244-5053-4
Type :
conf
DOI :
10.1109/NABIC.2009.5393386
Filename :
5393386
Link To Document :
بازگشت