Title :
A New Approach to Feature Selection in Handwritten Farsi/Arabic Character Recognition
Author :
Shayegan, M.A. ; Chee Seng Chan
Author_Institution :
Dept. of Artificial Intell., Univ. of Malaya, Kuala Lumpur, Malaysia
Abstract :
Feature extraction and feature selection are very important steps in pattern recognition systems. However, finding an optimal, effective, and robust feature set is usually a difficult task. In this paper, with the use of a comprehensive study on offline handwritten Farsi/Arabic digit recognition systems, a set of well-known features were extracted. Then, by employing one- and two-dimensional spectrum diagrams for standard deviation and minimum to maximum distributions, an optimal subset of initial features set was selected automatically. Experimental results, according to one of the biggest standard handwritten Farsi digit datasets, the HODA, had shown 95.70% accuracy with the proposed method. The achieved results showed a salient improvement in system precision in comparison to using other state-of-the-art approaches.
Keywords :
feature extraction; handwritten character recognition; natural language processing; optical character recognition; set theory; HODA; feature selection; handwritten Farsi-Arabic character recognition; offline handwritten Farsi-Arabic digit recognition systems; one-dimensional spectrum diagrams; optimal subset; pattern recognition systems; robust feature set; standard handwritten Farsi digit datasets; two-dimensional spectrum diagrams; Farsi/Arabic Handwritten OCR; Feature Extraction and Selection; Principal Component Analysis; Spectrum Diagram;
Conference_Titel :
Advanced Computer Science Applications and Technologies (ACSAT), 2012 International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4673-5832-3
DOI :
10.1109/ACSAT.2012.77