Title :
A Self Organizing Map based Urdu Nasakh character recognition
Author :
Hussain, Syed Afaq ; Zaman, Safdar ; Ayub, Muhammad
Author_Institution :
Dept. of Comput. Sci., Air Univ. Islamabad, Islamabad, Pakistan
Abstract :
Research in the field of character recognition for Urdu script faces challenges mainly due to its characteristics, like cursive nature, multiple fonts and context dependent shapes of characters and their position with respect to the base line. This paper addresses problems recognizing Nasakh script of Urdu Language. The proposed system takes segmented character as input and recognizes them in two steps. In the first step the different shapes of each character are classifies into 33 categories using Kohonen Self-organizing Map (SOM) by auto clustering similar ligatures for initial classification. During the Feature Extraction phase more than twenty five different features are extracted from each character which are further processed for final character recognition.
Keywords :
feature extraction; optical character recognition; self-organising feature maps; Kohonen self-organizing map; Nasakh script recognition; Urdu Nasakh character recognition; Urdu language; feature extraction; similar ligatures clustering; Character recognition; Computer science; Feature extraction; Image converters; Neural networks; Optical character recognition software; Optical sensors; Organizing; Shape; Writing; Clustering; Neural Network; Offline Character Recognition; Self Organizing Map (SOM); Urdu Nasakh;
Conference_Titel :
Emerging Technologies, 2009. ICET 2009. International Conference on
Conference_Location :
Islamabad
Print_ISBN :
978-1-4244-5630-7
Electronic_ISBN :
978-1-4244-5631-4
DOI :
10.1109/ICET.2009.5353161