DocumentCode
2674693
Title
A Self Organizing Map based Urdu Nasakh character recognition
Author
Hussain, Syed Afaq ; Zaman, Safdar ; Ayub, Muhammad
Author_Institution
Dept. of Comput. Sci., Air Univ. Islamabad, Islamabad, Pakistan
fYear
2009
fDate
19-20 Oct. 2009
Firstpage
267
Lastpage
273
Abstract
Research in the field of character recognition for Urdu script faces challenges mainly due to its characteristics, like cursive nature, multiple fonts and context dependent shapes of characters and their position with respect to the base line. This paper addresses problems recognizing Nasakh script of Urdu Language. The proposed system takes segmented character as input and recognizes them in two steps. In the first step the different shapes of each character are classifies into 33 categories using Kohonen Self-organizing Map (SOM) by auto clustering similar ligatures for initial classification. During the Feature Extraction phase more than twenty five different features are extracted from each character which are further processed for final character recognition.
Keywords
feature extraction; optical character recognition; self-organising feature maps; Kohonen self-organizing map; Nasakh script recognition; Urdu Nasakh character recognition; Urdu language; feature extraction; similar ligatures clustering; Character recognition; Computer science; Feature extraction; Image converters; Neural networks; Optical character recognition software; Optical sensors; Organizing; Shape; Writing; Clustering; Neural Network; Offline Character Recognition; Self Organizing Map (SOM); Urdu Nasakh;
fLanguage
English
Publisher
ieee
Conference_Titel
Emerging Technologies, 2009. ICET 2009. International Conference on
Conference_Location
Islamabad
Print_ISBN
978-1-4244-5630-7
Electronic_ISBN
978-1-4244-5631-4
Type
conf
DOI
10.1109/ICET.2009.5353161
Filename
5353161
Link To Document