DocumentCode :
2199263
Title :
Word-Wise Handwritten Persian and Roman Script Identification
Author :
Roy, Kaushik ; Alaei, Alireza ; Pal, Umapada
Author_Institution :
Dept. of Comput. Sci., West Bengal State Univ., Kolkata, India
fYear :
2010
fDate :
16-18 Nov. 2010
Firstpage :
628
Lastpage :
633
Abstract :
Most of the countries use bi-script documents. This is because every country uses its own national language and English as second/foreign language. Therefore, bi-lingual document with one language being the English and other being the national language is very common. Postal documents are a very good example of such bi-lingual/script document. This paper deals with word-wise handwritten script identification from bi-script documents written in Persian and Roman. In the proposed scheme, simple but fast computable set of 12 features based on fractal dimension, position of small component, topology etc. are used and a set of classifiers are employed for script identification experiments. We tested our scheme on a dataset of 5000 handwritten Persian and English words and 99.20% of correct script identification is obtained.
Keywords :
document image processing; handwritten character recognition; natural language processing; pattern classification; bi-lingual document; fractal dimension; national language; postal documents; word-wise handwritten Persian script identification; word-wise handwritten Roman script identification; Fractal dimension; Handwritten script identification; Persian handwritten Recognition; Word-wise script identification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2010 International Conference on
Conference_Location :
Kolkata
Print_ISBN :
978-1-4244-8353-2
Type :
conf
DOI :
10.1109/ICFHR.2010.103
Filename :
5693634
Link To Document :
بازگشت