DocumentCode :
1809158
Title :
A system for word-wise handwritten script identification for Indian postal automation
Author :
Roy, K. ; Banerjee, A. ; Pal, U.
Author_Institution :
Comput. Vision & Pattern Recognition Unit., Indian Stat. Inst., Kolkata, India
fYear :
2004
fDate :
20-22 Dec. 2004
Firstpage :
266
Lastpage :
271
Abstract :
Postal automation is a topic of research over the last few years. There are many works towards the postal automation in USA, UK, Japan and Australia, but for Indian postal automation there is no significant work. This paper deals with word-wise handwritten script identification for Indian postal automation. In the proposed scheme at first document skew is detected and corrected. Non-text parts are then segmented from the document using run length smoothing algorithm (RLSA). Next, using a piece-wise projection method the destination address block (DAB) is at first segmented into lines and then links into words. Using water reservoir concept we compute the busy-zone of the word. Finally, using matra/Shirorekha, water reservoir concept based feature, etc. a tree classifier is generated for word-wise Bangla/Devnagari and English scripts identification.
Keywords :
document image processing; feature extraction; government data processing; handwriting recognition; handwritten character recognition; natural languages; office automation; postal services; Australia; Bangla-Devnagari script; DAB; English scripts identification; Indian postal automation; Japan; RLSA; UK; USA; concept based feature; destination address block; document skew detection; matra-Shirorekha water reservoir; piece-wise projection method; run length smoothing algorithm; word-wise handwritten script identification; Australia; Automation; Feature extraction; Histograms; Natural languages; Optical character recognition software; Reservoirs; Seals; Smoothing methods; Water resources;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
India Annual Conference, 2004. Proceedings of the IEEE INDICON 2004. First
Print_ISBN :
0-7803-8909-3
Type :
conf
DOI :
10.1109/INDICO.2004.1497753
Filename :
1497753
Link To Document :
بازگشت