DocumentCode :
592004
Title :
A Semi-automatic Annotation Scheme for Bangla Online Mixed Cursive Handwriting Samples
Author :
Bhattacharya, Ujjwal ; Banerjee, Rohan ; Baral, S. ; De, Rishika ; Parui, Swapan K.
Author_Institution :
CVPR Unit, Indian Stat. Inst., Kolkata, India
fYear :
2012
fDate :
18-20 Sept. 2012
Firstpage :
680
Lastpage :
685
Abstract :
Requirement of annotated handwriting samples for the development of relevant recognition algorithms is an established fact. Although such annotated databases of unconstrained handwriting exist for several scripts of a few languages, the same is not true for any of the scripts of India. As far as Indian scripts are concerned a few databases of handwritten isolated characters are publicly available. These include samples of both online and offline handwriting. However, no such publicly available database of unconstrained handwriting in any of the Indian scripts exists. On the other hand, unconstrained handwriting in Bangla, the second most popular among Indian scripts, is mixed cursive in nature unlike the other scripts of India. Thus, annotation of Bangla unconstrained handwriting samples needs special consideration. During the last few years our group at the Indian Statistical Institute, Kolkata has been working towards the development of a large annotated database of online Bangla handwriting samples and has developed a GUI-based semi-automatic scheme for their annotation at character boundary levels and a scheme for XML representation of such annotated data. The present system implemented for annotation of unconstrained handwriting of Bangla may easily be customized for other scripts. Currently this system is in use for annotation of a large database of Bangla unconstrained online handwriting.
Keywords :
XML; graphical user interfaces; handwritten character recognition; image recognition; image retrieval; natural language processing; Bangla online mixed-cursive unconstrained handwriting sample annotation; GUI-based semiautomatic annotation scheme; Indian Statistical Institute; Indian language scripts; Kolkata; XML representation; character boundary level annotation; offline handwriting samples; publicly available handwritten isolated character database; Character recognition; Data collection; Databases; Handwriting recognition; Hidden Markov models; Shape; XML; Bangla online handwriting sample; Online handwriting; annotation of handwriting samples; semi-automatic annotation scheme;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2012 International Conference on
Conference_Location :
Bari
Print_ISBN :
978-1-4673-2262-1
Type :
conf
DOI :
10.1109/ICFHR.2012.168
Filename :
6424475
Link To Document :
بازگشت