Title :
Recognition of open vocabulary, online handwritten pages in Tamil script
Author :
Urala, K. Bhargava ; Ramakrishnan, A.G. ; Mohamed, Salina
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Sci., Bangalore, India
Abstract :
In this work, we describe a system, which recognises open vocabulary, isolated, online handwritten Tamil words and extend it to recognize a paragraph of writing. We explain in detail each step involved in the process: segmentation, preprocessing, feature extraction, classification and bigram-based post-processing. On our database of 45,000 handwritten words obtained through tablet PC, we have obtained symbol level accuracy of 78.5% and 85.3% without and with the usage of post-processing using symbol level language models, respectively. Word level accuracies for the same are 40.1% and 59.6%. A line and word level segmentation strategy is proposed, which gives promising results of 100% line segmentation and 98.1% word segmentation accuracies on our initial trials of 40 handwritten paragraphs. The two modules have been combined to obtain a full-fledged page recognition system for online handwritten Tamil data. To the knowledge of the authors, this is the first ever attempt on recognition of open vocabulary, online handwritten paragraphs in any Indian language.
Keywords :
document image processing; handwritten character recognition; image segmentation; natural language processing; notebook computers; Indian language; Tamil script; bigram-based post-processing; feature extraction; full-fledged page recognition system; handwritten paragraphs; online handwritten Tamil words; online handwritten pages; open vocabulary recognition; symbol level accuracy; tablet PC; word level segmentation strategy; Accuracy; Databases; Feature extraction; Handwriting recognition; Hidden Markov models; Support vector machines; Vectors;
Conference_Titel :
Signal Processing and Communications (SPCOM), 2014 International Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-4799-4666-2
DOI :
10.1109/SPCOM.2014.6984002