DocumentCode
861499
Title
Online handwritten script recognition
Author
Namboodiri, Anoop M. ; Jain, Anil K.
Author_Institution
Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
Volume
26
Issue
1
fYear
2004
Firstpage
124
Lastpage
130
Abstract
Automatic identification of handwritten script facilitates many important applications such as automatic transcription of multilingual documents and search for documents on the Web containing a particular script. The increase in usage of handheld devices which accept handwritten input has created a growing demand for algorithms that can efficiently analyze and retrieve handwritten data. This paper proposes a method to classify words and lines in an online handwritten document into one of the six major scripts: Arabic, Cyrillic, Devnagari, Han, Hebrew, or Roman. The classification is based on 11 different spatial and temporal features extracted from the strokes of the words. The proposed system attains an overall classification accuracy of 87.1 percent at the word level with 5-fold cross validation on a data set containing 13,379 words. The classification accuracy improves to 95 percent as the number of words in the test sample is increased to five, and to 95.5 percent for complete text lines consisting of an average of seven words.
Keywords
document image processing; feature extraction; handwriting recognition; online operation; Arabic script; Cyrillic script; Devnagari script; Han script; Hebrew script; Roman script; automatic identification; automatic transcription; feature extraction; handheld devices; handwritten data analysis; handwritten data retrieval; multilingual documents; online handwritten script recognition; Algorithm design and analysis; Data mining; Feature extraction; Handheld computers; Handwriting recognition; Information retrieval; Natural languages; Personal digital assistants; Text recognition; Writing; Algorithms; Artificial Intelligence; Automatic Data Processing; Computer Graphics; Computer Simulation; Handwriting; Image Enhancement; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Numerical Analysis, Computer-Assisted; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity; Signal Processing, Computer-Assisted; Subtraction Technique; User-Computer Interface;
fLanguage
English
Journal_Title
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher
ieee
ISSN
0162-8828
Type
jour
DOI
10.1109/TPAMI.2004.1261096
Filename
1261096
Link To Document