DocumentCode
478628
Title
Grouping Text Lines in Online Handwritten Japanese Documents by Combining Temporal and Spatial Information
Author
Zhou, Xiang-Dong ; Wang, Da-Han ; Liu, Cheng-Lin
fYear
2008
fDate
16-19 Sept. 2008
Firstpage
61
Lastpage
68
Abstract
We present an effective approach for grouping text lines in online handwritten Japanese documents by combining temporal and spatial information. Initially, strokes are grouped into text line strings according to off-stroke distances. Each text line string is segmented into text lines by dynamic programming (DP) optimizing a cost function trained by the minimum classification error (MCE) method. Over-segmented text lines are then merged with a support vector machine (SVM) classifier for making merge/non-merge decisions, and last, a spatial merge module corrects the segmentation errors caused by delayed strokes. In experiments on the TUAT Kondate database, the proposed approach achieves the Entity Detection Metric (EDM) rate of 0.8816, the Edit-Distance Rate (EDR) of 0.1234, which demonstrates the superiority of our approach.
Keywords
Engines; Feature extraction; Graphical models; Image analysis; Image representation; Image segmentation; Information analysis; Optical character recognition software; Text analysis; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis Systems, 2008. DAS '08. The Eighth IAPR International Workshop on
Conference_Location
Nara, Japan
Print_ISBN
978-0-7695-3337-7
Type
conf
DOI
10.1109/DAS.2008.15
Filename
4669946
Link To Document