DocumentCode :
3020352
Title :
Tree structure for word extraction from handwritten text lines
Author :
Varga, Tamás ; Bunke, Horst
Author_Institution :
Inst. fur Informatik und Angewandte Math., Bern Univ., Switzerland
fYear :
2005
fDate :
29 Aug.-1 Sept. 2005
Firstpage :
352
Abstract :
Word extraction from handwritten text lines usually involves the calculation of a line specific threshold which separates the gaps between words from the gaps inside the words in that line. We show that this approach can be improved if the decision about a gap is not only made in terms of a threshold, but also depends on the context of that gap, i.e. if the relative sizes of the surrounding gaps are taken into consideration. For this purpose, we propose to build a structure tree of the text line, whose nodes represent possible word candidates. Such a tree is traversed in a top-down manner to find the nodes that correspond to words of the text line. Experiments with different gap metrics as well as threshold types show that the new method can yield significant improvements over conventional word extraction methods.
Keywords :
handwritten character recognition; text analysis; tree data structures; handwritten text lines; threshold based method; tree structure; word extraction; Neural networks; Text recognition; Tree data structures;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
ISSN :
1520-5263
Print_ISBN :
0-7695-2420-6
Type :
conf
DOI :
10.1109/ICDAR.2005.245
Filename :
1575568
Link To Document :
بازگشت