Title :
Handwritten word spotting based on a hybrid optimal distance
Author :
Peng Wang ; Eglin, Veronique ; Largeron, Christine ; Garcia, Christophe
Author_Institution :
LIRIS, INSA-Lyon, Villeurbanne, France
Abstract :
In this paper, we develop a comprehensive representation model for handwriting, which contains both morphological and topological information. An adapted Shape Context descriptor built on structural points is employed to describe the contour of the text. Graphs are first constructed by using the structural points as nodes and the skeleton of the strokes as edges. Based on graphs, Topological Node Features (TNFs) of n-neighbourhood are extracted. Bag-of-Words representation model based on the TNFs is employed to depict the topological characteristics of word images. Moreover, a novel approach for word spotting application by using the proposed model is presented. The final distance is a weighted mixture of the SC cost, and the TNF distribution comparison. Linear Discriminant Analysis (LDA) is used to learn the optimal weight for each part of the distance with the consideration of writing styles. The evaluation of the proposed approach shows the significance of combining the properties of the handwriting from different aspects.
Keywords :
handwriting recognition; image representation; statistical analysis; LDA; TNF; bag-of-words representation model; handwritten word spotting; hybrid optimal distance; linear discriminant analysis; morphological information; shape context descriptor; topological information; topological node features; word images; Adaptation models; Context; Hidden Markov models; Image edge detection; Shape; Skeleton; Writing; LDA; Shape Context; Topological Node Feature; handwritten document; word spotting;
Conference_Titel :
Image Processing (ICIP), 2014 IEEE International Conference on
Conference_Location :
Paris
DOI :
10.1109/ICIP.2014.7025522