Title :
Alpha-Numerical Sequences Extraction in Handwritten Documents
Author :
Thomas, Simon ; Chatelain, Clément ; Heutte, Laurent ; Paquet, Thierry
Author_Institution :
LITIS, Univ. de Rouen, St. Etienne du Rouvray, France
Abstract :
In this paper, we introduce an alpha-numerical sequences extraction system (keywords, numerical fields or alpha-numerical sequences) in unconstrained handwritten documents. Contrary to most of the approaches presented in the literature, our system relies on a global handwriting line model describing two kinds of information : i) the relevant information and ii) the irrelevant information represented by a shallow parsing model. The shallow parsing of isolated text lines allows quick information extraction in any document while rejecting at the same time irrelevant information. Results on a public french incoming mails database show the efficiency of the approach.
Keywords :
feature extraction; handwriting recognition; information retrieval; alpha numerical sequences extraction; handwriting line model; handwritten documents; information extraction; irrelevant information representation; isolated text lines; literature; shallow parsing model;
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2010 International Conference on
Conference_Location :
Kolkata
Print_ISBN :
978-1-4244-8353-2
DOI :
10.1109/ICFHR.2010.44