DocumentCode :
1633803
Title :
Unconstrained Handwritten Document Layout Extraction Using 2D Conditional Random Fields
Author :
Montreuil, Florent ; Grosicki, Emmanuèle ; Heutte, Laurent ; Nicolas, Stéphane
Author_Institution :
DGA, Centre d´´Expertise Parisien, Arcueil, France
fYear :
2009
Firstpage :
853
Lastpage :
857
Abstract :
The paper describes a new approach using a conditional random fields (CRFs) to extract physical and logical layouts in unconstrained handwritten letters such as those sent by individuals to companies. In this approach, the extraction of the layouts is considered as a labeling task consisting in assigning a label to each pixel of the document image. This label is chosen among a set of labels depicting the layout elements. The CRF-based method models two stochastic processes : the first one corresponds to the association between pixels and labels, the second one to the relationship of one label with respect to its neighboring labels. The CRF model gives access to the global conditional probability of a given labeling of the image according to image features and some prior knowledge about the structure of the document. This global probability is computed by means of local conditional probabilities at each pixel. To find the best label field, a key point of our model is the implementation of the optimal inference 2D dynamic programming method. Experiments have been performed on 1250 handwritten letters of the RIMES database. Good results have been reported showing the capacity of our approach to extract simultaneously the physical and logical layouts.
Keywords :
Markov processes; document image processing; dynamic programming; feature extraction; handwritten character recognition; image classification; learning (artificial intelligence); probability; random processes; support vector machines; 2D conditional random field; CRF; Markov process; RIMES database; SVM classifier; document image processing; global conditional probability; label assignment; local conditional probability; logical layout extraction; optimal inference 2D dynamic programming method; physical layout extraction; stochastic process; unconstrained handwritten document layout extraction; Data mining; Dissolved gas analysis; Handwriting recognition; Image analysis; Image segmentation; Labeling; Pixel; Stochastic processes; Tellurium; Text analysis; Conditional Random Fields; Layout Extraction; Page Segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
Conference_Location :
Barcelona
ISSN :
1520-5363
Print_ISBN :
978-1-4244-4500-4
Electronic_ISBN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2009.132
Filename :
5277530
Link To Document :
بازگشت