Title of article :
Stochastic language models for style-directed layout analysis of document images
Author/Authors :
Kanungo، نويسنده , , T.، نويسنده , , Song Mao ، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2003
Pages :
14
From page :
583
To page :
596
Abstract :
Image segmentation is an important component of any document image analysis system. While many segmentation algorithms exist in the literature, very few i) allowusers to specify the physical style, and ii) incorporate user-specified style information into the algorithm’s objective function that is to be minimized.We describe a segmentation algorithm that models a document’s physical structure as a hierarchical structure where each node describes a region of the document using a stochastic regular grammar. The exact form of the hierarchy and the stochastic language is specified by the user, while the probabilities associated with the transitions are estimated from groundtruth data.We demonstrate the segmentation algorithm on images of bilingual dictionaries.
Keywords :
duration hidden Markovmodels , bilingual dictionaries , physical layout analysis , Stochastic regular grammar , style-directed analysis.
Journal title :
IEEE TRANSACTIONS ON IMAGE PROCESSING
Serial Year :
2003
Journal title :
IEEE TRANSACTIONS ON IMAGE PROCESSING
Record number :
396857
Link To Document :
بازگشت