Title of article :
Stochastic language models for style-directed layout analysis of document images
Author/Authors :
Kanungo، نويسنده , , T.، نويسنده , , Song Mao
، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2003
Abstract :
Image segmentation is an important component of
any document image analysis system. While many segmentation algorithms
exist in the literature, very few i) allowusers to specify the
physical style, and ii) incorporate user-specified style information
into the algorithm’s objective function that is to be minimized.We
describe a segmentation algorithm that models a document’s physical
structure as a hierarchical structure where each node describes
a region of the document using a stochastic regular grammar. The
exact form of the hierarchy and the stochastic language is specified
by the user, while the probabilities associated with the transitions
are estimated from groundtruth data.We demonstrate the segmentation
algorithm on images of bilingual dictionaries.
Keywords :
duration hidden Markovmodels , bilingual dictionaries , physical layout analysis , Stochastic regular grammar , style-directed analysis.
Journal title :
IEEE TRANSACTIONS ON IMAGE PROCESSING
Journal title :
IEEE TRANSACTIONS ON IMAGE PROCESSING