Title :
Text Segmentation for MRC Document Compression
Author :
Haneda, Eri ; Bouman, Charles A.
Author_Institution :
Sch. of Electr. & Comput. Eng., Purdue Univ., West Lafayette, IN, USA
fDate :
6/1/2011 12:00:00 AM
Abstract :
The mixed raster content (MRC) standard (ITU-T T.44) specifies a framework for document compression which can dramatically improve the compression/quality tradeoff as compared to traditional lossy image compression algorithms. The key to MRC compression is the separation of the document into foreground and background layers, represented as a binary mask. Therefore, the resulting quality and compression ratio of a MRC document encoder is highly dependent upon the segmentation algorithm used to compute the binary mask. In this paper, we propose a novel multiscale segmentation scheme for MRC document encoding based upon the sequential application of two algorithms. The first algorithm, cost optimized segmentation (COS), is a blockwise segmentation algorithm formulated in a global cost optimization framework. The second algorithm, connected component classification (CCC), refines the initial segmentation by classifying feature vectors of connected components using an Markov random field (MRF) model. The combined COS/CCC segmentation algorithms are then incorporated into a multiscale framework in order to improve the segmentation accuracy of text with varying size. In comparisons to state-of-the-art commercial MRC products and selected segmentation algorithms in the literature, we show that the new algorithm achieves greater accuracy of text detection but with a lower false detection rate of nontext features. We also demonstrate that the proposed segmentation algorithm can improve the quality of decoded documents while simultaneously lowering the bit rate.
Keywords :
Markov processes; data compression; image classification; image coding; image segmentation; MRC document compression; Markov random field model; binary mask; blockwise segmentation algorithm; compression-quality tradeoff; connected component classification; cost optimized segmentation; encoder; false detection rate; feature vector classification; lossy image compression algorithm; mixed raster content standard; multiscale segmentation scheme; text detection; text segmentation; Classification algorithms; Hidden Markov models; Image coding; Image color analysis; Image segmentation; Markov processes; Pixel; Document compression; MRC compression; Markov random fields; Multiscale image analysis; image segmentation; Algorithms; Automatic Data Processing; Data Compression; Documentation; Image Enhancement; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Pattern Recognition, Automated;
Journal_Title :
Image Processing, IEEE Transactions on
DOI :
10.1109/TIP.2010.2101611