Title :
JPEG-matched MRC compression of compound documents
Author :
Mukherjee, Debargha ; Memon, Nasir ; Said, Amir
Author_Institution :
Compression & Multimedia Technol. Group, Hewlett Packard Labs., Palo Alto, CA, USA
fDate :
6/23/1905 12:00:00 AM
Abstract :
Mixed raster content (MRC) is an ITU document compression standard (T.44) specifying both a model for multilayer representation of a compound document, and a set of allowable standardized coders for the individual layers. The model requires decomposition of a document into two image layers and a binary mask layer, but the standard does not recommend any procedure for this task. For best compression results, the decomposition method should be optimized for the layer encoders. In this paper, a high performance MRC compound document codec is presented, where the layer decomposition scheme is matched to the JPEG encoder with arithmetic coding for the foreground and background image layers. JBIG is used to code the mask layer. Integrated noise removal routines enable handling of scanned documents along with electronic ones. Resolution scalable decoding features are also implemented. The page segmenter yields a segmentation mask, which serves to separate text and other features
Keywords :
arithmetic codes; code standards; data compression; decoding; document image processing; image coding; image representation; image segmentation; interference suppression; noise; ITU document compression standard; JPEG-matched MRC compression; T.44; arithmetic coding; background image layers; binary mask layer; compound documents; decomposition method; electronic document; foreground image layers; image layers; integrated noise removal routines; layer encoders; mixed raster content; multilayer representation; page segmenter; resolution scalable decoding features; scanned documents; segmentation mask; standardized coders; Arithmetic; Code standards; Codecs; Decoding; Document handling; Image coding; Image segmentation; Nonhomogeneous media; Optimization methods; Transform coding;
Conference_Titel :
Image Processing, 2001. Proceedings. 2001 International Conference on
Conference_Location :
Thessaloniki
Print_ISBN :
0-7803-6725-1
DOI :
10.1109/ICIP.2001.958144