Title :
JPEG2000-matched MRC compression of compound documents
Author :
Mukherjee, Debargha ; Chrysafis, Christos ; Said, Amir
Author_Institution :
Compression & Multimedia Technol. Group, Hewlett Packard Labs., Palo Alto, CA, USA
Abstract :
The mixed raster content (MRC) ITU document compression standard (T.44) specifies a multilayer decomposition model for compound documents into two contone image layers and a binary mask layer for independent compression. While T.44 does not recommend any procedure for decomposition, it does specify a set of allowable layer codecs to be used after decomposition. While T.44 only allows older standardized codecs such as JPEG/JBIG/G3/G4, higher compression could be achieved if newer contone and bi-level compression standards such as JPEG2000/JBIG2 were used instead. We present an MRC compound document codec using JPEG2000 as the image layer codec and a layer decomposition scheme matched to JPEG2000 for efficient compression. JBIG still codes the mask. Noise removal routines enable efficient coding of scanned documents along with electronic ones. Resolution scalable decoding features are also implemented. The segmentation mask, obtained from layer decomposition, serves to separate text and other features.
Keywords :
data compression; document image processing; feature extraction; image coding; image segmentation; ITU document compression standard; JBIG; JPEG2000; MRC compression; bi-level compression; binary mask layer; compound documents; contone image layers; decoding; electronic documents; feature separation; mixed raster content; multilayer decomposition; resolution scalable features; scanned documents; segmentation mask; Code standards; Codecs; Decoding; Graphics; Image coding; Image segmentation; Laboratories; Nonhomogeneous media; Standards development; Transform coding;
Conference_Titel :
Image Processing. 2002. Proceedings. 2002 International Conference on
Print_ISBN :
0-7803-7622-6
DOI :
10.1109/ICIP.2002.1038906