• DocumentCode
    1673395
  • Title

    JPEG-matched MRC compression of compound documents

  • Author

    Mukherjee, Debargha ; Memon, Nasir ; Said, Amir

  • Author_Institution
    Compression & Multimedia Technol. Group, Hewlett Packard Labs., Palo Alto, CA, USA
  • Volume
    3
  • fYear
    2001
  • fDate
    6/23/1905 12:00:00 AM
  • Firstpage
    434
  • Abstract
    Mixed raster content (MRC) is an ITU document compression standard (T.44) specifying both a model for multilayer representation of a compound document, and a set of allowable standardized coders for the individual layers. The model requires decomposition of a document into two image layers and a binary mask layer, but the standard does not recommend any procedure for this task. For best compression results, the decomposition method should be optimized for the layer encoders. In this paper, a high performance MRC compound document codec is presented, where the layer decomposition scheme is matched to the JPEG encoder with arithmetic coding for the foreground and background image layers. JBIG is used to code the mask layer. Integrated noise removal routines enable handling of scanned documents along with electronic ones. Resolution scalable decoding features are also implemented. The page segmenter yields a segmentation mask, which serves to separate text and other features
  • Keywords
    arithmetic codes; code standards; data compression; decoding; document image processing; image coding; image representation; image segmentation; interference suppression; noise; ITU document compression standard; JPEG-matched MRC compression; T.44; arithmetic coding; background image layers; binary mask layer; compound documents; decomposition method; electronic document; foreground image layers; image layers; integrated noise removal routines; layer encoders; mixed raster content; multilayer representation; page segmenter; resolution scalable decoding features; scanned documents; segmentation mask; standardized coders; Arithmetic; Code standards; Codecs; Decoding; Document handling; Image coding; Image segmentation; Nonhomogeneous media; Optimization methods; Transform coding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing, 2001. Proceedings. 2001 International Conference on
  • Conference_Location
    Thessaloniki
  • Print_ISBN
    0-7803-6725-1
  • Type

    conf

  • DOI
    10.1109/ICIP.2001.958144
  • Filename
    958144