• DocumentCode
    1593213
  • Title

    Digipaper: a versatile color document image representation

  • Author

    Huttenlocher, Daniel ; Felzenszwalb, Pedro ; Rucklidge, William

  • Author_Institution
    Dept. of Comput. Sci., Cornell Univ., Ithaca, NY, USA
  • Volume
    1
  • fYear
    1999
  • fDate
    6/21/1905 12:00:00 AM
  • Firstpage
    219
  • Abstract
    We describe a segmentation method and associated file format for storing images of color documents. We separate each page of the document into three layers, containing the background (usually one or more photographic images), the text, and the color of the text. Each of these layers has different properties, making it desirable to use different compression methods to represent the three layers. The background layers are compressed using any method designed for photographic images, the text layers are compressed using a token-based representation, and the text color layers are compressed by augmenting the representation used for the text layers. We also describe an algorithm for segmenting images into these three layers. This representation and algorithm can produce very highly-compressed document files that nonetheless retain excellent image quality
  • Keywords
    image representation; image segmentation; Digipaper; associated file format; background layers; image quality; image segmentation; photographic images; token-based representation; versatile color document image representation; Computer science; Design methodology; Discrete cosine transforms; Image coding; Image quality; Image representation; Image segmentation; Pixel; Quantization; Transform coding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing, 1999. ICIP 99. Proceedings. 1999 International Conference on
  • Conference_Location
    Kobe
  • Print_ISBN
    0-7803-5467-2
  • Type

    conf

  • DOI
    10.1109/ICIP.1999.821601
  • Filename
    821601