• DocumentCode
    3413865
  • Title

    Text segmentation using linear transforms

  • Author

    Chaddha, Navin ; Gupta, Anoop

  • Author_Institution
    Comput. Syst. Lab., Stanford Univ., CA, USA
  • Volume
    2
  • fYear
    1995
  • fDate
    Oct. 30 1995-Nov. 1 1995
  • Firstpage
    1447
  • Abstract
    Block-based linear transforms have found widespread use in image and video compression. However popular compression algorithms using such transforms, such as JPEG, which are very effective in compressing continuous tone images, do not perform well on mixed-mode images which have a substantial text component. With a growing number of applications where such images occur, e.g., color facsimile, digital libraries and educational videos, there are advantages in being able to classify each block as being text or continuous tone. With such a classification, different compression parameters or even algorithms may be employed for the two kinds of data to obtain high compression with minimal loss in visual quality. In this paper we propose algorithms for text segmentation based on a variety of linear transforms. We analyze the algorithms based on the accuracy and robustness of segmentation. Our results show that any of the popular linear transforms (DCT, DHT, DFT, WHT, DWT) can be used for accurate and robust text segmentation. An important practical implication of our results is that system designers can now use the same transform for both segmentation and compression, thus obtaining substantial savings in computational cost while improving quality.
  • Keywords
    data compression; DCT; DFT; DHT; DWT; WHT; block-based linear transforms; classification; image compression; mixed-mode images; segmentation; text segmentation; video compression; visual quality; Algorithm design and analysis; Compression algorithms; Discrete cosine transforms; Facsimile; Image coding; Image segmentation; Robustness; Software libraries; Transform coding; Video compression;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signals, Systems and Computers, 1995. 1995 Conference Record of the Twenty-Ninth Asilomar Conference on
  • Conference_Location
    Pacific Grove, CA, USA
  • ISSN
    1058-6393
  • Print_ISBN
    0-8186-7370-2
  • Type

    conf

  • DOI
    10.1109/ACSSC.1995.540937
  • Filename
    540937