DocumentCode
3413865
Title
Text segmentation using linear transforms
Author
Chaddha, Navin ; Gupta, Anoop
Author_Institution
Comput. Syst. Lab., Stanford Univ., CA, USA
Volume
2
fYear
1995
fDate
Oct. 30 1995-Nov. 1 1995
Firstpage
1447
Abstract
Block-based linear transforms have found widespread use in image and video compression. However popular compression algorithms using such transforms, such as JPEG, which are very effective in compressing continuous tone images, do not perform well on mixed-mode images which have a substantial text component. With a growing number of applications where such images occur, e.g., color facsimile, digital libraries and educational videos, there are advantages in being able to classify each block as being text or continuous tone. With such a classification, different compression parameters or even algorithms may be employed for the two kinds of data to obtain high compression with minimal loss in visual quality. In this paper we propose algorithms for text segmentation based on a variety of linear transforms. We analyze the algorithms based on the accuracy and robustness of segmentation. Our results show that any of the popular linear transforms (DCT, DHT, DFT, WHT, DWT) can be used for accurate and robust text segmentation. An important practical implication of our results is that system designers can now use the same transform for both segmentation and compression, thus obtaining substantial savings in computational cost while improving quality.
Keywords
data compression; DCT; DFT; DHT; DWT; WHT; block-based linear transforms; classification; image compression; mixed-mode images; segmentation; text segmentation; video compression; visual quality; Algorithm design and analysis; Compression algorithms; Discrete cosine transforms; Facsimile; Image coding; Image segmentation; Robustness; Software libraries; Transform coding; Video compression;
fLanguage
English
Publisher
ieee
Conference_Titel
Signals, Systems and Computers, 1995. 1995 Conference Record of the Twenty-Ninth Asilomar Conference on
Conference_Location
Pacific Grove, CA, USA
ISSN
1058-6393
Print_ISBN
0-8186-7370-2
Type
conf
DOI
10.1109/ACSSC.1995.540937
Filename
540937
Link To Document