DocumentCode
1902348
Title
Text segmentation in mixed-mode images
Author
Chaddha, Navin ; Sharma, Rosen ; Agrawal, Avneesh ; Gupta, Anoop
Author_Institution
Comput. Syst. Lab., Stanford Univ., CA, USA
Volume
2
fYear
1994
fDate
31 Oct-2 Nov 1994
Firstpage
1356
Abstract
Block based algorithms have found widespread use in image and video compression. However, popular algorithms such as JPEG, which are very effective in compressing continuous tone images, do not perform well with mixed-mode images which have a substantial text component. With a growing number of applications where such images occur, e.g., color facsimile, digital libraries and educational videos, there are advantages in being able to classify each block as being text or continuous tone. With such a classification, different compression parameters or even algorithms may be employed for the two kinds of data to obtain high compression with minimal loss in visual quality. In this paper we analyze and compare four methods for block classification in mixed mode images, namely variance, absolute-deviation, edge, and DCT based methods. Our evaluation of each scheme is based on the accuracy of segmentation, robustness across different types of images and sensitivity to the threshold used for segmentation. Our results show that DCT based segmentation offers the best accuracy and robustness. Another advantage of DCT is that it is compatible with standards like JPEG, MPEG and H.261
Keywords
data compression; discrete cosine transforms; edge detection; image classification; image coding; image segmentation; transform coding; visual databases; DCT based methods; H.261; JPEG; MPEG; absolute-deviation; block based algorithms; color facsimile; compression parameters; continuous tone images; digital libraries; edge based methods; edge detection; educational videos; image compression; image database; mixed-mode images; segmentation accuracy; standards compatibility; text segmentation; threshold sensitivity; variance; video compression; Analysis of variance; Discrete cosine transforms; Facsimile; Image analysis; Image coding; Image segmentation; Robustness; Software libraries; Transform coding; Video compression;
fLanguage
English
Publisher
ieee
Conference_Titel
Signals, Systems and Computers, 1994. 1994 Conference Record of the Twenty-Eighth Asilomar Conference on
Conference_Location
Pacific Grove, CA
ISSN
1058-6393
Print_ISBN
0-8186-6405-3
Type
conf
DOI
10.1109/ACSSC.1994.471679
Filename
471679
Link To Document