DocumentCode :
1225636
Title :
Algorithms for compressing compound document images with large text/background overlap
Author :
Wu, B.-F. ; Chiu, C.-C. ; Chen, Y.-L.
Author_Institution :
Dept. of Electr. & Control Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
Volume :
151
Issue :
6
fYear :
2004
Firstpage :
453
Lastpage :
459
Abstract :
Two algorithms are presented for compressing image documents, with a high compression ratio for both colour and monochromatic compound document images. The proposed algorithms apply a new method of segmentation to separate the text from the image in a compound document in which the text overlaps the background. The segmentation method classifies document images into three planes: the text plane, the background (non-text) plane and the text´s colour plane, each of which are processed using different compression techniques. The text plane is compressed using the pattern matching technique, called JB2. Wavelet transform and zerotree coding are used to compress the background plane and the text´s colour plane. Assigning bits for different planes yields high-quality compound document images with both a high compression ratio and well presented text. The proposed algorithms greatly outperform two well known image compression methods, JPEG and DjVu, and enable the effective extraction of the text from a complex background, achieving a high compression ratio for compound document images.
Keywords :
data compression; document image processing; feature extraction; image classification; image coding; image colour analysis; image matching; image segmentation; wavelet transforms; DjVu; JPEG; background plane; compound document image compression; image segmentation method; monochromatic compound document image; pattern matching technique; text colour plane; text plane; text-background overlap; wavelet transform; zerotree coding;
fLanguage :
English
Journal_Title :
Vision, Image and Signal Processing, IEE Proceedings -
Publisher :
iet
ISSN :
1350-245X
Type :
jour
DOI :
10.1049/ip-vis:20040805
Filename :
1389216
Link To Document :
بازگشت