DocumentCode
3627603
Title
Improving HTML Compression
Author
Przemyslaw Skibinski
Author_Institution
Univ. of Wroclaw, Wroclaw
fYear
2008
Firstpage
545
Lastpage
545
Abstract
In this work, we describe a lossless HTML transform which, combined with generally used LZ77 and PPM compression algorithms, allows to attain high compression ratios. Its core is a fully reversible transform featuring substitution of words in an HTML document using a static dictionary or a semi-static dictionary, effective encoding of dictionary indices and numbers.The test results show the proposed transform to improve the HTML compression efficiency of general purpose compressors on average by 17% in case of Deflate and 8% in case of PPMVC.
Keywords
"HTML","Compression algorithms","Dictionaries","Internet","Data compression","Web pages","Compressors","XML","Image coding","Computer science"
Publisher
ieee
Conference_Titel
Data Compression Conference, 2008. DCC 2008
ISSN
1068-0314
Print_ISBN
978-0-7695-3121-2
Type
conf
DOI
10.1109/DCC.2008.74
Filename
4483372
Link To Document