• DocumentCode
    3627603
  • Title

    Improving HTML Compression

  • Author

    Przemyslaw Skibinski

  • Author_Institution
    Univ. of Wroclaw, Wroclaw
  • fYear
    2008
  • Firstpage
    545
  • Lastpage
    545
  • Abstract
    In this work, we describe a lossless HTML transform which, combined with generally used LZ77 and PPM compression algorithms, allows to attain high compression ratios. Its core is a fully reversible transform featuring substitution of words in an HTML document using a static dictionary or a semi-static dictionary, effective encoding of dictionary indices and numbers.The test results show the proposed transform to improve the HTML compression efficiency of general purpose compressors on average by 17% in case of Deflate and 8% in case of PPMVC.
  • Keywords
    "HTML","Compression algorithms","Dictionaries","Internet","Data compression","Web pages","Compressors","XML","Image coding","Computer science"
  • Publisher
    ieee
  • Conference_Titel
    Data Compression Conference, 2008. DCC 2008
  • ISSN
    1068-0314
  • Print_ISBN
    978-0-7695-3121-2
  • Type

    conf

  • DOI
    10.1109/DCC.2008.74
  • Filename
    4483372