• DocumentCode
    3287400
  • Title

    Observations on Compressing Text Files of Varying Length

  • Author

    Btoush, Mohammad Hjouj ; Siddiqi, Jawed ; Akhgar, Babak ; Dawahdeh, Ziad

  • Author_Institution
    Sheffield Hallam Univ., Sheffield
  • fYear
    2008
  • fDate
    7-9 April 2008
  • Firstpage
    224
  • Lastpage
    228
  • Abstract
    The paper compares different data compression algorithms of text files: LZW, Huffman, fixed-length code (FLC), and Huffman after using fixed-length code (HFLC). We compare these algorithms on different text files of different sizes in terms of compression scales of: size, ratio, time (speed), and entropy. Our evaluation reveals that initially for smaller size files the simplest algorithm namely LZW performs worst for first two scales than the more complex Huffman algorithm but as the size of the text increases interestingly the position is reversed. Moreover for the scales time and entropy LZW performs better than Huffmans but for larger files once again the position is reversed.
  • Keywords
    data compression; text analysis; data compression algorithms; fixed-length code; text files; Binary trees; Compression algorithms; Compressors; Data compression; Encoding; Entropy; Image coding; Image storage; Probability; Video compression; Data Compression; Huffman Coding; LZW; Text size;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology: New Generations, 2008. ITNG 2008. Fifth International Conference on
  • Conference_Location
    Las Vegas, NV
  • Print_ISBN
    0-7695-3099-0
  • Type

    conf

  • DOI
    10.1109/ITNG.2008.61
  • Filename
    4492483