DocumentCode
3287400
Title
Observations on Compressing Text Files of Varying Length
Author
Btoush, Mohammad Hjouj ; Siddiqi, Jawed ; Akhgar, Babak ; Dawahdeh, Ziad
Author_Institution
Sheffield Hallam Univ., Sheffield
fYear
2008
fDate
7-9 April 2008
Firstpage
224
Lastpage
228
Abstract
The paper compares different data compression algorithms of text files: LZW, Huffman, fixed-length code (FLC), and Huffman after using fixed-length code (HFLC). We compare these algorithms on different text files of different sizes in terms of compression scales of: size, ratio, time (speed), and entropy. Our evaluation reveals that initially for smaller size files the simplest algorithm namely LZW performs worst for first two scales than the more complex Huffman algorithm but as the size of the text increases interestingly the position is reversed. Moreover for the scales time and entropy LZW performs better than Huffmans but for larger files once again the position is reversed.
Keywords
data compression; text analysis; data compression algorithms; fixed-length code; text files; Binary trees; Compression algorithms; Compressors; Data compression; Encoding; Entropy; Image coding; Image storage; Probability; Video compression; Data Compression; Huffman Coding; LZW; Text size;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Technology: New Generations, 2008. ITNG 2008. Fifth International Conference on
Conference_Location
Las Vegas, NV
Print_ISBN
0-7695-3099-0
Type
conf
DOI
10.1109/ITNG.2008.61
Filename
4492483
Link To Document