Title :
XAdap: An Adaptive Huffman Coding on Markup Languages
Author :
Cherukuri, Kishore ; Agarwal, Suneeta
Author_Institution :
MNNIT, Allahabad
Abstract :
XML documents are used for data exchange and to store large amount of data over the Web. These documents are extremely verbose and require specific compression for efficient transformation. In this paper we are analyzing various existing compressors and propose a new method called XAdap, which uses adaptive Huffman coding. It is based on the principle of extracting data from the document, and grouping it based on semantics. The document is encoded as a sequence of integers, while the data grouping is based on XML tags/attributes/comments. The re-organized data is now compressed by adaptive Huffman coding. We compare the proposed method with other existing (which uses Huffman coding) tools. Performance evaluation shows that XAdap outperforms previously proposed XML specific compression tools.
Keywords :
Huffman codes; XML; data compression; XAdap; XML documents; adaptive Huffman coding; data compression; markup languages; Application software; Compressors; Computational intelligence; Computer science; Decoding; Encoding; Huffman coding; Markup languages; Statistics; XML;
Conference_Titel :
Conference on Computational Intelligence and Multimedia Applications, 2007. International Conference on
Conference_Location :
Sivakasi, Tamil Nadu
Print_ISBN :
0-7695-3050-8
DOI :
10.1109/ICCIMA.2007.156