DocumentCode
3764982
Title
Knowledge engineering perspective of text compression
Author
C. Oswald;Anirban I Ghosh;B. Sivaselvan
Author_Institution
Department of Computer Engineering, IIITDM Kancheepuram, Chennai - 127, India
fYear
2015
Firstpage
1
Lastpage
6
Abstract
The paper focuses on an engineering perspective of Data Mining, specifically using it as a tool for efficient data compression. Huffman encoding, a lossless compression technique is refined to incorporate frequent itemset mining, an important phase of Association Rule Mining. The research exploits the principle of assigning shorter codes to frequently occurring patterns(sequence of characters) in relation to single character based code assignment approach of Huffman encoding. Simulation results indicate the benefits of the Data Mining approach to compression, resulting in an efficient data compression algorithm.
Keywords
"Data mining","Itemsets","Channel coding","Data compression","Dictionaries"
Publisher
ieee
Conference_Titel
India Conference (INDICON), 2015 Annual IEEE
Electronic_ISBN
2325-9418
Type
conf
DOI
10.1109/INDICON.2015.7443683
Filename
7443683
Link To Document