• DocumentCode
    3764982
  • Title

    Knowledge engineering perspective of text compression

  • Author

    C. Oswald;Anirban I Ghosh;B. Sivaselvan

  • Author_Institution
    Department of Computer Engineering, IIITDM Kancheepuram, Chennai - 127, India
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    The paper focuses on an engineering perspective of Data Mining, specifically using it as a tool for efficient data compression. Huffman encoding, a lossless compression technique is refined to incorporate frequent itemset mining, an important phase of Association Rule Mining. The research exploits the principle of assigning shorter codes to frequently occurring patterns(sequence of characters) in relation to single character based code assignment approach of Huffman encoding. Simulation results indicate the benefits of the Data Mining approach to compression, resulting in an efficient data compression algorithm.
  • Keywords
    "Data mining","Itemsets","Channel coding","Data compression","Dictionaries"
  • Publisher
    ieee
  • Conference_Titel
    India Conference (INDICON), 2015 Annual IEEE
  • Electronic_ISBN
    2325-9418
  • Type

    conf

  • DOI
    10.1109/INDICON.2015.7443683
  • Filename
    7443683