• DocumentCode
    1729340
  • Title

    A Simple Approach to Optimized Text Compression´s Performance

  • Author

    Wichaiwong, Tanakorn ; Koonsanit, Kitti ; Juruskulchai, C.

  • Author_Institution
    Dept. Of Comput. Sci., Kasetsart Univ., Bangkok
  • fYear
    2008
  • Firstpage
    66
  • Lastpage
    70
  • Abstract
    While the basic of World Wide Web communication data almost of data still be represented by text such as data exchange in Web Services base on XML technology and storage data into relational databases. Unfortunately, these attractive of data come at the expense of performance to transfer data. A way to improve is compression technique. In this paper we present new compression algorithm using capitalization. The mechanism has 3 steps is following: Firstly, remove white space. Secondary, compressing data to UpperCamelCase capitalization style and lastly, to decompress compressed data. Our experiments have shown significant performance gains of our algorithm include reduce data size up to 22% and keep data integrity. In additionally, compressed data is easy to read and understand like naming convention in several programming language.
  • Keywords
    Internet; XML; data compression; data integrity; data reduction; electronic data interchange; information retrieval; text analysis; UpperCamelCase capitalization style; Web services; World Wide Web; XML technology; data decompression; data exchange; data integrity; data size reduction; data transfer; information retrieval; naming convention; optimized text compression algorithm; programming language; relational database; storage data; Computer languages; Data compression; Indexing; Information retrieval; Probability; Relational databases; Web services; Web sites; White spaces; XML; Capitalization Styles; Compression; Information Retrieval; Performance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Next Generation Web Services Practices, 2008. NWESP '08. 4th International Conference on
  • Conference_Location
    Seoul
  • Print_ISBN
    978-0-7695-3455-8
  • Electronic_ISBN
    978-0-7695-3455-8
  • Type

    conf

  • DOI
    10.1109/NWeSP.2008.12
  • Filename
    4700383