• DocumentCode
    2815767
  • Title

    Modified LZW algorithm for efficient compressed text retrieval

  • Author

    Zhang, Nan ; Tao, Tao ; Satya, Ravi Vijaya ; Mukherjee, Amar

  • Author_Institution
    Sch. of Comput. Sci., Univ. of Central Florida, Orlando, FL, USA
  • Volume
    2
  • fYear
    2004
  • fDate
    5-7 April 2004
  • Firstpage
    224
  • Abstract
    With increasing amount of text data being stored in the compressed format, efficient information retrieval in the compressed domain has become a major concern. Being able to randomly access the compressed data is highly desirable for efficient retrieval and is required in many applications. For example, in a library information retrieval system, only the records that are relevant to the query are displayed. We present modified LZW algorithms that support fast random access to the compressed text. Instead of fully decompressing the text and outputting the results selectively, we allow random access and partial decoding of the compressed text and displaying the relevant portion. The compression ratio can also be improved using the modified LZW algorithm. Preliminary results on the time and storage performance are given.
  • Keywords
    data compression; information retrieval systems; pattern matching; query processing; information retrieval; library information retrieval system; modified LZW algorithm; partial decoding text; random access text; text retrieval compression; Computer science; Databases; Decoding; Delay; Frequency; Information retrieval; Internet; Libraries; Pattern matching; Search engines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004. International Conference on
  • Print_ISBN
    0-7695-2108-8
  • Type

    conf

  • DOI
    10.1109/ITCC.2004.1286636
  • Filename
    1286636