• DocumentCode
    2579129
  • Title

    Answering Multiple Queries in Compressed Texts

  • Author

    Wang, Bin ; Yu, Minghe ; Yang, Xiaochun ; Wang, Guoren

  • Author_Institution
    Coll. of Inf. Sci. & Eng., Northeastern Univ., Shenyang, China
  • fYear
    2012
  • fDate
    16-18 Nov. 2012
  • Firstpage
    61
  • Lastpage
    66
  • Abstract
    With the exponential increment of data, compression technology becomes an important tool in the field of data management, especially in text management. An increasing pressing challenge is how to efficiently query these massive amounts of sequence data in their compressed format. In this paper we study the problem of answering subsequence-search queries on LZ78 format of texts. We propose the concept of conditional common sub strings of queries to improve query performance. We present a techniques to find minimal conditional common sub strings in compressed text and a local uncompressing technique to verify and locate positions of answers in text. Finally, the experimental results over real data demonstrate the efficiency of our algorithm.
  • Keywords
    data compression; query processing; LZ78 format; compressed text; compression technology; conditional common substring; data management; local uncompressing technique; multiple queries; subsequence-search queries; text management; Color; Greedy algorithms; Information science; Query processing; Silicon; Variable speed drives; common substring; multiple similar queries; string matching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Information Systems and Applications Conference (WISA), 2012 Ninth
  • Conference_Location
    Haikou
  • Print_ISBN
    978-1-4673-3054-1
  • Type

    conf

  • DOI
    10.1109/WISA.2012.55
  • Filename
    6385184