• DocumentCode
    451206
  • Title

    Compressing Inverted Files in Scalable Information Systems by Binary Decision Diagram Encoding

  • Author

    Lai, Chung-Hung ; Chen, Tien-Fu

  • Author_Institution
    National Chung Cheng University
  • fYear
    2001
  • fDate
    10-16 Nov. 2001
  • Firstpage
    36
  • Lastpage
    36
  • Abstract
    One of the key challenges of managing very huge volumes of data in scalable Information retrieval systems is providing fast access through keyword searches. The major data structure in the information retrieval system is an inverted file, which records the positions of each term in the documents. When the information set substantially grows, the number of terms and documents are significantly increased as well as the size of the inverted files. Approaches to reduce the inverted file without sacri.cing the query efficiency are important to the success of scalable information systems. In this paper, we propose a compression approach by using Binary Decision Diagram Encoding (BDD) so that all possible ordering correlation among large amount of documents will be extracted to minimize the posting representation. Another advantage of using BDD is that BDD expressions can e.ciently perform Boolean queries, which are very common in retrieval systems. Experiment results show that the compression ratios of the inverted files have been improved signi.cantly by the BDD scheme.
  • Keywords
    BDD; Information Retrieval; Inverted File; Scalable Information Systems; Binary decision diagrams; Boolean functions; Computer science; Data structures; Encoding; Information retrieval; Information systems; Keyword search; Management information systems; Permission; BDD; Information Retrieval; Inverted File; Scalable Information Systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Supercomputing, ACM/IEEE 2001 Conference
  • Print_ISBN
    1-58113-293-X
  • Type

    conf

  • DOI
    10.1109/SC.2001.10019
  • Filename
    1592812