DocumentCode
451206
Title
Compressing Inverted Files in Scalable Information Systems by Binary Decision Diagram Encoding
Author
Lai, Chung-Hung ; Chen, Tien-Fu
Author_Institution
National Chung Cheng University
fYear
2001
fDate
10-16 Nov. 2001
Firstpage
36
Lastpage
36
Abstract
One of the key challenges of managing very huge volumes of data in scalable Information retrieval systems is providing fast access through keyword searches. The major data structure in the information retrieval system is an inverted file, which records the positions of each term in the documents. When the information set substantially grows, the number of terms and documents are significantly increased as well as the size of the inverted files. Approaches to reduce the inverted file without sacri.cing the query efficiency are important to the success of scalable information systems. In this paper, we propose a compression approach by using Binary Decision Diagram Encoding (BDD) so that all possible ordering correlation among large amount of documents will be extracted to minimize the posting representation. Another advantage of using BDD is that BDD expressions can e.ciently perform Boolean queries, which are very common in retrieval systems. Experiment results show that the compression ratios of the inverted files have been improved signi.cantly by the BDD scheme.
Keywords
BDD; Information Retrieval; Inverted File; Scalable Information Systems; Binary decision diagrams; Boolean functions; Computer science; Data structures; Encoding; Information retrieval; Information systems; Keyword search; Management information systems; Permission; BDD; Information Retrieval; Inverted File; Scalable Information Systems;
fLanguage
English
Publisher
ieee
Conference_Titel
Supercomputing, ACM/IEEE 2001 Conference
Print_ISBN
1-58113-293-X
Type
conf
DOI
10.1109/SC.2001.10019
Filename
1592812
Link To Document