DocumentCode :
451206
Title :
Compressing Inverted Files in Scalable Information Systems by Binary Decision Diagram Encoding
Author :
Lai, Chung-Hung ; Chen, Tien-Fu
Author_Institution :
National Chung Cheng University
fYear :
2001
fDate :
10-16 Nov. 2001
Firstpage :
36
Lastpage :
36
Abstract :
One of the key challenges of managing very huge volumes of data in scalable Information retrieval systems is providing fast access through keyword searches. The major data structure in the information retrieval system is an inverted file, which records the positions of each term in the documents. When the information set substantially grows, the number of terms and documents are significantly increased as well as the size of the inverted files. Approaches to reduce the inverted file without sacri.cing the query efficiency are important to the success of scalable information systems. In this paper, we propose a compression approach by using Binary Decision Diagram Encoding (BDD) so that all possible ordering correlation among large amount of documents will be extracted to minimize the posting representation. Another advantage of using BDD is that BDD expressions can e.ciently perform Boolean queries, which are very common in retrieval systems. Experiment results show that the compression ratios of the inverted files have been improved signi.cantly by the BDD scheme.
Keywords :
BDD; Information Retrieval; Inverted File; Scalable Information Systems; Binary decision diagrams; Boolean functions; Computer science; Data structures; Encoding; Information retrieval; Information systems; Keyword search; Management information systems; Permission; BDD; Information Retrieval; Inverted File; Scalable Information Systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Supercomputing, ACM/IEEE 2001 Conference
Print_ISBN :
1-58113-293-X
Type :
conf
DOI :
10.1109/SC.2001.10019
Filename :
1592812
Link To Document :
بازگشت