Title :
Fast ranking in limited space
Author :
Moffet, A. ; Zobel, Justin
Author_Institution :
Dept. of Comput. Sci., Melbourne Univ., Parkville, Vic., Australia
Abstract :
Ranking techniques have long been suggested as alternatives to conventional Boolean methods for searching document collections. The cost of computing a ranking is, however, greater than the cost of performing a Boolean search, in terms of both memory space and processing time. The authors consider the resources required by the cosine method of ranking, and show that, with a careful application of indexing and selection techniques, both the space and the time required by ranking can be substantially reduced. The methods described in the paper have been used to build a retrieval system with which it is possible to process ranked queries of 40 terms in about 5% of the space required by previous implementations; in as little as 25% of the time; and without measurable degradation in retrieval effectiveness
Keywords :
computational complexity; database theory; query processing; Boolean methods; cosine method; document collections; fast ranking; indexing; memory space; processing time; ranking; retrieval system; Computer science; Costs; Databases; Degradation; Indexing; Query processing; Sun; Testing; Time measurement;
Conference_Titel :
Data Engineering, 1994. Proceedings.10th International Conference
Conference_Location :
Houston, TX
Print_ISBN :
0-8186-5402-3
DOI :
10.1109/ICDE.1994.283064