DocumentCode :
3372911
Title :
Search and ranking algorithms for locating resources on the World Wide Web
Author :
Yuwono, Budi ; Lee, Dik L.
Author_Institution :
Dept. of Comput. & Inf. Sci., Ohio State Univ., Columbus, OH, USA
fYear :
1996
fDate :
26 Feb-1 Mar 1996
Firstpage :
164
Lastpage :
171
Abstract :
Applying information retrieval techniques to the World Wide Web (WWW) environment is a challenge, mostly because of its hypertext/hypermedia nature and the richness of the meta-information it provides. We present four keyword-based search and ranking algorithms for locating relevant WWW pages with respect to user queries. The first algorithm, Boolean Spreading Activation, extends the notion of word occurrence in the Boolean retrieval model by propagating the occurrence of a query word in a page to other pages linked to it. The second algorithm, Most-cited, uses the number of citing hyperlinks between potentially relevant WWW pages to increase the relevance scores of the referenced pages over the referencing pages. The third algorithm, TFxIDF vector space model, is based on word distribution statistics. The last algorithm, Vector Spreading Activation, combines TFxIDF with the spreading activation model. We conducted an experiment to evaluate the retrieval effectiveness of these algorithms. From the results of the experiment, we draw conclusions regarding the nature of the WWW environment with respect to document ranking strategies
Keywords :
Internet; file servers; hypermedia; indexing; information retrieval systems; query processing; statistics; Boolean Spreading Activation algorithm; Boolean retrieval model; Most-cited algorithm; TFxIDF vector space model algorithm; Vector Spreading Activation algorithm; WWW page location; World Wide Web; citing hyperlinks; document ranking strategies; hypermedia; hypertext; information retrieval techniques; keyword-based ranking algorithms; keyword-based search algorithms; meta-information; referenced pages; referencing pages; resource location; retrieval effectiveness; user queries; word distribution statistics; word occurrence; Indexes; Indexing; Information retrieval; Internet; Multimedia databases; Robots; Web pages; Web server; Web sites; World Wide Web;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 1996. Proceedings of the Twelfth International Conference on
Conference_Location :
New Orleans, LA
ISSN :
1063-6382
Print_ISBN :
0-8186-7240-4
Type :
conf
DOI :
10.1109/ICDE.1996.492102
Filename :
492102
Link To Document :
بازگشت